One of the grad students and I continue to struggle with finding the best way to represent her results.
The results take the form of gel images - autoradiograms of positions of bands in the gel, representing the sizes of radioactive RNA fragments. But which kinds of bands convey significant information depends on the enzyme that's been used and on the pattern seen in the 'ladder' (unfolded RNA, the control) lane. But the images of the gels are too complex for the reader to make sense of without a lot of effort, and we present a typical one only to reassure the reader that the analysis was competently done. Thus we need to develop a representation that summarizes the gel results.
Sometimes the presence of a band in the 'structure' (folded RNA, the test) lane is significant information, telling us that the folding of the RNA lets the enzyme cut at a place that it couldn't in the unfolded RNA. Other times (different position or different enzyme) the presence of a band tells us nothing. Sometimes the absence of a band tells us that the folding makes that part of the RNA inaccessible, but sometimes it is just because that enzyme can't ever cut at that kind of position. To make matters worse, the gel results, though reasonably reproducible, are not always exactly the same, probably due to slight and unavoidable fluctuations in temperatures and volume measurements.
So we want our representation to convey both positive information and lack of information. One representation needs to say "These bases of the RNA have this cutting pattern with this enzyme, and these other bases have that pattern" and another needs to say "These bases of the RNA have this structure and these other bases have that pattern".
We're still working on it.
Modular drug design software?
7 hours ago in The Curious Wavefunction