Change of address
3 months ago in Variety of Life
Not your typical science blog, but an 'open science' research blog. Watch me fumbling my way towards understanding how and why bacteria take up DNA, and getting distracted by other cool questions.
"I know that most men, including those at ease with problems of the greatest complexity, can seldom accept even the simplest and most obvious truth if it be such as would oblige them to admit the falsity of conclusions which they have delighted in explaining to colleagues, which they have proudly taught to others, and which they have woven, thread by thread, into the fabric of their lives."How are we testing this assumption? By looking for evidence (at the molecular level) of how selection has shaped the processes that cause recombination. There are three such processes, but two of them, conjugation and transduction, can be easily shown to cause recombination only as side effects of genetic parasitism by plasmids and phages respectively. That leaves natural competence (DNA uptake) and its genetic consequence (transformation), which is what we work on. Transformation itself arises when the so-called recombination machinery in the cell acts on DNA the cell has taken up. But this machinery exists not because of selection for making new combinations of genes using DNA brought in from outside, but because of selection for the ability to repair and replicate the cell's own DNA. Because transformation itself is an unselected side effect of the replication and repair machinery, we concentrate on understanding how natural selection has acted on natural competence (the DNA uptake process).Leo Tolstoy
 The Gibbs analysis assigns a score to each site it finds.  With 'expect 2000' most sites have scores close to 1.0, but there are always some (~40 in the set I counted) with scores of zero and a few with scores between 0.5 and 0.9.
The Gibbs analysis assigns a score to each site it finds.  With 'expect 2000' most sites have scores close to 1.0, but there are always some (~40 in the set I counted) with scores of zero and a few with scores between 0.5 and 0.9.  checking out how bad these sites are.
checking out how bad these sites are. I'm working on the revisions for the USS paper, writing up the new analysis of reading frames and constraints.  I put the data in the table on the left; the "Relative to best codons" values estimate how easily the USS-encoded tripeptide would be translated. I'm puzzling over why only 49 USSs are in reading frame A.
I'm working on the revisions for the USS paper, writing up the new analysis of reading frames and constraints.  I put the data in the table on the left; the "Relative to best codons" values estimate how easily the USS-encoded tripeptide would be translated. I'm puzzling over why only 49 USSs are in reading frame A. would expect given how often the tripeptide it specifies appears in the proteome.
 would expect given how often the tripeptide it specifies appears in the proteome. Many USSs are in the protein-coding parts of the genome, and these can be sorted by which of the 6 possible reading frames the respective proteins are encoded.  The first two figures show the relationships of the USSs to the reading frames.
Many USSs are in the protein-coding parts of the genome, and these can be sorted by which of the 6 possible reading frames the respective proteins are encoded.  The first two figures show the relationships of the USSs to the reading frames. lection.
lection. The blue symbols in the graph show the results of the other analysis.  To look for a correlation between codon bias and USS reading frame usage, I needed a crude score indicating how  easily each USS-encoded tripeptide would be translated.  I was able to get a table of codon usage for the H. influenzae proteome from the TIGR website; this gave the percent usage of each codon.  So for each tripeptide I calculated a 'USS-codons' score as the sum of the percentages of the codons specified by USSs in that reading frame, and a 'best-codons' score as the sum of the percentages of the most commonly used codons for its three amino acids.  Then I calculated a 'codon cost' as the ratios of these scores, and multiplied it by 100 so it would fit neatly on the graph.
The blue symbols in the graph show the results of the other analysis.  To look for a correlation between codon bias and USS reading frame usage, I needed a crude score indicating how  easily each USS-encoded tripeptide would be translated.  I was able to get a table of codon usage for the H. influenzae proteome from the TIGR website; this gave the percent usage of each codon.  So for each tripeptide I calculated a 'USS-codons' score as the sum of the percentages of the codons specified by USSs in that reading frame, and a 'best-codons' score as the sum of the percentages of the most commonly used codons for its three amino acids.  Then I calculated a 'codon cost' as the ratios of these scores, and multiplied it by 100 so it would fit neatly on the graph.

 Here's the data.  The top graph shows culture density as a function of time for the three strains I tested.  The first points (t=0) are before the cells were diluted 300-fold.  You can see the 'lag' for the first ~30 minutes, then the cells begin growing exponentially. (You can tell that they're doubling at a constant rate because the points fall on a straight line on this log scale.)  After about 200 minutes growth slows.  The lines aren't joined to the last points because there's a 700 minute gap separating them (this part of the graph isn't to scale).  You can also see that one strain grows slower than the others (red line and points); this is
Here's the data.  The top graph shows culture density as a function of time for the three strains I tested.  The first points (t=0) are before the cells were diluted 300-fold.  You can see the 'lag' for the first ~30 minutes, then the cells begin growing exponentially. (You can tell that they're doubling at a constant rate because the points fall on a straight line on this log scale.)  After about 200 minutes growth slows.  The lines aren't joined to the last points because there's a 700 minute gap separating them (this part of the graph isn't to scale).  You can also see that one strain grows slower than the others (red line and points); this is  the strain whose crp gene is knocked out.
the strain whose crp gene is knocked out.