The postdoc derailed our consideration of contamination in his 'uptake' pool of DNA fragments by raising the issue of errors in the Illumina sequencing. We had discussed this issue long ago, before we had any data, and then forgot about it in the rush to analyze the results. How embarassing!
The expected level of sequencing errors is somewhere between 0.1 and 1%. We have two sets of estimates from our data, but they're very discordant.
One set of estimates comes from the frequency of sequences in the uptake pool that differ from the 225158 perfect consensus sequences at only one of the 31 degenerate positions. At the positions that are most important for uptake, positions 7 and 8, there are only 215 and 156 such fragments. If we make the extreme assumption that they all arose by sequencing errors of perfect-consensus fragments, the error rate must be less than 0.1% (If we allow some contamination and/or some uptake of the mismatched fragments the error rate would be even lower.
The other set of estimates comes from the control non-degenerate bases that precede (4) and follow (5) the degenerate sequence. We know what the base should be at these positions, so we can just count the differences. These are shockingly high; for different positions they range from 0.5% to 9.1%. Because of a weird pattern in the identity of the error bases, we suspect that these values have been confounded by misalignment problems, arising because the oligo synthesis or the sequencing erroneously skipped one or more positions. We'll try to sort this out this morning by looking directly at the non-degenerate positions in a few of these reads. If the differences at the 10 control positions are really due to base-identification errors we should see them in almost half of the reads.
- Home
- Angry by Choice
- Catalogue of Organisms
- Chinleana
- Doc Madhattan
- Games with Words
- Genomics, Medicine, and Pseudoscience
- History of Geology
- Moss Plants and More
- Pleiotropy
- Plektix
- RRResearch
- Skeptic Wonder
- The Culture of Chemistry
- The Curious Wavefunction
- The Phytophactor
- The View from a Microbiologist
- Variety of Life
Field of Science
-
-
-
Political pollsters are pretending they know what's happening. They don't.5 weeks ago in Genomics, Medicine, and Pseudoscience
-
-
Course Corrections6 months ago in Angry by Choice
-
-
The Site is Dead, Long Live the Site2 years ago in Catalogue of Organisms
-
The Site is Dead, Long Live the Site2 years ago in Variety of Life
-
Does mathematics carry human biases?4 years ago in PLEKTIX
-
-
-
-
A New Placodont from the Late Triassic of China5 years ago in Chinleana
-
Posted: July 22, 2018 at 03:03PM6 years ago in Field Notes
-
Bryophyte Herbarium Survey7 years ago in Moss Plants and More
-
Harnessing innate immunity to cure HIV8 years ago in Rule of 6ix
-
WE MOVED!8 years ago in Games with Words
-
-
-
-
post doc job opportunity on ribosome biochemistry!9 years ago in Protein Evolution and Other Musings
-
Growing the kidney: re-blogged from Science Bitez9 years ago in The View from a Microbiologist
-
Blogging Microbes- Communicating Microbiology to Netizens10 years ago in Memoirs of a Defective Brain
-
-
-
The Lure of the Obscure? Guest Post by Frank Stahl12 years ago in Sex, Genes & Evolution
-
-
Lab Rat Moving House13 years ago in Life of a Lab Rat
-
Goodbye FoS, thanks for all the laughs13 years ago in Disease Prone
-
-
Slideshow of NASA's Stardust-NExT Mission Comet Tempel 1 Flyby13 years ago in The Large Picture Blog
-
in The Biology Files
Not your typical science blog, but an 'open science' research blog. Watch me fumbling my way towards understanding how and why bacteria take up DNA, and getting distracted by other cool questions.
1 comment:
Markup Key:
- <b>bold</b> = bold
- <i>italic</i> = italic
- <a href="http://www.fieldofscience.com/">FoS</a> = FoS
Subscribe to:
Post Comments (Atom)
Rosie, I haven't quite understand what you are doing enough to comment specifically but I would direct you to this paper http://nar.oxfordjournals.org/content/early/2011/05/14/nar.gkr344.full and ensure you take into account Illumina systematic sequencing errors that were present until the very latest chemistry update with GGC / GGCxG motifs.
ReplyDeleteMore info here http://seqanswers.com/forums/showthread.php?t=4883