Yesterday I worked out a way to nudge the Gibbs motif sampler into finding the Neisseria meningitidis DUS (their term for their uptake signal sequence). Even though the DUS is present in Neisserial genomes even more frequently than the H. influenzae USS is in its genome, the sampler couldn't find it without prompting. This may be because it's much shorter than the USS (only 12 contiguous bp vs 22 bp spread over 29 positions), or for some other reason I don't understand.
I didn't want to give the sampler a prior file specifying the pattern to look for, so instead I added two lines of fake sequence with a very high frequency of the DUS to the start of the genome file. This 'seed' was enough to get the sampler started on the right motif. Once it's started it has no trouble finding the DUS, and I can later delete the seeded DUSs from the list it generates.
This morning I obtained the A. pleuropneumoniae genome sequence my collaborators have been working with, split it into pieces, and generated reverse complements of both it and the N. meningitidis genome, and combined each genome's forward and reverse-complement sequences into single 'F+RC' files for searching. I did this because I need to have the sampler search both strands, and (I think) I have better control if I tell it to search just the sequence I've given it. The A. pleuropneumoniae USS is very similar to but even longer than the H. influenzae USS, so I did test runs with the 'prior' masking file I'd used for H. influenzae to make sure everything worked.
I did this and all my other tests using only 10% of the genome and only one orientation, because I wanted them to run very fast and because the guys who manage the computer cluster want all long runs to be entered through their 'Fair Share' queueing system. And now I've successfully queue'd requests for full-genome searches. I don't expect to get the results until tonight or tomorrow.
I also emailed my collaborators to let them know I'm finally back working on this project. The PI is on vacation, but the bioinformatician has been taking advantage of his absence to work full time on it! She's going to send me her new data and rewrite in a few days, so I'm not going to do any work on the manuscript until then. I could go ahead and do Gibbs analysis of all the genomes we might want to consider, but I think I should wait to see how the three main foci of our work (H. influenzae, A. pleuropneumoniae and N. meningitidis) fit into the manuscript.
- Home
- Angry by Choice
- Catalogue of Organisms
- Chinleana
- Doc Madhattan
- Games with Words
- Genomics, Medicine, and Pseudoscience
- History of Geology
- Moss Plants and More
- Pleiotropy
- Plektix
- RRResearch
- Skeptic Wonder
- The Culture of Chemistry
- The Curious Wavefunction
- The Phytophactor
- The View from a Microbiologist
- Variety of Life
Field of Science
-
-
-
Political pollsters are pretending they know what's happening. They don't.5 weeks ago in Genomics, Medicine, and Pseudoscience
-
-
Course Corrections6 months ago in Angry by Choice
-
-
The Site is Dead, Long Live the Site2 years ago in Catalogue of Organisms
-
The Site is Dead, Long Live the Site2 years ago in Variety of Life
-
Does mathematics carry human biases?4 years ago in PLEKTIX
-
-
-
-
A New Placodont from the Late Triassic of China5 years ago in Chinleana
-
Posted: July 22, 2018 at 03:03PM6 years ago in Field Notes
-
Bryophyte Herbarium Survey7 years ago in Moss Plants and More
-
Harnessing innate immunity to cure HIV8 years ago in Rule of 6ix
-
WE MOVED!8 years ago in Games with Words
-
-
-
-
post doc job opportunity on ribosome biochemistry!9 years ago in Protein Evolution and Other Musings
-
Growing the kidney: re-blogged from Science Bitez9 years ago in The View from a Microbiologist
-
Blogging Microbes- Communicating Microbiology to Netizens10 years ago in Memoirs of a Defective Brain
-
-
-
The Lure of the Obscure? Guest Post by Frank Stahl12 years ago in Sex, Genes & Evolution
-
-
Lab Rat Moving House13 years ago in Life of a Lab Rat
-
Goodbye FoS, thanks for all the laughs13 years ago in Disease Prone
-
-
Slideshow of NASA's Stardust-NExT Mission Comet Tempel 1 Flyby13 years ago in The Large Picture Blog
-
in The Biology Files
Not your typical science blog, but an 'open science' research blog. Watch me fumbling my way towards understanding how and why bacteria take up DNA, and getting distracted by other cool questions.
1 comment:
Markup Key:
- <b>bold</b> = bold
- <i>italic</i> = italic
- <a href="http://www.fieldofscience.com/">FoS</a> = FoS
Subscribe to:
Post Comments (Atom)
the bioinformatician has been taking advantage of his absence to work full time on it
ReplyDeleteOh, that sounds so familiar. Bioinformaticians enjoy working on numerous, fun side-projects far more than what the stuffy old PI thinks is important :)