The third question I asked about the USS motif was whether there is evidence for interactions. My query to the EvolDir list produced three applicable programs. One looked difficult so I left it as a last resort. A second had been written by a colleague (in Fortran! He's an old-fashioned guy (we were post-docs together)). He kindly offered to try running our preliminary sequence set for us, and sent a monster Excel file full of the statistical results, with the 24 significant ones highlighted. There's a strong risk of spurious correlations in this kind of analysis, but the ones he found seem likely to be genuine, as they are almost all between adjacent positions.
In the meantime I'd also been trying out a program that had a lovely simple web interface. But it found only two covarying positions, and these seemed very weak (i.e. their squares on the matrix were only a tiny bit darker than the background. I was attracted to this web program because its matrix display of the results seemed so intuitive, but quickly realized that this simplicity was failing to tell me what I need to know. After a lot of back and forth with a helpful expert (= person who let his email address be linked to the web page) I now have a folder full of the software and associated files (ReadMe, Help), and can begin working out how to run it for myself.
Aaarrgghhhh! It's written in a programming language called GAWK/NAWK. Wikipedia says AWK was a precursor to Perl, and runs in Unix; GAWK is GNU-AWK. Thanks, that's a big help. Mac OS 10.4 doesn't have GAWK, just AWK. I hope Westgrid has GAWK.
- Home
- Angry by Choice
- Catalogue of Organisms
- Chinleana
- Doc Madhattan
- Games with Words
- Genomics, Medicine, and Pseudoscience
- History of Geology
- Moss Plants and More
- Pleiotropy
- Plektix
- RRResearch
- Skeptic Wonder
- The Culture of Chemistry
- The Curious Wavefunction
- The Phytophactor
- The View from a Microbiologist
- Variety of Life
Field of Science
-
-
-
Political pollsters are pretending they know what's happening. They don't.5 weeks ago in Genomics, Medicine, and Pseudoscience
-
-
Course Corrections6 months ago in Angry by Choice
-
-
The Site is Dead, Long Live the Site2 years ago in Catalogue of Organisms
-
The Site is Dead, Long Live the Site2 years ago in Variety of Life
-
Does mathematics carry human biases?4 years ago in PLEKTIX
-
-
-
-
A New Placodont from the Late Triassic of China5 years ago in Chinleana
-
Posted: July 22, 2018 at 03:03PM6 years ago in Field Notes
-
Bryophyte Herbarium Survey7 years ago in Moss Plants and More
-
Harnessing innate immunity to cure HIV8 years ago in Rule of 6ix
-
WE MOVED!8 years ago in Games with Words
-
-
-
-
post doc job opportunity on ribosome biochemistry!9 years ago in Protein Evolution and Other Musings
-
Growing the kidney: re-blogged from Science Bitez9 years ago in The View from a Microbiologist
-
Blogging Microbes- Communicating Microbiology to Netizens10 years ago in Memoirs of a Defective Brain
-
-
-
The Lure of the Obscure? Guest Post by Frank Stahl12 years ago in Sex, Genes & Evolution
-
-
Lab Rat Moving House13 years ago in Life of a Lab Rat
-
Goodbye FoS, thanks for all the laughs13 years ago in Disease Prone
-
-
Slideshow of NASA's Stardust-NExT Mission Comet Tempel 1 Flyby13 years ago in The Large Picture Blog
-
in The Biology Files
Not your typical science blog, but an 'open science' research blog. Watch me fumbling my way towards understanding how and why bacteria take up DNA, and getting distracted by other cool questions.
2 comments:
Markup Key:
- <b>bold</b> = bold
- <i>italic</i> = italic
- <a href="http://www.fieldofscience.com/">FoS</a> = FoS
Subscribe to:
Post Comments (Atom)
Is it definitely necessary to use both programs to test for linkage, or could you just use the results that you already have?
ReplyDeleteI have been pondering something you mentioned about strong consensus sites effacing evidence of linkage - can we have confidence in linkage searches like this across a sequence where strength of motif varies so greatly from position to position?
I think that the difference between AWK and GAWK isn't so crucial. Actually I believe that GNU awk is the prevalent implementation of awk nowadays, and that GNU awk certainly implements all the features of original awk (with the help of special options or just out-of-the box), so it shouldn't represent any problem.
ReplyDeleteI must confess that you have astounding level of interaction with a computer in your research. Here in Russia (at least in the city where I live, ca. 2000 km from Moscow) no head of medical/biological lab would EVER use blogs, let alone Unix command-line tools.