but only in Statistics. (I promise, this is the last Bayesian post, at least for a while.)
I've always thought that 'probability' and 'likelihood' were synonyms, but yesterday I learned that in Statistics they have distinct and complementary meanings. Unfortunately it's hard to define either of them without using the other, so I'll use the word 'chance' and clarify with examples.
Consider that you have some data and that you have a hypothesis about the reality that produced this data. For example, the data could be that plating the same volume of bacteria on a novobiocinagar plate and a plainagar plate gave 43 and 321 colonies respectively, and your hypothesis about reality is that 15.0% of the cells in the culture are able to grow in the presence of novobiocin (are NovR).
Likelihood (as defined for statistical work) is the chance that a real culture with 15.0% NovR cells would have given these numbers of cells when that volume was plated. More generally, it's the chance that the reality you've hypothesized (often your 'null hypothesis') could have produced the particular data you got. This is what classic 'frequentist' statistical methods deal with. Phylogenetic methods using 'maximum likelihood' presumably take this approach.
Probability (as defined for statistical work) reasons the other way around. It's the chance that the culture really has 15.0% NovR cells, given that your plating experiment produced 43 NovR colonies out of 321 total colonies. More generally it's the chance that the reality you're considering is true, given the data you have. This is what Bayesian methods deal with. The phylogenetic software 'Mr. Bayes' presumably takes this approach.
For now I'm not going to worry about why this might matter.
 Home
 Angry by Choice
 Catalogue of Organisms
 Chinleana
 Doc Madhattan
 Games with Words
 Genomics, Medicine, and Pseudoscience
 History of Geology
 Moss Plants and More
 Pleiotropy
 Plektix
 RRResearch
 Skeptic Wonder
 The Culture of Chemistry
 The Curious Wavefunction
 The Phytophactor
 The View from a Microbiologist
 Variety of Life
Field of Science





FTC steps in where FDA fears to tread  on homeopathy1 week ago in Genomics, Medicine, and Pseudoscience






Does variation in sequencing coverage help explain apparent variation in recombination?3 months ago in RRResearch


Harnessing innate immunity to cure HIV4 months ago in Rule of 6ix

WE MOVED!4 months ago in Games with Words


Aetosaurs: New Phylogenetic Analysis, New Taxon; and New Technique to Analyze Incongruent Character Datasets10 months ago in Chinleana





post doc job opportunity on ribosome biochemistry!1 year ago in Protein Evolution and Other Musings

Growing the kidney: reblogged from Science Bitez1 year ago in The View from a Microbiologist

Blogging Microbes Communicating Microbiology to Netizens2 years ago in Memoirs of a Defective Brain



The Lure of the Obscure? Guest Post by Frank Stahl4 years ago in Sex, Genes & Evolution


Lab Rat Moving House5 years ago in Life of a Lab Rat

Goodbye FoS, thanks for all the laughs5 years ago in Disease Prone


Slideshow of NASA's StardustNExT Mission Comet Tempel 1 Flyby5 years ago in The Large Picture Blog

in The Biology Files
Not your typical science blog, but an 'open science' research blog. Watch me fumbling my way towards understanding how and why bacteria take up DNA, and getting distracted by other cool questions.
4 comments:
Markup Key:
 <b>bold</b> = bold
 <i>italic</i> = italic
 <a href="http://www.fieldofscience.com/">FoS</a> = FoS
Subscribe to:
Post Comments (Atom)
"For now I'm not going to worry about why this might matter."
ReplyDeleteIt may matter even less than you think, at least in phylogenetics. MrBayes does use Bayesian statistics to infer phylogenetic relationships among taxa. However, most people give the program a "flat" prior, which basically makes it a Maximum Likelihood analysis.
So why even use MrBayes? Why not just use a ML method? ML analyses are notoriously computationally intensive and that is just to produce one tree and it is best to resample the data and produce a distribution of trees to calculate statistical support for branches (bootstrap).
However, bootstrapping is not needed for MrBayes because the program outputs the probability of each node. So basically, MrBayes saves time and produces similar results to ML. And you can use a lot of sophisticated evolutionary models in your analysis.
Sorry if this bores you......but now you have me thinking of Bayesian versus Maximum Likelihood and since I am publishing a paper using these methods, I keep asking.........what is the point?
I think you could put your general definition even ore simply: probability is the chance of an event/data given certain values of parameters and likelihood is the chance of certain values of parameters given the event/data.
ReplyDeleteJust to add, likelihood function need not define a probability measure i.e. the integral need not add up to 1.
ReplyDeleteI think your definitions are the wrong way around. From wikipedia:
ReplyDelete"In nontechnical parlance, "likelihood" is usually a synonym for "probability" but in statistical usage, a clear technical distinction is made. One may ask "If I were to flip a fair coin 100 times, what is the probability of it landing headsup every time?" or "Given that I have flipped a coin 100 times and it has landed headsup 100 times, what is the likelihood that the coin is fair?" but it would be improper to switch "likelihood" and "probability" in the two sentences."
Probability talks about the chances of the observation, likelihood refers to the chances of the parameters being correct given an observation.