The theory that would not die

| No Comments | 2 TrackBacks
Share |
theory-would-not-die.jpg And you thought that Bayesian statistics were just for inferring phylogenies, species distribution, population genetics parameters, and various kinds of equally arcane social science type stuff. Well, think again.

The review in the New Scientist confuses Bayes' Theorem1 and Bayesian inference In a way that I presume the book doesn't, but it concludes:

[T]o have crafted a page-turner out of the history of statistics is an impressive feat.
Indeed. I can imagine it being a page-turner for a stats geek like me, but David Robson doesn't sound like a stats geek. If Sharon Bertsch McGrayne really turned a history of Bayesian inference into something that more normal people find interesting, she's accomplished a task many of us would envy.

If you're wondering what the review (I hope not the book) got wrong about Bayes' Theorem versus Bayesian inference, read on.
Bayes' Theorem is exactly what it say it is, a theorem that no one disputes. It concerns conditional probabilities. Here's how it goes, the probability that event B happens, given that we already know event A occurred is

P(B|A) = P(B and A)/P(A)
That's the definition of conditional probability. But if that's true for B given A, this is true for A given B:

P(A|B) = P(A and B)/P(B)
Since P(A and B) = P(B and A),

P(B|A)P(A) = P(A|B)P(B)
and

P(B|A) = P(A|B)P(B)/P(A)
Everybody who understands probability would agree with that. That's why they call it a theorem.

So what's Bayesian inference, and why is it controversial?

Well, suppose we collect a bunch of data, call it X. R.A. Fisher pointed out a long time ago that we can calculate a quantity called the likelihood that is, roughly, the probability of getting that data given a particular probability distribution and the (unknown) parameters that govern that probability distribution, like the mean and the variance of a normal distribution. That's P(X|theta), where theta are the unknown parameters.

Fisher proposed to estimate theta by finding the value(s) of theta that make X that maximize P(X|theta), i.e., that make the data more likely than any other possible value. That's what we mean when we say that we have a maximum-likelihood estimate of some parameter.

Bayesians like me use the same likelihood, but we get our estimates from it in a different way. We think it's more natural and informative to think about the probability that the unknown parameters take on a particular value given the data we already have than it is to think about the probability of getting data we already have given some unknown parameters. To "invert" the likelihood - the probability of the data given the parameter - into the more natural probability of the parameter given the data, we use Bayes theorem:

P(theta|X) = P(X|theta)P(theta)/P(X)
That looks pretty easy. So why is it controversial? Because we have to specify P(theta), the prior probability. We have to say what are possible (or likely) values of theta before we look at any data. That's where the subjectivity (apparently) comes in. I say "apparently" because I think the subjectivity is more apparent than real. Andrew Gelman has a series of papers that delve into this much more deeply and authoritatively than I can. If you're interested, click on the link below.

Related articles

1As you can see, the book refers to "Bayes' rule". I presume, not having read the book yet - I will place an order on Amazon.com to have it delivered to my Kindle as soon as I finish writing this - that by "Bayes' rule" the author means what is more commonly in my world referred to as Bayes' Theorem.

2 TrackBacks

TrackBack URL: http://darwin.eeb.uconn.edu/cgi-bin/mt/mt-tb.cgi/643

Remember awhile back when I went on too long about Bayesian inference. Well, if you'd like a gentle introduction to different approaches to statistical inference, I recently ran across a page entitled "Statistics for experimental biologists" that gives... Read More

2011 in review from Uncommon Ground on January 1, 2012 7:03 AM

Uncommon Ground received nearly 25,000 page views in 2011.1 The most read posts by month were: January: The scale of the universe February: Climate change and extreme weather March: The sixth mass extinction April: Remembering Sally Richards May: I'm a... Read More

Leave a comment

 Subscribe in a reader

Pages

OpenID accepted here Learn more about OpenID

Technorati

Technorati search

» Blogs that link here

Nature Blog Network
Creative Commons License
This blog is licensed under a Creative Commons License.

About this Entry

This page contains a single entry by Kent published on July 7, 2011 6:00 AM.

Rumana Monzur returns to Vancouver was the previous entry in this blog.

Something I don't need is the next entry in this blog.

Find recent content on the main index or look in the archives to find all content.