« Data and Web 2.0: "many eyes" from IBM | Main | Summarizing election results using average district vote or total vote »
January 24, 2007
A new causality blog
A group of from University of California in Los Angeles, including the popular author of books on Bayesian networks (sometimes referred to as belief networks or as graphical models, as they aren't Bayesian in the Bayesian statistics sense) and causality Judea Pearl, have set up a new blog on causality. Their approach to causality is based on probability theory with random variables and operators. For a taste of it, see "Causality is undefinable" or "The meaning of counterfactuals".
While it takes the form of a blog, the system is more like a help line. The good stuff is often in the comments.
Posted by Aleks at January 24, 2007 8:33 PM
Trackback Pings
TrackBack URL for this entry:
http://www.stat.columbia.edu/~cook/movabletype/mt-tb.cgi/782
Comments
Posted by: Andrew
at January 24, 2007 9:34 PM.
Nice link. BTW, Alexs, what do you mean by "they aren't Bayesian in the Bayesian statistics sense" ? Thanks.
Posted by: PierreD at January 29, 2007 4:26 AM.
Pierre, Bayesian statistics is usually about P(Model|Data) \propto P(Model)P(Data|Model). A lot of Bayesian non-statistics stuff is in the context of already having a probabilistic model, or structuring it by factorizing the joint probabilistic model. On the other hand, you could view Bayesian statistics as a special case of Bayes rule applications.
What one should note is that not all people mean the same thing when they say "Bayesian".
Posted by: Aleks
at January 30, 2007 8:21 PM.
Aleks,
I disagree with your comment. I think Bayesian statistics is usually about p(parameters|data) and p(predictions|data), conditional on an assumed model, with the model occasionally improved (for example, after seeing a problem via predictive checking).
Posted by: Andrew
at January 30, 2007 8:32 PM.
Andrew, by "Model" I do actually mean the parameters and assumptions associated with it. I agree with the rest of your comment.
Posted by: Aleks at January 30, 2007 8:43 PM.
Thanks. Actually I was asking this question to better understand the point of view of your community (statisticians). In mine (robotics), a lot of researchers are updating probabilities without the P(param|data, model) framework in mind. They often do *not* consider parameters as random variables of the model.
As regards the different definitions of bayesians, We discussed this last year in a post :
http://emotion.inrialpes.fr/~dangauthier/blog/2006/03/14/which-bayesian-are-you/
Posted by: PierreD at January 31, 2007 5:01 AM.
Pierre, actually, another concise definition I've heard from Neil Lawrence and liked very much but forgotten to mention was precisely that Bayesians consider the parameters to be random variables.
Posted by: Aleks
at January 31, 2007 9:40 AM.
Yes, indeed this a definition I like, especially because it underlines the subjective interpretation of probabilities (degree of belief of an agent). Thank you.
Posted by: PierreD at February 1, 2007 4:47 AM.
There is an 'objective' way of interpreting those probabilities too, and I've written about it in my Modelling Modelled: it would be unscientific to reject a model that could plausibly explain the data. For that matter, many probabilistic models explain the data, so we cannot zero in on the true one: we can only guess about what could be the true one. And picking the best one would be quite arbitrary and easily affected by coincidences in the data.
In the second step, when you're making predictions and no longer being a scientist, you do have to pick what you're going to predict them with. The idea of averaging over the posterior is one approach, but I can understand those who don't like this and swear by optimization and regularization approaches to finding the equivalent of a MAP.
Posted by: Aleks at February 1, 2007 9:35 AM.
