KDD 2011的關於topic modeling的Tutorial
首先,神馬是topic model? wikipedia說是這個:
In machine learning and natural language processing, a topic model is a type of statistical model for discovering the abstract “topics” that occur in a collection of documents. An early topic model wasprobabilistic latent semantic indexing (PLSI), created by Thomas Hofmann in 1999.[1] Latent Dirichlet allocation (LDA), perhaps the most common topic model currently in use, is a generalization of PLSI developed by David Blei, Andrew Ng, and Michael Jordan in 2002。
然後這個David Blei在今年的KDD上做了一個Tutorial,有Slides,異常的新鮮。。。大家看看。。(表示我自己不是很懂這個,有懂這個的可以自告奮勇寫篇Tutorial。。。)