oO(ML Discuss)
Talking about ICML 2010
Spherical Topic Models
by Joseph Reisinger , Bryan Silverthorn , Austin Waters , Raymond Mooney , at ICML 2010
We introduce the Spherical Admixture Model (SAM), a Bayesian topic model for arbitrary L2 normalized data. SAM maintains the same hierarchical structure as Latent Dirichlet Allocation (LDA), but models documents as points on a high-dimensional spherical manifold, allowing a natural likelihood parameterization in terms of cosine distance. Furthermore, SAM topics are capable of assigning negative weight to terms and can model word absence/presence unlike previous models. Performance is evaluated empirically both subjectively as a topic model using human raters and across several disparate classification tasks, from natural language processing and computer vision.
Download PDF
blog comments powered by Disqus