0
Views
0
CrossRef citations to date
0
Altmetric
Review Article

Hidden Markov model with Pitman-Yor priors for probabilistic topic model

ORCID Icon, , &
Received 26 Jun 2023, Accepted 17 Jun 2024, Published online: 29 Jul 2024
 

Abstract.

Empirical studies of natural language have demonstrated that word frequencies follow power law distributions. However, standard statistical models often fail to capture this property. The Pitman-Yor process (PYP), a Bayesian non parametric model capable of generating power law distributions, has been widely used in probabilistic topic models to handle data with an infinite number of components. However, existing PYP topic models rarely account for the relationships between topics. Hidden Markov models (HMMs) are popular models for modeling topic relationships. To address this limitation, we propose a probabilistic topic model that combines HMM with Pitman-Yor priors. The posterior inference was performed by using variational Bayes methods. We applied our method to text categorization and compared it with two related topic models: the hidden Markov topic model and hierarchical PYP topic model.

Acknowledgments

The authors thank the editor, the associate editor, and a referee for their constructive comments and suggestions that helped to improve the paper.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

Additional information

Funding

Jianjie Guo’s research was partially supported by Shanghai Philosophy and Social Science Planning Project (Grant No. 2023BGL014). Wenchao Xu’s research was partially supported by National Natural Science Foundation of China (Grant Nos. 12101591 and 12201006).

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 1,069.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.