On the use of Bernoulli mixture models for text classification

Author:Juan, A; Vidal, E

Article Title:On the use of Bernoulli mixture models for text classification

Abstract:
Mixture modelling of class-conditional densities is a standard pattern recognition technique. Although most research on mixture models has concentrated on mixtures for continuous data, emerging pattern recognition applications demand extending research efforts to other data types. This paper focuses on the application of mixtures of multivariate Bernoulli distributions to binary data. More concretely, a text classification task aimed at improving language modelling for machine translation is considered. (C) 2002 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.

Keywords: mixture models; EM algorithm; data categorization; multivariate binary data; text classification; multivariate Bernoulli distribution

DOI: 10.1016/S0031-3203(01)00242-4

Source:PATTERN RECOGNITION

Welcome to correct the error, please contact email: humanisticspider@gmail.com