mixdir: Cluster High Dimensional Categorical Datasets
Scalable Bayesian clustering of categorical datasets. The package implements a hierarchical Dirichlet
(Process) mixture of multinomial distributions. It is thus a probabilistic latent class model (LCM) and can be used
to reduce the dimensionality of hierarchical data and cluster individuals into latent classes. It can automatically
infer an appropriate number of latent classes or find k classes, as defined by the user. The model is based on a
paper by Dunson and Xing (2009) <doi:10.1198/jasa.2009.tm08439>, but implements a scalable variational inference algorithm so that it is
applicable to large datasets. It is described and tested in the accompanying paper by
Ahlmann-Eltze and Yau (2018) <doi:10.1109/DSAA.2018.00068>.
Version: |
0.3.0 |
Depends: |
R (≥ 2.10) |
Imports: |
extraDistr, Rcpp |
LinkingTo: |
Rcpp |
Suggests: |
testthat, tibble, purrr, dplyr, rmutil, pheatmap, mcclust, ggplot2, tidyr, utils |
Published: |
2019-09-20 |
DOI: |
10.32614/CRAN.package.mixdir |
Author: |
Constantin Ahlmann-Eltze
[aut, cre],
Christopher Yau
[ths] |
Maintainer: |
Constantin Ahlmann-Eltze <artjom31415 at googlemail.com> |
License: |
GPL-3 |
URL: |
https://github.com/const-ae/mixdir |
NeedsCompilation: |
yes |
Citation: |
mixdir citation info |
Materials: |
README NEWS |
CRAN checks: |
mixdir results |
Documentation:
Downloads:
Linking:
Please use the canonical form
https://CRAN.R-project.org/package=mixdir
to link to this page.