An R package for detecting multimodality in discrete data using Bayesian techniques. The approach works in two stages. First, a mixture distribution is fitted on the data using a Sparse Finite Mixture (SFM) MCMC algorithm. The number of mixture components does not have to be specified; it is estimated simultaneously with the mixture weights and components through the SFM approach. Second, the resulting MCMC output is used to calculate the number of modes and their locations. Posterior probabilities are retrieved for both of these quantities providing a powerful tool for mode inference. Currently the package supports a flexible mixture of shifted Poisson distributions. The shifted Poisson is a two-parameter generalisation of the Poisson distribution where the mean can diverge from the variance. More distributions (discrete and continuous) are in the pipeline.
# Installing from github :
# install.packages("devtools") # if devtools is not installed
::install_github("paullabonne/BayesMultiMode") devtools
library(BayesMultiMode)
set.seed(1)
= 0.3
p1 = 1-p1
p2 = 3
kap1 = 0
kap2 = 1
lam1 = 0.5
lam2 = 70
length_data <- c(rpois(length_data*p1, lam1)+kap1, rpois(length_data*p2, lam2)+kap2) simulated_data
# Select DNA data :
data("d4z4")
= d4z4
y
# Or select simulated data :
# y = simulated_data
# Number of MCMC iterations
= 5000
M
# Proportion of draws to discard as burnin
= 0.5
S
# Maximum number of mixture components
= 6 Jmax
#Bayesian estimation
= sfm_mcmc_spmix(y=y,Jmax=Jmax, M=M) sfm_mcmc
## 10 % draws finished. Accept. prob of e0 = 9 percent
## 20 % draws finished. Accept. prob of e0 = 8 percent
## 30 % draws finished. Accept. prob of e0 = 8 percent
## 40 % draws finished. Accept. prob of e0 = 9 percent
## 50 % draws finished. Accept. prob of e0 = 9 percent
## 60 % draws finished. Accept. prob of e0 = 9 percent
## 70 % draws finished. Accept. prob of e0 = 9 percent
## 80 % draws finished. Accept. prob of e0 = 9 percent
## 90 % draws finished. Accept. prob of e0 = 9 percent
## 100 % draws finished. Accept. prob of e0 = 9 percent
#Plots of the estimation output
= plots_mcmc(sfm_mcmc,S)
graphs 3]] graphs[[
4]] graphs[[
## Warning: Removed 41 rows containing missing values (position_stack).
= post_sfm_mcmc(sfm_mcmc,S) post_sfmmcmc
= bayes_mode(post_sfmmcmc$theta_draws_slim,y)
sfm_mcmc_modes $graphs sfm_mcmc_modes
(Basturk, Hoogerheide, and Dijk 2021) (Malsiner-Walli, Fruhwirth-Schnatter, and Grun 2016) (Schaap et al. 2013)