dnbc-scala

Parallel implementation of dynamic naive Bayesian classifier

Download the paper.

Data set type	Average success rate [%]
Discrete	65
Continuous	42
Bivariate	76
Gaussian mixture (without hint)	96
Gaussian mixture (with hint)	99

The average success rate means the average percentage of hidden states inferred correctly.

There are two main reasons for relatively low overall sucess rate:

Property	Value
Number of hidden states	10
Sequence length	200
Observed discrete variables	5
Observed continuous variables	5
Learning set length (#sequences)	1000
Testing set length (#sequences)	200
Max Gaussians per mixture	3
Transitions per hidden state	5

Property	Workers=1	Workers=2	Workers=4	Workers=8	Workers=15
Learning time speed up	1	1.3	1.5	1.8	2