BayesianGaussianMixture fit takes 1000 times as long on a subset - why? #25476
russell-at-qudo
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm using BayesianGaussianMixture to create some segments of data, and it works really well. The issue I am having is that I want to test relabel consistency, and I am doing this by creating a stratified train/test split (stratified by segment label). When I am doing this, the fit process is taking around 1000 times as long to fit the training set than the full dataset given the same model parameters, and I don't know why!?
This takes around 1.2 seconds for my dataset.
This takes about 1400 seconds.
Any ideas?
Beta Was this translation helpful? Give feedback.
All reactions