Elbow Test using AIC/BIC for identifying number of clusters using GMM
Clash Royale CLAN TAG#URR8PPP
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;
up vote
4
down vote
favorite
How to select number of clusters using GMM when the elbow test (AIC/BIC vs n_components) results in a graph like this?
aic bic generalized-moments
New contributor
psangam is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
add a comment |Â
up vote
4
down vote
favorite
How to select number of clusters using GMM when the elbow test (AIC/BIC vs n_components) results in a graph like this?
aic bic generalized-moments
New contributor
psangam is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
– user2974951
2 hours ago
add a comment |Â
up vote
4
down vote
favorite
up vote
4
down vote
favorite
How to select number of clusters using GMM when the elbow test (AIC/BIC vs n_components) results in a graph like this?
aic bic generalized-moments
New contributor
psangam is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
How to select number of clusters using GMM when the elbow test (AIC/BIC vs n_components) results in a graph like this?
aic bic generalized-moments
aic bic generalized-moments
New contributor
psangam is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
psangam is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
edited 1 hour ago
Ferdi
3,39342049
3,39342049
New contributor
psangam is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
asked 2 hours ago
psangam
232
232
New contributor
psangam is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
psangam is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
psangam is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
– user2974951
2 hours ago
add a comment |Â
6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
– user2974951
2 hours ago
6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
– user2974951
2 hours ago
6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
– user2974951
2 hours ago
add a comment |Â
1 Answer
1
active
oldest
votes
up vote
2
down vote
accepted
Welcome to CV!
This plot shows how the AIC and BIC change as a function of the number of clusters. While the AIC continuous to decrease with a larger number of clusters, you can see that the BIC stops decreasing after $k=6$ clusters. For this reason, you could choose $k = 6$.
Another way to choose the 'best' number of clusters is by considering the elbow(s) of the figure. The elbow of a function is a point after which the decrease becomes notably smaller. An elbow is a heuristic, so there is no exact way to determine which value best describes this point. For example, one could argue that the AIC & BIC both stop decreasing as much after $k = 5$ clusters, while someone else might argue that this is after $k = 6$ clusters. You could even argue that the biggest decrease has already happened after $k = 2$ clusters.
Lastly, you don't have to choose any number of clusters just because AIC/BIC/whatever suggested you do so. If you have some a priori reason to assume that there should be $k = 3$ clusters, then that might be a better choice.
In short: An elbow in this context is a heuristic guide to decide the number of clusters if you have no other reason to assume a certain number of clusters.
add a comment |Â
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
2
down vote
accepted
Welcome to CV!
This plot shows how the AIC and BIC change as a function of the number of clusters. While the AIC continuous to decrease with a larger number of clusters, you can see that the BIC stops decreasing after $k=6$ clusters. For this reason, you could choose $k = 6$.
Another way to choose the 'best' number of clusters is by considering the elbow(s) of the figure. The elbow of a function is a point after which the decrease becomes notably smaller. An elbow is a heuristic, so there is no exact way to determine which value best describes this point. For example, one could argue that the AIC & BIC both stop decreasing as much after $k = 5$ clusters, while someone else might argue that this is after $k = 6$ clusters. You could even argue that the biggest decrease has already happened after $k = 2$ clusters.
Lastly, you don't have to choose any number of clusters just because AIC/BIC/whatever suggested you do so. If you have some a priori reason to assume that there should be $k = 3$ clusters, then that might be a better choice.
In short: An elbow in this context is a heuristic guide to decide the number of clusters if you have no other reason to assume a certain number of clusters.
add a comment |Â
up vote
2
down vote
accepted
Welcome to CV!
This plot shows how the AIC and BIC change as a function of the number of clusters. While the AIC continuous to decrease with a larger number of clusters, you can see that the BIC stops decreasing after $k=6$ clusters. For this reason, you could choose $k = 6$.
Another way to choose the 'best' number of clusters is by considering the elbow(s) of the figure. The elbow of a function is a point after which the decrease becomes notably smaller. An elbow is a heuristic, so there is no exact way to determine which value best describes this point. For example, one could argue that the AIC & BIC both stop decreasing as much after $k = 5$ clusters, while someone else might argue that this is after $k = 6$ clusters. You could even argue that the biggest decrease has already happened after $k = 2$ clusters.
Lastly, you don't have to choose any number of clusters just because AIC/BIC/whatever suggested you do so. If you have some a priori reason to assume that there should be $k = 3$ clusters, then that might be a better choice.
In short: An elbow in this context is a heuristic guide to decide the number of clusters if you have no other reason to assume a certain number of clusters.
add a comment |Â
up vote
2
down vote
accepted
up vote
2
down vote
accepted
Welcome to CV!
This plot shows how the AIC and BIC change as a function of the number of clusters. While the AIC continuous to decrease with a larger number of clusters, you can see that the BIC stops decreasing after $k=6$ clusters. For this reason, you could choose $k = 6$.
Another way to choose the 'best' number of clusters is by considering the elbow(s) of the figure. The elbow of a function is a point after which the decrease becomes notably smaller. An elbow is a heuristic, so there is no exact way to determine which value best describes this point. For example, one could argue that the AIC & BIC both stop decreasing as much after $k = 5$ clusters, while someone else might argue that this is after $k = 6$ clusters. You could even argue that the biggest decrease has already happened after $k = 2$ clusters.
Lastly, you don't have to choose any number of clusters just because AIC/BIC/whatever suggested you do so. If you have some a priori reason to assume that there should be $k = 3$ clusters, then that might be a better choice.
In short: An elbow in this context is a heuristic guide to decide the number of clusters if you have no other reason to assume a certain number of clusters.
Welcome to CV!
This plot shows how the AIC and BIC change as a function of the number of clusters. While the AIC continuous to decrease with a larger number of clusters, you can see that the BIC stops decreasing after $k=6$ clusters. For this reason, you could choose $k = 6$.
Another way to choose the 'best' number of clusters is by considering the elbow(s) of the figure. The elbow of a function is a point after which the decrease becomes notably smaller. An elbow is a heuristic, so there is no exact way to determine which value best describes this point. For example, one could argue that the AIC & BIC both stop decreasing as much after $k = 5$ clusters, while someone else might argue that this is after $k = 6$ clusters. You could even argue that the biggest decrease has already happened after $k = 2$ clusters.
Lastly, you don't have to choose any number of clusters just because AIC/BIC/whatever suggested you do so. If you have some a priori reason to assume that there should be $k = 3$ clusters, then that might be a better choice.
In short: An elbow in this context is a heuristic guide to decide the number of clusters if you have no other reason to assume a certain number of clusters.
answered 2 hours ago
Frans Rodenburg
3,000423
3,000423
add a comment |Â
add a comment |Â
psangam is a new contributor. Be nice, and check out our Code of Conduct.
psangam is a new contributor. Be nice, and check out our Code of Conduct.
psangam is a new contributor. Be nice, and check out our Code of Conduct.
psangam is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f368560%2felbow-test-using-aic-bic-for-identifying-number-of-clusters-using-gmm%23new-answer', 'question_page');
);
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
– user2974951
2 hours ago