Parametric tests for non-normal data?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;







up vote
2
down vote

favorite












I'm trying to polish my stats skills and it seems to me that you have either parametric test for normal data, or non-parametric tests for non-normal data.



Looking at the t-test for instance, I don't really see the reason why a similar derivation could not be done for other known distributions. I guess some of them might be hard to do analytically, but since we have computers anyway, that should not be really a problem.



So I'm guessing that there are such tests, but they are not really taught and/or are impractical for one or more reasons . Can someone enlighten me?










share|cite|improve this question







New contributor




fbence is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1




    I don't think this is a black or white type of problem. Many parametric tests are robust to minor or even moderate violations of the normality assumption of the data or model residuals and can still be used. It's a matter of quantum!
    – Isabella Ghement
    1 hour ago

















up vote
2
down vote

favorite












I'm trying to polish my stats skills and it seems to me that you have either parametric test for normal data, or non-parametric tests for non-normal data.



Looking at the t-test for instance, I don't really see the reason why a similar derivation could not be done for other known distributions. I guess some of them might be hard to do analytically, but since we have computers anyway, that should not be really a problem.



So I'm guessing that there are such tests, but they are not really taught and/or are impractical for one or more reasons . Can someone enlighten me?










share|cite|improve this question







New contributor




fbence is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1




    I don't think this is a black or white type of problem. Many parametric tests are robust to minor or even moderate violations of the normality assumption of the data or model residuals and can still be used. It's a matter of quantum!
    – Isabella Ghement
    1 hour ago













up vote
2
down vote

favorite









up vote
2
down vote

favorite











I'm trying to polish my stats skills and it seems to me that you have either parametric test for normal data, or non-parametric tests for non-normal data.



Looking at the t-test for instance, I don't really see the reason why a similar derivation could not be done for other known distributions. I guess some of them might be hard to do analytically, but since we have computers anyway, that should not be really a problem.



So I'm guessing that there are such tests, but they are not really taught and/or are impractical for one or more reasons . Can someone enlighten me?










share|cite|improve this question







New contributor




fbence is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











I'm trying to polish my stats skills and it seems to me that you have either parametric test for normal data, or non-parametric tests for non-normal data.



Looking at the t-test for instance, I don't really see the reason why a similar derivation could not be done for other known distributions. I guess some of them might be hard to do analytically, but since we have computers anyway, that should not be really a problem.



So I'm guessing that there are such tests, but they are not really taught and/or are impractical for one or more reasons . Can someone enlighten me?







parametric






share|cite|improve this question







New contributor




fbence is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|cite|improve this question







New contributor




fbence is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|cite|improve this question




share|cite|improve this question






New contributor




fbence is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked 1 hour ago









fbence

1112




1112




New contributor




fbence is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





fbence is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






fbence is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







  • 1




    I don't think this is a black or white type of problem. Many parametric tests are robust to minor or even moderate violations of the normality assumption of the data or model residuals and can still be used. It's a matter of quantum!
    – Isabella Ghement
    1 hour ago













  • 1




    I don't think this is a black or white type of problem. Many parametric tests are robust to minor or even moderate violations of the normality assumption of the data or model residuals and can still be used. It's a matter of quantum!
    – Isabella Ghement
    1 hour ago








1




1




I don't think this is a black or white type of problem. Many parametric tests are robust to minor or even moderate violations of the normality assumption of the data or model residuals and can still be used. It's a matter of quantum!
– Isabella Ghement
1 hour ago





I don't think this is a black or white type of problem. Many parametric tests are robust to minor or even moderate violations of the normality assumption of the data or model residuals and can still be used. It's a matter of quantum!
– Isabella Ghement
1 hour ago











1 Answer
1






active

oldest

votes

















up vote
2
down vote













If you know that the distribution is in fact normal, then tests derived from normal data will be optimal. The Z-test (with known variance) achieves just such a property for the 1 parameter normal distribution.



Parametric tests can be derived for any old distribution with maximum likelihood. If data are Poisson, Exponential, etc., a likelihood ratio test with 1 degree of freedom can be done as a two sample test. The link between T-tests and regression with adjustment for a binary group variable can be extended to generalized linear models for two-sample tests with known but not normal data.



It's way more interesting to think about when we don't know the distribution of the data. I mean, if you don't know the mean, what sense is to say, "I know this is a 3 parameter bimodal normal mixture model!"



The T-test has some interesting properties that it is also efficient even for a general class of finite-variance distributions. This is because of the central limit theorem. The sampling distribution of the mean converges to normal even in very small samples. Another way of describing the T-test is an asymptotic test because you are approximating the long-run sampling distribution of the mean.



Some test statistics, especially mins and maxes, do not converge to normal distributions, so tests based on their limiting distributions are actually compared to exponential (Huzurbazar), Gumbell, or extreme value distributions as $n rightarrow infty$.



In general, we would never apply a parametric test to the data of the wrong parametric form, it's just obviously the less optimal solution.






share|cite|improve this answer




















  • So essentially, you are saying, that a) they do exist they just don't specifically have a name and b) if the data is not normal, usually the distribution is not something I know anyway, and c) if it kind of is normal, than the t-test will be okay anyway assuming I have enough data?
    – fbence
    1 hour ago











  • @fbence a) some are named, e.g. Pearson chi-square test, but certainly not for all cases b) we rarely know any distribution in the wild, but the context/science gives us some background, like I know the distribution of neutrophils is well modeled by Negative binomial because it is a concentration. c) Usually yes. The normal approximation converges fast, and even faster when the sample(s) are unimodel, symmetric, concave, and normokurtic.
    – AdamO
    54 mins ago










Your Answer




StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "65"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: false,
noModals: false,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);






fbence is a new contributor. Be nice, and check out our Code of Conduct.









 

draft saved


draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f368478%2fparametric-tests-for-non-normal-data%23new-answer', 'question_page');

);

Post as a guest






























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes








up vote
2
down vote













If you know that the distribution is in fact normal, then tests derived from normal data will be optimal. The Z-test (with known variance) achieves just such a property for the 1 parameter normal distribution.



Parametric tests can be derived for any old distribution with maximum likelihood. If data are Poisson, Exponential, etc., a likelihood ratio test with 1 degree of freedom can be done as a two sample test. The link between T-tests and regression with adjustment for a binary group variable can be extended to generalized linear models for two-sample tests with known but not normal data.



It's way more interesting to think about when we don't know the distribution of the data. I mean, if you don't know the mean, what sense is to say, "I know this is a 3 parameter bimodal normal mixture model!"



The T-test has some interesting properties that it is also efficient even for a general class of finite-variance distributions. This is because of the central limit theorem. The sampling distribution of the mean converges to normal even in very small samples. Another way of describing the T-test is an asymptotic test because you are approximating the long-run sampling distribution of the mean.



Some test statistics, especially mins and maxes, do not converge to normal distributions, so tests based on their limiting distributions are actually compared to exponential (Huzurbazar), Gumbell, or extreme value distributions as $n rightarrow infty$.



In general, we would never apply a parametric test to the data of the wrong parametric form, it's just obviously the less optimal solution.






share|cite|improve this answer




















  • So essentially, you are saying, that a) they do exist they just don't specifically have a name and b) if the data is not normal, usually the distribution is not something I know anyway, and c) if it kind of is normal, than the t-test will be okay anyway assuming I have enough data?
    – fbence
    1 hour ago











  • @fbence a) some are named, e.g. Pearson chi-square test, but certainly not for all cases b) we rarely know any distribution in the wild, but the context/science gives us some background, like I know the distribution of neutrophils is well modeled by Negative binomial because it is a concentration. c) Usually yes. The normal approximation converges fast, and even faster when the sample(s) are unimodel, symmetric, concave, and normokurtic.
    – AdamO
    54 mins ago














up vote
2
down vote













If you know that the distribution is in fact normal, then tests derived from normal data will be optimal. The Z-test (with known variance) achieves just such a property for the 1 parameter normal distribution.



Parametric tests can be derived for any old distribution with maximum likelihood. If data are Poisson, Exponential, etc., a likelihood ratio test with 1 degree of freedom can be done as a two sample test. The link between T-tests and regression with adjustment for a binary group variable can be extended to generalized linear models for two-sample tests with known but not normal data.



It's way more interesting to think about when we don't know the distribution of the data. I mean, if you don't know the mean, what sense is to say, "I know this is a 3 parameter bimodal normal mixture model!"



The T-test has some interesting properties that it is also efficient even for a general class of finite-variance distributions. This is because of the central limit theorem. The sampling distribution of the mean converges to normal even in very small samples. Another way of describing the T-test is an asymptotic test because you are approximating the long-run sampling distribution of the mean.



Some test statistics, especially mins and maxes, do not converge to normal distributions, so tests based on their limiting distributions are actually compared to exponential (Huzurbazar), Gumbell, or extreme value distributions as $n rightarrow infty$.



In general, we would never apply a parametric test to the data of the wrong parametric form, it's just obviously the less optimal solution.






share|cite|improve this answer




















  • So essentially, you are saying, that a) they do exist they just don't specifically have a name and b) if the data is not normal, usually the distribution is not something I know anyway, and c) if it kind of is normal, than the t-test will be okay anyway assuming I have enough data?
    – fbence
    1 hour ago











  • @fbence a) some are named, e.g. Pearson chi-square test, but certainly not for all cases b) we rarely know any distribution in the wild, but the context/science gives us some background, like I know the distribution of neutrophils is well modeled by Negative binomial because it is a concentration. c) Usually yes. The normal approximation converges fast, and even faster when the sample(s) are unimodel, symmetric, concave, and normokurtic.
    – AdamO
    54 mins ago












up vote
2
down vote










up vote
2
down vote









If you know that the distribution is in fact normal, then tests derived from normal data will be optimal. The Z-test (with known variance) achieves just such a property for the 1 parameter normal distribution.



Parametric tests can be derived for any old distribution with maximum likelihood. If data are Poisson, Exponential, etc., a likelihood ratio test with 1 degree of freedom can be done as a two sample test. The link between T-tests and regression with adjustment for a binary group variable can be extended to generalized linear models for two-sample tests with known but not normal data.



It's way more interesting to think about when we don't know the distribution of the data. I mean, if you don't know the mean, what sense is to say, "I know this is a 3 parameter bimodal normal mixture model!"



The T-test has some interesting properties that it is also efficient even for a general class of finite-variance distributions. This is because of the central limit theorem. The sampling distribution of the mean converges to normal even in very small samples. Another way of describing the T-test is an asymptotic test because you are approximating the long-run sampling distribution of the mean.



Some test statistics, especially mins and maxes, do not converge to normal distributions, so tests based on their limiting distributions are actually compared to exponential (Huzurbazar), Gumbell, or extreme value distributions as $n rightarrow infty$.



In general, we would never apply a parametric test to the data of the wrong parametric form, it's just obviously the less optimal solution.






share|cite|improve this answer












If you know that the distribution is in fact normal, then tests derived from normal data will be optimal. The Z-test (with known variance) achieves just such a property for the 1 parameter normal distribution.



Parametric tests can be derived for any old distribution with maximum likelihood. If data are Poisson, Exponential, etc., a likelihood ratio test with 1 degree of freedom can be done as a two sample test. The link between T-tests and regression with adjustment for a binary group variable can be extended to generalized linear models for two-sample tests with known but not normal data.



It's way more interesting to think about when we don't know the distribution of the data. I mean, if you don't know the mean, what sense is to say, "I know this is a 3 parameter bimodal normal mixture model!"



The T-test has some interesting properties that it is also efficient even for a general class of finite-variance distributions. This is because of the central limit theorem. The sampling distribution of the mean converges to normal even in very small samples. Another way of describing the T-test is an asymptotic test because you are approximating the long-run sampling distribution of the mean.



Some test statistics, especially mins and maxes, do not converge to normal distributions, so tests based on their limiting distributions are actually compared to exponential (Huzurbazar), Gumbell, or extreme value distributions as $n rightarrow infty$.



In general, we would never apply a parametric test to the data of the wrong parametric form, it's just obviously the less optimal solution.







share|cite|improve this answer












share|cite|improve this answer



share|cite|improve this answer










answered 1 hour ago









AdamO

30.7k255128




30.7k255128











  • So essentially, you are saying, that a) they do exist they just don't specifically have a name and b) if the data is not normal, usually the distribution is not something I know anyway, and c) if it kind of is normal, than the t-test will be okay anyway assuming I have enough data?
    – fbence
    1 hour ago











  • @fbence a) some are named, e.g. Pearson chi-square test, but certainly not for all cases b) we rarely know any distribution in the wild, but the context/science gives us some background, like I know the distribution of neutrophils is well modeled by Negative binomial because it is a concentration. c) Usually yes. The normal approximation converges fast, and even faster when the sample(s) are unimodel, symmetric, concave, and normokurtic.
    – AdamO
    54 mins ago
















  • So essentially, you are saying, that a) they do exist they just don't specifically have a name and b) if the data is not normal, usually the distribution is not something I know anyway, and c) if it kind of is normal, than the t-test will be okay anyway assuming I have enough data?
    – fbence
    1 hour ago











  • @fbence a) some are named, e.g. Pearson chi-square test, but certainly not for all cases b) we rarely know any distribution in the wild, but the context/science gives us some background, like I know the distribution of neutrophils is well modeled by Negative binomial because it is a concentration. c) Usually yes. The normal approximation converges fast, and even faster when the sample(s) are unimodel, symmetric, concave, and normokurtic.
    – AdamO
    54 mins ago















So essentially, you are saying, that a) they do exist they just don't specifically have a name and b) if the data is not normal, usually the distribution is not something I know anyway, and c) if it kind of is normal, than the t-test will be okay anyway assuming I have enough data?
– fbence
1 hour ago





So essentially, you are saying, that a) they do exist they just don't specifically have a name and b) if the data is not normal, usually the distribution is not something I know anyway, and c) if it kind of is normal, than the t-test will be okay anyway assuming I have enough data?
– fbence
1 hour ago













@fbence a) some are named, e.g. Pearson chi-square test, but certainly not for all cases b) we rarely know any distribution in the wild, but the context/science gives us some background, like I know the distribution of neutrophils is well modeled by Negative binomial because it is a concentration. c) Usually yes. The normal approximation converges fast, and even faster when the sample(s) are unimodel, symmetric, concave, and normokurtic.
– AdamO
54 mins ago




@fbence a) some are named, e.g. Pearson chi-square test, but certainly not for all cases b) we rarely know any distribution in the wild, but the context/science gives us some background, like I know the distribution of neutrophils is well modeled by Negative binomial because it is a concentration. c) Usually yes. The normal approximation converges fast, and even faster when the sample(s) are unimodel, symmetric, concave, and normokurtic.
– AdamO
54 mins ago










fbence is a new contributor. Be nice, and check out our Code of Conduct.









 

draft saved


draft discarded


















fbence is a new contributor. Be nice, and check out our Code of Conduct.












fbence is a new contributor. Be nice, and check out our Code of Conduct.











fbence is a new contributor. Be nice, and check out our Code of Conduct.













 


draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f368478%2fparametric-tests-for-non-normal-data%23new-answer', 'question_page');

);

Post as a guest













































































Comments

Popular posts from this blog

What does second last employer means? [closed]

Installing NextGIS Connect into QGIS 3?

One-line joke