To choose between linear or generalised mixed effects model, what is the most important thing to consider?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;







up vote
2
down vote

favorite












Linear mixed effects models are for continuous variables.
Generalised ones are for non continuous, e.g., binomial.



We have a task in which subjects can get each item correct or incorrect. I'd say that is binomial at the level of subject at least.



Other member of the teams says the most important thing to make this decision is the research question, which is "how many items out of N they will get correct at each test", and suggested to treat the variable as continuous and use a linear model.



Also, we got many 0s (almost 80% in the last of 3 tests), so maybe we shouldn't even use binomial but zero inflated binomial. This would be important if we decide to use the brms package for R. Meanwhile, we will use lme4.



What do you think?










share|cite





























    up vote
    2
    down vote

    favorite












    Linear mixed effects models are for continuous variables.
    Generalised ones are for non continuous, e.g., binomial.



    We have a task in which subjects can get each item correct or incorrect. I'd say that is binomial at the level of subject at least.



    Other member of the teams says the most important thing to make this decision is the research question, which is "how many items out of N they will get correct at each test", and suggested to treat the variable as continuous and use a linear model.



    Also, we got many 0s (almost 80% in the last of 3 tests), so maybe we shouldn't even use binomial but zero inflated binomial. This would be important if we decide to use the brms package for R. Meanwhile, we will use lme4.



    What do you think?










    share|cite

























      up vote
      2
      down vote

      favorite









      up vote
      2
      down vote

      favorite











      Linear mixed effects models are for continuous variables.
      Generalised ones are for non continuous, e.g., binomial.



      We have a task in which subjects can get each item correct or incorrect. I'd say that is binomial at the level of subject at least.



      Other member of the teams says the most important thing to make this decision is the research question, which is "how many items out of N they will get correct at each test", and suggested to treat the variable as continuous and use a linear model.



      Also, we got many 0s (almost 80% in the last of 3 tests), so maybe we shouldn't even use binomial but zero inflated binomial. This would be important if we decide to use the brms package for R. Meanwhile, we will use lme4.



      What do you think?










      share|cite















      Linear mixed effects models are for continuous variables.
      Generalised ones are for non continuous, e.g., binomial.



      We have a task in which subjects can get each item correct or incorrect. I'd say that is binomial at the level of subject at least.



      Other member of the teams says the most important thing to make this decision is the research question, which is "how many items out of N they will get correct at each test", and suggested to treat the variable as continuous and use a linear model.



      Also, we got many 0s (almost 80% in the last of 3 tests), so maybe we shouldn't even use binomial but zero inflated binomial. This would be important if we decide to use the brms package for R. Meanwhile, we will use lme4.



      What do you think?







      r mixed-model lme4-nlme binomial zero-inflation






      share|cite















      share|cite













      share|cite




      share|cite








      edited 1 hour ago

























      asked 3 hours ago









      Lili

      134




      134




















          2 Answers
          2






          active

          oldest

          votes

















          up vote
          2
          down vote



          accepted










          The number of successes out of N trials is a Binomial distribution. Hence, it seems that you should go for a mixed-effects logistic regression.






          share|cite|improve this answer




















          • But considering the number of zeroes, could it be a zero-inflated binomial? If we decided to use brms we need to figure that out. Thanks.
            – Lili
            1 hour ago

















          up vote
          1
          down vote














          Linear mixed effects models are for continuous variables. Generalised ones are for non continuous, e.g., binomial.




          This is not true. See the wiki page for generalized linear models. E.g., the gamma and exponential distribution are generalized linaer models and both are continuous. The difference is that you allow for other distribution than the normal distribution with generalized linear models.




          We have a task in which subjects can get each item correct or incorrect. I'd say that is binomial at the level of subject at least.




          Yes that is binomial data.




          Other member of the teams says the most important thing to make this decision is the research question, which is "how many items out of N they will get correct at each test", and suggested to treat the variable as continuous and use a linear model.




          I assume that you have fixed number of $n_i$ trials for each subject $i$. In that case it is a fraction that can have value $0, 1/n_i, 2/n_i, dots, 1$. So you should use the binomial distribution as Dimitris Rizopoulos writes.




          Also, we got many 0s (almost 80% in the last of 3 tests), so maybe we shouldn't even use binomial but zero inflated binomial. This would be important if we decide to use the brms package for R.




          As far as I gather, you have some number of subjects, $k$, who each make some number of guesses, $n_1,dots,n_k$. Then you model $E(y_i/n_i)$ where $y_i$ is the number of correct guesses from subject $i$. Assuming that you have no covaraites then the model with random effects could be



          $$g(E(y_i/n_i)) = mu + epsilon_i,qquad epsilon_isim N(0,sigma^2)$$



          where $g$ is a link function (e.g., logit), $mu$ is logit of the probability of a subject guessing correct when the random effect is zero, and $epsilon_i$ is the random effect of subject $i$. Notice that this model easily yield "a lot of zeroes" if the $mu$ is sufficiently small and you use the logit link function. Hence, a lot zeroes may not be good argument to use a zero inflated binomial in this case.






          share|cite|improve this answer




















            Your Answer




            StackExchange.ifUsing("editor", function ()
            return StackExchange.using("mathjaxEditing", function ()
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
            );
            );
            , "mathjax-editing");

            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "65"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            convertImagesToLinks: false,
            noModals: false,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );













             

            draft saved


            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f367050%2fto-choose-between-linear-or-generalised-mixed-effects-model-what-is-the-most-im%23new-answer', 'question_page');

            );

            Post as a guest






























            2 Answers
            2






            active

            oldest

            votes








            2 Answers
            2






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes








            up vote
            2
            down vote



            accepted










            The number of successes out of N trials is a Binomial distribution. Hence, it seems that you should go for a mixed-effects logistic regression.






            share|cite|improve this answer




















            • But considering the number of zeroes, could it be a zero-inflated binomial? If we decided to use brms we need to figure that out. Thanks.
              – Lili
              1 hour ago














            up vote
            2
            down vote



            accepted










            The number of successes out of N trials is a Binomial distribution. Hence, it seems that you should go for a mixed-effects logistic regression.






            share|cite|improve this answer




















            • But considering the number of zeroes, could it be a zero-inflated binomial? If we decided to use brms we need to figure that out. Thanks.
              – Lili
              1 hour ago












            up vote
            2
            down vote



            accepted







            up vote
            2
            down vote



            accepted






            The number of successes out of N trials is a Binomial distribution. Hence, it seems that you should go for a mixed-effects logistic regression.






            share|cite|improve this answer












            The number of successes out of N trials is a Binomial distribution. Hence, it seems that you should go for a mixed-effects logistic regression.







            share|cite|improve this answer












            share|cite|improve this answer



            share|cite|improve this answer










            answered 2 hours ago









            Dimitris Rizopoulos

            6667




            6667











            • But considering the number of zeroes, could it be a zero-inflated binomial? If we decided to use brms we need to figure that out. Thanks.
              – Lili
              1 hour ago
















            • But considering the number of zeroes, could it be a zero-inflated binomial? If we decided to use brms we need to figure that out. Thanks.
              – Lili
              1 hour ago















            But considering the number of zeroes, could it be a zero-inflated binomial? If we decided to use brms we need to figure that out. Thanks.
            – Lili
            1 hour ago




            But considering the number of zeroes, could it be a zero-inflated binomial? If we decided to use brms we need to figure that out. Thanks.
            – Lili
            1 hour ago












            up vote
            1
            down vote














            Linear mixed effects models are for continuous variables. Generalised ones are for non continuous, e.g., binomial.




            This is not true. See the wiki page for generalized linear models. E.g., the gamma and exponential distribution are generalized linaer models and both are continuous. The difference is that you allow for other distribution than the normal distribution with generalized linear models.




            We have a task in which subjects can get each item correct or incorrect. I'd say that is binomial at the level of subject at least.




            Yes that is binomial data.




            Other member of the teams says the most important thing to make this decision is the research question, which is "how many items out of N they will get correct at each test", and suggested to treat the variable as continuous and use a linear model.




            I assume that you have fixed number of $n_i$ trials for each subject $i$. In that case it is a fraction that can have value $0, 1/n_i, 2/n_i, dots, 1$. So you should use the binomial distribution as Dimitris Rizopoulos writes.




            Also, we got many 0s (almost 80% in the last of 3 tests), so maybe we shouldn't even use binomial but zero inflated binomial. This would be important if we decide to use the brms package for R.




            As far as I gather, you have some number of subjects, $k$, who each make some number of guesses, $n_1,dots,n_k$. Then you model $E(y_i/n_i)$ where $y_i$ is the number of correct guesses from subject $i$. Assuming that you have no covaraites then the model with random effects could be



            $$g(E(y_i/n_i)) = mu + epsilon_i,qquad epsilon_isim N(0,sigma^2)$$



            where $g$ is a link function (e.g., logit), $mu$ is logit of the probability of a subject guessing correct when the random effect is zero, and $epsilon_i$ is the random effect of subject $i$. Notice that this model easily yield "a lot of zeroes" if the $mu$ is sufficiently small and you use the logit link function. Hence, a lot zeroes may not be good argument to use a zero inflated binomial in this case.






            share|cite|improve this answer
























              up vote
              1
              down vote














              Linear mixed effects models are for continuous variables. Generalised ones are for non continuous, e.g., binomial.




              This is not true. See the wiki page for generalized linear models. E.g., the gamma and exponential distribution are generalized linaer models and both are continuous. The difference is that you allow for other distribution than the normal distribution with generalized linear models.




              We have a task in which subjects can get each item correct or incorrect. I'd say that is binomial at the level of subject at least.




              Yes that is binomial data.




              Other member of the teams says the most important thing to make this decision is the research question, which is "how many items out of N they will get correct at each test", and suggested to treat the variable as continuous and use a linear model.




              I assume that you have fixed number of $n_i$ trials for each subject $i$. In that case it is a fraction that can have value $0, 1/n_i, 2/n_i, dots, 1$. So you should use the binomial distribution as Dimitris Rizopoulos writes.




              Also, we got many 0s (almost 80% in the last of 3 tests), so maybe we shouldn't even use binomial but zero inflated binomial. This would be important if we decide to use the brms package for R.




              As far as I gather, you have some number of subjects, $k$, who each make some number of guesses, $n_1,dots,n_k$. Then you model $E(y_i/n_i)$ where $y_i$ is the number of correct guesses from subject $i$. Assuming that you have no covaraites then the model with random effects could be



              $$g(E(y_i/n_i)) = mu + epsilon_i,qquad epsilon_isim N(0,sigma^2)$$



              where $g$ is a link function (e.g., logit), $mu$ is logit of the probability of a subject guessing correct when the random effect is zero, and $epsilon_i$ is the random effect of subject $i$. Notice that this model easily yield "a lot of zeroes" if the $mu$ is sufficiently small and you use the logit link function. Hence, a lot zeroes may not be good argument to use a zero inflated binomial in this case.






              share|cite|improve this answer






















                up vote
                1
                down vote










                up vote
                1
                down vote










                Linear mixed effects models are for continuous variables. Generalised ones are for non continuous, e.g., binomial.




                This is not true. See the wiki page for generalized linear models. E.g., the gamma and exponential distribution are generalized linaer models and both are continuous. The difference is that you allow for other distribution than the normal distribution with generalized linear models.




                We have a task in which subjects can get each item correct or incorrect. I'd say that is binomial at the level of subject at least.




                Yes that is binomial data.




                Other member of the teams says the most important thing to make this decision is the research question, which is "how many items out of N they will get correct at each test", and suggested to treat the variable as continuous and use a linear model.




                I assume that you have fixed number of $n_i$ trials for each subject $i$. In that case it is a fraction that can have value $0, 1/n_i, 2/n_i, dots, 1$. So you should use the binomial distribution as Dimitris Rizopoulos writes.




                Also, we got many 0s (almost 80% in the last of 3 tests), so maybe we shouldn't even use binomial but zero inflated binomial. This would be important if we decide to use the brms package for R.




                As far as I gather, you have some number of subjects, $k$, who each make some number of guesses, $n_1,dots,n_k$. Then you model $E(y_i/n_i)$ where $y_i$ is the number of correct guesses from subject $i$. Assuming that you have no covaraites then the model with random effects could be



                $$g(E(y_i/n_i)) = mu + epsilon_i,qquad epsilon_isim N(0,sigma^2)$$



                where $g$ is a link function (e.g., logit), $mu$ is logit of the probability of a subject guessing correct when the random effect is zero, and $epsilon_i$ is the random effect of subject $i$. Notice that this model easily yield "a lot of zeroes" if the $mu$ is sufficiently small and you use the logit link function. Hence, a lot zeroes may not be good argument to use a zero inflated binomial in this case.






                share|cite|improve this answer













                Linear mixed effects models are for continuous variables. Generalised ones are for non continuous, e.g., binomial.




                This is not true. See the wiki page for generalized linear models. E.g., the gamma and exponential distribution are generalized linaer models and both are continuous. The difference is that you allow for other distribution than the normal distribution with generalized linear models.




                We have a task in which subjects can get each item correct or incorrect. I'd say that is binomial at the level of subject at least.




                Yes that is binomial data.




                Other member of the teams says the most important thing to make this decision is the research question, which is "how many items out of N they will get correct at each test", and suggested to treat the variable as continuous and use a linear model.




                I assume that you have fixed number of $n_i$ trials for each subject $i$. In that case it is a fraction that can have value $0, 1/n_i, 2/n_i, dots, 1$. So you should use the binomial distribution as Dimitris Rizopoulos writes.




                Also, we got many 0s (almost 80% in the last of 3 tests), so maybe we shouldn't even use binomial but zero inflated binomial. This would be important if we decide to use the brms package for R.




                As far as I gather, you have some number of subjects, $k$, who each make some number of guesses, $n_1,dots,n_k$. Then you model $E(y_i/n_i)$ where $y_i$ is the number of correct guesses from subject $i$. Assuming that you have no covaraites then the model with random effects could be



                $$g(E(y_i/n_i)) = mu + epsilon_i,qquad epsilon_isim N(0,sigma^2)$$



                where $g$ is a link function (e.g., logit), $mu$ is logit of the probability of a subject guessing correct when the random effect is zero, and $epsilon_i$ is the random effect of subject $i$. Notice that this model easily yield "a lot of zeroes" if the $mu$ is sufficiently small and you use the logit link function. Hence, a lot zeroes may not be good argument to use a zero inflated binomial in this case.







                share|cite|improve this answer












                share|cite|improve this answer



                share|cite|improve this answer










                answered 43 mins ago









                Benjamin Christoffersen

                928417




                928417



























                     

                    draft saved


                    draft discarded















































                     


                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f367050%2fto-choose-between-linear-or-generalised-mixed-effects-model-what-is-the-most-im%23new-answer', 'question_page');

                    );

                    Post as a guest













































































                    Comments

                    Popular posts from this blog

                    Long meetings (6-7 hours a day): Being “babysat” by supervisor

                    Is the Concept of Multiple Fantasy Races Scientifically Flawed? [closed]

                    Confectionery