State of the art results of mnist digits dataset

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;







up vote
1
down vote

favorite












I am doing some research work regarding deep learning, I have a model and now I want to know and compare how my model performs as compared to others existing model.(like capsule networks, CNN ets..)



I have no idea of about how to evaluate model performance:-



Que 1. For how many epochs and for what batch size(lets say for mnist digits dataset) do I need to train my model?



Que 2. Is there any time condition to train my model, or I can train my model for same no of epochs as other models? Like lets say comparisons have been made after training models for 1 hour. Or is there any condition that I have to train my model for particular amount of time(like 1 hour)?



Que 3. What is the state of the art accuracy and error for mnist dataset today?



Que 4. How do we calculate error?



Thanks for helping :)










share|cite|improve this question







New contributor




Rishik is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

























    up vote
    1
    down vote

    favorite












    I am doing some research work regarding deep learning, I have a model and now I want to know and compare how my model performs as compared to others existing model.(like capsule networks, CNN ets..)



    I have no idea of about how to evaluate model performance:-



    Que 1. For how many epochs and for what batch size(lets say for mnist digits dataset) do I need to train my model?



    Que 2. Is there any time condition to train my model, or I can train my model for same no of epochs as other models? Like lets say comparisons have been made after training models for 1 hour. Or is there any condition that I have to train my model for particular amount of time(like 1 hour)?



    Que 3. What is the state of the art accuracy and error for mnist dataset today?



    Que 4. How do we calculate error?



    Thanks for helping :)










    share|cite|improve this question







    New contributor




    Rishik is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
    Check out our Code of Conduct.





















      up vote
      1
      down vote

      favorite









      up vote
      1
      down vote

      favorite











      I am doing some research work regarding deep learning, I have a model and now I want to know and compare how my model performs as compared to others existing model.(like capsule networks, CNN ets..)



      I have no idea of about how to evaluate model performance:-



      Que 1. For how many epochs and for what batch size(lets say for mnist digits dataset) do I need to train my model?



      Que 2. Is there any time condition to train my model, or I can train my model for same no of epochs as other models? Like lets say comparisons have been made after training models for 1 hour. Or is there any condition that I have to train my model for particular amount of time(like 1 hour)?



      Que 3. What is the state of the art accuracy and error for mnist dataset today?



      Que 4. How do we calculate error?



      Thanks for helping :)










      share|cite|improve this question







      New contributor




      Rishik is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      I am doing some research work regarding deep learning, I have a model and now I want to know and compare how my model performs as compared to others existing model.(like capsule networks, CNN ets..)



      I have no idea of about how to evaluate model performance:-



      Que 1. For how many epochs and for what batch size(lets say for mnist digits dataset) do I need to train my model?



      Que 2. Is there any time condition to train my model, or I can train my model for same no of epochs as other models? Like lets say comparisons have been made after training models for 1 hour. Or is there any condition that I have to train my model for particular amount of time(like 1 hour)?



      Que 3. What is the state of the art accuracy and error for mnist dataset today?



      Que 4. How do we calculate error?



      Thanks for helping :)







      deep-learning model-comparison






      share|cite|improve this question







      New contributor




      Rishik is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      share|cite|improve this question







      New contributor




      Rishik is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      share|cite|improve this question




      share|cite|improve this question






      New contributor




      Rishik is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      asked 5 hours ago









      Rishik

      62




      62




      New contributor




      Rishik is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.





      New contributor





      Rishik is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






      Rishik is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.




















          1 Answer
          1






          active

          oldest

          votes

















          up vote
          2
          down vote














          Que 1. For how many epochs and for what batch size(lets say for mnist
          digits dataset) do I need to train my model?




          This depends on your model. Different models may perform differently when trained on different number of epochs, with different batch sizes. Basically, batch size, learning rate, dropout and other regularization methods all have some regularizing effect and interact with each other. They also interact with number of epochs needed to train the model (more regularization needs more training epochs).




          Que 2. Is there any time condition to train my model, or I can train
          my model for same no of epochs as other models? Like lets say
          comparisons have been made after training models for 1 hour. Or is
          there any condition that I have to train my model for particular
          amount of time(like 1 hour)?




          No, unless you want to make comparison under time restriction. Notice that this would need you to re-run all the other models in the same conditions as your model, since the results will vary a lot depending on the computational resources that were used by different authors. If you won't re-train the models, you wouldn't know if the results are due to having better (or worse) computational resources or using better model.




          Que 3. What is the state of the art accuracy and error for mnist
          dataset today?




          You can find such results (with references) on LeCun's page on MINST or here.




          Que 4. How do we calculate error?




          Error rate is 1-accuracy, but accuracy is not the best measure of performance, so you should consider other measures as well.



          As a comment, MINST is not a great benchmark since everyone trains on it, we probably have NN architectures that overfit to MINST.






          share|cite|improve this answer






















            Your Answer




            StackExchange.ifUsing("editor", function ()
            return StackExchange.using("mathjaxEditing", function ()
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
            );
            );
            , "mathjax-editing");

            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "65"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            convertImagesToLinks: false,
            noModals: false,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );






            Rishik is a new contributor. Be nice, and check out our Code of Conduct.









             

            draft saved


            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f369560%2fstate-of-the-art-results-of-mnist-digits-dataset%23new-answer', 'question_page');

            );

            Post as a guest






























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes








            up vote
            2
            down vote














            Que 1. For how many epochs and for what batch size(lets say for mnist
            digits dataset) do I need to train my model?




            This depends on your model. Different models may perform differently when trained on different number of epochs, with different batch sizes. Basically, batch size, learning rate, dropout and other regularization methods all have some regularizing effect and interact with each other. They also interact with number of epochs needed to train the model (more regularization needs more training epochs).




            Que 2. Is there any time condition to train my model, or I can train
            my model for same no of epochs as other models? Like lets say
            comparisons have been made after training models for 1 hour. Or is
            there any condition that I have to train my model for particular
            amount of time(like 1 hour)?




            No, unless you want to make comparison under time restriction. Notice that this would need you to re-run all the other models in the same conditions as your model, since the results will vary a lot depending on the computational resources that were used by different authors. If you won't re-train the models, you wouldn't know if the results are due to having better (or worse) computational resources or using better model.




            Que 3. What is the state of the art accuracy and error for mnist
            dataset today?




            You can find such results (with references) on LeCun's page on MINST or here.




            Que 4. How do we calculate error?




            Error rate is 1-accuracy, but accuracy is not the best measure of performance, so you should consider other measures as well.



            As a comment, MINST is not a great benchmark since everyone trains on it, we probably have NN architectures that overfit to MINST.






            share|cite|improve this answer


























              up vote
              2
              down vote














              Que 1. For how many epochs and for what batch size(lets say for mnist
              digits dataset) do I need to train my model?




              This depends on your model. Different models may perform differently when trained on different number of epochs, with different batch sizes. Basically, batch size, learning rate, dropout and other regularization methods all have some regularizing effect and interact with each other. They also interact with number of epochs needed to train the model (more regularization needs more training epochs).




              Que 2. Is there any time condition to train my model, or I can train
              my model for same no of epochs as other models? Like lets say
              comparisons have been made after training models for 1 hour. Or is
              there any condition that I have to train my model for particular
              amount of time(like 1 hour)?




              No, unless you want to make comparison under time restriction. Notice that this would need you to re-run all the other models in the same conditions as your model, since the results will vary a lot depending on the computational resources that were used by different authors. If you won't re-train the models, you wouldn't know if the results are due to having better (or worse) computational resources or using better model.




              Que 3. What is the state of the art accuracy and error for mnist
              dataset today?




              You can find such results (with references) on LeCun's page on MINST or here.




              Que 4. How do we calculate error?




              Error rate is 1-accuracy, but accuracy is not the best measure of performance, so you should consider other measures as well.



              As a comment, MINST is not a great benchmark since everyone trains on it, we probably have NN architectures that overfit to MINST.






              share|cite|improve this answer
























                up vote
                2
                down vote










                up vote
                2
                down vote










                Que 1. For how many epochs and for what batch size(lets say for mnist
                digits dataset) do I need to train my model?




                This depends on your model. Different models may perform differently when trained on different number of epochs, with different batch sizes. Basically, batch size, learning rate, dropout and other regularization methods all have some regularizing effect and interact with each other. They also interact with number of epochs needed to train the model (more regularization needs more training epochs).




                Que 2. Is there any time condition to train my model, or I can train
                my model for same no of epochs as other models? Like lets say
                comparisons have been made after training models for 1 hour. Or is
                there any condition that I have to train my model for particular
                amount of time(like 1 hour)?




                No, unless you want to make comparison under time restriction. Notice that this would need you to re-run all the other models in the same conditions as your model, since the results will vary a lot depending on the computational resources that were used by different authors. If you won't re-train the models, you wouldn't know if the results are due to having better (or worse) computational resources or using better model.




                Que 3. What is the state of the art accuracy and error for mnist
                dataset today?




                You can find such results (with references) on LeCun's page on MINST or here.




                Que 4. How do we calculate error?




                Error rate is 1-accuracy, but accuracy is not the best measure of performance, so you should consider other measures as well.



                As a comment, MINST is not a great benchmark since everyone trains on it, we probably have NN architectures that overfit to MINST.






                share|cite|improve this answer















                Que 1. For how many epochs and for what batch size(lets say for mnist
                digits dataset) do I need to train my model?




                This depends on your model. Different models may perform differently when trained on different number of epochs, with different batch sizes. Basically, batch size, learning rate, dropout and other regularization methods all have some regularizing effect and interact with each other. They also interact with number of epochs needed to train the model (more regularization needs more training epochs).




                Que 2. Is there any time condition to train my model, or I can train
                my model for same no of epochs as other models? Like lets say
                comparisons have been made after training models for 1 hour. Or is
                there any condition that I have to train my model for particular
                amount of time(like 1 hour)?




                No, unless you want to make comparison under time restriction. Notice that this would need you to re-run all the other models in the same conditions as your model, since the results will vary a lot depending on the computational resources that were used by different authors. If you won't re-train the models, you wouldn't know if the results are due to having better (or worse) computational resources or using better model.




                Que 3. What is the state of the art accuracy and error for mnist
                dataset today?




                You can find such results (with references) on LeCun's page on MINST or here.




                Que 4. How do we calculate error?




                Error rate is 1-accuracy, but accuracy is not the best measure of performance, so you should consider other measures as well.



                As a comment, MINST is not a great benchmark since everyone trains on it, we probably have NN architectures that overfit to MINST.







                share|cite|improve this answer














                share|cite|improve this answer



                share|cite|improve this answer








                edited 2 mins ago

























                answered 4 hours ago









                Tim♦

                53.1k9120205




                53.1k9120205




















                    Rishik is a new contributor. Be nice, and check out our Code of Conduct.









                     

                    draft saved


                    draft discarded


















                    Rishik is a new contributor. Be nice, and check out our Code of Conduct.












                    Rishik is a new contributor. Be nice, and check out our Code of Conduct.











                    Rishik is a new contributor. Be nice, and check out our Code of Conduct.













                     


                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f369560%2fstate-of-the-art-results-of-mnist-digits-dataset%23new-answer', 'question_page');

                    );

                    Post as a guest













































































                    Comments

                    Popular posts from this blog

                    What does second last employer means? [closed]

                    List of Gilmore Girls characters

                    Confectionery