Why Standard Deviation is more popular than Mean Absolute Deviation?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;







up vote
1
down vote

favorite
1












$$
Mean;Absolute;Deviation = frac1nsum_i=1^n |x_i-mean(X)|
$$

Almost all textbooks and papers are using Standard Deviation as a measurement of dispersion. And of course, almost all the models are built based on Standard Deviation.



But I don't understand how Standard Deviation has gained such popularity. I mean, Mean Absolute Deviation is a very intuitive measurement of dispersion. It tells you exact average distant that each value deviates from their mean. Standard Deviation, on the other hand, makes the result more sensitive to outliers. Why we need this sensitivity. Is there any historical reason leads us to use SD way more often than MAD?










share|cite|improve this question







New contributor




27182818 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1




    Something something absolute function not differentiable.
    – user2974951
    4 hours ago










  • Both are used in empirical research. Standard deviation is more convenient mathematically, probably among other reasons.
    – Xiaomi
    3 hours ago










  • A few potential reasons: it's differentiable, the Normal Distribution naturally parammetrises in terms of its standard deviation, the true mean minimises the expected variance (whereas the true median minimises the expected MAD)
    – gazza89
    3 hours ago










  • Possible duplicate of Is minimizing squared error equivalent to minimizing absolute error? Why squared error is more popular than the latter?
    – Martijn Weterings
    1 hour ago










  • I am voting to close this question because it has been asked several times before. However, it is useful and it could serve some purpose as helping in searches. For instance I seem not to find the post that I had in mind when marking this as duplicate (although several other posts came up that are obvious duplicates as well).
    – Martijn Weterings
    1 hour ago

















up vote
1
down vote

favorite
1












$$
Mean;Absolute;Deviation = frac1nsum_i=1^n |x_i-mean(X)|
$$

Almost all textbooks and papers are using Standard Deviation as a measurement of dispersion. And of course, almost all the models are built based on Standard Deviation.



But I don't understand how Standard Deviation has gained such popularity. I mean, Mean Absolute Deviation is a very intuitive measurement of dispersion. It tells you exact average distant that each value deviates from their mean. Standard Deviation, on the other hand, makes the result more sensitive to outliers. Why we need this sensitivity. Is there any historical reason leads us to use SD way more often than MAD?










share|cite|improve this question







New contributor




27182818 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1




    Something something absolute function not differentiable.
    – user2974951
    4 hours ago










  • Both are used in empirical research. Standard deviation is more convenient mathematically, probably among other reasons.
    – Xiaomi
    3 hours ago










  • A few potential reasons: it's differentiable, the Normal Distribution naturally parammetrises in terms of its standard deviation, the true mean minimises the expected variance (whereas the true median minimises the expected MAD)
    – gazza89
    3 hours ago










  • Possible duplicate of Is minimizing squared error equivalent to minimizing absolute error? Why squared error is more popular than the latter?
    – Martijn Weterings
    1 hour ago










  • I am voting to close this question because it has been asked several times before. However, it is useful and it could serve some purpose as helping in searches. For instance I seem not to find the post that I had in mind when marking this as duplicate (although several other posts came up that are obvious duplicates as well).
    – Martijn Weterings
    1 hour ago













up vote
1
down vote

favorite
1









up vote
1
down vote

favorite
1






1





$$
Mean;Absolute;Deviation = frac1nsum_i=1^n |x_i-mean(X)|
$$

Almost all textbooks and papers are using Standard Deviation as a measurement of dispersion. And of course, almost all the models are built based on Standard Deviation.



But I don't understand how Standard Deviation has gained such popularity. I mean, Mean Absolute Deviation is a very intuitive measurement of dispersion. It tells you exact average distant that each value deviates from their mean. Standard Deviation, on the other hand, makes the result more sensitive to outliers. Why we need this sensitivity. Is there any historical reason leads us to use SD way more often than MAD?










share|cite|improve this question







New contributor




27182818 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











$$
Mean;Absolute;Deviation = frac1nsum_i=1^n |x_i-mean(X)|
$$

Almost all textbooks and papers are using Standard Deviation as a measurement of dispersion. And of course, almost all the models are built based on Standard Deviation.



But I don't understand how Standard Deviation has gained such popularity. I mean, Mean Absolute Deviation is a very intuitive measurement of dispersion. It tells you exact average distant that each value deviates from their mean. Standard Deviation, on the other hand, makes the result more sensitive to outliers. Why we need this sensitivity. Is there any historical reason leads us to use SD way more often than MAD?







mathematical-statistics standard-deviation






share|cite|improve this question







New contributor




27182818 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|cite|improve this question







New contributor




27182818 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|cite|improve this question




share|cite|improve this question






New contributor




27182818 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked 4 hours ago









27182818

61




61




New contributor




27182818 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





27182818 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






27182818 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







  • 1




    Something something absolute function not differentiable.
    – user2974951
    4 hours ago










  • Both are used in empirical research. Standard deviation is more convenient mathematically, probably among other reasons.
    – Xiaomi
    3 hours ago










  • A few potential reasons: it's differentiable, the Normal Distribution naturally parammetrises in terms of its standard deviation, the true mean minimises the expected variance (whereas the true median minimises the expected MAD)
    – gazza89
    3 hours ago










  • Possible duplicate of Is minimizing squared error equivalent to minimizing absolute error? Why squared error is more popular than the latter?
    – Martijn Weterings
    1 hour ago










  • I am voting to close this question because it has been asked several times before. However, it is useful and it could serve some purpose as helping in searches. For instance I seem not to find the post that I had in mind when marking this as duplicate (although several other posts came up that are obvious duplicates as well).
    – Martijn Weterings
    1 hour ago













  • 1




    Something something absolute function not differentiable.
    – user2974951
    4 hours ago










  • Both are used in empirical research. Standard deviation is more convenient mathematically, probably among other reasons.
    – Xiaomi
    3 hours ago










  • A few potential reasons: it's differentiable, the Normal Distribution naturally parammetrises in terms of its standard deviation, the true mean minimises the expected variance (whereas the true median minimises the expected MAD)
    – gazza89
    3 hours ago










  • Possible duplicate of Is minimizing squared error equivalent to minimizing absolute error? Why squared error is more popular than the latter?
    – Martijn Weterings
    1 hour ago










  • I am voting to close this question because it has been asked several times before. However, it is useful and it could serve some purpose as helping in searches. For instance I seem not to find the post that I had in mind when marking this as duplicate (although several other posts came up that are obvious duplicates as well).
    – Martijn Weterings
    1 hour ago








1




1




Something something absolute function not differentiable.
– user2974951
4 hours ago




Something something absolute function not differentiable.
– user2974951
4 hours ago












Both are used in empirical research. Standard deviation is more convenient mathematically, probably among other reasons.
– Xiaomi
3 hours ago




Both are used in empirical research. Standard deviation is more convenient mathematically, probably among other reasons.
– Xiaomi
3 hours ago












A few potential reasons: it's differentiable, the Normal Distribution naturally parammetrises in terms of its standard deviation, the true mean minimises the expected variance (whereas the true median minimises the expected MAD)
– gazza89
3 hours ago




A few potential reasons: it's differentiable, the Normal Distribution naturally parammetrises in terms of its standard deviation, the true mean minimises the expected variance (whereas the true median minimises the expected MAD)
– gazza89
3 hours ago












Possible duplicate of Is minimizing squared error equivalent to minimizing absolute error? Why squared error is more popular than the latter?
– Martijn Weterings
1 hour ago




Possible duplicate of Is minimizing squared error equivalent to minimizing absolute error? Why squared error is more popular than the latter?
– Martijn Weterings
1 hour ago












I am voting to close this question because it has been asked several times before. However, it is useful and it could serve some purpose as helping in searches. For instance I seem not to find the post that I had in mind when marking this as duplicate (although several other posts came up that are obvious duplicates as well).
– Martijn Weterings
1 hour ago





I am voting to close this question because it has been asked several times before. However, it is useful and it could serve some purpose as helping in searches. For instance I seem not to find the post that I had in mind when marking this as duplicate (although several other posts came up that are obvious duplicates as well).
– Martijn Weterings
1 hour ago











2 Answers
2






active

oldest

votes

















up vote
2
down vote













Historically, Laplace started with the expected absolute deviation from the expectation and got mired into computational issues, beyond the Laplace (or double exponential) distribution, while Legendre and Gauss advocated the expected square difference from the expectation, which is more naturally connected with the Normal or Gaussian distribution. Portnoy and Koenker wrote a nice paper called the Gaussian Hare and the Laplacian Tortoise (!) on that issue, including a parody of the Hare and the Tortoise with Laplace's and Gauss' heads:



enter image description here



The issue is covered in depth in this earlier (2015) X Validated question. (Which makes the current one a potential duplicate.)






share|cite|improve this answer





























    up vote
    1
    down vote













    One possible answer, among many others, is that the mean is precisely defined from the standard deviation as



    $meanleft( X right) = mathop arg min limits_x frac1nsumlimits_i = 1^n left( x_i - x right)^2 $



    Hence, mean and standard deviation come together. This is not the case for the MAD.






    share|cite|improve this answer




















      Your Answer




      StackExchange.ifUsing("editor", function ()
      return StackExchange.using("mathjaxEditing", function ()
      StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
      StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
      );
      );
      , "mathjax-editing");

      StackExchange.ready(function()
      var channelOptions =
      tags: "".split(" "),
      id: "65"
      ;
      initTagRenderer("".split(" "), "".split(" "), channelOptions);

      StackExchange.using("externalEditor", function()
      // Have to fire editor after snippets, if snippets enabled
      if (StackExchange.settings.snippets.snippetsEnabled)
      StackExchange.using("snippets", function()
      createEditor();
      );

      else
      createEditor();

      );

      function createEditor()
      StackExchange.prepareEditor(
      heartbeatType: 'answer',
      convertImagesToLinks: false,
      noModals: false,
      showLowRepImageUploadWarning: true,
      reputationToPostImages: null,
      bindNavPrevention: true,
      postfix: "",
      onDemand: true,
      discardSelector: ".discard-answer"
      ,immediatelyShowMarkdownHelp:true
      );



      );






      27182818 is a new contributor. Be nice, and check out our Code of Conduct.









       

      draft saved


      draft discarded


















      StackExchange.ready(
      function ()
      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f371112%2fwhy-standard-deviation-is-more-popular-than-mean-absolute-deviation%23new-answer', 'question_page');

      );

      Post as a guest






























      2 Answers
      2






      active

      oldest

      votes








      2 Answers
      2






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes








      up vote
      2
      down vote













      Historically, Laplace started with the expected absolute deviation from the expectation and got mired into computational issues, beyond the Laplace (or double exponential) distribution, while Legendre and Gauss advocated the expected square difference from the expectation, which is more naturally connected with the Normal or Gaussian distribution. Portnoy and Koenker wrote a nice paper called the Gaussian Hare and the Laplacian Tortoise (!) on that issue, including a parody of the Hare and the Tortoise with Laplace's and Gauss' heads:



      enter image description here



      The issue is covered in depth in this earlier (2015) X Validated question. (Which makes the current one a potential duplicate.)






      share|cite|improve this answer


























        up vote
        2
        down vote













        Historically, Laplace started with the expected absolute deviation from the expectation and got mired into computational issues, beyond the Laplace (or double exponential) distribution, while Legendre and Gauss advocated the expected square difference from the expectation, which is more naturally connected with the Normal or Gaussian distribution. Portnoy and Koenker wrote a nice paper called the Gaussian Hare and the Laplacian Tortoise (!) on that issue, including a parody of the Hare and the Tortoise with Laplace's and Gauss' heads:



        enter image description here



        The issue is covered in depth in this earlier (2015) X Validated question. (Which makes the current one a potential duplicate.)






        share|cite|improve this answer
























          up vote
          2
          down vote










          up vote
          2
          down vote









          Historically, Laplace started with the expected absolute deviation from the expectation and got mired into computational issues, beyond the Laplace (or double exponential) distribution, while Legendre and Gauss advocated the expected square difference from the expectation, which is more naturally connected with the Normal or Gaussian distribution. Portnoy and Koenker wrote a nice paper called the Gaussian Hare and the Laplacian Tortoise (!) on that issue, including a parody of the Hare and the Tortoise with Laplace's and Gauss' heads:



          enter image description here



          The issue is covered in depth in this earlier (2015) X Validated question. (Which makes the current one a potential duplicate.)






          share|cite|improve this answer














          Historically, Laplace started with the expected absolute deviation from the expectation and got mired into computational issues, beyond the Laplace (or double exponential) distribution, while Legendre and Gauss advocated the expected square difference from the expectation, which is more naturally connected with the Normal or Gaussian distribution. Portnoy and Koenker wrote a nice paper called the Gaussian Hare and the Laplacian Tortoise (!) on that issue, including a parody of the Hare and the Tortoise with Laplace's and Gauss' heads:



          enter image description here



          The issue is covered in depth in this earlier (2015) X Validated question. (Which makes the current one a potential duplicate.)







          share|cite|improve this answer














          share|cite|improve this answer



          share|cite|improve this answer








          edited 1 hour ago

























          answered 2 hours ago









          Xi'an

          50.5k686335




          50.5k686335






















              up vote
              1
              down vote













              One possible answer, among many others, is that the mean is precisely defined from the standard deviation as



              $meanleft( X right) = mathop arg min limits_x frac1nsumlimits_i = 1^n left( x_i - x right)^2 $



              Hence, mean and standard deviation come together. This is not the case for the MAD.






              share|cite|improve this answer
























                up vote
                1
                down vote













                One possible answer, among many others, is that the mean is precisely defined from the standard deviation as



                $meanleft( X right) = mathop arg min limits_x frac1nsumlimits_i = 1^n left( x_i - x right)^2 $



                Hence, mean and standard deviation come together. This is not the case for the MAD.






                share|cite|improve this answer






















                  up vote
                  1
                  down vote










                  up vote
                  1
                  down vote









                  One possible answer, among many others, is that the mean is precisely defined from the standard deviation as



                  $meanleft( X right) = mathop arg min limits_x frac1nsumlimits_i = 1^n left( x_i - x right)^2 $



                  Hence, mean and standard deviation come together. This is not the case for the MAD.






                  share|cite|improve this answer












                  One possible answer, among many others, is that the mean is precisely defined from the standard deviation as



                  $meanleft( X right) = mathop arg min limits_x frac1nsumlimits_i = 1^n left( x_i - x right)^2 $



                  Hence, mean and standard deviation come together. This is not the case for the MAD.







                  share|cite|improve this answer












                  share|cite|improve this answer



                  share|cite|improve this answer










                  answered 2 hours ago









                  Fabrice Pautot

                  664




                  664




















                      27182818 is a new contributor. Be nice, and check out our Code of Conduct.









                       

                      draft saved


                      draft discarded


















                      27182818 is a new contributor. Be nice, and check out our Code of Conduct.












                      27182818 is a new contributor. Be nice, and check out our Code of Conduct.











                      27182818 is a new contributor. Be nice, and check out our Code of Conduct.













                       


                      draft saved


                      draft discarded














                      StackExchange.ready(
                      function ()
                      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f371112%2fwhy-standard-deviation-is-more-popular-than-mean-absolute-deviation%23new-answer', 'question_page');

                      );

                      Post as a guest













































































                      Comments

                      Popular posts from this blog

                      White Anglo-Saxon Protestant

                      BuddyTV

                      Conflict (narrative)