What stopwords list is Wolfram language using?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
2
down vote

favorite












The documentation of DeleteStopwords only says that it "uses a standard, built-in list of stopwords".



So what is it exactly?










share|improve this question



























    up vote
    2
    down vote

    favorite












    The documentation of DeleteStopwords only says that it "uses a standard, built-in list of stopwords".



    So what is it exactly?










    share|improve this question

























      up vote
      2
      down vote

      favorite









      up vote
      2
      down vote

      favorite











      The documentation of DeleteStopwords only says that it "uses a standard, built-in list of stopwords".



      So what is it exactly?










      share|improve this question















      The documentation of DeleteStopwords only says that it "uses a standard, built-in list of stopwords".



      So what is it exactly?







      string-manipulation text implementation-details linguistics






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited 48 mins ago









      J. M. is somewhat okay.♦

      92.7k10286440




      92.7k10286440










      asked 49 mins ago









      Αλέξανδρος Ζεγγ

      2,089721




      2,089721




















          1 Answer
          1






          active

          oldest

          votes

















          up vote
          3
          down vote













          A little spelunking of the code for DeleteStopwords yields the internally used stopword list:



          DeleteStopwords; (* force auto-load *)
          AlphabeticSort[List @@ TextProcessing`TextModificationDump`$stopWords["English"]] // Short
          "a", "A", "about", "above", "across", "after", "again", "against", "all", "almost",
          "alone", "along", "already", "also", "although", <<240>>,
          "within", "without", "won't", "would", "wouldn't", "yet", "you", "you'd", "you'll",
          "you're", "you've", "your", "yours", "yourself", "yourselves"





          share|improve this answer






















          • Wow, these are some undocumented functions?
            – Î‘λέξανδρος Ζεγγ
            40 mins ago










          • But your codes merely return "English" as the result on my machine. Did I miss something?
            – Î‘λέξανδρος Ζεγγ
            38 mins ago







          • 1




            @ΑλέξανδροςΖεγγ One has to evaluate DeleteStopwords first. The actual code for DeleteStopwords and TextProcessing`TextModificationDump`$stopWords is stored in a file and is loaded only after DeleteStopwords is evaluated in the current session.
            – Henrik Schumacher
            33 mins ago










          • Ah, forgot the autoload. Thanks @Henrik!
            – J. M. is somewhat okay.♦
            31 mins ago










          • @J.M. Always at your service! =)
            – Henrik Schumacher
            31 mins ago










          Your Answer




          StackExchange.ifUsing("editor", function ()
          return StackExchange.using("mathjaxEditing", function ()
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          );
          );
          , "mathjax-editing");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "387"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          convertImagesToLinks: false,
          noModals: false,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













           

          draft saved


          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmathematica.stackexchange.com%2fquestions%2f182531%2fwhat-stopwords-list-is-wolfram-language-using%23new-answer', 'question_page');

          );

          Post as a guest






























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes








          up vote
          3
          down vote













          A little spelunking of the code for DeleteStopwords yields the internally used stopword list:



          DeleteStopwords; (* force auto-load *)
          AlphabeticSort[List @@ TextProcessing`TextModificationDump`$stopWords["English"]] // Short
          "a", "A", "about", "above", "across", "after", "again", "against", "all", "almost",
          "alone", "along", "already", "also", "although", <<240>>,
          "within", "without", "won't", "would", "wouldn't", "yet", "you", "you'd", "you'll",
          "you're", "you've", "your", "yours", "yourself", "yourselves"





          share|improve this answer






















          • Wow, these are some undocumented functions?
            – Î‘λέξανδρος Ζεγγ
            40 mins ago










          • But your codes merely return "English" as the result on my machine. Did I miss something?
            – Î‘λέξανδρος Ζεγγ
            38 mins ago







          • 1




            @ΑλέξανδροςΖεγγ One has to evaluate DeleteStopwords first. The actual code for DeleteStopwords and TextProcessing`TextModificationDump`$stopWords is stored in a file and is loaded only after DeleteStopwords is evaluated in the current session.
            – Henrik Schumacher
            33 mins ago










          • Ah, forgot the autoload. Thanks @Henrik!
            – J. M. is somewhat okay.♦
            31 mins ago










          • @J.M. Always at your service! =)
            – Henrik Schumacher
            31 mins ago














          up vote
          3
          down vote













          A little spelunking of the code for DeleteStopwords yields the internally used stopword list:



          DeleteStopwords; (* force auto-load *)
          AlphabeticSort[List @@ TextProcessing`TextModificationDump`$stopWords["English"]] // Short
          "a", "A", "about", "above", "across", "after", "again", "against", "all", "almost",
          "alone", "along", "already", "also", "although", <<240>>,
          "within", "without", "won't", "would", "wouldn't", "yet", "you", "you'd", "you'll",
          "you're", "you've", "your", "yours", "yourself", "yourselves"





          share|improve this answer






















          • Wow, these are some undocumented functions?
            – Î‘λέξανδρος Ζεγγ
            40 mins ago










          • But your codes merely return "English" as the result on my machine. Did I miss something?
            – Î‘λέξανδρος Ζεγγ
            38 mins ago







          • 1




            @ΑλέξανδροςΖεγγ One has to evaluate DeleteStopwords first. The actual code for DeleteStopwords and TextProcessing`TextModificationDump`$stopWords is stored in a file and is loaded only after DeleteStopwords is evaluated in the current session.
            – Henrik Schumacher
            33 mins ago










          • Ah, forgot the autoload. Thanks @Henrik!
            – J. M. is somewhat okay.♦
            31 mins ago










          • @J.M. Always at your service! =)
            – Henrik Schumacher
            31 mins ago












          up vote
          3
          down vote










          up vote
          3
          down vote









          A little spelunking of the code for DeleteStopwords yields the internally used stopword list:



          DeleteStopwords; (* force auto-load *)
          AlphabeticSort[List @@ TextProcessing`TextModificationDump`$stopWords["English"]] // Short
          "a", "A", "about", "above", "across", "after", "again", "against", "all", "almost",
          "alone", "along", "already", "also", "although", <<240>>,
          "within", "without", "won't", "would", "wouldn't", "yet", "you", "you'd", "you'll",
          "you're", "you've", "your", "yours", "yourself", "yourselves"





          share|improve this answer














          A little spelunking of the code for DeleteStopwords yields the internally used stopword list:



          DeleteStopwords; (* force auto-load *)
          AlphabeticSort[List @@ TextProcessing`TextModificationDump`$stopWords["English"]] // Short
          "a", "A", "about", "above", "across", "after", "again", "against", "all", "almost",
          "alone", "along", "already", "also", "although", <<240>>,
          "within", "without", "won't", "would", "wouldn't", "yet", "you", "you'd", "you'll",
          "you're", "you've", "your", "yours", "yourself", "yourselves"






          share|improve this answer














          share|improve this answer



          share|improve this answer








          edited 31 mins ago

























          answered 42 mins ago









          J. M. is somewhat okay.♦

          92.7k10286440




          92.7k10286440











          • Wow, these are some undocumented functions?
            – Î‘λέξανδρος Ζεγγ
            40 mins ago










          • But your codes merely return "English" as the result on my machine. Did I miss something?
            – Î‘λέξανδρος Ζεγγ
            38 mins ago







          • 1




            @ΑλέξανδροςΖεγγ One has to evaluate DeleteStopwords first. The actual code for DeleteStopwords and TextProcessing`TextModificationDump`$stopWords is stored in a file and is loaded only after DeleteStopwords is evaluated in the current session.
            – Henrik Schumacher
            33 mins ago










          • Ah, forgot the autoload. Thanks @Henrik!
            – J. M. is somewhat okay.♦
            31 mins ago










          • @J.M. Always at your service! =)
            – Henrik Schumacher
            31 mins ago
















          • Wow, these are some undocumented functions?
            – Î‘λέξανδρος Ζεγγ
            40 mins ago










          • But your codes merely return "English" as the result on my machine. Did I miss something?
            – Î‘λέξανδρος Ζεγγ
            38 mins ago







          • 1




            @ΑλέξανδροςΖεγγ One has to evaluate DeleteStopwords first. The actual code for DeleteStopwords and TextProcessing`TextModificationDump`$stopWords is stored in a file and is loaded only after DeleteStopwords is evaluated in the current session.
            – Henrik Schumacher
            33 mins ago










          • Ah, forgot the autoload. Thanks @Henrik!
            – J. M. is somewhat okay.♦
            31 mins ago










          • @J.M. Always at your service! =)
            – Henrik Schumacher
            31 mins ago















          Wow, these are some undocumented functions?
          – Î‘λέξανδρος Ζεγγ
          40 mins ago




          Wow, these are some undocumented functions?
          – Î‘λέξανδρος Ζεγγ
          40 mins ago












          But your codes merely return "English" as the result on my machine. Did I miss something?
          – Î‘λέξανδρος Ζεγγ
          38 mins ago





          But your codes merely return "English" as the result on my machine. Did I miss something?
          – Î‘λέξανδρος Ζεγγ
          38 mins ago





          1




          1




          @ΑλέξανδροςΖεγγ One has to evaluate DeleteStopwords first. The actual code for DeleteStopwords and TextProcessing`TextModificationDump`$stopWords is stored in a file and is loaded only after DeleteStopwords is evaluated in the current session.
          – Henrik Schumacher
          33 mins ago




          @ΑλέξανδροςΖεγγ One has to evaluate DeleteStopwords first. The actual code for DeleteStopwords and TextProcessing`TextModificationDump`$stopWords is stored in a file and is loaded only after DeleteStopwords is evaluated in the current session.
          – Henrik Schumacher
          33 mins ago












          Ah, forgot the autoload. Thanks @Henrik!
          – J. M. is somewhat okay.♦
          31 mins ago




          Ah, forgot the autoload. Thanks @Henrik!
          – J. M. is somewhat okay.♦
          31 mins ago












          @J.M. Always at your service! =)
          – Henrik Schumacher
          31 mins ago




          @J.M. Always at your service! =)
          – Henrik Schumacher
          31 mins ago

















           

          draft saved


          draft discarded















































           


          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmathematica.stackexchange.com%2fquestions%2f182531%2fwhat-stopwords-list-is-wolfram-language-using%23new-answer', 'question_page');

          );

          Post as a guest













































































          Comments

          Popular posts from this blog

          White Anglo-Saxon Protestant

          Is the Concept of Multiple Fantasy Races Scientifically Flawed? [closed]

          One-line joke