What stopwords list is Wolfram language using?

Clash Royale CLAN TAG#URR8PPP
up vote
2
down vote
favorite
The documentation of DeleteStopwords only says that it "uses a standard, built-in list of stopwords".
So what is it exactly?
string-manipulation text implementation-details linguistics
add a comment |Â
up vote
2
down vote
favorite
The documentation of DeleteStopwords only says that it "uses a standard, built-in list of stopwords".
So what is it exactly?
string-manipulation text implementation-details linguistics
add a comment |Â
up vote
2
down vote
favorite
up vote
2
down vote
favorite
The documentation of DeleteStopwords only says that it "uses a standard, built-in list of stopwords".
So what is it exactly?
string-manipulation text implementation-details linguistics
The documentation of DeleteStopwords only says that it "uses a standard, built-in list of stopwords".
So what is it exactly?
string-manipulation text implementation-details linguistics
string-manipulation text implementation-details linguistics
edited 48 mins ago
J. M. is somewhat okay.â¦
92.7k10286440
92.7k10286440
asked 49 mins ago
ÃÂûÃÂþñýôÃÂÿàÃÂõóó
2,089721
2,089721
add a comment |Â
add a comment |Â
1 Answer
1
active
oldest
votes
up vote
3
down vote
A little spelunking of the code for DeleteStopwords yields the internally used stopword list:
DeleteStopwords; (* force auto-load *)
AlphabeticSort[List @@ TextProcessing`TextModificationDump`$stopWords["English"]] // Short
"a", "A", "about", "above", "across", "after", "again", "against", "all", "almost",
"alone", "along", "already", "also", "although", <<240>>,
"within", "without", "won't", "would", "wouldn't", "yet", "you", "you'd", "you'll",
"you're", "you've", "your", "yours", "yourself", "yourselves"
Wow, these are some undocumented functions?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
40 mins ago
But your codes merely return"English"as the result on my machine. Did I miss something?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
38 mins ago
1
@ÃÂûÃÂþñýôÃÂÿÃÂÃÂõóó One has to evaluateDeleteStopwordsfirst. The actual code forDeleteStopwordsandTextProcessing`TextModificationDump`$stopWordsis stored in a file and is loaded only afterDeleteStopwordsis evaluated in the current session.
â Henrik Schumacher
33 mins ago
Ah, forgot the autoload. Thanks @Henrik!
â J. M. is somewhat okay.â¦
31 mins ago
@J.M. Always at your service! =)
â Henrik Schumacher
31 mins ago
add a comment |Â
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
3
down vote
A little spelunking of the code for DeleteStopwords yields the internally used stopword list:
DeleteStopwords; (* force auto-load *)
AlphabeticSort[List @@ TextProcessing`TextModificationDump`$stopWords["English"]] // Short
"a", "A", "about", "above", "across", "after", "again", "against", "all", "almost",
"alone", "along", "already", "also", "although", <<240>>,
"within", "without", "won't", "would", "wouldn't", "yet", "you", "you'd", "you'll",
"you're", "you've", "your", "yours", "yourself", "yourselves"
Wow, these are some undocumented functions?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
40 mins ago
But your codes merely return"English"as the result on my machine. Did I miss something?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
38 mins ago
1
@ÃÂûÃÂþñýôÃÂÿÃÂÃÂõóó One has to evaluateDeleteStopwordsfirst. The actual code forDeleteStopwordsandTextProcessing`TextModificationDump`$stopWordsis stored in a file and is loaded only afterDeleteStopwordsis evaluated in the current session.
â Henrik Schumacher
33 mins ago
Ah, forgot the autoload. Thanks @Henrik!
â J. M. is somewhat okay.â¦
31 mins ago
@J.M. Always at your service! =)
â Henrik Schumacher
31 mins ago
add a comment |Â
up vote
3
down vote
A little spelunking of the code for DeleteStopwords yields the internally used stopword list:
DeleteStopwords; (* force auto-load *)
AlphabeticSort[List @@ TextProcessing`TextModificationDump`$stopWords["English"]] // Short
"a", "A", "about", "above", "across", "after", "again", "against", "all", "almost",
"alone", "along", "already", "also", "although", <<240>>,
"within", "without", "won't", "would", "wouldn't", "yet", "you", "you'd", "you'll",
"you're", "you've", "your", "yours", "yourself", "yourselves"
Wow, these are some undocumented functions?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
40 mins ago
But your codes merely return"English"as the result on my machine. Did I miss something?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
38 mins ago
1
@ÃÂûÃÂþñýôÃÂÿÃÂÃÂõóó One has to evaluateDeleteStopwordsfirst. The actual code forDeleteStopwordsandTextProcessing`TextModificationDump`$stopWordsis stored in a file and is loaded only afterDeleteStopwordsis evaluated in the current session.
â Henrik Schumacher
33 mins ago
Ah, forgot the autoload. Thanks @Henrik!
â J. M. is somewhat okay.â¦
31 mins ago
@J.M. Always at your service! =)
â Henrik Schumacher
31 mins ago
add a comment |Â
up vote
3
down vote
up vote
3
down vote
A little spelunking of the code for DeleteStopwords yields the internally used stopword list:
DeleteStopwords; (* force auto-load *)
AlphabeticSort[List @@ TextProcessing`TextModificationDump`$stopWords["English"]] // Short
"a", "A", "about", "above", "across", "after", "again", "against", "all", "almost",
"alone", "along", "already", "also", "although", <<240>>,
"within", "without", "won't", "would", "wouldn't", "yet", "you", "you'd", "you'll",
"you're", "you've", "your", "yours", "yourself", "yourselves"
A little spelunking of the code for DeleteStopwords yields the internally used stopword list:
DeleteStopwords; (* force auto-load *)
AlphabeticSort[List @@ TextProcessing`TextModificationDump`$stopWords["English"]] // Short
"a", "A", "about", "above", "across", "after", "again", "against", "all", "almost",
"alone", "along", "already", "also", "although", <<240>>,
"within", "without", "won't", "would", "wouldn't", "yet", "you", "you'd", "you'll",
"you're", "you've", "your", "yours", "yourself", "yourselves"
edited 31 mins ago
answered 42 mins ago
J. M. is somewhat okay.â¦
92.7k10286440
92.7k10286440
Wow, these are some undocumented functions?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
40 mins ago
But your codes merely return"English"as the result on my machine. Did I miss something?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
38 mins ago
1
@ÃÂûÃÂþñýôÃÂÿÃÂÃÂõóó One has to evaluateDeleteStopwordsfirst. The actual code forDeleteStopwordsandTextProcessing`TextModificationDump`$stopWordsis stored in a file and is loaded only afterDeleteStopwordsis evaluated in the current session.
â Henrik Schumacher
33 mins ago
Ah, forgot the autoload. Thanks @Henrik!
â J. M. is somewhat okay.â¦
31 mins ago
@J.M. Always at your service! =)
â Henrik Schumacher
31 mins ago
add a comment |Â
Wow, these are some undocumented functions?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
40 mins ago
But your codes merely return"English"as the result on my machine. Did I miss something?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
38 mins ago
1
@ÃÂûÃÂþñýôÃÂÿÃÂÃÂõóó One has to evaluateDeleteStopwordsfirst. The actual code forDeleteStopwordsandTextProcessing`TextModificationDump`$stopWordsis stored in a file and is loaded only afterDeleteStopwordsis evaluated in the current session.
â Henrik Schumacher
33 mins ago
Ah, forgot the autoload. Thanks @Henrik!
â J. M. is somewhat okay.â¦
31 mins ago
@J.M. Always at your service! =)
â Henrik Schumacher
31 mins ago
Wow, these are some undocumented functions?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
40 mins ago
Wow, these are some undocumented functions?
â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
40 mins ago
But your codes merely return
"English" as the result on my machine. Did I miss something?â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
38 mins ago
But your codes merely return
"English" as the result on my machine. Did I miss something?â ÃÂûÃÂþñýôÃÂÿàÃÂõóó
38 mins ago
1
1
@ÃÂûÃÂþñýôÃÂÿÃÂÃÂõóó One has to evaluate
DeleteStopwords first. The actual code for DeleteStopwords and TextProcessing`TextModificationDump`$stopWords is stored in a file and is loaded only after DeleteStopwords is evaluated in the current session.â Henrik Schumacher
33 mins ago
@ÃÂûÃÂþñýôÃÂÿÃÂÃÂõóó One has to evaluate
DeleteStopwords first. The actual code for DeleteStopwords and TextProcessing`TextModificationDump`$stopWords is stored in a file and is loaded only after DeleteStopwords is evaluated in the current session.â Henrik Schumacher
33 mins ago
Ah, forgot the autoload. Thanks @Henrik!
â J. M. is somewhat okay.â¦
31 mins ago
Ah, forgot the autoload. Thanks @Henrik!
â J. M. is somewhat okay.â¦
31 mins ago
@J.M. Always at your service! =)
â Henrik Schumacher
31 mins ago
@J.M. Always at your service! =)
â Henrik Schumacher
31 mins ago
add a comment |Â
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmathematica.stackexchange.com%2fquestions%2f182531%2fwhat-stopwords-list-is-wolfram-language-using%23new-answer', 'question_page');
);
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
