Wolfram Function Repository
Instant-use add-on functions for the Wolfram Language
Function Repository Resource:
Stopwords for multiple languages
ResourceFunction["LinguaStopwords"][lang] gives stopwords of the language lang. |
Armenian stopwords:
In[1]:= | ![]() |
Out[1]= | ![]() |
Bulgarian stopwords:
In[2]:= | ![]() |
Out[2]= | ![]() |
Hindi stopwords:
In[3]:= | ![]() |
Out[3]= | ![]() |
Zulu stopwords:
In[4]:= | ![]() |
Out[4]= | ![]() |
LinguaStopwords works with entities. Here is an example with a language entity:
In[5]:= | ![]() |
Out[5]= | ![]() |
Here is an example with a country entity:
In[6]:= | ![]() |
Out[6]= | ![]() |
The stopwords for all languages can be obtained with the argument All:
In[7]:= | ![]() |
Out[7]= | ![]() |
Remove the stopwords from a text and show the top word counts:
In[8]:= | ![]() |
Out[8]= | ![]() |
Show the top word counts without stopword removal:
In[9]:= | ![]() |
Out[9]= | ![]() |
Here is a Pareto principle adherence plot of the sizes of stopword collections:
In[10]:= | ![]() |
Out[10]= | ![]() |
This work is licensed under a Creative Commons Attribution 4.0 International License