Wolfram Function Repository
Instant-use add-on functions for the Wolfram Language
Function Repository Resource:
Stopwords for multiple languages
ResourceFunction["LinguaStopwords"][lang] gives stopwords of the language lang. |
Armenian stopwords:
| In[1]:= |
| Out[1]= | ![]() |
Bulgarian stopwords:
| In[2]:= |
| Out[2]= | ![]() |
Hindi stopwords:
| In[3]:= |
| Out[3]= | ![]() |
Zulu stopwords:
| In[4]:= |
| Out[4]= |
LinguaStopwords works with entities. Here is an example with a language entity:
| In[5]:= |
| Out[5]= |
Here is an example with a country entity:
| In[6]:= |
| Out[6]= |
The stopwords for all languages can be obtained with the argument All:
| In[7]:= |
| Out[7]= | ![]() |
Remove the stopwords from a text and show the top word counts:
| In[8]:= | ![]() |
| Out[8]= | ![]() |
Show the top word counts without stopword removal:
| In[9]:= |
| Out[9]= |
Here is a Pareto principle adherence plot of the sizes of stopword collections:
| In[10]:= | ![]() |
| Out[10]= | ![]() |
This work is licensed under a Creative Commons Attribution 4.0 International License