Wolfram Language Paclet Repository
Community-contributed installable additions to the Wolfram Language
Creation of- and classification with ensembles of classifiers
Contributed by: Anton Antonov
Functions for creating Machine Learning classifier ensembles and making classifications using averaged probabilities, votes, and thresholds.
To install this paclet in your Wolfram Language environment,
evaluate this code:
PacletInstall["AntonAntonov/ClassifierEnsembles"]
To load the code after installation, evaluate this code:
Needs["AntonAntonov`ClassifierEnsembles`"]
Get training data:
| In[1]:= | ![]() |
Get testing data:
| In[2]:= | ![]() |
Summarize the training and testing data:
| In[3]:= |
| Out[3]= | ![]() |
Create a classifier ensemble using Classify method names:
| In[4]:= | ![]() |
| Out[5]= | ![]() |
Classify a record with the ensemble:
| In[6]:= |
| Out[6]= |
Classify a record using classifier votes and specifying 2 to be the threshold for the label "survived":
| In[7]:= |
| Out[7]= |
Classify a record using the mean of the probabilities given by each classifier and a threshold for "survived":
| In[8]:= |
| Out[8]= |
Classify a list of records and return "survived" if it gets at least two votes:
| In[9]:= |
| Out[9]= |
Return "survived" if its average probability is at least 0.7:
| In[10]:= |
| Out[10]= |
Get the probabilities:
| In[11]:= |
| Out[11]= |
Get the votes:
| In[12]:= |
| Out[12]= |
Compute classifier measurements:
| In[13]:= | ![]() |
| Out[14]= |
Compute Receiver Operating Characteristic (ROC) for a range of thresholds for the label "survived" (using the paclet "ROCFunctions"):
| In[15]:= | ![]() |
| Out[8]= | ![]() |
EnsembleClassifier can take data arguments that Classify can take. Here is a Dataset object for the built-in Titanic data:
| In[16]:= |
Here we split that Dataset object into two (training and testing):
| In[17]:= |
| Out[8]= | ![]() |
Here we make a classifier ensemble with the training dataset:
| In[18]:= |
| Out[18]= | ![]() |
Here we classify the records of the testing dataset:
| In[19]:= |
| Out[20]= |
Confusion matrix can be computed and plotted using functions of the paclet "ROCFunctions". Here is an example:
| In[21]:= |
| Out[21]= | ![]() |
Here is a flowchart that summarizes the classification process (made with Mermaid-JS):
| In[22]:= | ![]() |
| Out[22]= | ![]() |