Wolfram Function Repository
Instant-use add-on functions for the Wolfram Language
Function Repository Resource:
Search the PubChem database for similar compounds
ResourceFunction["PubChemSimilaritySearch"][mol] returns a list of "PubChemCompoundID" identifiers for compounds similar to the molecule or chemical entity mol. | |
ResourceFunction["PubChemSimilaritySearch"][mol,"Molecule"] returns a list of Molecule objects constructed from the external identifiers similar to mol. |
| "Similarity2DSearch" | Tanimoto similarity of topological fingerprints |
| "Similarity3DSearch" | Tanimoto similarity of 3D shape fingerprints |
| "Original" | exact match to input |
| "Parent" | parent compound |
| "SameStereo" | same stereo |
| "SameIsotopes" | same isotopes |
| "SameConnectivity" | same connectivity |
| "SameFormula" | same molecular formula |
| "SameTautomer" | same tautomer |
| "SameParent" | same parent |
| "SameParentStereo" | same parent stereo |
| "SameParentIsotopes" | same parent isotopes |
| "SameParentConnectivity" | same parent connectivity |
| "SameParentTautomer" | same parent tautomer |
Find the compound ID for a similar molecules:
| In[1]:= |
| Out[1]= | ![]() |
Perform the same search, but return the results as Molecule objects:
| In[2]:= |
| Out[2]= | ![]() |
Visualize the results using MoleculePlot:
| In[3]:= |
| Out[3]= | ![]() |
Search PubChem for tautomers:
| In[4]:= | ![]() |
| Out[4]= |
Get a list of molecules with the same connectivity as adenosine:
| In[5]:= | ![]() |
| Out[5]= | ![]() |
Visualize the molecules using MoleculePlot:
| In[6]:= |
| Out[6]= | ![]() |
Get a list of IDs for compounds with the same parent:
| In[7]:= | ![]() |
| Out[7]= |
Get a list of isomers by using the "SameFormula" search type:
| In[8]:= | ![]() |
| Out[8]= | ![]() |
PubChemSimilaritySearch can be used with Entity or Molecule objects:
| In[9]:= | ![]() |
| Out[9]= |
PubChemSimilaritySearch will automatically thread over lists:
| In[10]:= | ![]() |
| Out[11]= |
By adjusting the Tanimoto threshold, the number of similar compounds returned can be controlled:
| In[12]:= | ![]() |
| Out[12]= |
Use ListLogPlot to visualize the relationship between threshold and the size of the similarity space:
| In[13]:= |
| Out[13]= | ![]() |
This work is licensed under a Creative Commons Attribution 4.0 International License