Wolfram Language Paclet Repository

Community-contributed installable additions to the Wolfram Language

Primary Navigation

    • Cloud & Deployment
    • Core Language & Structure
    • Data Manipulation & Analysis
    • Engineering Data & Computation
    • External Interfaces & Connections
    • Financial Data & Computation
    • Geographic Data & Computation
    • Geometry
    • Graphs & Networks
    • Higher Mathematical Computation
    • Images
    • Knowledge Representation & Natural Language
    • Machine Learning
    • Notebook Documents & Presentation
    • Scientific and Medical Data & Computation
    • Social, Cultural & Linguistic Data
    • Strings & Text
    • Symbolic & Numeric Computation
    • System Operation & Setup
    • Time-Related Computation
    • User Interface Construction
    • Visualization & Graphics
    • Random Paclet
    • Alphabetical List
  • Using Paclets
    • Get Started
    • Download Definition Notebook
  • Learn More about Wolfram Language

TCGADataTool

Guides

  • TCGA Data Tool

Tech Notes

  • Custom Entities
  • Data Exploration
  • Data Modeling
  • Data Visualization
  • Genomic Data
  • Images Download
  • Property Standard Name
  • User Interface

Symbols

  • buildDesignMatrix
  • buildModel
  • cleanRawData
  • columnHeaderRiskClassSummary
  • downloadGenomicData
  • dynamicallyExploreThreshold
  • exampleDataTCGA
  • getHistologicalImages
  • getPotentialPredictors
  • importGenomicDataFile
  • inspectPotentialPredictors
  • modelMeasurementsAtThreshold
  • overallSurvivalPlot
  • progressionFreeSurvivalPlot
  • pullDataSlice
  • radiologicalImagesBatchProcessing
  • swimmerPlot
  • TCGADataToolUserInterface
Data Visualization
Wolfram language statistical visualizations
TCGADataTool visualizations
It is possible to create various visualizations of the data downloaded using the
TCGADataToolUserInterface
.
This loads the paclet.
In[1]:=
Needs["JaneShenGunther`TCGADataTool`"]
Wolfram language statistical visualizations
TCGA project data is downloaded from the
TCGADataToolUserInterface
in the form of a list of
Association
, one for each patient in the project. It is possible to use built-in Wolfram language statistical visualization functions to explore the data.
Load TCGA-CESC example data structure and its description.
In[2]:=
dataStructure=
exampleDataTCGA
[{"TCGAProjectData","TCGACESCExceptGenomicDataAllPatients"}];

Basic examples

Histogram
of property Clinical::Patient::age_began_smoking_in_years
In[36]:=
ageBeginSmoking=
pullDataSlice
[dataStructure,"Clinical::Patient::age_began_smoking_in_years","DeleteMissing"True,"ListHandling"(If[ListQ[#],First[#],#]&)];
In[38]:=
Histogram[Flatten@Values[ageBeginSmoking],Automatic,"Probability",PlotRangeAll,FrameTrue,GridLinesAutomatic,PlotLabel"Age began smoking\n(years)"]
Out[38]=
PieChart
of patient vital_status
In[86]:=
vitalStatusData=Flatten@Values@
pullDataSlice
[dataStructure,"Clinical::Patient::vital_status","DeleteMissing"True,"ListHandling"(If[ListQ[#],First[#],#]&)];
In[88]:=
vitalStatusData〚;;3〛
Out[88]=
{Alive,Dead,Alive}
In[90]:=
PieChart[​​ Counts[vitalStatusData],​​ SectorOriginTop,LabelingFunction"RadialCenter",​​ PlotLabel"Clinical::Patient::vital_status",​​ ChartLabelsNone,ChartLegendsAutomatic​​]
Out[90]=
Alive
Dead
Plot age of initial pathologic diagnosis vs age began smoking.
In[94]:=
ageProperties=
pullDataSlice
[dataStructure,{"Clinical::Patient::age_at_initial_pathologic_diagnosis","Clinical::Patient::age_began_smoking_in_years"},"DeleteMissing"True,"ListHandling"(If[ListQ[#],First[#],#]&)];​​ageProperties〚;;2〛
Out[95]=
{Clinical::Patient::age_at_initial_pathologic_diagnosis34,Clinical::Patient::age_began_smoking_in_years21,Clinical::Patient::age_at_initial_pathologic_diagnosis50,Clinical::Patient::age_began_smoking_in_years12}
In[103]:=
ListPlot[ageProperties,PlotRangeAll,FrameTrue,FrameLabel{"age_at_initial_pathologic_diagnosis","age_began_smoking_in_years"},GridLinesAutomatic,GridLinesStyleLightGray]
Out[103]=

Additional examples

Distribution of the difference between age at diagnosis and age began smoking.
In[34]:=
ageProperties=
pullDataSlice
[dataStructure,{"Clinical::Patient::age_at_initial_pathologic_diagnosis","Clinical::Patient::age_began_smoking_in_years"},"DeleteMissing"True,"ListHandling"(If[ListQ[#],First[#],#]&)];
In[51]:=
ageProperties〚;;2〛
Out[51]=
{Clinical::Patient::age_at_initial_pathologic_diagnosis34,Clinical::Patient::age_began_smoking_in_years21,Clinical::Patient::age_at_initial_pathologic_diagnosis50,Clinical::Patient::age_began_smoking_in_years12}
In[52]:=
ageProperties=Merge[ageProperties,Identity]
Out[52]=
Clinical::Patient::age_at_initial_pathologic_diagnosis{34,50,43,63,51,25,41,34,80,49,42,37,56,51,35,64,55,33,56,49,35,36,45,57,27,37,78,69,61,54,47,47,50,43,36,58,46,44,65,34,68,47,65,21,85,48,33,38,51,40,29,63,34,42,46,24,42,46,38,44,33,53,54,26,32,31,58,79,45,35,24,48,38,38,60,55,37,53,74,43,54,46,32,47,51},Clinical::Patient::age_began_smoking_in_years{21,12,16,24,14,13,21,21,27,24,22,17,39,27,25,18,15,14,24,20,23,20,26,43,14,17,20,29,30,18,15,17,15,34,16,33,13,18,12,24,20,16,15,13,23,16,19,36,15,18,14,15,27,23,16,16,12,16,20,35,16,17,20,15,14,16,27,23,17,14,11,29,18,27,37,30,16,22,41,33,18,24,16,44,16}
In[53]:=
ageDifference=ageProperties["Clinical::Patient::age_at_initial_pathologic_diagnosis"]-ageProperties["Clinical::Patient::age_began_smoking_in_years"]
Out[53]=
{13,38,27,39,37,12,20,13,53,25,20,20,17,24,10,46,40,19,32,29,12,16,19,14,13,20,58,40,31,36,32,30,35,9,20,25,33,26,53,10,48,31,50,8,62,32,14,2,36,22,15,48,7,19,30,8,30,30,18,9,17,36,34,11,18,15,31,56,28,21,13,19,20,11,23,25,21,31,33,10,36,22,16,3,35}
In[57]:=
Histogram[ageDifference,Automatic,"Probability",PlotRangeAll,FrameTrue,GridLinesAutomatic,PlotLabel"Difference between\nage at diagnosis and age began smoking\n(years)"]
Out[57]=
Distribution of the difference between age at diagnosis and age began smoking, color coded by eastern cancer oncology group.
In[114]:=
patientData=
pullDataSlice
[dataStructure,{"Clinical::Patient::age_at_initial_pathologic_diagnosis","Clinical::Patient::age_began_smoking_in_years","Clinical::Patient::eastern_cancer_oncology_group"},"DeleteMissing"True,"ListHandling"(If[ListQ[#],First[#],#]&)];
In[115]:=
patientData〚;;2〛
Out[115]=
{Clinical::Patient::age_at_initial_pathologic_diagnosis50,Clinical::Patient::age_began_smoking_in_years12,Clinical::Patient::eastern_cancer_oncology_group1,Clinical::Patient::age_at_initial_pathologic_diagnosis43,Clinical::Patient::age_began_smoking_in_years16,Clinical::Patient::eastern_cancer_oncology_group2}
In[116]:=
groupedPatientData=GroupBy[patientData,#["Clinical::Patient::eastern_cancer_oncology_group"]&];
TCGADataTool visualizations
The TCGADataTool paclet includes custom visualizations for TCGA project data.
TCGADataTool specific visualization functions.
Create a swimmer plot for all patients treated with Cisplatin.
Plot overall survival rate.
Plot progression free survival rate.

© 2025 Wolfram. All rights reserved.

  • Legal & Privacy Policy
  • Contact Us
  • WolframAlpha.com
  • WolframCloud.com