Wolfram Language Paclet Repository

Community-contributed installable additions to the Wolfram Language

Primary Navigation

    • Cloud & Deployment
    • Core Language & Structure
    • Data Manipulation & Analysis
    • Engineering Data & Computation
    • External Interfaces & Connections
    • Financial Data & Computation
    • Geographic Data & Computation
    • Geometry
    • Graphs & Networks
    • Higher Mathematical Computation
    • Images
    • Knowledge Representation & Natural Language
    • Machine Learning
    • Notebook Documents & Presentation
    • Scientific and Medical Data & Computation
    • Social, Cultural & Linguistic Data
    • Strings & Text
    • Symbolic & Numeric Computation
    • System Operation & Setup
    • Time-Related Computation
    • User Interface Construction
    • Visualization & Graphics
    • Random Paclet
    • Alphabetical List
  • Using Paclets
    • Get Started
    • Download Definition Notebook
  • Learn More about Wolfram Language

TCGADataTool

Guides

  • TCGA Data Tool

Tech Notes

  • Custom Entities
  • Data Exploration
  • Data Modeling
  • Data Visualization
  • Genomic Data
  • Images Download
  • Property Standard Name
  • User Interface

Symbols

  • buildDesignMatrix
  • buildModel
  • cleanRawData
  • columnHeaderRiskClassSummary
  • downloadGenomicData
  • dynamicallyExploreThreshold
  • exampleDataTCGA
  • getHistologicalImages
  • getPotentialPredictors
  • importGenomicDataFile
  • inspectPotentialPredictors
  • modelMeasurementsAtThreshold
  • overallSurvivalPlot
  • progressionFreeSurvivalPlot
  • pullDataSlice
  • radiologicalImagesBatchProcessing
  • swimmerPlot
  • TCGADataToolUserInterface
JaneShenGunther`TCGADataTool`
buildDesignMatrix
​
buildDesignMatrix
[dataStructure,predictors,outcome]
creates a design matrix with data from
dataStructure
for the input
predictors
and
outcome
.
​
Details and Options

Examples  
(9)
Basic Examples  
(1)
Given the example data structure for project TCGA-CESC:
In[1]:=
dataStructure=
exampleDataTCGA
[{"TCGAProjectData","TCGACESCExceptGenomicDataAllPatients"}];
In[2]:=
dataStructure//Length
Out[2]=
307
Create the design matrix:
In[3]:=
outcomeProperty="Clinical::Patient::vital_status";​​predictors={"Clinical::Patient::person_neoplasm_cancer_status","Clinical::Patient::eastern_cancer_oncology_group","Biospecimen::Analyte::analyte_type"};
In[4]:=
designMatrix=
buildDesignMatrix
[dataStructure,predictors,outcomeProperty];
In[5]:=
designMatrix//Length
Out[5]=
307
In[6]:=
designMatrix〚;;3〛//Dataset
Out[6]=
bcr_patient_barcode
bcr_patient_uuid
Biospecimen::Analyte::analyte_type
Clinical::Patient::eastern_cancer_oncology_group
Clinical::Patient::person_neoplasm_cancer_status
Clinical::Patient::vital_status
TCGA-ZJ-AAXN
5f22e050-c172-4f2d-a1f9-6cb749b6ef98
{DNA,RNA,DNA}
{Missing[NotAvailable]}
{Missing[Unknown]}
{Alive}
TCGA-HM-A3JJ
f150d999-cdec-427e-b8b2-00731c648989
{DNA,RNA,Repli-G (Qiagen) DNA,DNA,Repli-G (Qiagen) DNA,DNA,RNA,Repli-G (Qiagen) DNA}
{0}
{WITH TUMOR}
{Dead}
TCGA-HG-A9SC
cf87812f-6db6-4592-9842-c62b3e4ff03f
{DNA,RNA,DNA}
{0}
{TUMOR FREE}
{Alive}
Print statements are generated if properties are discarded from the design matrix:
In[7]:=
predictors={"Clinical::Patient::person_neoplasm_cancer_status","Clinical::Patient::tumor_tissue_site"};
In[8]:=
designMatrix=
buildDesignMatrix
[dataStructure,predictors,outcomeProperty];
Wed 15 Feb 2023 12:29:33 1 column(s) dropped because contained only one value: Clinical::Patient::tumor_tissue_site
In[9]:=
designMatrix〚;;3〛//Dataset
Out[9]=
bcr_patient_barcode
bcr_patient_uuid
Clinical::Patient::person_neoplasm_cancer_status
Clinical::Patient::vital_status
TCGA-ZJ-AAXN
5f22e050-c172-4f2d-a1f9-6cb749b6ef98
{Missing[Unknown]}
{Alive}
TCGA-HM-A3JJ
f150d999-cdec-427e-b8b2-00731c648989
{WITH TUMOR}
{Dead}
TCGA-HG-A9SC
cf87812f-6db6-4592-9842-c62b3e4ff03f
{TUMOR FREE}
{Alive}
In[10]:=
dataStructure〚All,"Clinical","Patient",All,"tumor_tissue_site"〛//Union
Out[10]=
{{Cervical}}
Options  
(8)

SeeAlso
getPotentialPredictors
 
▪
inspectPotentialPredictors
 
▪
buildModel
TechNotes
▪
Property Standard Name
▪
Data Modeling
RelatedGuides
▪
TCGA Data Tool
""

© 2025 Wolfram. All rights reserved.

  • Legal & Privacy Policy
  • Contact Us
  • WolframAlpha.com
  • WolframCloud.com