Skip to content

First embedding file design#2328

Draft
i-am-leslie wants to merge 3 commits into
cBioPortal:masterfrom
i-am-leslie:feat/add-embedding-support
Draft

First embedding file design#2328
i-am-leslie wants to merge 3 commits into
cBioPortal:masterfrom
i-am-leslie:feat/add-embedding-support

Conversation

@i-am-leslie

Copy link
Copy Markdown

What?

N/A

Describe changes proposed in this pull request:

  • Added embedding dataset files for ingestion pipeline for msk_impact_50k_2026
  • Added metadata and definition files for embeddings for msk_impact_50k_2026

checks

For all pull requests:

  • Passes validation

For a new study (in addition to above):

  • Does study name and study ID follow our convention? e.g. Tumor_Type (Institue, Journal Year); brca_mskcc_2015
  • Is the study meta data complete? e.g. pmid, citation
  • Were all samples profiled with WES/WGS? If not, is gene panel file curated?
  • Are oncotree codes of all samples curated; Cancer Type and Cancer Type Detailed needs to be added in addition to Oncotree Code
  • Clinical sample and patient data with meta files.
  • Mutations data with meta file.
  • Is the study based on hg38? If so, is the reference_genome: hg38 option included in meta study.
  • CNA data with meta files
  • CNA segment data with meta files
  • Expression data including z-scores with meta files
  • Other genomic profiles with meta files
  • Case-lists for all profiles.
  • Perform sanity checks based on the items in the checklist
  • Manual checking (Niki or JJ): Triage or private Portal link here

@i-am-leslie i-am-leslie marked this pull request as draft June 1, 2026 14:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant