MassIVE MSV000082648

Partial Public

Deep learning using tumor HLA peptide mass spectrometry datasets improves neoantigen identification

Comment Reanalyze Spectra Add Reanalysis

Description

Neoantigens, which are expressed on tumor cells, are one of the main targets of an effective anti-tumor T-cell response. Cancer immunotherapies to target neoantigens are of growing interest, and are currently in early human trials, but methods to identify neoantigens either require invasive or difficult-to-obtain clinical specimens, the screening of hundreds to thousands of synthetic peptides or tandem minigenes or are only relevant to specific human leukocyte antigen (HLA) alleles. We apply deep learning to a large (N=74 patients) HLA peptide and genomic dataset from various human tumors to create a computational model of antigen presentation for neoantigen prediction. We show that our model, named EDGE, increases the positive predictive value of HLA antigen prediction by up to 9 fold. We apply EDGE to enable identification of neoantigens and neoantigen-reactive T cells using routine clinical specimens and small numbers of synthetic peptides for most common HLA alleles. EDGE could enable an improved ability to develop neoantigen-targeted immunotherapies for cancer patients. [dataset license: Custom User License]

Keywords: HLA, Machine Learning

Contact

Principal Investigators: (in alphabetical order)	Roman Yelensky, Gritstone Oncology, United States
Submitting User:	bbulik

Number of Files:
Total Size:
Spectra:
Subscribers:

	Owner	Reanalyses
Experimental Design
Conditions:
Biological Replicates:
Technical Replicates:

Identification Results
Proteins (Human, Remapped):
Proteins (Reported):
Peptides:
Variant Peptides:
PSMs:

Quantification Results
Differential Proteins:
Quantified Proteins:

Browse Dataset Files

FTP Download Link (click to copy):

- Dataset Reanalyses

+ Dataset History

Number of distinct technical replicates across all analyses (original submission and reanalyses) associated with this dataset.

The technical replicate count is defined as the maximum number of times any one distinct combination of condition and biological replicate was analyzed across all files submitted in the "Metadata" category. In the case of fractionated experiments, only the first fraction is considered.

"N/A" means no results of this type were submitted.

Number of distinct proteins found to be differentially abundant in at least one comparison across all analyses (original submission and reanalyses) associated with this dataset.

A protein is differentially abundant if its change in abundance across conditions is found to be statistically significant with an adjusted p-value <= 0.05 and lists no issues associated with statistical tests for differential abundance.

Distinct protein accessions are counted across all files submitted in the "Statistical Analysis of Quantified Analytes" category having a "Protein" column in this dataset.

"N/A" means no results of this type were submitted.

MassIVE MSV000082648

Deep learning using tumor HLA peptide mass spectrometry datasets improves neoantigen identification

Description

Contact

Species

Instrument

Modifications

- Dataset Reanalyses

+ Dataset History