MassIVE MSV000089225

Partial Public PXD033106

HLA-II immunopeptidome profiling and deep learning reveal features of antigenicity to inform antigen discovery

Description

Strazar M, Park J, Abelin JG, Taylor HB, Pedersen TK, Plichta DR, Brown EM, Eraslan B, Ortiz K, Clauser KR, Carr SA, Xavier RJ, Graham DB. 2022. T cell responses are exquisitely antigen-specific and directed against peptide epitopes displayed by human leukocyte antigen (HLA) on the surface of presenting cells. In particular, class II HLA (HLA-II) is remarkably polymorphic, which allows for presentation of diverse peptide antigens to T cells, but also forms the basis for genetic associations with diverse immunopathologies across the spectrum of infectious disease and autoimmunity. Here, we employ monoallelic immunopeptidomics to retrieve over 200,000 unique peptides presented by 41 HLA-II heterodimers covering major alleles across diverse ancestries. We leveraged this expansive dataset to develop computational models that predict peptide antigens based on HLA-II binding properties and infer informative features of the protein antigens from which these peptides derive. Combining both peptide and (contextual) protein features, we develop Context Aware Predictor of T cell Antigens (CAPTAn) to discover novel T cell epitopes from prokaryotes in the human microbiome and the viral pandemic pathogen SARS-CoV-2. [doi:10.25345/C5C24QR72] [dataset license: CC0 1.0 Universal (CC0 1.0)]

Keywords: immunopeptidomics ; HLA-II

Contact

Principal Investigators: (in alphabetical order)	Steven A. Carr, Broad Institute of MIT and Harvard, United States
Submitting User:	clauser

Number of Files:
Total Size:
Spectra:
Subscribers:

	Owner	Reanalyses
Experimental Design
Conditions:
Biological Replicates:
Technical Replicates:

Identification Results
Proteins (Human, Remapped):
Proteins (Reported):
Peptides:
Variant Peptides:
PSMs:

Quantification Results
Differential Proteins:
Quantified Proteins:

Browse Dataset Files

FTP Download Link (click to copy):

Species

Instrument

Orbitrap Exploris 480

Modifications

- Dataset Reanalyses

+ Dataset History

Number of distinct proteins found to be differentially abundant in at least one comparison across all analyses (original submission and reanalyses) associated with this dataset.

A protein is differentially abundant if its change in abundance across conditions is found to be statistically significant with an adjusted p-value <= 0.05 and lists no issues associated with statistical tests for differential abundance.

Distinct protein accessions are counted across all files submitted in the "Statistical Analysis of Quantified Analytes" category having a "Protein" column in this dataset.

"N/A" means no results of this type were submitted.