MassIVE MSV000098448

Partial Public PXD065896

AlphaDIA enables DIA Transfer Learning for Feature-Free Proteomics

Comment Reanalyze Spectra Add Reanalysis

Description

The scale of data generated for mass spectrometry-based proteomics as well as modern acquisition strategies pose a challenge to bioinformatic analysis. Search engines need to make optimal use of the data for biological discoveries while remaining statistically rigorous, transparent and performant. Here we present alphaDIA, a modular open-source search framework for data independent acquisition (DIA) proteomics. We developed a feature-free identification algorithm that performs machine learning directly on the raw signal and is particularly suited for detecting patterns in data produced by time-of-flight instruments. Benchmarking demonstrates competitive identification and quantification performance. While the method supports empirical spectral libraries, we propose a search strategy named DIA transfer learning that uses fully predicted libraries. This entails continuously optimizing a deep neural network for predicting machine- and experiment-specific properties, enabling the generic DIA analysis of any post-translational modification. AlphaDIA provides a high performance and accessible framework running locally or in the cloud, opening DIA analysis to the community. [doi:10.25345/C5GF0N84Q] [dataset license: CC0 1.0 Universal (CC0 1.0)]

Keywords: search engine ; DatasetType:Proteomics

Contact

Principal Investigators: (in alphabetical order)	Matthias Mann, Proteomics and Signal Transduction Max Planck Institute of Biochemistry Am Klopferspitz 18 D-82152 Martinsried, N/A
Submitting User:	wallmann

Number of Files:
Total Size:
Spectra:
Subscribers:

	Owner	Reanalyses
Experimental Design
Conditions:
Biological Replicates:
Technical Replicates:

Identification Results
Proteins (Human, Remapped):
Proteins (Reported):
Peptides:
Variant Peptides:
PSMs:

Quantification Results
Differential Proteins:
Quantified Proteins:

Browse Dataset Files

FTP Download Link (click to copy):

- Dataset Reanalyses

+ Dataset History

Number of distinct proteins found to be differentially abundant in at least one comparison across all analyses (original submission and reanalyses) associated with this dataset.

A protein is differentially abundant if its change in abundance across conditions is found to be statistically significant with an adjusted p-value <= 0.05 and lists no issues associated with statistical tests for differential abundance.

Distinct protein accessions are counted across all files submitted in the "Statistical Analysis of Quantified Analytes" category having a "Protein" column in this dataset.

"N/A" means no results of this type were submitted.

MassIVE MSV000098448

AlphaDIA enables DIA Transfer Learning for Feature-Free Proteomics

Description

Contact

Species

Instrument

Modifications

- Dataset Reanalyses

+ Dataset History