Electrospray ionization is a powerful and prevalent technique used to ionize analytes prior to measurement through mass spectrometry. The distribution of charges that an analyte receives (charge state distribution, CSD) is an important consideration for interpreting the mass spectra and for identifying analytes. However, due to an incomplete understanding of the ionization mechanism, the analyte properties that influence CSDs are not fully understood. Here, we employ a machine learning-based high-throughput approach and analyze CSDs of hundreds of thousands of peptides. We find that half of the peptides exhibit charges that differ from what one would naively expect (number of basic sites). Moreover, these peptides can be classified into two regimes---undercharging and overcharging---and that these two regimes display markedly different charging characteristics. Strikingly, peptides in the overcharging (but not undercharging) regime show minimal dependence on basic site count, and more generally, the two regimes exhibit distinct sequence determinants. These findings highlight the rich ionization behavior of peptides and the potential of CSDs for enhancing peptide identification.
[doi:10.25345/C56Q1SS76]
[dataset license: CC0 1.0 Universal (CC0 1.0)]
Keywords: proteomics, charge state distribution, voltage, gradient length, flow rate
Principal Investigators: (in alphabetical order) |
Marko Jovanovic, Columbia University, USA Oded Regev, New York University, USA |
Submitting User: | allynmuzhixu |
Number of Files: | |
Total Size: | |
Spectra: | |
Subscribers: | |
Owner | Reanalyses | |
---|---|---|
Experimental Design | ||
Conditions:
|
||
Biological Replicates:
|
||
Technical Replicates:
|
||
Identification Results | ||
Proteins (Human, Remapped):
|
||
Proteins (Reported):
|
||
Peptides:
|
||
Variant Peptides:
|
||
PSMs:
|
||
Quantification Results | ||
Differential Proteins:
|
||
Quantified Proteins:
|
||
Browse Dataset Files | |
FTP Download Link (click to copy):
|