MassIVE MSV000082283

Complete Public PXD009533

CDK12 regulates DNA damage response genes through suppression of premature cleavage and polyadenylation

Description

Cyclin-dependent kinases (CDKs) are key regulators of the cell cycle or transcription, and are emerging as promising therapeutic targets in cancer cells. The transcriptional kinase, CDK12, is especially intriguing, as it modulates transcription elongation by phosphorylating the carboxy-terminal domain (CTD) of RNA polymerase (Pol II) with subsequent effects on expression of DNA damage response (DDR) genes. Yet the exact mechanism by which CDK12 regulates this vital process remains unclear. Using the selective inhibitor of CDK12/13, THZ5317 and transient transcriptome sequencing (TT-seq), we sought to capture the dynamic transcriptional changes linked to CDK12 by analyzing the immediate, early effects on nascent RNA production after CDK12 depletion in neuroblastoma (NB) cells. CDK12-induced inhibition resulted in gene length-dependent defects in transcription elongation, leading to a disproportionate loss of 3' reads in long genes. Short transcripts, by contrast, including those of replication-dependent histone genes, underwent 3' UTR extension. The effect on long genes was accompanied by premature cleavage and polyadenylation (PCPA), followed by early termination and loss of gene expression. DDR gene transcripts were the most prominent among those terminated by PCPA. Phosphoproteomic analysis indicated that pre-mRNA processing factors, including those involved in CPA , are direct phospho-targets of CDK12 and that CDK depletion phenocopied their effects on transcription. Importantly, sensitivity to CDK12 inhibition was predicted by a lower proportion of U1 snRNP binding sites compared to polyadenylation sites (PAS) and hence a higher propensity to intronic poly(A) site selection - a characteristic observed in most DDR genes. These results support a model in which CDK12 regulates expression of DDR genes not only by regulating transcription elongation through its effects on Pol II CTD phosphorylation, but also through direct phosphorylation of pre-mRNA processing factors. These mechanistic insights may enable the development of new anti-cancer strategies targeting multiple vulnerable steps in CDK12-dependent regulation of DNA damage repair. [dataset license: CC0 1.0 Universal (CC0 1.0)]

Keywords: Cyclin-dependent kinase ; CDK ; CDK12 ; CDK13 ; DNA damage ; premature cleavage ; polyadenylation ; THZ5317 ; transcription ; phosphorylation ; SILAC ; phosphoproteomics

Contact

Principal Investigators:
(in alphabetical order)
Scott Gerber, The Geisel School of Medicine at Dartmouth, United States
Submitting User: madamo
Number of Files:
Total Size:
Spectra:
Subscribers:
 
Owner Reanalyses
Experimental Design
    Conditions:
    Biological Replicates:
    Technical Replicates:
 
Identification Results
    Proteins (Human, Remapped):
    Proteins (Reported):
    Peptides:
    Variant Peptides:
    PSMs:
 
Quantification Results
    Differential Proteins:
    Quantified Proteins:
 
Browse Dataset Files Browse Results
 
FTP Download Link (click to copy):

- Dataset Reanalyses


+ Dataset History


Click here to queue conversion of this dataset's submitted spectrum files to open formats (e.g. mzML). This process may take some time.

When complete, the converted files will be available in the "ccms_peak" subdirectory of the dataset's FTP space (accessible via the "FTP Download" link to the right).
Number of distinct conditions across all analyses (original submission and reanalyses) associated with this dataset.

Distinct condition labels are counted across all files submitted in the "Metadata" category having a "Condition" column in this dataset.

"N/A" means no results of this type were submitted.
Number of distinct biological replicates across all analyses (original submission and reanalyses) associated with this dataset.

Distinct replicate labels are counted across all files submitted in the "Metadata" category having a "BioReplicate" or "Replicate" column in this dataset.

"N/A" means no results of this type were submitted.
Number of distinct technical replicates across all analyses (original submission and reanalyses) associated with this dataset.

The technical replicate count is defined as the maximum number of times any one distinct combination of condition and biological replicate was analyzed across all files submitted in the "Metadata" category. In the case of fractionated experiments, only the first fraction is considered.

"N/A" means no results of this type were submitted.
Originally identified proteins that were automatically remapped by MassIVE to proteins in the SwissProt human reference database.

"N/A" means no results of this type were submitted.
Number of distinct protein accessions reported across all analyses (original submission and reanalyses) associated with this dataset.

"N/A" means no results of this type were submitted.
Number of distinct unmodified peptide sequences reported across all analyses (original submission and reanalyses) associated with this dataset.

"N/A" means no results of this type were submitted.
Number of distinct peptide sequences (including modified variants or peptidoforms) reported across all analyses (original submission and reanalyses) associated with this dataset.

"N/A" means no results of this type were submitted.
Total number of peptide-spectrum matches (i.e. spectrum identifications) reported across all analyses (original submission and reanalyses) associated with this dataset.

"N/A" means no results of this type were submitted.
Number of distinct proteins quantified across all analyses (original submission and reanalyses) associated with this dataset.

Distinct protein accessions are counted across all files submitted in the "Statistical Analysis of Quantified Analytes" category having a "Protein" column in this dataset.

"N/A" means no results of this type were submitted.
Number of distinct proteins found to be differentially abundant in at least one comparison across all analyses (original submission and reanalyses) associated with this dataset.

A protein is differentially abundant if its change in abundance across conditions is found to be statistically significant with an adjusted p-value <= 0.05 and lists no issues associated with statistical tests for differential abundance.

Distinct protein accessions are counted across all files submitted in the "Statistical Analysis of Quantified Analytes" category having a "Protein" column in this dataset.

"N/A" means no results of this type were submitted.
This dataset may not contain all raw spectra data as originally deposited in PRIDE. It has been imported to MassIVE for reanalysis purposes, so its spectra data here may consist solely of processed peak lists suitable for reanalysis with most software.