• Introduction
• Release notes (Version 1.0.0)
• Software and Hardware Requirements
• Installation
• Input Files and their Preparation
• Update the parameter file
• Run the JUMPsilactmt program
• Output files
• Maintainers
• Acknowledgments
• References
JUMPsilactmt integrated data inputs from the JUMP batch program and the mzXML files of individual samples within each batch. It encompassed a two-fold quantification approach involving MS1 level quantification/normalization (specifically SILAC quantification) and MS2 level quantification. The workflow started with extracting TMT reporter ion peaks and consolidating m/z shifts among these reporters. Furthermore, it utilized an additional round of TMT reporter ion peak extraction in the initial phases of MS2 level quantification. The process assigned the lowest intensity across the entire batch to the respective PSMs if the reporters lacked quantification.
Additionally, the process applied a TMT impurity correction, assigning a value of 1 to reporters with an intensity of <=0 and subsequently incorporating the relevant proteins and peptides into the PSMs. Notably, this TMT impurity correction deviated slightly from the standard impurity correction used in our JUMP-q pipeline (Niu et al., 2017). In this case, the approach shifted from the previous evaluation of the final PSM quantification based on a maximum of (original reporter intensity/2) and impurity-corrected intensity to utilizing the impurity-corrected reporter intensities. The subsequent stages of MS2-level quantification included the application of loading bias and TMT noise corrections. Following the approach of Niu et al. (2017), the loading bias correction aimed to eliminate systematic sample processing discrepancies by normalizing reporter intensities for individual Lys PSMs. It involved the division of these intensities by PSM-wise mean values and their subsequent conversion into log-scale values.
Consequently, the majority of PSM intensities centered around zero across the reporters. The trimmed mean intensity for each reporter was generated by excluding the top and bottom 10% of values, providing the necessary normalization factors that were then converted to raw-scale single Lys PSMs. The process excluded PSMs containing decoys, contaminants, or anomalies.
The TMT noise correction process commenced with determining the minimum intensity observed across the entire batch. It is initiated by identifying noise channels for both light PSMs (representing the fully labeled channel) and heavy PSMs (representing the fully unlabeled channel). Subsequently, it set the TMT noise level for each PSM based on the first and second lowest signals. After this determination, it subjected both heavy and light PSMs to noise correction at varying levels as dictated by the TMT noise. Each light and heavy PSM underwent normalization based on the respective total intensity, and heavy PSMs were aggregated at the peptide and protein levels. The normalization involved the MS1 intensity ratio. This method encompassed the calculation of heavy and light peptide isotopic distributions for all identified peptides, including extracting heavy and light peptide MS1 intensities from each scan, focusing on the most significant peaks. It computed heavy and light peptide MS1 precursor intensities ratios for each PSM. Subsequently, it summarized the MS1 intensity ratio at the peptide level and normalized TMT noise-corrected light PSMs using the corresponding MS1 L/H ratio. After summarizing TMT noise-corrected light and heavy PSMs into the related peptides, it calculated each peptide's light-Lys% (L/(L+H)) using heavy and light peptides. Finally, it identified and removed outliers using the extreme studentized deviation (ESD) and Dixson Q-test methods. The entire pipeline concluded with the ultimate protein summarization.
If each protein (
Total
Total
The ratio of the mixed to the heavy peptide
Thus,
We can use the above simple equations to derive the averaged
As the
1. We can quantify proteins from multiple batches of samples generated by the SILAC-TMT-LC/LC-MS/MS process.
2. We can also estimate free lysine decay from this data if experimental free lysine information is unavailable.
• It needs the JUMP module to be loaded.
https://drive.google.com/drive/folders/1VRKWWJVgPSKCH_HU1_tiYiVo_ZdGxiD6?usp=sharing
Figure 1
• idtxt; This is the main output from JUMP-batch-id program
• id_res_folder; This is an associated folder generated by JUMP-batch-id program
• output_folder_protein; The output folder where protein quantification results will be stored
• output_folder_lysine; The output folder where free lysine estimation results will be stored
• tmt_reporters_used; TMT reporter information
• Sample information in for each sets of pulse experiment; for example set1, set2, and set3 in the sample parameter files
• noise_removal_in_lightPSM; Noise correction by fully heavy reporters ; 1 = Yes; 0 = No
• noise_removal_in_heavyPSM; Noise correction by fully light reporters ; 1 = Yes; 0 = No
• nc_level; Noise correction level; may vary from 0.1 to 0.95; It prevents overcorrection
• normalization; 1 = Normalize with MS1 intensity ratio (default); 2 = Normalize each channel in every PSM obtained using protein amount obtained by non-Lys PSMs
• free_lysine_estimation; 1= yes; 0= no
• ms1_peak_extraction_method; 1 = strongest intensity; 2 = closest to expected peptide mass; only if multiple peaks detected within mass tolerance
• strong_isotopic_peaks; 1 = select strongest isotopic peaks within mass_tolerance instead of merging isotopic peaks based on a weighted average of intensity; = 0 otherwise
Step 1: Copy all the files to your working folder on HPC.
Step-2: Update the '.params' file with input file names and other parameters as necessary.
Step-3: bsub -R "rusage[mem=15000]" -q standard -P Proteomics -Is bash (Optional and it is system specific to assign more memory into your workspace)
Step-4: module load jump
Step-5: python JUMPsilactmt.py Batch1.params Batch2.params ... BatchN.params
• For example: If you have the sample parameter files, the command is: python JUMPsilactmt.py prot_quan_lysine_est_batch1.params prot_quan_lysine_est_batch2.params
Here, we include a sample program "Sample_combine_two_batches_prot_quan_data_with_qc.py" with its parameter file "Sample_prot_quan_two_batch_join_qc.params" to show how a user can join the protein quantification results from two batches and perform a quality control depending on free Lys outlier (proteins having faster degradation curve than free lysine) and degradation pattern outlier (proteins not showing continuous degradation pattern). To run this program, the user should execute the following command:
• python Sample_combine_two_batches_prot_quan_data_with_qc.py Sample_prot_quan_two_batch_join_qc.params
However, for multiple batch (>2) samples, the user should modify the program and its parameter file accordingly.
https://drive.google.com/drive/folders/1X-282W2PrTh51saSF509AuAz66hjpvjK?usp=sharing
To submit bug reports and feature suggestions, please contact: Surendhar Reddy Chepyala (surendharreddy.chepyala@stjude.org), Junmin Peng (junmin.peng@stjude.org), Abhijit Dasgupta (abhijit.dasgupta@stjude.org), Jay Yarbro (jay.yarbro@stjude.org), Xusheng Wang (xusheng.wang@stjude.org).
2. Wang, X., et al., JUMP: a tag-based database search tool for peptide identification with high sensitivity and accuracy. Molecular & Cellular Proteomics, 2014. 13(12): p. 3663-3673.
3. Wang, X., et al., JUMPm: A Tool for Large-Scale Identification of Metabolites in Untargeted Metabolomics. Metabolites, 2020. 10(5): p. 190.
4. Li, Y., et al., JUMPg: an integrative proteogenomics pipeline identifying unannotated proteins in human brain and cancer cells. Journal of proteome research, 2016. 15(7): p. 2309-2320.
5. Tan, H., et al., Integrative proteomics and phosphoproteomics profiling reveals dynamic signaling networks and bioenergetics pathways underlying T cell activation. Immunity, 2017. 46(3): p. 488-503.
6. Peng, J., et al., Evaluation of multidimensional chromatography coupled with tandem mass spectrometry (LC/LC-MS/MS) for large-scale protein analysis: the yeast proteome. Journal of proteome research, 2003. 2(1): p. 43-50.
7. Niu, M., et al., Extensive peptide fractionation and y 1 ion-based interference detection method for enabling accurate quantification by isobaric labeling and mass spectrometry. Analytical chemistry, 2017. 89(5): p. 2956-2963.
8. Xu, P., et al., Systematical optimization of reverse-phase chromatography for shotgun proteomics. Journal of proteome research, 2009. 8(8), p. 3944-3950.