/kinact

Toolbox for Kinase Activity Scoring based on phosphoproteomic data

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

kinact

Toolbox for Kinase Activity Scoring

This repository collects different computational methods to infer kinase activities from phosphoproteomics datasets. Each method will be accompanied by a protocol for explanation and in order to enable transfer to other datasets.

The package has been implemented by Jakob Wirbel in the Saez-Rodriguez Group at RWTH-Aachen.

Please be aware that the package is under on-going development.

Licence

Distributed under the GNU GPLv3 License. See accompanying file LICENSE.txt or copy at http://www.gnu.org/licenses/gpl-3.0.html

Installation

Mac/Linux Users

In order to use kinact, open a command line/terminal. You can clone the repository and install the package with

git clone https://github.com/saezlab/kinact.git
cd kinact
python setup.py install

Windows Users

working on it :)

Trouble?

Please use the GitHub Issue tracker if you run into any trouble or if you have feature requests. We will work on them as fast as possible.

Quick start

The package contains example data from a publication by de Graaf et al., in which the phosphoproteome of Jurkat T-cells after prostaglandin E2 stimulation was analysed. The data can be loaded with:

import kinact
data_fc, data_p_value = kinact.get_example_data()
print data_fc.head()

             "5min"     "10min"    "20min"    "30min"    "60min"
ID
A0AVK6_S710.3193060.4849600.7980820.8561030.928753 
A0FGR8_S7430.8566610.9819511.5004121.4418680.861470 
A0FGR8_S7581.4453862.3979152.6929942.7947621.553398
A0FGR8_S691   0.271458   0.264596   0.501685   0.461984   0.655501 
A0JLT2_S2260.080786   1.069710   0.519780   0.5208830.296040

Prior-knowledge information about kinase-substrate interactions can be loaded from the pypath package (see also the documentation or the github repository). Per default, interactions from PhosphoSitePlus and Signor are loaded, but other sources from pypath can be specified with the sources parameter of the function. For kinase-substrate interactions from other organisms, specify organim='yeast' or organism='mouse' in the function call.

adjacency_matrix = kinact.get_kinase_targets()

Finally, estimation of the kinase activities can be performed as follows for the example of the KSEA protocol:

scores, p_values = kinact.ksea.ksea_mean(data_fc=data_fc["5min"].dropna(),
                                         interactions=adjacency_matrix,
                                         mP=data_fc["5min"].values.mean(),
                                         delta=data_fc["5min"].values.std())

Methods included in kinact:

  • KSEA

    KSEA is a method for the inference of kinase activities from phosphoproteomics data based on kinase substrate sets, which are constructed from curated information about kinase-substrate interactions from public resources like PhosphoSitePlus. The values of the fold changes of the phospho-sites in the substrate set of a given kinase are used to compute a score for the activity of this kinase. The main scoring system for KSEA is the mean or the median of the fold changes in the substrate set. KSEA was first proposed in the publication from Casado et al..

  • to be extended