/PaDEL-descriptor-ws

PaDEL ws descriptors engine

Primary LanguageShellGNU General Public License v3.0GPL-3.0

PaDEL ws descriptors engine

This web app implements a web server to compute PaDEL descriptors in a more efficient way in the framework of eTOXlab and eTOX project

PaDEL decriptors web site

PaDEL decriptors paper

In the latest version 2.21 (21/07/2014). The software currently calculates 1875 descriptors (1444 1D, 2D descriptors and 431 3D descriptors) and 12 types of fingerprints (total 16092 bits).

eTOXlab is a modelling framework to develop and deploy QSAR models in production environments. One of the major goals of eTOXlab is the robustness of the platform.

To acomplish this goal the eTOXlab pipeline processess molecules individually. But this has a major performance penalty when computing PaDEL descriptors due to PaDEL commandline mode startup delay and JVM startup delay.

So we developed the PaDEL ws descriptors engine that maintains a in-memory ready PaDEL descriptor computation engine.

By calling the web server we can compute descriptors for the molecules needed without the need to reload PaDEL engine for every individual molecule.

In the current implementation the input molecules must be located in a local folder of the PaDEL ws descriptors engine. The ouput files containing the descriptors computed are also stored locally in the web server. A possible improvement is to provide an upload molecule mechanism

We can measure de difference in computation time.

Using the original PaDEL commnd-line interface:

Molecule Computation method Computation time
mol1 PaDEL command-line time ./test_cl_single_molecule.sh
real 0m2.487s
user 0m5.606s
sys 0m0.442s
Set of 141 molecules PaDEL command-line time ./test_cl_folders.sh
real 6m35.709s
user 15m28.519s
sys 1m1.071s

Using the PaDEL ws descriptors engine:

Molecule Computation method Computation time
mol1 PaDEL ws descriptors engine time ./test_ws_single_molecule.sh
real 0m0.260s
user 0m0.005s
sys 0m0.015s
Set of 141 molecules PaDEL ws descriptors engine time ./test_ws_folders.sh
real 0m52.643s
user 0m0.398s
sys 0m0.946s

The throughput of descriptor computation is increased by 7x (test dataset).

If you want to obtain further information please contact the author Oriol López Massaguer oriol.lopez@upf.edu

Copyright 2017 Oriol Lopez

PaDEL ws descriptors engine is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation version 3.

PaDEL ws descriptors engine is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with PaDEL ws descriptors engine. If not, see http://www.gnu.org/licenses/.

The research leading to these results has received support from the Innovative Medicines Initiative (IMI) Joint Undertaking under grant agreement n° 115002 (eTOX), resources of which are composed of financial contribution from the European Union’s Seventh Framework Programme (FP7/2007-2013) and EFPIA companies’ in kind contributions.