kmansouri/OPERA

PaDEL needs updating to CDK 2.0

Opened this issue · 23 comments

egonw commented

We should see if we can find a few people to upgrade PaDEL to CDK 2.0.... cc @johnmay @rajarshi

Where are the PaDEL sources?

Technically not a project issue... :-)

Scrap that thought it was on CDK/issues 👍

Yes I'm working on integrating CDK2 in the next version. Thank you all!

I am working on using CDK2 in the next version of OPERA but I'm facing some problems...
@egonw @rajarshi @johnmay @tsufz

  1. Not all PaDEL descriptors (and fingerprints) are available in CDK2
    So I think the solution is to upgrade PaDEL to use CDK2 (not to replace it). Has anyone figured that out?

  2. PaDEL is using CDK 1.5 which is giving different values than CDK2 for certain descriptors...
    Does that mean those descriptors had issues in CDK 1.5?

  1. PaDEL may have descriptors that are not in the CDK
  2. There have been some bug fixes in descriptor implementations that lead to differences in computed values

Thanks, @rajarshi for the quick response!

I'm wondering if anyone is working on updating PaDEL to use CDK2?

That'd be a question for the PaDEL devs

That makes sense :)
Any plans to include the extra PaDEL descriptors in CDK?

Thanks a lot Tony! I'm looking into it :)

tsufz commented

Is PaDEL further developed? The webpage is available again, but quite outdated (last update: 2014). The link is: http://www.yapcwsoft.com/dd/padeldescriptor. The source code could be downloaded here:
http://www.yapcwsoft.com/dd/padeldescriptor/PaDEL-Descriptor_src.zip. However, there is no Github or other public repository.

I don't think that PaDEL will be maintained by its original developer anymore. But it'll be useful if someone else can...

tsufz commented

@egonw, maybe something to be integrated in CDK?

egonw commented

We integrated some of the PaDEL descriptors in the past... integrating more of them is welcome. Check the 5 patches in this branch: https://github.com/cdk/cdk/tree/dev-1.4.x-padel

Since

@egonw, maybe something to be integrated in CDK?

I'm thinking to do something of that, maybe integrating also some other open source descriptors, maybe starting from this list :http://www.scbdd.com/chemdes/list-descriptors/.

@kmansouri Are you still using cdk libraries 1.4.9 or you managed to update them?

@marcabrus a lot of those in that list appear to be duplicates. A few descriptor types that would be useful include

  • Grid based descriptors derived from MIF's
  • Conformational descriptors (e.g., 4D descriptors from Hopfinger and colleagues)
  • Task specific descriptors (e.g., https://t.co/Zheeo7tn61)

@marcabrus

@kmansouri Are you still using cdk libraries 1.4.9 or you managed to update them?

The new models from OPERA 2.0 (CERAPP, CoMPARA and CATMoS) are using CDK2. The older models from previous versions are still using CDK1.48.
The goal is to migrate to CDK2 for new models or if existing models are significantly updated (like adding new data).

@marcabrus a lot of those in that list appear to be duplicates. A few descriptor types that would be useful include

  • Grid based descriptors derived from MIF's
  • Conformational descriptors (e.g., 4D descriptors from Hopfinger and colleagues)
  • Task specific descriptors (e.g., https://t.co/Zheeo7tn61)

I'm aware that lot of that descriptors are duplicates, or even diffrent names with the same meaning.
For this reason i was referring to this map
2019-01-22

There is a significant overlap among the various free software, but also there is a large part of decriptors not implemented in CDK.
Furthermore I need to mention that several descriptors contained in Dragon can be easily implemented.

My current resources and skills , let me to write (or adapt) code only for 2d based descriptors, graph or smarts based, nonetheless they are found relevant to several QSAR models we implemented.

The descriptors you suggested need some experience in 3D optimization, and the work on structures with a clear knowledge of isomerism. The kind of data we often analyze doesn't let us to work in this way.

These are important limitations, but maybe in the future we can overcome them.

@egonw @rajarshi @marcabrus @tsufz anyone has updates about updating PaDEL to use CDK2?
@marcabrus you mentioned reimplementing DRAGON/PaDEL 2D descriptors, any updates about that?
Thanks!

tsufz commented

@kmansouri, No, I did not work on any update.

egonw commented

@egonw anyone has updates about updating PaDEL to use CDK2?

An update would be nice, but I don't have funding for it right now :(