THUNLP-MT/dyMEAN

The complex and Hard pipline

Opened this issue · 5 comments

Dear researcher
thanks for sharing this pipline, but you try to design Antibody with a lot of package, but I believe it can be easier than you made it pipeline,
You try to use for example IgFold and Rosseta for what ? while you can easily put antigen as a input for diffab ....
best wish

Hi, IgFold and Rosetta are optional. They are only included as pipeline-based baselines in our paper. Our end-to-end model can be used with an antigen and sequence of antibody frameworks. Please see our instructions here.

Thank you for your response. After generating multiple CDR samples, how do you choose the best model? Is there any scoring method for Antigen to CDR docking, such as using Prodigy?

Additionally, once the best model is selected, how do you coordinate the construction of the full atomic structure of the antibody (considering both human and mouse models)?

We didn't explore the scoring paradigm much in this paper. But I think common tools like rosetta and foldx might serve the purpose of ranking the samples, though the actuall effects remained unknown.

The full-atom structure is simultaneously generated in our end-to-end model (i.e. the sequence and the full-atom structure are jointly designed). There is no need to do further side-chain packing or something like that. However, openmm relax might still be needed for solve clashes and physically invalid local geometries (e.g. bond lengths), which we have also integrated in our inference API.

I’ve reviewed over 15 antibody construction packages, but each of them had issues that made them unusable. The package you constructed is much better than the others, but it requires a considerable amount of time to complete the full structure.
best wish

Thanks for the appreciation! I think the most time is spent on openmm relax. Current algorithms are still struggling on generating physically valid local geometries, though the approximate spread of the side chains might be correct, therefore openmm relax might be necessary. I'm also delving into this problem and upgrading generative algorithms. Wish that in the future the deep generative models can directly generate structures that do not have obvious problems at the first glance.

Best wishes