2022AI4SciChem01

Challenge#1 Collect and standardize chemical reaction data, including structures, reagents and conditions.  Standardization of chemical reaction data is challenging because there are varies of reactions in organic chemistry. For example, reagents (such as Pd/C) may have different specifications. Solvents may or may not participate (as a reactant) in the reaction.  NLP may be required to extract data from reaction procedures

The repo below is needed

https://github.com/sergsb/IUPAC2Struct.git

https://github.com/rxn4chemistry/rxnmapper.git

spiltrxn.py is to recognize the role of chemicals

image


ibmp.py is to transform the unstructured paragraph to structured paragraph

image

iupac_smi.py is to transform the Iupac to smiles

image