A sample code for analysis and modeling of QM9 dataset, including preprocessing adjustment based on qml and machine learning models.
- Modeling molecular representation: Columb Matrix (CM), Bags of Bonds (BOB)
- Machine learning: Kernel Ridge Regression, Random Forest.
QM9 includes the spatial information and 12 quantum properties of 13385 small molecules for quantum chemistry research.
-
Download QM9 dataset
-
Install QML (Quantum machine learning) toolkit. Note that QML is built for analysis of QM7 (another dataset).
-
Adjust the data preprocessing process in QML. Replace the compound.py file in src of QML.
mv qml/qml/compound.py compound_qm7.py cp compound.py qml/qml/compound.py
-
Input the parameters in main.py. Choose Representation, Data size, Hyperparameter.
-
Run python main.py to perform validating and testing.
Code is generated by Github Copilot :) . What an amazing tool!