Embedding representation of polymer derived from PI1M dataset
- Python 3.x
- pandas
- numpy
- rdkit
- LightGBM
- Optuna
- scikit-learn
- gensim
- mol2vec
Use the following command to quickly train a QSPR model:
python quick_qspr.py -in your_data.csv -x smi -y tg -o your_model
-
-in
: The name of the input CSV file.- Example:
train_test.csv
- Example:
-
-x
: The name of the SMILES column.- Example:
smi
- Example:
-
-y
: The name of the target column.- Example:
tg
- Example:
-
-o
: The name of trained model will be saved.- Example:
your_model
- Example:
-
Machine learning discovery of high-temperature polymers
-
Mol2vec: Unsupervised Machine Learning Approach with Chemical Intuition