This repo provides code and notebooks for sompy project. You can find more detailed information from this publication.
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.
You need to download anaconda and install python packages in the requirement.txt file using command below:
pip install -r requirement.txt
tfprop_sompy is the fundamental framework we base on. In order to use that, simply follow instruction in the package
In the Data
folder, the compdata_classificatiion_des.csv
is the data file with molecular descriptors appened to it (More than 1800 descriptors are appended, and you can do feature selection based on them). The original data comes from Professor Saeki's paper: Computer-Aided Screening of Conjugated Polymers for Organic Solar Cell: Classification by Random Forest. You can find the original data there.
Molecular descriptors are calculated using a third-party library called Mordred Descriptor
.
Notebook can be run after installing Python, rdkit (I use Anaconda), mordred and some other libraries.
Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.
We use SemVer for versioning. For the versions available, see the tags on this repository.
- Jingtian Zhang (https://github.com/Zhangjt9317)
- Luna Huang (https://mse.washington.edu/people/faculty/adjunct)
This project is licensed under the MIT License - see the LICENSE.md file for details
- University of Washington Chem E, MSE, Tohoku University MSE, Grad School of Engineering, etc.