ModelSmith serves as a toolkit for compressing machine learning models, designed to enhance their speed, reduce their size, and improve energy efficiency. This toolbox addresses the demands for smaller model sizes, minimal memory usage, quicker inference times, energy-saving operations, and scenarios conforming to bandwidth limitations.
ModelSmith makes it ideal for deploying on a diverse array of devices and platforms, ensuring satisfactory performance. Its main features encompass post-training quantization, machine unlearning and a range of pruning algorithms, essential for optimizing models.
ModelSmith is developed by researchers and engineers working at Cisco Research.
To get started with ModelSmith, follow these steps:
-
Clone the repository to your local machine:
git clone https://github.com/your-username/modelsmith.git
1. How do I configure ModelSmith to run locally? Read here.
2. How do I configure ModelSmith to run on VM? Read here.
-
Frontend Server: ModelSmith's frontend is built using Angular and is deployed as a production build to the 'frontend/server/modelsmith-build' folder. The frontend server, powered by Express.js, serves the web application in to users in optimized mode on port 4200. It provides a user interface for interacting with the application.
-
Backend Server: ModelSmith includes a backend server built using Node.js, which serves as the core of the application. The backend server handles various tasks such as model optimization, deployment, and management. It is accessible on port 3000. The backend offers APIs and services to support the frontend's functionality and communicates with a VM for running the python scripts.
-
ModelSmith Project - Python Files for Model Training: This feature includes a comprehensive collection of Python scripts designed for the training of machine learning models. These scripts are fine-tuned for high efficiency and can be executed on local machines with GPU support or on virtual machines (VMs), offering flexibility for users to train models using their own hardware or cloud resources.
-
Post Training Quantization
Visit the source code PTQ
-
Model Pruning
Visit the source code Pruning
-
Machine Unlearning
Visit the source code Unlearning
ModelSmith strives to maintain a more up-to-date fork and encourages contributions from all individuals.
Yuguang Yao
Ioan Pop (John)
Yuzhang Shang
Gaowen Liu
We have several TODO's spread out throughout our files! We will continue working in these directions to improve ModelSmith, adjusting as needed based on our user's feedback
We intend to add functionalities to the log analysis for unsupervised log analysis.
If you wish to contribute or suggest any additional funtionalities, please check out Contributing Guidelines