Pinned Repositories
awesome-deeplearning-resources
Deep Learning and deep reinforcement learning research papers and some codes
awesome-mlops
A curated list of references for MLOps
awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
ColossalAI-Examples
Examples of training models with hybrid parallelism using ColossalAI
ColossalChat
ColossalChat is the project to implement LLM with RLHF, powered by the Colossal-AI project.
LARS-ImageNet-PyTorch
Accuracy 77%. Large batch deep learning optimizer LARS for ImageNet with PyTorch and ResNet, using Horovod for distribution. Optional accumulated gradient and NVIDIA DALI dataloader.
oh-my-server
ColossalAI
Making large AI models cheaper, faster and more accessible
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
LARS-ImageNet-PyTorch
Accuracy 77%. Large batch deep learning optimizer LARS for ImageNet with PyTorch and ResNet, using Horovod for distribution. Optional accumulated gradient and NVIDIA DALI dataloader.
binmakeswell's Repositories
binmakeswell/ColossalChat
ColossalChat is the project to implement LLM with RLHF, powered by the Colossal-AI project.
binmakeswell/oh-my-server
binmakeswell/ColossalAI-Examples
Examples of training models with hybrid parallelism using ColossalAI
binmakeswell/LARS-ImageNet-PyTorch
Accuracy 77%. Large batch deep learning optimizer LARS for ImageNet with PyTorch and ResNet, using Horovod for distribution. Optional accumulated gradient and NVIDIA DALI dataloader.
binmakeswell/awesome-deeplearning-resources
Deep Learning and deep reinforcement learning research papers and some codes
binmakeswell/awesome-mlops
A curated list of references for MLOps
binmakeswell/awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
binmakeswell/Awesome-System-for-Machine-Learning
A curated list of research in machine learning systems (MLSys). Paper notes are also provided.
binmakeswell/ColossalAI
Colossal-AI: A Unified Deep Learning System for Big Model Era
binmakeswell/ColossalAI-Benchmark
Performance benchmarking with ColossalAI
binmakeswell/FastFold
Optimizing Protein Structure Prediction Model Training and Inference on GPU Clusters
binmakeswell/metaseq
Repo for external large-scale work
binmakeswell/Open-Sora
Building your own video generation model like OpenAI's Sora
binmakeswell/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
binmakeswell/PaLM-colossalai
Scalable PaLM implementation of PyTorch
binmakeswell/pytorch-lamb
PyTorch implementation of LAMB for ImageNet/ResNet-50 training
binmakeswell/pytorch-vit
binmakeswell/SkyComputing
Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
binmakeswell/TensorNVMe
A Python library transfers PyTorch tensors between CPU and NVMe
binmakeswell/toy-vit