Pinned Repositories
CachedEmbedding
A memory efficient DLRM training solution using ColossalAI
ColossalAI
Making large AI models cheaper, faster and more accessible
ColossalAI-Examples
Examples of training models with hybrid parallelism using ColossalAI
EnergonAI
Large-scale model inference.
FastFold
Optimizing AlphaFold Training and Inference on GPU Clusters
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
PaLM-colossalai
Scalable PaLM implementation of PyTorch
SkyComputing
Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
SwiftInfer
Efficient AI Inference & Serving
TensorNVMe
A Python library transfers PyTorch tensors between CPU and NVMe
HPC-AI Tech's Repositories
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
hpcaitech/EnergonAI
Large-scale model inference.
hpcaitech/FastFold
Optimizing AlphaFold Training and Inference on GPU Clusters
hpcaitech/SwiftInfer
Efficient AI Inference & Serving
hpcaitech/ColossalAI-Examples
Examples of training models with hybrid parallelism using ColossalAI
hpcaitech/PaLM-colossalai
Scalable PaLM implementation of PyTorch
hpcaitech/TensorNVMe
A Python library transfers PyTorch tensors between CPU and NVMe
hpcaitech/CachedEmbedding
A memory efficient DLRM training solution using ColossalAI
hpcaitech/SkyComputing
Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
hpcaitech/ColossalAI-Benchmark
Performance benchmarking with ColossalAI
hpcaitech/Titans
A collection of models built with ColossalAI
hpcaitech/ColossalAI-Documentation
Documentation for Colossal-AI
hpcaitech/ColossalAI-Pytorch-lightning
hpcaitech/Oh-My-Dockerfile
A collection of dockerfiles for various tasks
hpcaitech/Elixir
Elixir: Train a Large Language Model on a Small GPU Cluster
hpcaitech/ColossalAI-Platform-CLI
CLI for ColossalAI Platform
hpcaitech/GPT-Demo
GPT Demo with hybrid distributed training
hpcaitech/public_assets
Storing publicly available assets such as images, animations and texts
hpcaitech/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
hpcaitech/OPT-Benchmark
hpcaitech/mmdetection-examples
Train mmdetection models with ColossalAI.
hpcaitech/CANN-Installer
This repository contains Huawei Ascend CANN files
hpcaitech/Cloud-Platform-Docs
Documentation for our cloud platform
hpcaitech/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
hpcaitech/LLaVA-NeXT
hpcaitech/Open-Sora-Demo
hpcaitech/pytest-testmon
Selects tests affected by changed files. Executes the right tests first. Continuous test runner when used with pytest-watch.
hpcaitech/torchrec
Pytorch domain library for recommendation systems