Pinned Repositories
llama.cpp
LLM inference in C/C++
InternEvo
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
InternLM
Official release of InternLM2.5 base and chat models. 1M context support
DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
tensorpipe
A tensor-aware point-to-point communication primitive for machine learning
BaiduContest_paddlepadde
基于深度学习的交通情况预测的导航系统
DailyMLSysPaper
DailyMLSysPaper
neu_geometric
Graduation Project 2020
RDMA_middleware
华为项目RDMA智能网卡卸载验证项目
SimpleRDMA
A very simple and easy-to-use interface to the RDMA communication library that can be used for quick verification and experimentation.
SolenoidWGT's Repositories
SolenoidWGT/DailyMLSysPaper
DailyMLSysPaper
SolenoidWGT/DI-engine-docs
DI-engine docs (Chinese and English)
SolenoidWGT/LineKV
Chain-replicated distributed kv storage
SolenoidWGT/SimpleRDMA
A very simple and easy-to-use interface to the RDMA communication library that can be used for quick verification and experimentation.
SolenoidWGT/ai-chatbot
A full-featured, hackable Next.js AI chatbot built by Vercel
SolenoidWGT/ColossalAI-Examples
Examples of training models with hybrid parallelism using ColossalAI
SolenoidWGT/dear_pytorch
[ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining
SolenoidWGT/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
SolenoidWGT/DI-engine
OpenDILab Decision AI Engine
SolenoidWGT/DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
SolenoidWGT/ds_comm_bench
SolenoidWGT/flash-attention
Fast and memory-efficient exact attention
SolenoidWGT/InternEvo
SolenoidWGT/InternLM
InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.
SolenoidWGT/libnvidia-container
NVIDIA container runtime library
SolenoidWGT/llama.cpp
Port of Facebook's LLaMA model in C/C++
SolenoidWGT/llama3
The official Meta Llama 3 GitHub site
SolenoidWGT/Megatron-LM
Ongoing research training transformer models at scale
SolenoidWGT/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
SolenoidWGT/nccl
Optimized primitives for collective multi-GPU communication
SolenoidWGT/nccl-tests
NCCL Tests
SolenoidWGT/nvidia-container-runtime
NVIDIA container runtime
SolenoidWGT/pj-kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
SolenoidWGT/PY2C2PY
SolenoidWGT/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
SolenoidWGT/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
SolenoidWGT/rdma-core
RDMA core userspace libraries and daemons
SolenoidWGT/RDMA-Tutorial
A tutorial on RDMA based programming using code examples
SolenoidWGT/so-vits-svc
SoftVC VITS Singing Voice Conversion
SolenoidWGT/tensorpipe
A tensor-aware point-to-point communication primitive for machine learning