wswcfan

@TencentShenzhen,Guangdong

wswcfan's Stars

Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python16.5k1.6k
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Language:Python4.7k355
deepspeedai/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6.4k1.1k
AntonOsika/gpt-engineer
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Language:Python53.6k7k
yoheinakajima/babyagi
Language:Python21.3k2.8k
tensorchord/envd
🏕️ Reproducible development environment
Language:Go2.1k160
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python11.3k877
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
9.8k764
kubernetes-sigs/node-feature-discovery
Node feature discovery for Kubernetes
Language:Go852258
OpenNLPLab/TransnormerLLM
Official implementation of TransNormerLLM: A Faster and Better LLM
Language:Python24311
bigcode-project/starcoder
Home of StarCoder: fine-tuning & inference!
Language:Python7.4k525
InternLM/InternLM
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Language:Python6.8k482
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca
Language:C4.2k419
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
7.5k396
roclark/NeMo-Megatron-Launcher
NeMo Megatron launcher and tools
Language:Python21
NVIDIA/nccl
Optimized primitives for collective multi-GPU communication
Language:C++3.6k885
kubeflow/trainer
Distributed ML Training and Fine-Tuning on Kubernetes
Language:Python1.7k758
ggml-org/llama.cpp
LLM inference in C/C++
Language:C++77.2k11.2k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python38.2k4.7k
sylabs/wlm-operator
Singularity implementation of k8s operator for interacting with SLURM.
Language:Go11728
IBM/Bridge-Operator
Bridge operator repo
Language:Go185
SchedMD/slurm
Slurm: A Highly Scalable Workload Manager
Language:C3k695
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.6k2.9k
deepspeedai/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python2k353
alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization.
Language:Python3.1k360
flyteorg/flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Language:Go6.1k708
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python142k28.4k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python13.5k2.8k
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Jupyter Notebook14.1k3.3k
onnx/onnx
Open standard for machine learning interoperability
Language:Python18.7k3.7k

wswcfan

wswcfan's Stars

Dao-AILab/flash-attention

togethercomputer/RedPajama-Data

deepspeedai/DeepSpeedExamples

AntonOsika/gpt-engineer

yoheinakajima/babyagi

tensorchord/envd

RUCAIBox/LLMSurvey

Mooler0410/LLMsPracticalGuide

kubernetes-sigs/node-feature-discovery

OpenNLPLab/TransnormerLLM

bigcode-project/starcoder

InternLM/InternLM

Facico/Chinese-Vicuna

openlm-research/open_llama

roclark/NeMo-Megatron-Launcher

NVIDIA/nccl

kubeflow/trainer

ggml-org/llama.cpp

lm-sys/FastChat

sylabs/wlm-operator

IBM/Bridge-Operator

SchedMD/slurm

Vision-CAIR/MiniGPT-4

deepspeedai/Megatron-DeepSpeed

alpa-projects/alpa

flyteorg/flyte

huggingface/transformers

NVIDIA/NeMo

NVIDIA/DeepLearningExamples

onnx/onnx