tigerlchen's Stars
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
vivo-ai-lab/BlueLM
BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
HuaizhengZhang/Awesome-System-for-Machine-Learning
A curated list of research in machine learning systems (MLSys). Paper notes are also provided.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
microsoft/DeepSpeedExamples
Example models using DeepSpeed
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
bytedance/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
codecaution/Awesome-Mixture-of-Experts-Papers
A curated reading list of research in Mixture-of-Experts(MoE).
google-deepmind/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
microsoft/AI-System
System for AI Education Resource.
IDEA-CCNL/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
milvus-io/pymilvus
Python SDK for Milvus.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Morizeyao/GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
doocs/advanced-java
😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识
ChanYiLin/tf-operator-Dragon
Extends Kubeflow/tf-operator to enable gang-scheduling and auto-scaling.
sql-machine-learning/elasticdl
Kubernetes-native Deep Learning Framework
alibaba/cloud-kernel
Cloud Kernel - an open-source Linux kernel originated by Alibaba Operating System Team