listenlink

listenlink's Stars

hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python38.8k 385 1.7k4.3k
taichi-dev/taichi
Productive, portable, and performant GPU programming in Python.
Language:C++25.6k 385 2.7k2.3k
invoke-ai/InvokeAI
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
Language:TypeScript23.8k 204 3.2k2.4k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.2k 114 3901.4k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.4k 120 1.1k1.3k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.3k 208 2.3k2.5k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.6k 152 3551k
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python10.8k 94 7881.1k
bmaltais/kohya_ss
Language:Python9.7k 93 2.1k1.3k
mistralai/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
Language:Jupyter Notebook8.8k 116 115761
Syllo/nvtop
GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm
Language:C8.3k 78 250295
XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Language:Jupyter Notebook7.6k 92 148795
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.4k 44 81574
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.7k 60 104514
rinongal/textual_inversion
Language:Jupyter Notebook2.9k 54 158280
NVIDIA/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Language:Python2.5k 59 52535
KohakuBlueleaf/LyCORIS
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
Language:Python2.2k 20 143152
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python2k 34 352331
ray-project/ray-llm
RayLLM - LLMs on Ray
Language:Python1.2k 20 8994
Lightning-AI/lightning-thunder
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Language:Python1.2k 35 56480
CHIANGEL/Awesome-LLM-for-RecSys
Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.
1.1k 16 757
devcontainers/templates
Repository for Dev Container Templates that are managed by Dev Container spec maintainers. See https://github.com/devcontainers/template-starter to create your own!
Language:Shell1k 19 82264
kennethleungty/Llama-2-Open-Source-LLM-CPU-Inference
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
Language:Python952 14 24212
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
Language:Python630 6 6351
NVIDIA/nvcomp
Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloaded from https://developer.nvidia.com/nvcomp.
Language:C++561 32 8678
lichao-sun/SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
491 9 220
johnmarktaylor91/torchlens
Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.
Language:Python485 7 1817
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language:Python368 4 1824
microsoft/mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
Language:C++251 18 9340
Abonia1/CheatSheet-LLM
cheat sheet of LLM
183 7 138