inocsin

Deep Learning @NVIDIA

NVIDIA CorporationShanghai

inocsin's Stars

f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
Language:HTML116k 1.4k 015.8k
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook69.1k 563 71710.2k
twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
Language:Scala62.7k 342 97912.2k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.7k 449 3155.1k
fxsjy/jieba
结巴中文分词
Language:Python33.6k 1.3k 8536.7k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python33.3k 271 5.8k5.1k
llvm/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
Language:LLVM30k 581 79.2k12.4k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27k 211 4.4k5.5k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python15.2k 112 1.1k1.2k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python15k 123 1.2k1.4k
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python11.1k 70 108698
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9.1k 97 2.1k1k
facebookresearch/llama-recipes
Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger
Language:Jupyter Notebook7.8k 68 2271.1k
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.8k 108 290490
jackhawks/rectg
本项目汇集5000+优质的Telegram群组、频道和机器人，为您提供高质量的学习和技术资源。内容涵盖热门群组、实用频道和各类机器人，助您快速找到感兴趣的资源，轻松提升技能。欢迎加入，一起探索丰富的Telegram资源库！
Language:Python6.6k 54 34358
kingoflolz/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
Language:Python6.3k 112 206893
facebookresearch/ConvNeXt
Code release for ConvNeXt model
Language:Python5.8k 32 130702
NVIDIA/thrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
Language:C++4.9k 207 778757
huggingface/notebooks
Notebooks using the Hugging Face libraries 🤗
Language:Jupyter Notebook3.8k 74 1721.6k
fundamentalvision/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Language:Python3.5k 72 274562
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Language:Python1.3k 21 92152
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数，训练数据，评估数据，评估方法。
Language:Python1.2k 24 63110
NVIDIA/nccl-tests
NCCL Tests
Language:Cuda956 26 238251
j2kun/mlir-tutorial
MLIR For Beginners tutorial
Language:C++869 18 1773
BabitMF/bmf
Cross-platform, customizable multimedia/video processing framework. With strong GPU acceleration, heterogeneous design, multi-language support, easy to use, multi-framework compatible and high performance, the framework is ideal for transcoding, AI inference, algorithm integration, live video streaming, and more.
Language:C++845 24 7771
NVIDIA/cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
Language:C++483 15 6992
GPT-Fathom/GPT-Fathom
GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under aligned settings.
Language:Python349 1 623
bytedance/primus
Language:Java199 5 726
itsliupeng/torchnvjpeg
Decode JPEG image on GPU using PyTorch
Language:C++84 5 1010
thrust/cub
THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.
Language:Cuda10 2 03

inocsin

inocsin's Stars

f/awesome-chatgpt-prompts

CompVis/stable-diffusion

twitter/the-algorithm

Stability-AI/stablediffusion

fxsjy/jieba

vllm-project/vllm

llvm/llvm-project

huggingface/diffusers

QwenLM/Qwen

Dao-AILab/flash-attention

microsoft/LoRA

NVIDIA/TensorRT-LLM

facebookresearch/llama-recipes

01-ai/Yi

jackhawks/rectg

kingoflolz/mesh-transformer-jax

facebookresearch/ConvNeXt

NVIDIA/thrust

huggingface/notebooks

fundamentalvision/BEVFormer

mit-han-lab/smoothquant

SkyworkAI/Skywork

NVIDIA/nccl-tests

j2kun/mlir-tutorial

BabitMF/bmf

NVIDIA/cudnn-frontend

GPT-Fathom/GPT-Fathom

bytedance/primus

itsliupeng/torchnvjpeg

thrust/cub