BoxiangW

There are no solutions, there are only trade-offs.

@NVIDIASanta Clara, CA

BoxiangW's Stars

ohmyzsh/ohmyzsh
🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python, etc), 140+ themes to spice up your morning, and an auto-update tool that makes it easy to keep up with the latest updates from the community.
Language:Shell174k 2.6k 4.8k25.9k
kubernetes/kubernetes
Production-Grade Container Scheduling and Management
Language:Go111k 3.2k 46.3k39.7k
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook95.4k 691 7.9k15.5k
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python34.2k 178 5k2.6k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python30.9k 251 5.4k4.7k
mbadolato/iTerm2-Color-Schemes
Over 325 terminal color schemes/themes for iTerm/iTerm2. Includes ports to Terminal, Konsole, PuTTY, Xresources, XRDB, Remmina, Termite, XFCE, Tilda, FreeBSD VT, Terminator, Kitty, MobaXterm, LXTerminal, Microsoft's Windows Terminal, Visual Studio, Alacritty, and many more
Language:Shell24.8k 342 1086.4k
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Language:Python22.6k 633 2665.5k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.4k 159 1.5k2.3k
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Language:Python15.7k 132 6151.9k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.4k 120 1.1k1.4k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.5k 195 1.5k1.7k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.3k 98 5531.1k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.3k 208 2.3k2.5k
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language:C++10.9k 157 3.8k2.1k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.7k 77 565622
ohmybash/oh-my-bash
A delightful community-driven framework for managing your bash configuration, and an auto-update tool so that makes it easy to keep up with the latest updates from the community.
Language:Shell6.1k 74 312669
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++5.9k 62 625893
nelhage/reptyr
Reparent a running program to a new terminal
Language:C5.8k 96 69215
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.8k 34 198646
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Language:Python4.6k 79 90350
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
Language:Python3.8k 74 243969
NVIDIA/nccl
Optimized primitives for collective multi-GPU communication
Language:C++3.3k 153 1.3k826
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python2k 34 352333
binance/binance-public-data
Details on how to get Binance public data
Language:Python1.6k 37 294481
r2d4/react-llm
Easy-to-use headless React Hooks to run LLMs in the browser with WebGPU. Just useLLM().
Language:TypeScript676 8 529
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
Language:Python628 8 6639
forhaoliu/ringattention
Transformers with Arbitrarily Large Context
Language:Python625 6 1648
NVIDIA/NeMo-Framework-Launcher
Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.
Language:Python474 20 37140
NVIDIA/NVTX
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.
Language:C309 12 3948
sail-sg/zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism
Language:Python285 7 2714

BoxiangW

BoxiangW's Stars

ohmyzsh/ohmyzsh

kubernetes/kubernetes

langchain-ai/langchain

gradio-app/gradio

vllm-project/vllm

mbadolato/iTerm2-Color-Schemes

openai/gpt-2

haotian-liu/LLaVA

THUDM/ChatGLM2-6B

Dao-AILab/flash-attention

triton-lang/triton

state-spaces/mamba

NVIDIA/NeMo

NVIDIA/TensorRT

facebookresearch/xformers

ohmybash/oh-my-bash

NVIDIA/FasterTransformer

nelhage/reptyr

salesforce/BLIP

togethercomputer/RedPajama-Data

attardi/wikiextractor

NVIDIA/nccl

NVIDIA/TransformerEngine

binance/binance-public-data

r2d4/react-llm

BlackSamorez/tensor_parallel

forhaoliu/ringattention

NVIDIA/NeMo-Framework-Launcher

NVIDIA/NVTX

sail-sg/zero-bubble-pipeline-parallelism