xvyaward's Stars
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
xvyaward/owq
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
DefTruth/Awesome-LLM-Inference
๐A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
HuangOwen/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
yangshun/tech-interview-handbook
๐ฏ Curated coding interview preparation materials for busy software engineers
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
lixin4ever/Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
encord-team/encord-active
The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
vitrun/FasterTransformer
Transformer related optimization, including BERT, GPT
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
TransformerOptimus/SuperAGI
<โก๏ธ> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
lonelywing/POSTECH_thesis_template_latex
twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
memoiry/Awesome-model-compression-and-acceleration
HyeminNoh/Tech-Stack
๐ ์ ์ ๊ฐ๋ฐ์๋ก์ ์ฑ์ฅ์ ์ํ ์ ๊ณต ์ง์์ ์ ๋ฆฌํฉ๋๋ค ๐
CodeTest-StudyGroup/Code-Test-Study
์ฝ๋ฉ ํ ์คํธ ๊ด๋ จ ๊ธฐ์ถ๋ฌธํญ์ ํ์ด๋ณด๊ณ ์์ค์ฝ๋ ๋ฐ ์ค๋ช ์ ์ ๋ก๋ํฉ๋๋ค.
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
roboticcam/machine-learning-notes
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) ๆไธ้ดๆญๆดๆฐ็ๆบๅจๅญฆไน ๏ผๆฆ็ๆจกๅๅๆทฑๅบฆๅญฆไน ็่ฎฒไน(2000+้กต)ๅ่ง้ข้พๆฅ
dkozlov/awesome-knowledge-distillation
Awesome Knowledge Distillation
NVIDIA/retinanet-examples
Fast and accurate object detection with end-to-end GPU optimization
Hyungjun-K1m/scientific_color_palette
Color palettes which are also distinguishable when printed in grayscale
rickiepark/python-machine-learning-book-2nd-edition
<๋จธ์ ๋ฌ๋ ๊ต๊ณผ์ with ํ์ด์ฌ, ์ฌ์ดํท๋ฐ, ํ ์ํ๋ก>์ ์ฝ๋ ์ ์ฅ์
bjpublic/PyTorch
๋ฅ๋ฌ๋์ ๋ชฉ๋ง๋ฅธ ์ฌ๋๋ค์ ์ํ PyTorch
MingSun-Tse/Efficient-Deep-Learning
Collection of recent methods on (deep) neural network compression and acceleration.
nanamake/r22sdf
Pipeline FFT Implementation in Verilog HDL
xvyaward/maze-solver-cuda
C++ based maze solver, accelerated with Nvidia CUDA
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites