GindaChen

UCSD CS PhD Student | (Former) Technical Lead at DataChat

@DataChatAISan Diego, CA

GindaChen's Stars

Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java47.6k 157 9953.9k
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
Language:Rust33.3k 71 4.5k900
mem0ai/mem0
The Memory layer for your AI apps
Language:Python23.5k 130 6962.2k
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python20.9k 146 5652k
meta-llama/llama-stack
Composable building blocks to build Llama Apps
Language:Python5.6k 142 206750
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
Language:Python3.4k 38 347226
lululxvi/deepxde
A library for scientific machine learning and physics-informed learning
Language:Python2.8k 55 791763
yhzhang0128/egos-2000
Envision a future where every student can read all the code of a teaching operating system.
Language:C2.2k 32 13156
zou-group/textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Language:Python1.9k 19 83167
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Language:Cuda1.8k 30 3182
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda1.6k 21 148160
microsoft/MInference
[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
Language:Python850 6 7039
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Language:Python831 35 2634
parrt/tensor-sensor
The goal of this library is to generate more helpful exception messages for matrix algebra expressions for numpy, pytorch, jax, tensorflow, keras, fastai.
Language:Jupyter Notebook798 12 2440
lucidrains/ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Language:Python484 11 1629
FloridSleeves/LLMDebugger
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
Language:Python454 6 1445
nyu-systems/Grendel-GS
Ongoing research training gaussian splatting at scale by distributed system
Language:Python405 18 3322
spcl/QuaRot
Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.
Language:Python301 11 5325
jy-yuan/KIVI
[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
Language:Python255 5 2523
AlibabaPAI/llumnix
Efficient and easy multi-instance LLM serving
Language:Python252 10 1014
Saibo-creator/Awesome-LLM-Constrained-Decoding
A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.
114 5 124
QingruZhang/PASTA
PASTA: Post-hoc Attention Steering for LLMs
Language:Python108 2 108
snu-comparch/InfiniGen
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
Language:Python90 3 218
Mutinifni/splitwise-sim
LLM serving cluster simulator
Language:Jupyter Notebook86 2 38
microsoft/llguidance
Low-level Guidance Parser
Language:Rust40 5 247
barabanshek/sabre
Language:Python16 1 90
yunjiazhang/ReAcTable
The code base for paper: "ReAcTable: Enhancing ReAct for Table Question Answering"
Language:HTML16 2 42
dpaleka/stealing-part-lm-supplementary
Some code for "Stealing Part of a Production Language Model"
Language:Python11 1 22
TuftsNATLab/PCS
Language:Python4
sramshetty/stealing-part-of-an-LM
An unofficial implementation of "Stealing Part of a Production Language Model"
Language:Python3 1 01

GindaChen

GindaChen's Stars

Stirling-Tools/Stirling-PDF

astral-sh/uv

mem0ai/mem0

microsoft/graphrag

meta-llama/llama-stack

pytorch/torchchat

lululxvi/deepxde

yhzhang0128/egos-2000

zou-group/textgrad

HazyResearch/ThunderKittens

flashinfer-ai/flashinfer

microsoft/MInference

lucidrains/transfusion-pytorch

parrt/tensor-sensor

lucidrains/ring-attention-pytorch

FloridSleeves/LLMDebugger

nyu-systems/Grendel-GS

spcl/QuaRot

jy-yuan/KIVI

AlibabaPAI/llumnix

Saibo-creator/Awesome-LLM-Constrained-Decoding

QingruZhang/PASTA

snu-comparch/InfiniGen

Mutinifni/splitwise-sim

microsoft/llguidance

barabanshek/sabre

yunjiazhang/ReAcTable

dpaleka/stealing-part-lm-supplementary

TuftsNATLab/PCS

sramshetty/stealing-part-of-an-LM