sidjha1

UC BerkeleyBerkeley, United States

Pinned Repositories

alpa
Training and serving large-scale neural networks with auto parallelization.
Language:Python00
bairblog.github.io
Language:JavaScript00
BigLittleDecoder
[NeurIPS'23] Speculative Decoding with Big Little Decoder
Language:Python00
ColossalAI
Making big AI models cheaper, easier, and more scalable
Language:Python00
DiskANN
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
Language:C++00
dspy
DSPy: The framework for programming—not prompting—foundation models
Language:Python00
Megatron-LM-Benchmarks
Benchmarks of NVIDIA's Megatron-LM
20
TinyAgent
TinyAgent: Function Calling at the Edge!
Language:Python00
lotus
LOTUS: The semantic query engine - process data with LMs as easily as writing pandas code
Language:Python14210
TAG-Bench
TAG: Table-Augmented Generation
Language:Python353 5 025

sidjha1's Repositories

sidjha1/Megatron-LM-Benchmarks
Benchmarks of NVIDIA's Megatron-LM
20
sidjha1/alpa
Training and serving large-scale neural networks with auto parallelization.
Language:Python00
sidjha1/bairblog.github.io
Language:JavaScript00
sidjha1/BigLittleDecoder
[NeurIPS'23] Speculative Decoding with Big Little Decoder
Language:Python00
sidjha1/ColossalAI
Making big AI models cheaper, easier, and more scalable
Language:Python00
sidjha1/DiskANN
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
Language:C++00
sidjha1/dspy
DSPy: The framework for programming—not prompting—foundation models
Language:Python00
sidjha1/lmql
A programming language for large language models.
Language:Python00
sidjha1/TinyAgent
TinyAgent: Function Calling at the Edge!
Language:Python00
sidjha1/NeuralDB
Database Reasoning Over Text project for ACL paper
sidjha1/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
sidjha1/SqueezeLLM
SqueezeLLM: Dense-and-Sparse Quantization
sidjha1/SqueezeLLM-gradients
Language:Python0 0
sidjha1/tensorflow-alpa
Language:C++0 0
sidjha1/zero_scrolls
Running inference on the ZeroSCROLLS benchmark