raghunanj's Stars
ggerganov/llama.cpp
LLM inference in C/C++
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI đź”— https://microsoft.github.io/generative-ai-for-beginners/
charlax/professional-programming
A collection of learning resources for curious software engineers
roboflow/supervision
We write your reusable computer vision tools. đź’ś
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—foundation models
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
cpacker/MemGPT
Letta (fka MemGPT) is a framework for creating stateful LLM services.
livekit/livekit
End-to-end stack for WebRTC. SFU media server and SDKs.
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
Rust-GPU/Rust-CUDA
Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.
mistralai/mistral-finetune
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
kyegomez/swarms
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503
wilicc/gpu-burn
Multi-GPU CUDA stress test
Dicklesworthstone/swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
XiongjieDai/GPU-Benchmarks-on-LLM-Inference
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
NVlabs/curobo
CUDA Accelerated Robot Library
wasiahmad/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
arpitingle/gpu-alpha
High Quality Resources on GPU Programming/Architecture
JohnCrickett/SystemDesign
Useful resources on distributed system design.
gpu-mode/awesomeMLSys
An ML Systems Onboarding list
jimmc414/1filellm
Specify a github or local repo, github pull request, arXiv or Sci-Hub paper, Youtube transcript or documentation URL on the web and scrape into a text file and clipboard for easier LLM ingestion
NVIDIA/modulus-makani
Massively parallel training of machine-learning based weather and climate models
Genentech/gReLU
gReLU is a python library to train, interpret, and apply deep learning models to DNA sequences.
SpectacularAI/sdk-examples
Spectacular AI SDK examples
NVlabs/HALP
tpn/cuda-by-example
Code for NVIDIA's CUDA By Example Book.