riksanyal's Stars
google-research/google-research
Google Research
vdumoulin/conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
GoogleCloudPlatform/microservices-demo
Sample cloud-first application with 10 microservices showcasing Kubernetes, Istio, and gRPC.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
karpathy/llm.c
LLM training in simple, raw C/CUDA