dhandhalyabhavik

Intel

Pinned Repositories

AI-ML-Research-Insights
This is a public repository of AI-ML Research Insights. Feel free to contribute.
10
model_server
A scalable inference server for models optimized with OpenVINO™
Language:C++02
llama.cpp
LLM inference in C/C++
Language:C++70k 558 4.2k10.1k
text-generation-inference
Large Language Model Text Generation Inference
Language:Python9.5k 105 1.4k1.1k
airllm
AirLLM 70B inference with single 4GB GPU
Language:Jupyter Notebook5.5k 130 185440
ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go105k 613 5.3k8.4k
tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Language:Python27.4k 274 8343k
udlbook
Understanding Deep Learning - Simon J.D. Prince
Language:Jupyter Notebook6.8k 97 1931.4k
GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
Language:Python7.3k 58 173513

dhandhalyabhavik's Repositories

dhandhalyabhavik/AI-ML-Research-Insights
This is a public repository of AI-ML Research Insights. Feel free to contribute.
10
dhandhalyabhavik/model_server
A scalable inference server for models optimized with OpenVINO™
Language:C++02