AstroSayan
I'm a student of Electronics and Instrumentation Engineering. Currently working on developing Python based advanced robotic control systems and data science.
RCC Institute of Information TechnologyPune
AstroSayan's Stars
ollama/ollama
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
ggerganov/llama.cpp
LLM inference in C/C++
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
unclecode/crawl4ai
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/mEkkMXFG
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
stitionai/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI. [⚠️ DEVIKA DOES NOT HAVE AN OFFICIAL WEBSITE ⚠️]
karpathy/llama2.c
Inference Llama 2 in one file of pure C
state-spaces/mamba
Mamba SSM architecture
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
bentoml/OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
NirDiamant/GenAI_Agents
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.
abetlen/llama-cpp-python
Python bindings for llama.cpp
onnx/models
A collection of pre-trained, state-of-the-art models in the ONNX format
meta-llama/llama-stack
Composable building blocks to build Llama Apps
rustformers/llm
[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models
EricLBuehler/mistral.rs
Blazingly fast LLM inference.
awslabs/multi-agent-orchestrator
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
microsoft/PromptWizard
Task-Aware Agent-driven Prompt Optimization Framework
bRAGAI/bRAG-langchain
Everything you need to know to build your own RAG application
getzep/graphiti
Build and query dynamic, temporally-aware Knowledge Graphs
hesamsheikh/ml-retreat
Machine Learning Journal for Intermediate to Advanced Topics.
KruxAI/ragbuilder
A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data
python-microservices/pyms
Library of utils to create REST Python Microservices
willccbb/mlx_parallm
Fast parallel LLM inference for MLX
zorse-project/COBOLEval
Evaluate LLM-generated COBOL
IBM/zDLC
IBM Z Deep Learning Compiler Documentation and Usage examples