raihan0824's Stars
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
slatedocs/slate
Beautiful static documentation for your API
janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
mem0ai/mem0
The Memory layer for AI Agents
Shubhamsaboo/awesome-llm-apps
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
pedroslopez/whatsapp-web.js
A WhatsApp client library for NodeJS that connects through the WhatsApp Web browser app
kvcache-ai/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
llmware-ai/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
farizdotid/DAFTAR-API-LOKAL-INDONESIA
Berisi API tentang Data Indonesia ataupun API yang dibuat oleh developer Indonesia <3
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
fixie-ai/ultravox
A fast multimodal LLM for real-time voice
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
deepspeedai/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
qhjqhj00/MemoRAG
Empowering RAG with a memory-based data interface for all-purpose applications!
huggingface/optimum-nvidia
vllm-project/production-stack
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Thytu/Agentarium
open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for designing complex, interactive environments where agents can act, learn, and evolve.
triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
triton-inference-server/python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
databridge-org/databridge-core
Scalable Multi-modal RAG platform
leobeeson/llm_benchmarks
A collection of benchmarks and datasets for evaluating LLM.
deeptrust-ai/terifai-open
Terrify people
nicksavarese/allora-ios
An iOS Keyboard Extension that allows for interacting with LLMs directly from any text input field. The LLM response is placed into the text field. Includes options to send clipboard contents with the request to help instruct/guide the response.
fangyuan-ksgk/Evolutionary-Model-Merge
Unofficial Implementation of Evolutionary Model Merging