RealNLP's Stars
reworkd/AgentGPT
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
interpretml/interpret
Fit interpretable models. Explain blackbox machine learning.
andrewyng/translation-agent
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
pytorch/serve
Serve, optimize and scale PyTorch models in production
DataTalksClub/llm-zoomcamp
LLM Zoomcamp - a free online course about building a Q&A system
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
GeostatsGuy/PythonNumericalDemos
Well-documented Python demonstrations for spatial data analytics, geostatistical and machine learning to support my courses.
allenai/open-instruct
VikParuchuri/apartment-finder
A Slack bot that helps you find an apartment.
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
VikParuchuri/texify
Math OCR model that outputs LaTeX and markdown
triton-inference-server/pytriton
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
VikParuchuri/textbook_quality
Generate textbook-quality synthetic LLM pretraining data
shaikhsajid1111/social-media-profile-scrapers
Fetch user's data across social media
VikParuchuri/pdftext
Extract structured text from pdfs quickly
allenai/wimbd
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
microsoft/otdd
Optimal Transport Dataset Distance
VikParuchuri/scribe
Simple speech recognition using your microphone.
VikParuchuri/classified
Score LLM pretraining data with classifiers
VikParuchuri/scan
Score essays automatically with an easy web interface.
VikParuchuri/simpsons-scripts
Find out how much the simpsons characters like each other with text and audio analysis.
alexeygrigorev/minsearch
Minimalistic text search engine that uses sklearn and pandas