mtfranzen's Stars
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
user1342/Tomato
LLM steganography with minimum-entropy coupling - Hiding encrypted messages in natural language.
mlfoundations/dclm
DataComp for Language Models
microsoft/promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
NVlabs/MambaVision
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
lm-sys/RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
aishwaryanr/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
langchain-ai/langchain-extract
🦜⛏️ Did you say you like data?
flyteorg/flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
NVIDIA/warp
A Python framework for high performance GPU simulation and graphics
deepdoctection/deepdoctection
A Repo For Document AI
run-llama/llama_parse
Parse files for optimal RAG
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
qdrant/qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
kermitt2/grobid
A machine learning software for extracting information from scholarly documents
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
nlmatics/llmsherpa
Developer APIs to Accelerate LLM Projects
Archilyse/ArchilyseAuto
ArchilyseAuto - Automatic Floor Plan Recognition
databricks/megablocks
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
BatsResearch/bonito
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
GoogleCloudPlatform/terraformer
CLI tool to generate terraform files from existing infrastructure (reverse Terraform). Infrastructure to Code
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.