Nick17t's Stars
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
ulixee/hero
The web browser built for scraping
aurelio-labs/semantic-router
Superfast AI decision making and intelligent processing of multi-modal data.
infinilabs/pizza-website
🍕 Home page of INFINI PIzza.
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
qhjqhj00/MemoRAG
Empowering RAG with a memory-based data interface for all-purpose applications!
jina-ai/reader
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
leobeeson/llm_benchmarks
A collection of benchmarks and datasets for evaluating LLM.
supermemoryai/supermemory
Build your own second brain with supermemory. It's a ChatGPT for your bookmarks. Import tweets or save websites and content using the chrome extension.
pgvector/pgvector
Open-source vector similarity search for Postgres
wizardAEI/Gomoon
Gomoon 基于大模型的桌面端效率工具
lancedb/lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
intel/AI-Playground
AI PC starter app for doing AI image creation, image stylizing, and chatbot on a PC powered by an Intel® Arc™ GPU.
NVIDIA/workbench-example-hybrid-rag
An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)
kinfey/Microsoft-Phi-3-NvidiaNIMWorkshop
This is Microsoft-Phi-3-NvidiaNIMWorkshop
ihower/zh-tw-embedding-model-benchmark
使用繁體中文資料集做的 Embedding 模型評測
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
aurelio-labs/semantic-chunkers
michaelfeil/infinity
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
Tyrrrz/DiscordChatExporter
Exports Discord chat logs to a file
tegal1337/YOMEN
Youtube Bot Auto Comment
zRich/translation
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
run-llama/llama_parse
Parse files for optimal RAG
promptfoo/promptfoo
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
reorproject/reor
Private & local AI personal knowledge management app for high entropy people.