mwbyeon's Stars
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
meilisearch/meilisearch
A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
jgm/pandoc
Universal markup converter
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
dragonflydb/dragonfly
A modern replacement for Redis and Memcached
aristocratos/btop
A monitor of resources
danny-avila/LibreChat
Enhanced ChatGPT Clone: Features Agents, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.
pallets/click
Python composable command line interface toolkit
Kanaries/pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
allegroai/clearml
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
XuehaiPan/nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
hynek/structlog
Simple, powerful, and fast logging for Python.
modelscope/data-juicer
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
AI-Hypercomputer/maxtext
A simple, performant and scalable Jax LLM!
chrismattmann/tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
quiltdata/quilt
Quilt is a data mesh for connecting people with actionable data
wustho/epy
CLI Ebook (epub2, epub3, fb2, mobi) Reader
allenai/papermage
library supporting NLP and CV research on scientific papers
mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
rwitten/HighPerfLLMs2024
yosoyjay/cyclecloud-llm