coltoreg's Stars
EbookFoundation/free-programming-books
:books: Freely available programming books
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
D4Vinci/Scrapling
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
ultralytics/ultralytics
Ultralytics YOLO11 🚀
JimmyLv/BibiGPT-v1
BibiGPT v1 · one-Click AI Summary for Audio/Video & Chat with Learning Content: Bilibili | YouTube | Tweet丨TikTok丨Dropbox丨Google Drive丨Local files | Websites丨Podcasts | Meetings | Lectures, etc. 音视频内容 AI 一键总结 & 对话:哔哩哔哩丨YouTube丨推特丨小红书丨抖音丨快手丨百度网盘丨阿里云盘丨网页丨播客丨会议丨本地文件等 (原 BiliGPT 省流神器 & AI课代表)
Kevin-free/chatgpt-prompt-engineering-for-developers
吴恩达《ChatGPT Prompt Engineering for Developers》课程中英版
Netflix/maestro
Maestro: Netflix’s Workflow Orchestrator
unclecode/crawl4ai
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
psf/black
The uncompromising Python code formatter
heiko-hotz/automated-prompt-engineering-from-scratch
A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.
jwlin/ptt-web-crawler
PTT 網路版爬蟲
huangwl18/ReKep
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
BruceDone/awesome-crawler
A collection of awesome web crawler,spider in different languages
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
instructor-ai/instructor
structured outputs for llms
vespa-engine/vespa
AI + Data, online. https://vespa.ai
landing-ai/vision-agent
Vision agent
mengjian-github/copilot-analysis
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
google/styleguide
Style guides for Google-originated open-source projects
langchain-ai/langgraph-studio
Desktop app for prototyping and debugging LangGraph applications locally.
reczoo/FuxiCTR
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io
THUwangcy/ReChorus
“Chorus” of recommendation models: a light and flexible PyTorch framework for Top-K recommendation.
salmon1802/xDCN
xDCN: Combining Exponential and Linear Cross Network for Click-Through Rate Prediction
mli/paper-reading
深度学习经典、新论文逐段精读
darshilparmar/twitter-airflow-data-engineering-project
YouTube tutorial project
airscholar/e2e-data-engineering
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.