dongpf's Stars
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
CASIA-LM/ChineseWebText
google/langfun
OO for LLMs
FlagOpen/FlagData
e2b-dev/awesome-ai-agents
A list of AI autonomous agents
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
microsoft/TaskWeaver
A code-first agent framework for seamlessly planning and executing data analytics tasks.
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
bentoml/OpenLLM
Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
langgptai/LangGPT
LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt
thunlp/PLMpapers
Must-read Papers on pre-trained language models.
InteractiveAdvertisingBureau/Taxonomies
Easy access to IAB Tech Lab taxonomies, including Content, Audience and Ad Product
alibaba-edu/mpc4j
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
OpenMined/PySyft
Perform data science on data that remains in someone else's server
IABTechLab/uid2docs
Documentation Repository for Unified ID 2.0
alibaba/Elastic-Federated-Learning-Solution
familyld/Awesome-Best-Papers
Collect awesome best papers from top AI conferences.
uber/causalml
Uplift modeling and causal inference with machine learning algorithms
jd-ads-data/jd-mta
Multi Touch Attribution: Simulation Code
TheAlgorithms/Python
All Algorithms implemented in Python
sameeragarwal/blinkdb
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
htorun/dbtableprinter
Database Table Printer - a Java utility class to print a pretty table to standard out.
mozafari/cliffguard
WorkloadMiner + CliffGuard (Robust Physical Designer for Databases)