fzp0424's Stars
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
aiwaves-cn/agents
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
andrewyng/translation-agent
OpenBMB/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
prometheus-eval/prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
kengz/awesome-deep-rl
A curated list of awesome Deep Reinforcement Learning resources.
zhentingqi/rStar
FloridSleeves/LLMDebugger
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
fe1ixxu/ALMA
State-of-the-art LLM-based translation models.
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
vincen-github/mlimpl
This repository collects some codes that encapsulates commonly used algorithms in the field of machine learning. Most of them are based on Numpy, Pandas or Torch. You can deepen your understanding to related model and algorithm or revise it to get the customized code belongs yourself by referring to this repository.
allwefantasy/byzer-llm
Easy, fast, and cheap pretrain,finetune, serving for everyone
thunlp/ChatEval
Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"
jiangsongtao/Med-MoE
[EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"
fzp0424/self_correct_mt
TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"
primeqa/clapnq
fzp0424/MT-Ladder
[EMNLP'24] Code and data for paper "Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level"
fzp0424/EC-Guide-KDDUP-2024
The solution and dataset of Team ZJU_AI4H in Amazon KDDCUP 2024 (Track 2 Top 2; Track 5 Top 5)
YutongWang1216/ReflectionLLMMT
Code and data realeases for the paper -- TasTe: Teaching Large Language Models to Translate through Self-Reflection
HAITrans-lab/instruction-tuned-medical-LLM