Gosicfly's Stars
lvze92/DMR
Deep Match to Rank Model for Personalized Click-Through Rate Prediction
quantumiracle/Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
jeinlee1991/chinese-llm-benchmark
中文大模型能力评测榜单:覆盖百度文心一言、chatgpt、阿里通义千问、讯飞星火、belle / chatglm6b 等开源大模型,多维度能力评测。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!
microsoft/DeepSpeedExamples
Example models using DeepSpeed
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
lukasschwab/arxiv.py
Python wrapper for the arXiv API
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
SqueezeAILab/LLM2LLM
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
ZhuiyiTechnology/simbert
a bert for retrieval and generation
bojone/CoSENT
比Sentence-BERT更有效的句向量方案
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
huggingface/chat-ui
Open source codebase powering the HuggingChat app
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
SmirkCao/Lihang
Statistical learning methods, 统计学习方法(第2版)[李航] [笔记, 代码, notebook, 参考文献, Errata, lihang]
Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
apple/ml-ferret
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
ray-project/llm-numbers
Numbers every LLM developer should know
MisterBooo/LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
zhulei227/ML_Notes
机器学习算法的公式推导以及numpy实现
fengdu78/lihang-code
《统计学习方法》的代码实现
UKPLab/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Zoeyyao27/CoT-Igniting-Agent
This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
km1994/LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题