Gosicfly

Fudan University - NLP Lab

Shanghai

Gosicfly's Stars

lvze92/DMR
Deep Match to Rank Model for Personalized Click-Through Rate Prediction
Language:Python23258
quantumiracle/Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Language:Jupyter Notebook1k119
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
Language:Python11.9k1.2k
jeinlee1991/chinese-llm-benchmark
中文大模型能力评测榜单：覆盖百度文心一言、chatgpt、阿里通义千问、讯飞星火、belle / chatglm6b 等开源大模型，多维度能力评测。不仅提供能力评分排行榜，也提供所有模型的原始输出结果！
1.8k91
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python5.8k987
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python4.2k390
lukasschwab/arxiv.py
Python wrapper for the arXiv API
Language:Python1k115
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python12.4k1k
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
Language:TypeScript72.6k57.6k
SqueezeAILab/LLM2LLM
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Language:Python1288
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
Language:Python33.2k4.6k
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python1.8k142
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
14.1k936
ZhuiyiTechnology/simbert
a bert for retrieval and generation
Language:Python827152
bojone/CoSENT
比Sentence-BERT更有效的句向量方案
Language:Python34824
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python3.7k834
huggingface/chat-ui
Open source codebase powering the HuggingChat app
Language:TypeScript6.7k946
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1.6k331
SmirkCao/Lihang
Statistical learning methods, 统计学习方法(第2版)[李航] [笔记, 代码, notebook, 参考文献, Errata, lihang]
Language:Python5.9k1.6k
Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Language:Python947162
apple/ml-ferret
Language:Python8.2k475
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
Language:Python78841
ray-project/llm-numbers
Numbers every LLM developer should know
4k137
MisterBooo/LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.（用动画的形式呈现解LeetCode题目的思路）
Language:Java75.1k14k
zhulei227/ML_Notes
机器学习算法的公式推导以及numpy实现
Language:Jupyter Notebook2k471
fengdu78/lihang-code
《统计学习方法》的代码实现
Language:Jupyter Notebook18.6k6.3k
UKPLab/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
Language:Python14.3k2.4k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8k564
Zoeyyao27/CoT-Igniting-Agent
This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
31524
km1994/LLMs_interview_notes
该仓库主要记录大模型（LLMs）算法工程师相关的面试题
1.1k89

Gosicfly

Gosicfly's Stars

lvze92/DMR

quantumiracle/Popular-RL-Algorithms

princeton-nlp/SWE-agent

jeinlee1991/chinese-llm-benchmark

microsoft/DeepSpeedExamples

allenai/OLMo

lukasschwab/arxiv.py

QwenLM/Qwen

ChatGPTNextWeb/ChatGPT-Next-Web

SqueezeAILab/LLM2LLM

run-llama/llama_index

eric-mitchell/direct-preference-optimization

HumanAIGC/AnimateAnyone

ZhuiyiTechnology/simbert

bojone/CoSENT

sweetice/Deep-reinforcement-learning-with-pytorch

huggingface/chat-ui

nikhilbarhate99/PPO-PyTorch

SmirkCao/Lihang

Lizhi-sjtu/DRL-code-pytorch

apple/ml-ferret

pjlab-sys4nlp/llama-moe

ray-project/llm-numbers

MisterBooo/LeetCodeAnimation

zhulei227/ML_Notes

fengdu78/lihang-code

UKPLab/sentence-transformers

facebookresearch/xformers

Zoeyyao27/CoT-Igniting-Agent

km1994/LLMs_interview_notes