Foo1szz

I am Yang Hanjie，a normal graduate student of Dalian University of Technology.

DLUTDalian, Chinese

Foo1szz's Stars

f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
Language:HTML115k 1.5k 015.7k
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python66.6k 282 1.7k8.2k
scutan90/DeepLearning-500-questions
深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06
Language:JavaScript55.1k 2.2k 18915.9k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.7k 344 2694.1k
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Language:Python23.8k 381 1812k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python20.5k 307 1.4k2.6k
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Language:Python18.6k 95 2181.9k
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
14k 116 501.4k
datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook9.8k 80 1481.9k
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
Language:Python8.1k 95 7481.1k
fmzquant/strategies
quantitative trading with Javascript, Python, C++, PineScript, Blockly, MyLanguage(麦语言)
4.2k 312 51.4k
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.7k 66 229829
kimiyoung/transformer-xl
Language:Python3.6k 84 133762
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Language:Jupyter Notebook3.3k 76 1554
hyunwoongko/transformer
Transformer: PyTorch Implementation of "Attention Is All You Need"
Language:Python3.2k 10 22456
datawhalechina/daily-interview
Datawhale成员整理的面经，内容包括机器学习，CV，NLP，推荐，开发等，欢迎大家star
Language:HTML2.7k 51 11434
quantumiracle/Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Language:Jupyter Notebook1.2k 16 21131
booydar/recurrent-memory-transformer
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
Language:Jupyter Notebook760 10 061
TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Language:Python671 15 10109
lucidrains/FLASH-pytorch
Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
Language:Python354 9 1324
google-research/meliad
Language:Python252 10 530
jerrodparker20/adaptive-transformers-in-rl
Adaptive Attention Span for Reinforcement Learning
Language:Python132 7 814
YYCAAA/V-MPO_Lunarlander
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
Language:Python44 2 26
RodkinIvan/Transformer-RL
Transformers (GTrXL & CoBERL) applied to RL tasks
Language:Python28 1 02
acyclics/MPO
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
Language:Python26 3 24
jsikyoon/V-MPO_torch
V-MPO torch version with DMLab30 and GTrXL
Language:Python12 3 31
bjoluc/gymwipe
OpenAI Gym Environments for the Application of Reinforcement Learning in the Simulation of Wireless Networked Feedback Control Loops
Language:Python11 2 03
Giseung-Park/BlockSeq
Code for 'Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning' (AAAI 2022, Oral presentation)
Language:Python7 1 01
vidits-kth/gym-radio-scheduler
Language:Python4 1 03
cyj407/RL-MPO-DMC
Language:Python1 1 00

Foo1szz

Foo1szz's Stars

f/awesome-chatgpt-prompts

binary-husky/gpt_academic

scutan90/DeepLearning-500-questions

tatsu-lab/stanford_alpaca

microsoft/JARVIS

microsoft/unilm

kaixindelele/ChatPaper

dair-ai/ml-visuals

datawhalechina/easy-rl

thu-ml/tianshou

fmzquant/strategies

ikostrikov/pytorch-a2c-ppo-acktr-gail

kimiyoung/transformer-xl

higgsfield-ai/higgsfield

hyunwoongko/transformer

datawhalechina/daily-interview

quantumiracle/Popular-RL-Algorithms

booydar/recurrent-memory-transformer

TianhongDai/reinforcement-learning-algorithms

lucidrains/FLASH-pytorch

google-research/meliad

jerrodparker20/adaptive-transformers-in-rl

YYCAAA/V-MPO_Lunarlander

RodkinIvan/Transformer-RL

acyclics/MPO

jsikyoon/V-MPO_torch

bjoluc/gymwipe

Giseung-Park/BlockSeq

vidits-kth/gym-radio-scheduler

cyj407/RL-MPO-DMC