wangyy161

Pinned Repositories

A3C_CNN_LSTM
This is a tensorflow implementation of Asynchronous advantage actor-critic algorithm for CNN-LSTM as function approximator
Language:Python0 1 00
act-tensorflow
Adaptive Computation Time algorithm in Tensorflow
Language:Python00
adaptive-transformers-in-rl
Adaptive Attention Span for Reinforcement Learning
Language:Python0 0 00
ai_lib
Language:JavaScript0 0 00
AtomGPT
中英文预训练大模型，目标与ChatGPT的水平一致
Language:Python0 0 00
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Language:Python0 1 00
attention_is_all_you_need_pytorch
fork from jadore801120/attention-is-all-you-need-pytorch,修改了en和de的model下载不下来的问题
0 1 00
auto-sklearn
Automated Machine Learning with scikit-learn
Language:Python0 0 00
DDPG_CNN_Pendulum_practice
practice
Language:Python9 2 12
PySnooper
Never use print for debugging again
Language:Python10

wangyy161's Repositories

wangyy161/caffe_ocr
主流ocr算法研究实验性的项目，目前实现了CNN+BLSTM+CTC架构
wangyy161/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
wangyy161/adaptive-transformers-in-rl
Adaptive Attention Span for Reinforcement Learning
wangyy161/Recommendation_system_using_RL_RecSim
Explore the potential of recommendation system using reinforcement learning
wangyy161/Relational_DRL
Implementation of Relational Deep Reinforcement Learning
wangyy161/act-tensorflow
Adaptive Computation Time algorithm in Tensorflow
Language:Python
wangyy161/multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
wangyy161/tensorflow_practice
tensorflow实战练习，包括强化学习、推荐系统、nlp等
wangyy161/auto-sklearn
Automated Machine Learning with scikit-learn
wangyy161/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
wangyy161/leetcode_wyy
Brush problem
wangyy161/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Language:C++
wangyy161/DRL-algorithms
Application of deep reinforcement learning algorithms
Language:Python
wangyy161/test_learning
主要是平时用作测试的代码
Language:Python
wangyy161/OnlinePythonTutor
Visualize Python, Java, JavaScript, TypeScript, Ruby, C, and C++ code execution in your Web browser
Language:C
wangyy161/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python
wangyy161/machinelearning
My blogs and code for machine learning. http://cnblogs.com/pinard
wangyy161/models
Models and examples built with TensorFlow
Language:Python
wangyy161/mlsh
Code for the paper "Meta-Learning Shared Hierarchies"
Language:Python
wangyy161/drl-rec
Deep reinforcement learning for recommendation system
wangyy161/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Language:Python
wangyy161/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
Language:Python
wangyy161/pysot
SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.
Language:Python
wangyy161/CPS-OCR-Engine
An awesome OCR engine developed by SYSU DeepDriving Lab
wangyy161/PySnooper
Never use print for debugging again
Language:Python1
wangyy161/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
Language:Python
wangyy161/Relational_Deep_Reinforcement_Learning
wangyy161/BiCNet
Bidirectionally-Coordinated Net Implements with PyTorch 1.0
wangyy161/nndl.github.io
《神经网络与深度学习》 Neural Network and Deep Learning
Language:HTML
wangyy161/Faster-RCNN-TensorFlow-Python3.5
Tensorflow Faster R-CNN for Windows and Python 3.5
Language:Jupyter Notebook