Pinned Repositories
10_days_of_deep_learning
10 days 10 different practical applications of Deep Learning (primarily NLP) using Tensorflow and Keras
Algorithm
algorithms-stanford
This repo holds my solutions (in python) for the programming assignments of the Coursera course Algorithms - Design and Analysis (Stanford)
alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
asst1
Stanford CS149 -- Assignment 1
blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
headlines
Automatically generate headlines to short articles
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Victordongy.github.io
Personal Blog
Victordongy's Repositories
Victordongy/Victordongy.github.io
Personal Blog
Victordongy/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Victordongy/asst1
Stanford CS149 -- Assignment 1
Victordongy/blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
Victordongy/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Victordongy/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Victordongy/asst1-1
Stanford CS149 -- Assignment 1
Victordongy/asst2
Stanford CS149 -- Assignment 2
Victordongy/covid-vaccine-simulation
A simulation project for Covid vaccine distribution
Victordongy/deeplift
Public facing deeplift repo
Victordongy/degen
Official Repository for "The Curious Case of Neural Text Degeneration"
Victordongy/Dynamic_Contract
Victordongy/gpt-2-output-dataset
Dataset of GPT-2 outputs for research in detection, biases, and more
Victordongy/HUSE
Official Github repo for the paper "Unifying Human and Statistical Evaluation for Natural Language Generation"
Victordongy/kaldi
This is now the official location of the Kaldi project.
Victordongy/large-scale-curiosity
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
Victordongy/models
Models and examples built with TensorFlow
Victordongy/mopo
Code for MOPO: Model-based Offline Policy Optimization
Victordongy/noreward-rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
Victordongy/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Victordongy/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Victordongy/RankGan-NIPS2017
Tensorflow implementation of RankGan (Adversarial Ranking for Language Generation)
Victordongy/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Victordongy/sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
Victordongy/tensorflow
An Open Source Machine Learning Framework for Everyone
Victordongy/TextGAIL
Victordongy/tianshou
An elegant PyTorch deep reinforcement learning platform.
Victordongy/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Victordongy/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Victordongy/weak-to-strong