Pinned Repositories
CNN-for-CQA
DRNN
dynamic recursive neural network
megablocks
MoA
Mixture of Attention Heads
Ordered-Memory
This repository contains the code used for Ordered Memory paper
Ordered-Neurons
Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"
PRPN
Parsing Reading Predict Network
rouge
A full Python Implementation of the ROUGE Metric (not a wrapper)
self-attentive-parser
Constituency Parsing with a Self-Attentive Encoder (ACL 2018)
UDGN
Code for Unsupervised Dependency Graph Network paper
yikangshen's Repositories
yikangshen/Ordered-Neurons
Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"
yikangshen/PRPN
Parsing Reading Predict Network
yikangshen/MoA
Mixture of Attention Heads
yikangshen/Ordered-Memory
This repository contains the code used for Ordered Memory paper
yikangshen/megablocks
yikangshen/UDGN
Code for Unsupervised Dependency Graph Network paper
yikangshen/DRNN
dynamic recursive neural network
yikangshen/CNN-for-CQA
yikangshen/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
yikangshen/rouge
A full Python Implementation of the ROUGE Metric (not a wrapper)
yikangshen/self-attentive-parser
Constituency Parsing with a Self-Attentive Encoder (ACL 2018)
yikangshen/awd-lstm-lm
yikangshen/DCNN_SemEval_Yikang
DCNN base SemEval
yikangshen/dolomite-engine
yikangshen/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
yikangshen/IAMhwr
Online handwriting recognition on IAM-ON database with TDNN and RNN
yikangshen/Inpainting
IFT6266 Deep Learning course project by Yikang Shen
yikangshen/LM_syneval_OrderedNeurons
Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.
yikangshen/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
yikangshen/mos
yikangshen/PRPN-Analysis
This repo contains the analysis results reported in the paper "Grammar Induction with Neural Language Models: An Unusual Replication"
yikangshen/smartdispatch
An easy to use job launcher for supercomputers with PBS compatible job manager.
yikangshen/Theano
Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.
yikangshen/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
yikangshen/tutel
Tutel MoE: An Optimized Mixture-of-Experts Implementation
yikangshen/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs