What's in this fork?

This is a fork of this repository. It fixes some minor typos and python notebook issues. It also introduces a docker file, fixes python requirements and provides python implementation of example train code originally implemented in python notebook. The goal is to test the training speed.

Bare metal

virtualenv -p python3 ./.python3
source ./.python3/bin/activate
pip install -r ./requirements.txt
# For quick test
CUDA_VISIBLE_DEVICES=0 python ./test_trnn.py  --training_steps 100 --display_step 25

Docker containers

The base docker image for deepts/tensorflow-1.13-1 is nvidia/cuda:10.0-cudnn7-runtime-ubuntu16.04 because I had this image on my dev box.

docker build -f ./docker/Dockerfile-gpu -t deepts/tensorflow:gpu-1.13-1 .
nvidia-docker run --rm -ti -v $(pwd):/workspace deepts/tensorflow:gpu-1.13-1
cd /workspace
CUDA_VISIBLE_DEVICES=0 python ./test_trnn.py  --training_steps 100 --display_step 25

Tensor Train Recurrent Neural Network

Clean code repo for tensor train recurrent neural network, implemented in Tensorflow. See details in our paper Long-Term Forecasting with Tensor Train RNNs

Getting Started

install prerequisites

tensorflow >= r1.6
Python >=3.0
Jupyter >=4.1.1

import module

from trnn import TensorLSTMCell

from trnn_imply import tensor_rnn_with_feed_prev

Classes

TensorLSTMCell(num_units, num_lags, rank_vals) – creates a TensorTrainLSTM object with num_units hidden nodes, num_lags time lags, with rank_vals is the list of values for tensor train decomposition rank

Methods

tensor_rnn_with_feed_prev – forward pass for a single TensorTrainLSTM cell, returns an output and a hidden state.

Running the test

Run the Jupyter notebook

jupyter notebook test_trnn.pynb

A simple example of using TensorTrainLSTM by

loading a set of sim sequences
building a tensor train Seq2Seq model
making long-term predictions

Citation

If you think the repo is useful, we kindly ask you to cite our work at

@article{yu2017long,
  title={Long-term forecasting using tensor-train RNNs},
  author={Yu, Rose and Zheng, Stephan and Anandkumar, Anima and Yue, Yisong},
  journal={arXiv preprint arXiv:1711.00073},
  year={2017}
}

sergey-serebryakov/tensor_train_RNN