Real-time video streaming is now one of the main applications in all network environments. Due to the fluctuation of throughput under various network conditions, how to choose a proper bitrate adaptively has become an upcoming and interesting issue. To tackle this problem, most proposed rate control methods work for providing high video bitrates instead of video qualities. Nevertheless, we notice that there exists a trade-off between sending bitrate and video quality, which motivates us to focus on how to reach a balance between them. We then propose QARC (video Quality Aware Rate Control), a rate control algorithm that aims to obtain a higher perceptual video quality with possible lower sending rate and transmission latency. In detail, QARC uses deep reinforcement learning(DRL) algorithm to train a neural network to select future bitrates based on previously observed network status and past video frames. To overcome the ``state explosion problem'', we design a neural network to predict future perceptual video quality as a vector for taking the place of the raw picture in the DRL's inputs.
The current codebase runs on Python 2.7. This work requires tensorflow, tflearn, scipy, matplotlib, opencv2 and tensorboard. Please install these package before using QARC.
pip install tensorflow tflearn scipy==1.2.2 matplotlib opencv-python tensorboard
In 'videodatasets/', we describe how to generate video datasets.
- Clone the vmaf git repository and checkout version 1.3.9 (available on the
v1.3.9
tag). for computing vmaf score: git clone https://github.com/Netflix/vmaf, then install it. - Copy the 'videodatasets' folder to the 'vmaf' folder.
- Download several video clips into 'videodatasets/mov' folders, and the video MUST be encoded as h.264 format.
- Run 'trans.py' for transcoding 'mp4' video format to 'flv'.
- Run 'main.py' for generating video datasets including logs and video frames. If you encounter the following error,
Must install ffmpeg and set FFMPEG_PATH in vmaf/python/src/vmaf/externals.py, e.g. add a line like
FFMPEG_PATH = "[path to exec]/ffmpeg"
run which ffmpeg
from the commmand line to find your path to ffmpeg
and add it to the externals.py
file as noted.
- Run
process-vmaf.py
to generate a h5py file for training. Note this script expects frame image files from the previous step to be in a subdirectory namedimg/<original_video_filename>/
. E.g., if the original file was1.mp4
, then it expects frame image files to be underimg/1.mp4/
.
The traditional QARC method is composed of two modules: VQPN and VQRL.
To run and evaluate VQPN, you should follow these steps.
-
Copy the '*.h5' training dataset and test dataset from 'videodatasets/' to 'QARC/VQPN'
-
There are two nerual network architecture candidates, GRU model (as shown in the paper) and CNN model (also reaches a high performance but a little bit lower than the GRU model performs), in different folders. Type 'cd gru/ & python main.py' for training GRU model or 'cd cnn/ & python main.py' for training CNN model. We use early stopping method that automaticlly tear down the training process if overfitting occurs.
-
We provide several baselines for evaluating the performance of VQPN in 'baseline/' folder.
To run and test VQRL, one should follow these steps:
-
Copy the pretrain VQPN model to 'QARC/VQRL/model/', we recommend GRU model for higher performances.
-
Collect network traces and copy the traces to 'QARC/VQRL/train_sim_traces/', the network traces' format is described like this: 'timestamp\tbandwidth\n', and you can also use cooked_traces from Pensieve as network trace dataset.
-
Run the training process via 'python main.py'
The useage of advanced-QARC is quite same as QARC.