/DeepTTE

Primary LanguagePython

This project is the code of AAAI 2018 paper When Will You Arrive? Estimating Travel Time Based on Deep Neural Networks.

We provide the complete version of code and part of sample data in Chengdu. You can replace the sample data with your own data easily. See the samples in data/ for more details. The complete data can be downloaded at https://duke.box.com/s/ni5ca8iktneq828fk5cul8afwkvszkdr , which is provided by the following competion http://www.dcjingsai.com/common/cmpt/%E4%BA%A4%E9%80%9A%E7%BA%BF%E8%B7%AF%E9%80%9A%E8%BE%BE%E6%97%B6%E9%97%B4%E9%A2%84%E6%B5%8B_%E8%B5%9B%E4%BD%93%E4%B8%8E%E6%95%B0%E6%8D%AE.html.

Usage:

Model Training

python train.py

Parameters:

  • task: train/test
  • batch_size: the batch_size to train, default 400
  • epochs: the epoch to train, default 100
  • kernel_size: the kernel size of Geo-Conv, only used when the model contains the Geo-conv part
  • pooling_method: attention/mean
  • alpha: the weight of combination in multi-task learning
  • log_file: the path of log file
  • result_file: the path to save the predict result. By default, this switch is off during the training

Example:

python main.py --task train  --batch_size 10  --result_file ./result/deeptte.res --pooling_method attention --kernel_size 3 --alpha 0.1 --log_file run_log

Model Evaluation

Parameters:

  • weight_file: the path of model weight
  • result_file: the path to save the result

Example:

python main.py --task test --weight_file ./saved_weights/weight --batch_size 10  --result_file ./result/deeptte.res --pooling_method attention --kernel_size 3 --alpha 0.1

How to User Your Own Data

In the data folder we provide some sample data. You can use your own data with the corresponding format as in the data samples. The sampled data contains 1800 trajectories. To make the model performance close to our proposed result, make sure your dataset contains more than 5M trajectories.

Format Instructions

Each sample is a json string. The key contains:

  • driverID
  • dateID: the date in a month, from 0 to 30
  • weekID: the day of week, from 0 to 6 (Mon to Sun)
  • timeID: the ID of the start time (in minute), from 0 to 1439
  • dist: total distance of the path (KM)
  • time: total travel time (min), i.e., the ground truth. You can set it as any value during the test phase
  • lngs: the sequence of longitutes of all sampled GPS points
  • lats: the sequence of latitudes of all sampled GPS points
  • states: the sequence of taxi states (available/unavaible). You can remove this attributes if it is not available in your dataset. See models/base/Attr.py for details.
  • time_gap: the same length as lngs. Each value indicates the time gap from current point to the firt point (set it as arbitrary values during the test)
  • dist_gap: the same as time_gap

The GPS points in a path should be resampled with nearly equal distance.

Furthermore, repalce the config file according to your own data, including the dist_mean, time_mean, lngs_mean, etc.