DLCV Final Project ( Talking to me ) | Poster

├──student_data
    ├──test
    ├──train
    ├──videos
    ├──Frame
        ├──{video_hashcode}_{frame_id}.jpg
    ├──Audio
        ├──{video_hashcode}.wav

Environment

# install the dependency
pip install -r requirements.txt

Before reproducing

# Download the model by executing
bash download.sh

Data Pre-processing

# Download the model by executing
bash prerocess.sh <Path to videos folder> <Path to seg folder> <Path to bbox folder>
# i.e. bash preprocess.sh './student_data/videos' './student_data/train/seg' './student_data/train/bbox' && bash preprocess.sh './student_data/videos' './student_data/test/seg' './student_data/test/bbox'

Reproduce the inference

# to reproduce test file
bash inference.sh <Path to videos folder> <Path to seg folder> <Path to bbox folder>  <Path to output csv file>
# i.e. bash inference.sh './student_data/videos' './student_data/test/seg' './student_data/test/bbox' 'output.csv'

Training

# to run the training
bash train.sh <Path to videos folder> <Path to seg folder> <Path to bbox folder>
# i.e. bash train.sh './student_data/videos' './student_data/train/seg' './student_data/train/bbox'

Contributors

薛博文, Dobby-the-elf
李尚宴, ShangYenLee
許雅晴, Shelley1214
黃秉茂, maomao0819

Shelley1214/DLCVFinalProject_TalkingToMe

DLCV Final Project ( Talking to me ) | Poster

Environment

Before reproducing

Data Pre-processing

Reproduce the inference

Training

Contributors