/img-captioning-in-vietnamese

An attempt to solve image captioning (in Vietnamese language) regarding ball sports contexts.

Primary LanguagePython

img-captioning-in-vietnamese

Project description

Pipeline

image

Applications

  • Speech synthesis to help the blind people
  • Automatically generate image description on e-commerce platforms
  • Video description. Used for searching video

The notebooks

The notebooks for preprocessing data, training, evaluating the experiments are in strategy-1 branch

More details

Read our thesis here

The demo

Idea

Simulate a sport event where the crowd is cheering, and the commentator is delivering his speech based on the situation of the match.

Watch it

Demo video

The model

Reproduce the demo

Create anaconda environment

conda env create -f environment.yml

Run

cd cv-nlp-end-term
streamlit run test.py