/fp-ctqa

fp-ctqa created by GitHub Classroom

Primary LanguageJupyter NotebookBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

CMU Interactive Data Science Final Project

CTQA stands for Covid19 Twitter Question Answering

Work distribution

Yuanxin: Sentence encoder NLI model R&D

Aditya: NER model R&D

Yuanyuan: UI & Data Pipeline

Jefferey: UI & Results Analysis

A commentary on the project process: the project begins in brainstorming ideas with sufficient depth and appropriate scope. Then dataset exploration, dataset comparision, data cleaning, data sampling, and data preprocessing operations are performned parallelly with the development of the two models. All team members contributes to the bringing up ideas on different strategies on offline training to reduce workload for the streamlit server. Also, the team tries to make informed decision at every section of the project especially in selecting hyperparamters for users to interact with.

Deliverables

Proposal

  • The URL at the top of this readme needs to point to your application online. It should also list the names of the team members.
  • A completed proposal. The contact should submit it as a PDF on Canvas.

Design review

  • Develop a prototype of your project.
  • Create a 5 minute video to demonstrate your project and lists any question you have for the course staff. The contact should submit the video on Canvas.

Final deliverables

  • All code for the project should be in the repo.
  • A 5 minute video demonstration.
  • Update Readme according to Canvas instructions.
  • A detailed project report. The contact should submit the video and report as a PDF on Canvas.