/kaggle-qa-challenge-starter

The getting started notebook for the DTC Zoomcamp Q&A challenge

Primary LanguageJupyter Notebook

The DTC Zoomcamp Q&A Challenge Starter

A collection of starter notebooks for DTC Zoomcamp Q&A challenge on Kaggle on Kaggle

BERT Starter

In this notebook, we show how to:

  • Download the data from Kaggle and unzip it
  • Read the data
  • Use BERT to create embeddings
  • Calculate the cosine similarity between questions and answers
  • Select the most similar answer as prediction
  • Submit the results

See the notebook here: bert-starter.ipynb

You can run this notebook in SaturnCloud:

Run in Saturn Cloud

To use the recipe:

  • Download the credential file from Kaggle
  • Put the content of the file to SaturnCloud secrets, save this secret as "kaggle"
  • Click on the button above to create a resource in SaturnCloud
  • Verify that the kaggle secret is linked in the "secrets" tab
  • Run the code and submit your predictions
  • Improve the score

You can also see it as a video:

(The video was originally created for another competition, but it works for this one too)

XGBoost starter

TBA