JohnSnowLabs/spark-nlp-workshop

Dead URLs for example input files

dkaufman-rc opened this issue · 0 comments

Description

Notebook https://github.com/JohnSnowLabs/spark-nlp-workshop/blob/master/open-source-nlp/12.0.NLU_Crashcourse_Every_SparkNLP_Model_in_One_Line.ipynb includes wget commands that will fail because the URLs for those files to be downloaded aren't live anymore.

Steps to Reproduce

Run the cells containing either of

  • !wget http://ckl-it.de/wp-content/uploads/2020/12/small_btc.csv
  • !wget http://ckl-it.de/wp-content/uploads/2021/02/french_tweets.csv

Your Environment

  • Spark-NLP version: 5.0.2
  • Apache Spark version: 3.1.2
  • Operating System and version: Databricks runtime 9.1
  • Deployment (Docker, Jupyter, Scala, pip, conda, etc.): Databricks notebook