JohnSnowLabs/spark-nlp-workshop

dl-ner.ipynb incorrect start and download with `pipeline_fast_dl`

Chertushkin opened this issue · 1 comments

PipelineModel with stages does not load

Steps to Reproduce

  1. Pull and run the docker
  2. Run notebook https://github.com/JohnSnowLabs/spark-nlp-workshop/blob/master/jupyter/annotation/english/model-downloader/dl-ner.ipynb
  3. Try to launch the cell №3. You will get the exception on spark.createDataFrame saying that spark is not available. There is a quick fix: in the cell №2 you need to change sparknlp.start() to spark = sparknlp.start(). After that you can proceed forward.
  4. Then try to launch the cell №4. You will get the exception saying that resource failed to download. Therefore, pipeline_fast_dl will not be initialized.

Your Environment

  • Spark-NLP version: 2.0.3
  • Apache Spark version: 2.4.1
  • Operating System and version: The latest Docker
  • Deployment (Docker, Jupyter, Scala, pip, conda, etc.): I have pulled the latest docker as described on the main page.

This has been resolved in the latest Docker image. Both notebooks now work in model-downloader directory.