/spark-nlp-project

Micro project on big data technologies via spark

Primary LanguageJupyter Notebook

Russian language processing via Spark(NLP) 🔥

Go to colab

Micro project on big data technologies via spark

Content:

  1. Colab-Spark setup

  2. Data loading

  3. EDA & Preprocessing

  4. Pipelines & Experiments

  5. Text preprocessing

  6. Text classification

    • BoW models + LogReg
    • Transfer Learning (at least an attempt 😀)
  7. Entity Recgnition & Entity Linking

Tech stack:

...and much more 🤘