sparknlp

There are 19 repositories under sparknlp topic.

  • medicare-risk-adjustment

    databricks-industry-solutions/medicare-risk-adjustment

    Databricks and John Snow Labs Solution Accelerator for Medicare Risk Adjustment automates the extraction of undiagnosed member conditions from unstructured clinical notes with NLP models, improving downstream reimbursements.

    Language:Python12527
  • Dirkster99/PyNotes

    My notebook on using Python with Jupyter Notebook, PySpark etc

    Language:Jupyter Notebook10207
  • databricks-industry-solutions/ocr-phi-masking

    Our joint Solution Accelerator with John Snow Labs automates the detection of sensitive information contained within unstructured data using NLP models for healthcare. Extracted data is stored within the Lakehouse, where teams can use the pre-trained models to easily remove, obfuscate or mask data for downstream analytics at massive scale.

    Language:Python7313
  • databricks-industry-solutions/adverse-drug-events

    To ensure ongoing drug safety, pharma companies need to monitor and report adverse drug events post-market launch. This accelerator extracts, processes and analyzes adverse drug events from real-world text data using NLP

    Language:Python5316
  • databricks-industry-solutions/oncology

    Generate oncology insights from real-world data using NLP. Once extracted, oncology data is enriched with useful information like ICD-10 codes and used to build powerful visualizations

    Language:Python5315
  • databricks-industry-solutions/toxicity-detection-in-gaming

    Build a lakehouse for all your gamer data and use natural language processing techniques to flag questionable comments for moderation.

    Language:Python550
  • VirtualRoyalty/spark-nlp-project

    Micro project on big data technologies via spark

    Language:Jupyter Notebook4200
  • databricks-industry-solutions/jsl-medical-risk-factors

    Automated Extraction of Medical Risk Factors For Life Insurance Underwriting

    Language:Python3202
  • databricks-industry-solutions/jsl-financial-nlp

    Drawing a Company Ecosystem Graph

    Language:Python1301
  • sdarjunwadkar/Political-Idealogies-Prediction-in-News-Articles

    Media diversity shapes perspectives, yet biased news distorts reality, fostering misinformation. 'Political Ideologies Prediction in News Articles' aims to forecast bias using PySpark, NLP, and ML for adaptable, swift inference. Integrated with NYT API, it predicts bias in top political articles, fostering better understanding of subjective content

    Language:Jupyter Notebook1
  • AjaySurya-018/Emotion_Detection_in-text

    Web application to detect emotion in text

    Language:Jupyter Notebook0100
  • chuyu-c/NLP-with-Reddit-Comment

    This project focuses on the use of big data platforms, specifically Spark (PySpark, SparkML, Spark NLP). We will use the comment text a user posted, categorize the sentiment and predict scores of each comment. Our objective is to understand the dynamics of the Reddit online community and how the way people communicate online leads to different reactions from the community.

    Language:Jupyter Notebook0100
  • doshiharmish/Politica-ldeologies-Predictionin-News-Articles

    Media diversity shapes perspectives, yet biased news distorts reality, fostering misinformation. 'Political Ideologies Prediction in News Articles' aims to forecast bias using PySpark, NLP, and ML for adaptable, swift inference. Integrated with NYT API, it predicts bias in top political articles, fostering better understanding of subjective content

    Language:Jupyter Notebook0100
  • prabhupavitra/Text-Summarization-PySpark

    Text summarization algorithms using PySpark

    Language:Jupyter Notebook0200
  • saadkh1/Bert_Spark_Example

    This repository provides examples of using pre-trained BERT models from SparkNLP with PySpark for Natural Language Processing task.

    Language:Jupyter Notebook0100
  • uche-madu/deb-application

    This repository contains application code for the Wizeline Data Engineering Bootcamp (DEB) 2023. It is one of two repositories for the DEB. The other houses the infrastructure code.

    Language:Python0101
  • wongkhoon/Coursera

    Completed professional certificates and specializations via Coursera

    Language:Jupyter Notebook0101
  • cmsptcp/tsmp

    Twitter based stock market prediction using Pyspark, project for Big Data PW 2020L

    Language:Jupyter Notebook
  • gympohnpimol/Spark

    Language:Python10