kavgan
Author of The Business Case For AI. AI Advisor, Strategist & Consultant. Ph.D. in CS. Join 4500+ newsletter readers. https://ai-integrated-newsletter.beehiiv
@opinosis-analyticsSalt Lake City
Pinned Repositories
clinical-concepts
Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to mine related concepts by leveraging the volume within large amounts of clinical notes.
nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
opinosis-summarization
This repo contains code and dataset for the Opinosis Summarization Framework
OpinRank
OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)
phrase-at-scale
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
ROUGE-2.0
ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.
spark-examples
Examples of code in spark
stop-words
Stop word lists
text-mining-and-nlp-apis
APIs for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or URL, computing similarity between texts and more.
word_cloud
Python word cloud library for use within Jupyter notebook and Python apps.
kavgan's Repositories
kavgan/nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
kavgan/ROUGE-2.0
ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.
kavgan/phrase-at-scale
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
kavgan/opinosis-summarization
This repo contains code and dataset for the Opinosis Summarization Framework
kavgan/word_cloud
Python word cloud library for use within Jupyter notebook and Python apps.
kavgan/OpinRank
OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)
kavgan/clinical-concepts
Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to mine related concepts by leveraging the volume within large amounts of clinical notes.
kavgan/spark-examples
Examples of code in spark
kavgan/stop-words
Stop word lists
kavgan/text-mining-and-nlp-apis
APIs for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or URL, computing similarity between texts and more.
kavgan/hashtags_test
Test hashtags
kavgan/Micropinion-Generation-Dataset
Dataset for Micropinion Generation. Dataset is based on user reviews from CNET. The reviews are on products from various categories like tv, cell phones, gps etc.
kavgan/bootstrap
The most popular HTML, CSS, and JavaScript framework for developing responsive, mobile first projects on the web.
kavgan/data-science-blogs
A curated list of data science blogs
kavgan/pyrxnlp
Super simple NLP tools. Cluster sentences, get multiple text similarity measures including cosine, jaccard and dice, generate topics, extract text from html and more
kavgan/python-examples
Working examples in python
kavgan/CoreNLP
Stanford CoreNLP: A Java suite of core NLP tools.
kavgan/electron
Build cross platform desktop apps with JavaScript, HTML, and CSS
kavgan/GeoSpark
A Cluster Computing System for Processing Large-Scale Spatial Data
kavgan/images
website images
kavgan/rails
Ruby on Rails
kavgan/resources
Curated List of Blog Posts From Opinosis Analytics
kavgan/ROUGE-Utility
Utility tools to prepare and evaluate ROUGE scores. Perl script to convert perl output of ROUGE to CSV.
kavgan/SIF_mini_demo
minimal example for sentence embedding by Smooth Inverse Frequency weighting scheme
kavgan/spark
Mirror of Apache Spark
kavgan/spark-lucenerdd
Spark RDD with Lucene's query capabilities
kavgan/spectron
Test Electron apps using ChromeDriver
kavgan/stanza
Stanford NLP group's shared Python tools.
kavgan/test-repo
Test repo
kavgan/vuln_test_repo_public_ruby_gemfile_cve-2016-6317