jlmartin100

Data analyst, Arabic translator.

Chicago IL

jlmartin100's Stars

tqdm/tqdm
:zap: A Fast, Extensible Progress Bar for Python and CLI
Language:Python28.5k 206 9961.4k
twintproject/twint
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
Language:Python15.8k 328 1.2k2.7k
jacobeisenstein/gt-nlp-class
Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"
Language:TeX4.9k 320 281.1k
adashofdata/nlp-in-python-tutorial
comparing stand up comedians using natural language processing
Language:Jupyter Notebook1.7k 91 91.4k
awslabs/open-data-registry
A registry of publicly available datasets on AWS
Language:Python1.4k 70 100900
gregversteeg/corex_topic
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
Language:Python626 29 41119
aub-mind/arabert
Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)
Language:Python623 28 59138
brandomr/document_cluster
A guide to document clustering in Python
Language:Jupyter Notebook507 25 17339
mohataher/arabic-stop-words
Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب
306 12 2149
multilingual-dh/nlp-resources
Natural language processing resources for multiple languages, with an eye towards use for digital humanities.
124 12 012
aub-mind/Arabic-Empathetic-Chatbot
Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model
Language:Jupyter Notebook56 4 412
prinshul/Text-Scraping-Document-Clustering-Topic-modeling
The objective of this project is to scrape a corpus of news articles from a set of web pages, pre-process the corpus, and then to apply unsupervised clustering algorithms to explore and summarise the contents of the corpus. Part 1. Text Data Scraping This part of the project should be implemented as a Python script 1. Identify the URLs for all news articles listed on the website: http://mlg.ucd.ie/modules/COMP41680/news/index.html 2. Retrieve all web pages corresponding to these article URLs. 3. From the web pages, extract the main body text containing the content of each news article. Save the body of each article as plain text. Part 2. Corpus Exploration Tasks to be completed in your IPython notebook: 1. Load the text corpus generated in Part 1. Apply any appropriate pre-processing steps and construct a document-term matrix representation of the corpus. 2. Summarise the overall corpus by identifying the most characteristic terms and phrases in the corpus. 3. Apply two alternative clustering algorithms of your choice to the document-term matrix to produce clusters of related documents. This might require applying each algorithm several times with different parameter values. 4. For each clustering generated in Step 3, summarise the contents of the clusters. Based on your summary, suggest a topic/theme for each cluster.
Language:Jupyter Notebook50 2 113
aub-mind/hULMonA
hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic
Language:Jupyter Notebook46 5 010
EmnamoR/Arabic-named-entity-recognition
Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)
Language:Jupyter Notebook37 3 17
pksohn/tweet-clustering
Clustering analysis of one million tweets using scikit-learn, including basic benchmarking of various clustering algorithms
Language:Jupyter Notebook36 4 113
ObeidaElJundi/Arabic-Image-Captioning
Generate Arabic captions for images using Deep Learning
Language:Jupyter Notebook28 2 113
SarahAlqurashi/COVID-19-Arabic-Tweets-Dataset
The repository contains a collection of Arabic tweets IDs associated with the novel coronavirus COVID-19. The dataset contains Tweets' ids from 2020-01-01 to 2020-04-30. The Twitter search API was used to gather real-time tweets that contained specific keywords in the Arabic language. The dataset contains almost four millions and half Arabic tweets.
Language:Jupyter Notebook27 4 013
abursuc/dldiy-practicals
Slides, Jupyter Notebooks and scripts for the Deep Learning: Do-It-Yourself! lectures at ENS
Language:Jupyter Notebook21 9 212
aub-mind/Arabic-Image-Captioning
Generate Arabic captions for images using Deep Learning
Language:Jupyter Notebook16 1 02
CAMeL-Lab/WIDH_2020_Arabic_Text_Analysis
Material for the Text Analysis of Arabic course taught at the NYU Abu Dhabi Winter Institute in Digital Humanities 2020.
Language:Jupyter Notebook12 2 05
junhua/EPIC
EPIC: a large collection of over 30 million epidemic-related tweets
12 3 00
ObeidaElJundi/hULMonA
hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic
Language:Jupyter Notebook8 2 112
BeirutAI/IntroNLP
Language:Jupyter Notebook4 5 02
keryums/topic_modelling_demo
A workflow for CorEx-based topic modeling
Language:Jupyter Notebook42
dahouabdelghani/arabic_word_embeddings_CNN
Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification (Coling 2016)
Language:Python3 2 01
edubu2/metis-project4
Investigating the impact of Twitter Bots on the 2020 U.S. Presidential Election's Twitter Discourse
Language:Jupyter Notebook2 1 00
andreasorcinelli/Topic-Modeling-of-Tweets-Related-to-NFL-and-National-Anthem
My fourth project that I completed at Metis uses topic modeling to detect structure in tweets related to the nfl and national anthem.
Language:Jupyter Notebook1
mattranalletta/04_biden_election_tweets_NLP
METIS PROJECT 4: NATURAL LANGUAGE PROCESSING & UNSUPERVISED LEARNING // Skills: NLTK, Sci-kit Learn NLP libraries (TF-IDF vectorizer, K-means clustering, PCA, t-SNE), Wordcloud library
Language:Jupyter Notebook10
ValentinaPenaV/Twitter_NLP
Metis NLP project on Twitter Customer Service data
Language:Jupyter Notebook1
wandabwa2004/twitter-protest-analysis
Analysis of 10 million tweets
Language:Python10

jlmartin100

jlmartin100's Stars

tqdm/tqdm

twintproject/twint

jacobeisenstein/gt-nlp-class

adashofdata/nlp-in-python-tutorial

awslabs/open-data-registry

gregversteeg/corex_topic

aub-mind/arabert

brandomr/document_cluster

mohataher/arabic-stop-words

multilingual-dh/nlp-resources

aub-mind/Arabic-Empathetic-Chatbot

prinshul/Text-Scraping-Document-Clustering-Topic-modeling

aub-mind/hULMonA

EmnamoR/Arabic-named-entity-recognition

pksohn/tweet-clustering

ObeidaElJundi/Arabic-Image-Captioning

SarahAlqurashi/COVID-19-Arabic-Tweets-Dataset

abursuc/dldiy-practicals

aub-mind/Arabic-Image-Captioning

CAMeL-Lab/WIDH_2020_Arabic_Text_Analysis

junhua/EPIC

ObeidaElJundi/hULMonA

BeirutAI/IntroNLP

keryums/topic_modelling_demo

dahouabdelghani/arabic_word_embeddings_CNN

edubu2/metis-project4

andreasorcinelli/Topic-Modeling-of-Tweets-Related-to-NFL-and-National-Anthem

mattranalletta/04_biden_election_tweets_NLP

ValentinaPenaV/Twitter_NLP

wandabwa2004/twitter-protest-analysis