maybemkl
Your friendly neighbourhood webcrawler. Spatial data science and natural language processing.
McGill UniversityMontreal
Pinned Repositories
poi-osm
Scripts for getting POIs from OSM data
airbnb-place-ner
Data and models to extract toponyms and spatio-temporal entities from text data.
CSS_Programming
Python scripts written during the course "Programming in Social Science" at the University of Helsinki.
DigitalMethods_presentation2017
Presentation in the working group on 'The Digitalization of Societies & Methods' at the Finnish Sociology Days. Scraping & mapping data from Twitter and Facebook with R, Python, Open Refine & CartoDB.
GIS_tweets
This is my final project for a GIS class at Columbia, exploring the relationship between tweeting and gentrification.
maybemkl
minä minä minä
maybemkl.github.io
GitHub Pages
wmdecompose
A Python implementation of Word Mover's Distance that decomposes document level WMD into word level WMD for interpretable sociocultural NLP.
montreal-finance-2022
Code for 'High rises and housing stress: A spatial big-data analysis of rental housing financialization'
maybemkl's Repositories
maybemkl/wmdecompose
A Python implementation of Word Mover's Distance that decomposes document level WMD into word level WMD for interpretable sociocultural NLP.
maybemkl/airbnb-place-ner
Data and models to extract toponyms and spatio-temporal entities from text data.
maybemkl/DigitalMethods_presentation2017
Presentation in the working group on 'The Digitalization of Societies & Methods' at the Finnish Sociology Days. Scraping & mapping data from Twitter and Facebook with R, Python, Open Refine & CartoDB.
maybemkl/CSS_Programming
Python scripts written during the course "Programming in Social Science" at the University of Helsinki.
maybemkl/GIS_tweets
This is my final project for a GIS class at Columbia, exploring the relationship between tweeting and gentrification.
maybemkl/maybemkl
minä minä minä
maybemkl/maybemkl.github.io
GitHub Pages
maybemkl/MTA-Uber-Analytics
Analytics web app in Shiny using data from Uber and MTA.
maybemkl/muxViz
Analysis and Visualization of Interconnected Multilayer Networks
maybemkl/NLP-bits
Information theory in NLP for people without a Math background
maybemkl/NLP-Keras-Text-Summarizer
Text summariser written using Keras and TensorFlow for the NLP class of Professor Kathy McKeown at Columbia University.
maybemkl/NLP-Sentiment-Analysis-Keras
A very rudimentary homework assignment in using Keras for Sentiment Analysis. Part of the NLP class at the Columbia DSI.
maybemkl/NLP-Tweet-Classifier
Some Python scripts for classifying tweets as either Democratic or Republican. Homework assignment in the NLP class by Kathy Mckeown at Columbia Uni.
maybemkl/QMSS-Data-Viz
Homework assignments for the data visualization class at the Columbia University QMSS program.
maybemkl/QMSS-G5072-Modern-Data-Structures
course material
maybemkl/QMSS-Social-Nets
Lab assignments for the Social Network Analysis class at the Columbia QMSS program in the spring of 2018.
maybemkl/rs-mtl-atc
maybemkl/transformers_ner
Add CRF or LSTM+CRF for huggingface transformers bert to perform better on NER task. It is very simple to use and very convenient to customize
maybemkl/Tsoha-Bootstrap
Tietokantasovellus-kurssin aloituspaketti
maybemkl/TweetsTopicModelsGIS
This data mining project for Columbia uses LDA topic modelling to map tweets in New York.
maybemkl/WebPalvelinohjelmointi2016
maybemkl/wmd-relax
Calculates Word Mover's Distance Insanely Fast