Pinned Repositories
ForecastGA
A Python tool to forecast Google Analytics data using several popular time series models.
ghost-material
Materialize Theme For Ghost.js
glove-to-word2vec
Converting GloVe vectors into word2vec format for easy usage with Gensim
gsc-logger
Google Search Console Logger for Google App Engine
iCodeSEO
Repo for Content for iCodeSEO.dev
NodeRank
Content Extraction using the PageRank algorithm to find the element containing the best content.
querycat
A Sample repo using the Apriori and FP Growth algorithms to produce categories for queries, and BERT for PoP change visualization.
screaming-frog-shingling
Uses Screaming Frog Internal HTML with text extraction along with a shingling algorithm to compare content duplication across the pages of a crawled site.
tech-seo-crawler
Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.
jroakes's Repositories
jroakes/gsc-logger
Google Search Console Logger for Google App Engine
jroakes/screaming-frog-shingling
Uses Screaming Frog Internal HTML with text extraction along with a shingling algorithm to compare content duplication across the pages of a crawled site.
jroakes/glove-to-word2vec
Converting GloVe vectors into word2vec format for easy usage with Gensim
jroakes/NodeRank
Content Extraction using the PageRank algorithm to find the element containing the best content.
jroakes/CrUX-Queries
jroakes/Unsupervised-Sentence-Summarization
Unsupervised sentence summarization by contextual matching
jroakes/OpenNMT
Open-Source Neural Machine Translation in Torch
jroakes/page-analytics-to-csv
Download Google Page Analytics to CSV
jroakes/search-engine-ranking
📊 Repository for the study on 11.8 Million Google Search Results
jroakes/stat-python-beam-dataflow-cron
Python Apache Beam pipeline for Stat running in Google DataFlow using CRON scheduler on Google App Engine
jroakes/content-codeseo
Content CodeSEO
jroakes/ecs-fargate-taskqueue
Uses AWS Lambda and Fargate for exposing an API for long running tasks.
jroakes/fortune500
Fortune 500 company lists since 1955 in CSV format, mostly parsed using Beautiful Soup
jroakes/text
Data loaders and abstractions for text and NLP
jroakes/cosr-back
Backend of Common Search. Analyses webpages and sends them to the index.
jroakes/Humour.ai-Language-model-that-can-crack-Jokes
Language Model that makes you Laugh .
jroakes/jekyll-TeXt-theme
💎 🐳 A super customizable Jekyll theme for personal site, team site, blog, project, documentation, etc.
jroakes/LICENSE.md
Markdown formatted software licenses.
jroakes/MC-BERT
jroakes/nlg
jroakes/outlier-detect
Code that implements the novel outlier detection algorithms from my Ph.D. dissertation.
jroakes/pegasus
jroakes/preclick-preload
jroakes/pywren-ibm-cloud
PyWren for IBM Cloud Functions and IBM Cloud Object Storage
jroakes/reinvent_bot
jroakes/serverless-chrome
Run headless Chrome/Chromium on AWS Lambda (maybe Azure, & GCP later)
jroakes/StatAPIDataFlow
jroakes/Summarization-Lab
jroakes/tensorflow
Computation using data flow graphs for scalable machine learning
jroakes/twaudit
Twitter auditing