cathalgarvey
I have archived and moved all my non-professional work to Gitlab: https://gitlab.com/cathalgarvey
@scrapinghubIreland
Pinned Repositories
deadlock
Python implementation of minilock.io, an encryption utility for sharing files privately. (MOVED to Gitlab)
fmtless
A toolkit for replacing fmt's output funcs, plus fmt-free stdlib replacements (MOVED to Gitlab)
go-minilock
The minilock file encryption system, ported to pure Golang. Includes CLI utilities.
go-termux
Termux-API layer ported to a Go library; write pseudo-apps for Android in pure Go with Termux/API/Widget!
listless
A monolithic, lua-scripted discussion list engine over IMAP/SMTP (MOVED to Gitlab)
pqgrams
The PQ-Gram algorithm for approximating tree edit distance, in Rust, with generic interfaces.
pyqgrams
PQ-Grams in Python, with the heavy lifting in Rust (still WIP)
PySplicer
Evidence-based Gene Optimisation (MOVED to Gitlab)
sqrape
Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)
tinystatus
A peer to peer microstatus system written in 30 lines of pure python. (MOVED to Gitlab)
cathalgarvey's Repositories
cathalgarvey/whatlang-py
Simple bindings to the whatlang Rust package
cathalgarvey/pyqgrams
PQ-Grams in Python, with the heavy lifting in Rust (still WIP)
cathalgarvey/pqgrams
The PQ-Gram algorithm for approximating tree edit distance, in Rust, with generic interfaces.
cathalgarvey/req2vec
Data collection and SKLearn pipeline transformers for Scrapy projects
cathalgarvey/gzlines
A small Go helper-library for iterating lines from one or more Gzipped files
cathalgarvey/page_clustering
A simple algorithm for clustering web pages, suitable for crawlers
cathalgarvey/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
cathalgarvey/blackburn-mod
My tracker-free modification of the Blackburn theme for Hugo
cathalgarvey/docker.alpine.scipy
A docker image for data science based in alpine base image
cathalgarvey/extruct
Extract embedded metadata from HTML markup
cathalgarvey/mdr
A python library detect and extract listing data from HTML page.
cathalgarvey/scraperscript
A bookmarklet that helps you find unique selectors for page elements.
cathalgarvey/vcardgen
A simple vcard generation system for Go.
cathalgarvey/aoc-2017-elixir
Advent of Code 2017 in Elixir, for fun
cathalgarvey/bazel
Dockerfile for google bazel build system
cathalgarvey/borntyping
cathalgarvey/databrewer
The missing datasets manager.
cathalgarvey/elefren
It's like Mastodon.py, but for Rust (fork of https://github.com/Aaronepower/mammut)
cathalgarvey/greenglas
Machine Intelligence Preprocessing Framework
cathalgarvey/marco-polo-mb
Marco-Polo Game for the BBC Micro:bit
cathalgarvey/ocl
OpenCL for Rust
cathalgarvey/offlineimap
Read/sync your IMAP mailboxes (python2)
cathalgarvey/PyGram
An efficient approximation for tree edit-distance.
cathalgarvey/python-gsoc.github.io
Website and ideas page for Python's Google Summer of Code efforts
cathalgarvey/scantailor-advanced
ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
cathalgarvey/scrapinghub-autoextract
Python clients for Scrapinghub AutoExtract API
cathalgarvey/shub
Scrapinghub Command Line Client
cathalgarvey/sickle
Sickle: OAI-PMH for Humans
cathalgarvey/udpe506e
eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee over UDP
cathalgarvey/zigbee
Database of Zigbee devices compatible with third party gateways: ZHA, deCONZ, Zigbee2MQTT, Tasmota, ZiGate, ioBroker,