Pinned Repositories
are-the-bots-really-fighting
A research project exploring revert patterns between bots on Wikipedia.
deep_merge
deltas
A library for generating deltas of the difference between two sequences of tokens.
multiquery
Runs a single SQL query against a set of databases concatenates the results together.
python-mwapi
Simple Python Wrapper around MediaWiki API
python-mwxml
A set of utilities for processing MediaWiki XML dump data.
editquality
Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)
ores
Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)
revscoring
A generic, machine learning-based revision scoring system for MediaWiki
halfak's Repositories
halfak/are-the-bots-really-fighting
A research project exploring revert patterns between bots on Wikipedia.
halfak/deltas
A library for generating deltas of the difference between two sequences of tokens.
halfak/deep_merge
halfak/ores-paper
A paper about the ORES ML container system and the intervention it provides in Wikipedia
halfak/python-jsonable
Provides an abstract base class and utilities for defining trivially JSONable python objects
halfak/snuggle
halfak/yamlconf
This library provides a means to read yaml configuration files and propagate default values in reasonable ways. Nothing complicated.
halfak/ores-demos
A collection of demos for demonstrating the use of ORES
halfak/python-para
a set utilities that ake advantage of python's `multiprocessing` module to distribute CPU-intensive tasks
halfak/associate-engineer-2019
An engineering task for applicants to the WMF associate engineer position.
halfak/bayes-seg
Java code from the 2008 EMNLP paper "Bayesian Unsupervised Topic Segmentation" by Eisenstein and Barzilay
halfak/damaging-goodfaith-overlap
A study exploring the overlap between damaging and goodfaith predictions for ORES
halfak/demo_shared_memory
halfak/engineering-task-sp2019
halfak/enwikivoyage-creations
halfak/flask-mwoauth
Flask blueprint to connect to a MediaWiki OAuth server
halfak/flesch_complexity
halfak/gadgets-ArticleQuality
A mediawiki gadget for displaying ORES article quality and item quality information.
halfak/halfak.github.io
Website for halfaker.info (WIP)
halfak/HHVM-newcomer-engagement-experiment
We hypothesize that the type of performance improvement that HHVM provides will result in improved engagement of editors in Wikipedia. In this study we'll specifically focus on new editor engagement.
halfak/keilana-effect
A paper describing an analysis of content coverage in Wikipedia
halfak/langchain
⚡ Building applications with LLMs through composability ⚡
halfak/literacy-and-community-paper
halfak/module-storage-research
halfak/nlwiki_articlequality
halfak/npp-analysis
Analysis of the new page patrolling backlog for English Wikipedia
halfak/taxonomy_examples
halfak/tensorflow-opencl
OpenCL support for TensorFlow
halfak/Wiki_Semantic_Intention
halfak/wikidata_usage_tracking
Code for Wikidata usage tracking