You can see the app deployed on Heroku. For example:
- https://wikidossier.herokuapp.com/user/Riceissa
- https://wikidossier.herokuapp.com/usercompare?usernames=Vipul|Riceissa|Ethanbas&ns=0
sizediff.py
: This script uses the MediaWiki API to obtain revisions by username, and stores it as a TSV. The columns are "username", "ns" (namespace number), "timestamp", and "sizediff" (difference in bytes of the page before and after the edit).revisions.py
: This script uses the MediaWiki API to obtain revisions by page name and stores it as a TSV. The columns are "pagename", "timestamp", and "size".afd.py
: This script gathers some information from AfDs, most notably the voting information (a list of tuples (v, u), where v is the vote – keep, delete, merge, whatever – and u is the user that cast the vote).plot.py
: This script takes the data collected using the other scripts and plots them.wikidossier.py
: The main Flask app
To filter out "strange" page titles, I used
/[^\d9\d32-\d126“”‘’–—§éñãí¡áóâ°üčō×½êàëżıå]
:g//d
in Vim, to find characters not in a whitelist, and then deleting all lines with characters outside of that list. We can work out something better later.
CC0; see LICENSE
for the full text.