/Audiences-External_automatic_translation

Code of the external automatic translation services notebook: https://analytics.wikimedia.org/datasets/external-automatic-translation/impact%20of%20external%20automatic%20translation%20services.html

Primary LanguageJupyter Notebook

Measure the impact of external automatic translation services

This is the repository of the notebook "Measure the impact of external automatic translation services ". This notebook is currently updating daily at 2AM UTC on notebook1004 -- cron job:

0 2 * * * /home/<username>/external-automatic-translation/update_publish_notebook.sh

Please remember to add http proxy variables before running the notebook. And check the /home/<username>/external-automatic-translation/notebook_update.log file if you run into bugs.

The toledo_pageviews oozie job folder contains code used to update the toledo_pageviews table on HDFS. It filters and aggregates data from wmf.webrequest on a daily basis to get the pageviews from external automatic translation services. Please refer to wikitech about how to deploy an oozie job.

If changing ownership is needed, please see T228195 for more instructions.

If you have any comments or questions, please leave your feedback in the ticket: https://phabricator.wikimedia.org/T212414