/Guttenberg

A bot, searching for plagiarism on Stack Overflow.

Primary LanguageJavaGNU General Public License v3.0GPL-3.0

Actions Status

What is Guttenberg?

Guttenberg is a bot that searches for plagiarism or duplicated answers on Stack Overflow. It's currently running in SOBotics under the user Guttenberg.

Implementation

Every 60 seconds, Guttenberg fetches the most recent answers (the "targets") on Stack Overflow. For each of these answers, possibly related posts (for example answers to related questions) are collected. All those posts will be checked for different characteristics (such as the Jaro-Winkler distance of the posts). If at least one of the characteristics meets the requirements, a message like this will be posted in chat:

sample chat message

What to do with the reports

In this file, we collected information on how we react to Guttenberg's reports such as comments we leave or if and how we flag.

Accuracy

We are already collecting data with CopyPastor to provide statistics, but since there are not that many posts to report, it will take a while until we have enough data.