jgehrcke/github-repo-stats

Possible discrepancy between actual unique cloners and report data

litalmason opened this issue · 1 comments

Hello and thank you for your work!
I have followed your tutorial, and ran the Action for the first time.
And then I compared the report data result with my repo's Traffic page.
The number of unique cloners is higher in the report than it says on my repo's Traffic page:

image

Any idea why?
Thanks.

I believe that this is an issue in Github API, they return a number of unique cloners per day, but they are not necessarily unique when you look at more than 1 day.
So if you sum the "unique cloner" counts that the API returns, it is usually higher than the number that they display on their cumulative count under "Traffic" page. I suspect that GitHub stores cloner IDs and can tell over the course of 14 days which clones are actually unique.