unitedstates/inspectors-general

Wrong dates for some reports?

Closed this issue · 1 comments

Scout is showing a bunch of IG reports dated in the future:
https://scout.sunlightfoundation.com/search/documents/*?documents%5Bdocument_type%5D=ig_report

Any ideas what's going on?
screen shot 2015-01-15 at 3 42 33 pm
screen shot 2015-01-15 at 3 42 26 pm
screen shot 2015-01-15 at 3 42 17 pm

It looks like this is a result of the code here

Basically, if we can't find when the report was published, we set it to November 1st of the year mentioned in the title. This is obviously suboptimal, but it wasn't clear that there was a better option. If you can find a better way to get a more accurate published date (in some of the later scrapers I think we checked pdf metadata), it would certainly be welcome.