impresso/impresso-middle-layer

add timelines on /newspaper page

Closed this issue · 1 comments

Per year per newspaper:

contentItems

from SOLR:

  • count content items (solr)
  • count contentItems having OCR < 0.5 (solr range facet)

issues

from mysqlDB:

  • count issues
  • count corrupted archive (where issues.is_damaged = 1)

pages

from mysqlDB:

  • count pages
  • count empty pages (where pages.n_tokens = 0)
  • count corrupted pages (where pages.has_corrupted_json = 1)

service to get timeline of issues for the newspaper: http://localhost:3030/issues-timelines/stats?newspaper_uid=GDL