/gazettes

Primary LanguagePythonBSD 2-Clause "Simplified" LicenseBSD-2-Clause

#parsing gov gazettes: crawler + parser note: pilot version for Nat. Reg.

TO DO

  • more specific dict for outline
  • Issn sometimes is undetected (image?)- adjust for that
  • rewrite using classes and objects: class per gazette type
  • switch to PostgreSQL