ncss-tech/SoilKnowledgeBase

Extending OSD parsing tools

brownag opened this issue · 2 comments

Strategic TODOs:

  • refactor code that relies on plyr and possibly some of the other dependencies (stringi, purrr, furrr, R.utils)
  • soil color prediction modeling, and model objects: where is the best place to do it / house the results
  • move soil color modeling evaluation to soil-color repository
  • get SC via soilDB::get_soilseries_from_NASIS() or saved copy in soilweb-data repo.
  • adapt fulltext input data to get_OSD source data, eventually move to SoilWeb or SoilWeb-data

Technical TODOs:

  • adapt TYPICAL PEDON parsing code to use JSON representation of OSDs

JSON format:

  • Return series names in title case? ncss-tech/soilDB#95keep as-is
  • Trim off section headers from content (they are already in section)keep as-is until collapsing and reordering sections into groups is removed; the only way to reliably deparse combined sections is if their headers are included

parseOSD Issues:

  • extract additional data section and save ncss-tech/parse-osd#13 opened on May 13, 2020 by dylanbeaudette
  • old-style O horizons not parsed correcly ncss-tech/parse-osd#12 opened on Feb 10, 2020 by dylanbeaudette
  • add testing suite ncss-tech/parse-osd#11 opened on May 29, 2019 by dylanbeaudette
  • texture, pH, drainage class parsing is too greedy ncss-tech/parse-osd#10 opened on May 29, 2019 by dylanbeaudette
  • combination horizons described with 'and' ncss-tech/parse-osd#9 opened on Dec 20, 2018 by dylanbeaudette'
  • fill missing colors based on nearby horizons ncss-tech/parse-osd#8 opened on Mar 14, 2018 by dylanbeaudette
  • figure out how to find OSDs edited since last parse date ncss-tech/parse-osd#5 opened on Sep 15, 2017 by dylanbeaudette
  • some brief narratives are missing (93) ncss-tech/parse-osd#4 opened on Sep 2, 2017 by dylanbeaudette

Related Changes in soilDB

ncss-tech/soilDB#186

Updates:

  • I'm almost done adapting all of the old code related to fulltext searching. Once that is done, I'll move the SoilWeb prep code / results over to the SoilWeb-data repository. This includes everything related to fetchOSD and the fulltext search.
  • I'm planning to move the soil color stuff to a different repository soon

Quick update, all of the code used to prep SoilWeb related data has been moved over to SoilWeb-data. Relying on OSDregistry and SKB to do all of the parsing work.