Benzhaomin/bdiefinder

Parse hardwareluxx.de forums to update SKUs

Closed this issue · 0 comments

Building the list wasn't the funniest thing to do already but updating it manually based on a forum post with no diff over time is even worse.

https://www.hardwareluxx.de/community/f13/die-ultimative-hardwareluxx-samsung-8gb-b-die-liste-alle-hersteller-02-03-19-a-1161530.html

  1. parse the page whole page -> find the first post -> find tables -> select rows with 'ja' in the third column -> output them as a list of json objects {'sku', 'specs', 'b-die', 'screenshot', 'comment'}.

  2. diff that list against the current one in data.js

  3. add missing SKUs and make sure the parsers from parsers.js work on them