The main goal of this project is to generate ePubs or PDFs given an URL, by scraping its content. A secondary goal is to get familiar with the Scala language and ecosystem.
- Find suitable dependencies
- Define simple CLI interface.
- Define app skeleton.
- Define simple scraper functionality.
- Async scraping
- Abstract into configurable scrapers: the idea is to be able to define different scrapers for different (types of) website.
- Generate a PDF/ePub.
Get dependencies from Scala Library Index or Maven Central repository (Java libraries).
Copypaste dependency string into build.sbt
. Example:
libraryDependencies += "org.jsoup" % "jsoup" % "1.11.2"
Caveat: Don't forget to add trailing commas after adding a new dependency.