This repository is implematation of 📄 DOM based content extraction via text density. Tested for Korean web pages.
Primary LanguageGoMIT LicenseMIT