/cx-extractor-python

基于行块分布函数的通用网页正文抽取算法的Python版本实现,添加了英文支持/ Web page content extraction algorithm, support both Chinese and English

Primary LanguageHTMLMIT LicenseMIT

Stargazers