html-parsing
There are 115 repositories under html-parsing topic.
PuerkitoBio/goquery
A little like that j-thing, only in Go.
inikulin/parse5
HTML parsing/serialization toolset for Node.js. WHATWG HTML Living Standard (aka HTML5)-compliant.
milesj/interweave
🌀 React library to safely render HTML, filter attributes, autowrap text with matchers, render emoji characters, and much more.
cezheng/Fuzi
A fast & lightweight XML & HTML parser in Swift with XPath & CSS support
ruippeixotog/scala-scraper
A Scala library for scraping content from HTML pages
miso-belica/jusText
Heuristic based boilerplate removal tool
bookieio/breadability
Reworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)
adbar/htmldate
Fast and robust date extraction from web pages, with Python or on the command-line
themm1/procyclingstats
procyclingstats scraper
ange007/HTMLp
Delphi Dom HTML Parser and Converter. Fork (not from the original author): https://sourceforge.net/projects/htmlp/
petdance/htmlparsing
htmlparsing.com, a website devoted to helping people parse HTML correctly
digitalfondue/jfiveparse
A java html 5 compliant parser
liuderchi/ide-html
:atom: Atom-IDE for HTML, Go Template, Mustache and other Templates
MauriceConrad/XML-Parser
A Node.js XML DOM, Parser & Stringifier.
whimtrip/jwht-scrapper
Fully Featured Java Scrapping Framework, highly pluggable and customizable
julleboi/fast-wasm-scraper
Faster HTML scraper with WebAssembly
shabanali-faghani/IUST-HTMLCharDet
A java tool for detecting charset encoding of HTML web pages
fefit/rphtml
A html parser written in RUST, parse html into node trees.
whimtrip/jwht-htmltopojo
Fully Featured, highly pluggable and customizable Java Html to Pojo converter.
ktodorov/go-summarizer
Summarize text and websites and optionally saves the data to a local file
mohaxspb/ScpFoundationRu
SourceCode for SCP Foundation app - https://play.google.com/store/apps/details?id=ru.dante.scpfoundation
peterhil/slurp
BeautifulSoup4 packaged into a command line tool
siongui/go-facebook-post-parser
web scrape facebook post and extract data
patmull/disaster-warning-system-scripts
CAP (Common Alerting Protocol) XML alert format parsing, HTML parsing, inserting new alerts into database, OneSignal (possible Android and iOS push notifications), Twitter, Facebook, MailChimp (e-mail notifications) for project of open source solution for natural disasters early-warning.
raymccrae/swift-htmlsaxparser
Swift wrapper around libxml2 HTML Parser to provide SAX style HTML Parsing
bradmontgomery/django-janitor
django-janitor allows you to use bleach to clean HTML stored in a Model's field.
emmanuelroecker/php-simply-html
Add, delete, modify, get html tags, text, links by using css selector
hrbrmstr/drill-html-tools
Apache Drill UDFs for retrieving and working with HTML text
imingyu/forgiving-xml-parser
An XML/HTML parser and serializer for JavaScript.
ubbeg2000/pars
a simple package for parsing html files into dom trees
brianary/SelectHtml
A PowerShell module for extracting data from HTML using XPath
decal/cgiaudit
:package: general-purpose, "black box" CGI auditing tool (ARCHIVE)
kan01234/ur-web-spider
web spider to scan UR avialbe room and output as csv
LylaCoding/Website-Subpage-Scraper
This Python script scrapes internal links on a webpage. It prompts for a URL, sends a GET request to retrieve HTML, uses BeautifulSoup to parse and filter links. Then it prompts the user for output mode (terminal or file) to either print or write the links. Installs required modules (requests and beautifulsoup4) if not found.
rsharifnasab/telegram_export_analyzer
this script can analyze number of telegram messages by time
sidward35/splunk-messenger
Get insights into your Facebook Messenger activity with Splunk