html-parser

There are 428 repositories under html-parser topic.

  • fb55/htmlparser2

    The fast & forgiving HTML and XML parser

    Language:TypeScript4.5k51303382
  • oblac/jodd

    Jodd! Lightweight. Java. Zero dependencies. Use what you like.

    Language:Java4.1k262515723
  • posthtml/posthtml

    PostHTML is a tool to transform HTML/XML with JS plugins

    Language:JavaScript2.9k50137116
  • jsdf/react-native-htmlview

    A React Native component which renders HTML content as native views

    Language:JavaScript2.7k46238464
  • zzzprojects/html-agility-pack

    Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.

    Language:C#2.7k81493381
  • tid-kijyun/Kanna

    Kanna(鉋) is an XML/HTML parser for Swift.

    Language:Swift2.4k45168221
  • Imangazaliev/DiDOM

    Simple and fast HTML and XML parser

    Language:PHP2.2k85176203
  • floki

    philss/floki

    Floki is a simple HTML parser that enables search for nodes using CSS selectors.

    Language:Elixir2.1k26177155
  • Sub6Resources/flutter_html

    A Flutter widget for rendering static html as Flutter widgets (Will render over 80 different html tags!)

    Language:Dart1.8k301.1k889
  • lexborisov/myhtml

    Fast C/C++ HTML 5 Parser. Using threads.

    Language:C1.7k90136148
  • psharanda/Atributika

    Convert text with HTML tags, links, hashtags, mentions into NSAttributedString. Make them clickable with UILabel drop-in replacement.

    Language:Swift1.5k24141156
  • yorickpeterse/oga

    Oga is an XML/HTML parser written in Ruby.

    Language:Ruby1.2k3415139
  • cezheng/Fuzi

    A fast & lightweight XML & HTML parser in Swift with XPath & CSS support

    Language:Swift1.1k3479162
  • voku/simple_html_dom

    📜 Modern Simple HTML DOM Parser for PHP

    Language:PHP8762065117
  • skrape.it

    skrapeit/skrape.it

    A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

    Language:Kotlin8221416164
  • antchfx/htmlquery

    htmlquery is golang XPath package for HTML query.

    Language:Go751115975
  • miso-belica/jusText

    Heuristic based boilerplate removal tool

    Language:Python742212982
  • lexborisov/Modest

    Modest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.

    Language:C738404265
  • clj-commons/hickory

    HTML as data

    Language:Clojure645174752
  • pywebcopy

    rajatomar788/pywebcopy

    Locally saves webpages to your hard disk with images, css, js & links as is.

    Language:Python564888111
  • bupt1987/html-parser

    php html parser,类似与PHP Simple HTML DOM Parser,但是比它快好几倍

    Language:PHP5253521153
  • zhegexiaohuozi/JsoupXpath

    纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java.Just try it.

    Language:HTML4542161155
  • b-fuze/deno-dom

    Browser DOM & HTML parser in Deno

    Language:HTML432911845
  • Ksoup

    MohamedRejeb/Ksoup

    Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.

    Language:Kotlin39592310
  • duzun/hQuery.php

    An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.

    Language:PHP361246774
  • ZMarkupParser

    ZhgChgLi/ZMarkupParser

    ZMarkupParser is a pure-Swift library that helps you convert HTML strings into NSAttributedString with customized styles and tags.

    Language:Swift32383027
  • Prettyhtml/prettyhtml

    💅 The formatter for the modern web https://prettyhtml.netlify.com/

    Language:JavaScript28439521
  • csonchen/wxParse

    微信小程序富文本解析

    Language:JavaScript27873741
  • olamedia/nokogiri

    HTML parser for PHP - Парсер HTML

    Language:PHP229262264
  • ispras/dedoc

    Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

    Language:Python201122222
  • Hexilee/unhtml.rs

    A magic html parser

    Language:Rust194586
  • acrazing/html5parser

    A super tiny and fast html5 AST parser.

    Language:TypeScript18141426
  • alphanome-ai/sec-parser

    Parse SEC EDGAR HTML documents into a tree of elements that correspond to the visual (semantic) structure of the document.

    Language:Python17484251
  • justinwilaby/sax-wasm

    The first streamable, fixed memory XML, HTML, and JSX parser for WebAssembly.

    Language:TypeScript1694327
  • Swaagie/minimize

    Minimize HTML

    Language:JavaScript16287318
  • AutoCSer/AutoCSer

    AutoCSer is a high-performance RPC framework. AutoCSer 是一个以高效率为目标向导的整体开发框架。主要包括 TCP 接口服务框架、TCP 函数服务框架、远程表达式链组件、前后端一体 WEB 视图框架、ORM 内存索引缓存框架、日志流内存数据库缓存组件、消息队列组件、二进制 / JSON / XML 数据序列化 等一系列无缝集成的高性能组件。

    Language:C#15715055