xpath

There are 792 repositories under xpath topic.

influxdata/telegraf
Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.
Language:Go16.5k 298 8.4k5.7k
jhy/jsoup
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
Language:Java11.3k 387 1.6k2.3k
ssssssss-team/spider-flow
新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。
Language:Java11.1k 98 442.1k
D4Vinci/Scrapling
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
Language:Python8.1k 45 35463
zeux/pugixml
Light-weight, simple and fast XML parser for C++ with XPath support
Language:C++4.4k 148 384781
zzzprojects/html-agility-pack
Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
Language:C#2.8k 79 509397
mattt/Ono
A sensible way to deal with XML & HTML for iOS & macOS
Language:Objective-C2.6k 53 38197
Imangazaliev/DiDOM
Simple and fast HTML and XML parser
Language:PHP2.2k 86 180202
beevik/etree
Parse, query and modify XML easily in go
Language:Go1.6k 22 96183
scrapy/parsel
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Language:Python1.3k 33 125155
seveniruby/AppCrawler
基于appium的app自动遍历工具
Language:Scala1.2k 81 40475
cezheng/Fuzi
A fast & lightweight XML & HTML parser in Swift with XPath & CSS support
Language:Swift1.1k 33 80166
xingag/spider_python
python爬虫
Language:Python1.1k 33 8458
sibprogrammer/xq
Command-line XML and HTML beautifier and content extractor
Language:Go1k 9 5933
hbi99/defiant.js
http://defiantjs.com
Language:JavaScript914 33 10489
benibela/xidel
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Language:Pascal817 27 11946
lb2281075105/Python-Spider
豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Language:Python785 53 0275
antchfx/htmlquery
htmlquery is golang XPath package for HTML query.
Language:Go771 10 6079
BaseXdb/basex
BaseX Main Repository.
Language:Java733 57 1.8k273
antchfx/xpath
XPath package for golang, supports HTML, XML, JSON document query and more
Language:Go728 11 8194
h2non/jsonpath-ng
Finally, a JSONPath implementation for Python that aims to be standard compliant. That's all. Enjoy!
Language:Python704 11 121103
tuananh/camaro
camaro is a Node.js library that transform XML to JSON, using Node.js binding to native XML parser pugixml, one of the fastest XML parser around.
Language:JavaScript567 7 6833
antchfx/xmlquery
xmlquery is Golang XPath package for XML query.
Language:Go479 3 7796
zhegexiaohuozi/JsoupXpath
纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java.Just try it.
Language:HTML460 21 61153
ozanmakes/wring
Extract content from webpages using CSS Selectors, XPath, and JS expressions
Language:PureScript457 19 215
eXist-db/exist
eXist Native XML Database and Application Platform
Language:Java456 54 1.7k189
FearlessPeople/xianyu_spider
闲鱼APP数据爬虫（废弃项目）
Language:Python453 5 395
roniemartinez/dude
dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators
Language:Python428 8 4619
fivefilters/ftr-site-config
Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.
414 13 79279
smuyyh/CrawlerForReader
Android 本地网络小说爬虫，基于jsoup及xpath
Language:Java403 21 3137
kbrw/sweet_xml
Language:Elixir373 11 5261
serpapi/nokolexbor
High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.
Language:C348 14 146
ThomasWeinert/FluentDOM
A fluent api for working with XML in PHP
Language:PHP338 14 8620
mischov/meeseeks
An Elixir library for parsing and extracting data from HTML and XML with CSS or XPath selectors.
Language:Elixir323 8 6026
jpjacobpadilla/Stealth-Requests
Undetected web-scraping & seamless HTML parsing in Python!
Language:Python311 4 417
antchfx/jsonquery
JSON xpath query for Go. Golang XPath query for JSON query.
Language:Go274 7 1231

xpath

influxdata/telegraf

jhy/jsoup

ssssssss-team/spider-flow

D4Vinci/Scrapling

zeux/pugixml

zzzprojects/html-agility-pack

mattt/Ono

Imangazaliev/DiDOM

beevik/etree

scrapy/parsel

seveniruby/AppCrawler

cezheng/Fuzi

xingag/spider_python

sibprogrammer/xq

hbi99/defiant.js

benibela/xidel

lb2281075105/Python-Spider

antchfx/htmlquery

BaseXdb/basex

antchfx/xpath

h2non/jsonpath-ng

tuananh/camaro

antchfx/xmlquery

zhegexiaohuozi/JsoupXpath

ozanmakes/wring

eXist-db/exist

FearlessPeople/xianyu_spider

roniemartinez/dude

fivefilters/ftr-site-config

smuyyh/CrawlerForReader

kbrw/sweet_xml

serpapi/nokolexbor

ThomasWeinert/FluentDOM

mischov/meeseeks

jpjacobpadilla/Stealth-Requests

antchfx/jsonquery