xpath

There are 792 repositories under xpath topic.

  • influxdata/telegraf

    Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.

    Language:Go16.5k2988.4k5.7k
  • jsoup

    jhy/jsoup

    jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.

    Language:Java11.3k3871.6k2.3k
  • ssssssss-team/spider-flow

    新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。

    Language:Java11.1k98442.1k
  • Scrapling

    D4Vinci/Scrapling

    🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

    Language:Python8.1k4535463
  • zeux/pugixml

    Light-weight, simple and fast XML parser for C++ with XPath support

    Language:C++4.4k148384781
  • zzzprojects/html-agility-pack

    Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.

    Language:C#2.8k79509397
  • mattt/Ono

    A sensible way to deal with XML & HTML for iOS & macOS

    Language:Objective-C2.6k5338197
  • Imangazaliev/DiDOM

    Simple and fast HTML and XML parser

    Language:PHP2.2k86180202
  • beevik/etree

    Parse, query and modify XML easily in go

    Language:Go1.6k2296183
  • scrapy/parsel

    Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

    Language:Python1.3k33125155
  • seveniruby/AppCrawler

    基于appium的app自动遍历工具

    Language:Scala1.2k8140475
  • cezheng/Fuzi

    A fast & lightweight XML & HTML parser in Swift with XPath & CSS support

    Language:Swift1.1k3380166
  • xingag/spider_python

    python爬虫

    Language:Python1.1k338458
  • sibprogrammer/xq

    Command-line XML and HTML beautifier and content extractor

    Language:Go1k95933
  • hbi99/defiant.js

    http://defiantjs.com

    Language:JavaScript9143310489
  • benibela/xidel

    Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

    Language:Pascal8172711946
  • lb2281075105/Python-Spider

    豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章

    Language:Python785530275
  • antchfx/htmlquery

    htmlquery is golang XPath package for HTML query.

    Language:Go771106079
  • BaseXdb/basex

    BaseX Main Repository.

    Language:Java733571.8k273
  • antchfx/xpath

    XPath package for golang, supports HTML, XML, JSON document query and more

    Language:Go728118194
  • h2non/jsonpath-ng

    Finally, a JSONPath implementation for Python that aims to be standard compliant. That's all. Enjoy!

    Language:Python70411121103
  • tuananh/camaro

    camaro is a Node.js library that transform XML to JSON, using Node.js binding to native XML parser pugixml, one of the fastest XML parser around.

    Language:JavaScript56776833
  • antchfx/xmlquery

    xmlquery is Golang XPath package for XML query.

    Language:Go47937796
  • zhegexiaohuozi/JsoupXpath

    纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java.Just try it.

    Language:HTML4602161153
  • ozanmakes/wring

    Extract content from webpages using CSS Selectors, XPath, and JS expressions

    Language:PureScript45719215
  • eXist-db/exist

    eXist Native XML Database and Application Platform

    Language:Java456541.7k189
  • FearlessPeople/xianyu_spider

    闲鱼APP数据爬虫(废弃项目)

    Language:Python4535395
  • roniemartinez/dude

    dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators

    Language:Python42884619
  • fivefilters/ftr-site-config

    Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.

  • smuyyh/CrawlerForReader

    Android 本地网络小说爬虫,基于jsoup及xpath

    Language:Java403213137
  • kbrw/sweet_xml

    Language:Elixir373115261
  • nokolexbor

    serpapi/nokolexbor

    High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.

    Language:C34814146
  • ThomasWeinert/FluentDOM

    A fluent api for working with XML in PHP

    Language:PHP338148620
  • mischov/meeseeks

    An Elixir library for parsing and extracting data from HTML and XML with CSS or XPath selectors.

    Language:Elixir32386026
  • jpjacobpadilla/Stealth-Requests

    Undetected web-scraping & seamless HTML parsing in Python!

    Language:Python3114417
  • antchfx/jsonquery

    JSON xpath query for Go. Golang XPath query for JSON query.

    Language:Go27471231