Top 81 xpath open source projects

Meeseeks
An Elixir library for parsing and extracting data from HTML and XML with CSS or XPath selectors.
Ono
A sensible way to deal with XML & HTML for iOS & macOS
Ftr Site Config
Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.
✭ 231
xpath
Pugixml
Light-weight, simple and fast XML parser for C++ with XPath support
Nokogiri
HTML parser for PHP - Парсер HTML
Xembly
Assembly for XML: imperative language to modify XML documents
✭ 212
javaxpath
Xmlquery
xmlquery is Golang XPath package for XML query.
Zson
专为测试人员打造的JSON解析器
Jquery Xpath
jQuery XPath plugin (with full XPath 2.0 language support)
Astpath
A command-line search utility for Python ASTs using XPath syntax.
Xquery
Extract data or evaluate value from HTML/XML documents using XPath
Html Agility Pack
Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
Goxpath
An XPath 1.0 implementation written in the Go programming language.
Xsltdev.ru
Справочник web-разработчика с примерами
Cssplus
CSSplus is a collection of CSS Reprocessor plugins that dynamically update CSS variables
Harser
Easy way for HTML parsing and building XPath
Jsonquery
jsonq package for Go. Golang XPath query for JSON query.
Docs
《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Graphquery
GraphQuery is a query language and execution engine tied to any backend service.
Pythonstudy
Python related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.
Markup
A Swift package for working with HTML, XML, and other markup languages, based on libxml2.
Domquery
PHP library for easy 'jQuery like' DOM traversing and manipulation.
Internettools
XPath/XQuery 3.1 interpreter for Pascal with compatibility modes for XPath 2.0/XQuery 1.0/3.0, custom and JSONiq extensions, XML/HTML parsers and classes for HTTP/S requests
Xqerl
Erlang XQuery 3.1 Processor
Xom
XOM™ is a new XML object model. It is an open source (LGPL), tree-based API for processing XML with Java that strives for correctness, simplicity, and performance, in that order.
Xml
XML without worries
Appcrawler
基于appium的app自动遍历工具
Amazon Mobile Sentiment Analysis
Opinion mining of Mobile reviews on Amazon platform
Fuzi
A fast & lightweight XML & HTML parser in Swift with XPath & CSS support
Webhere
HTML scraping for Objective-C.
Sirix
SirixDB is a temporal, evolutionary database system, which uses an accumulate only approach. It keeps the full history of each resource. Every commit stores a space-efficient snapshot through structural sharing. It is log-structured and never overwrites data. SirixDB uses a novel page-level versioning approach called sliding snapshot.
Parsel
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Python Spider
豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Basex
BaseX Main Repository.
✭ 515
javaxmlxpath
Wring
Extract content from webpages using CSS Selectors, XPath, and JS expressions
Camaro
camaro is an utility to transform XML to JSON, using Node.js binding to native XML parser pugixml, one of the fastest XML parser around.
Xpath
XPath package for Golang, supports HTML, XML, JSON document query.
Spider Flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Xidel
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Htmlquery
htmlquery is golang XPath package for HTML query.
Jsoupxpath
纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java,ha ha.Just try it.
Fluentdom
A fluent api for working with XML in PHP
✭ 327
xmldomxpath
Crawlerforreader
Android 本地网络小说爬虫,基于jsoup及xpath
Exist
eXist Native XML Database and Application Platform
Didom
Simple and fast HTML and XML parser
Jsoup
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
spparser
an async ETL tool written in Python.
ElementFinder
Fetch data from HTML and XML via xpath/css and prepare it with regexp
XPathTools
A Visual Studio Extension which can run any XPath and XPath function; navigates through results at the click of a button. Can show and copy any XPath incl. XML namespaces, avoiding XML namespace induced headaches. Keeps track of the current XPath via the statusbar.
Python-notes
Python related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.
crawler CIA CREST
R-crawler for CIA website (CREST)
XPath2.Net
Lightweight XPath2 for .NET
web-data-extractor
Extracting and parsing structured data with jQuery Selector, XPath or JsonPath from common web format like HTML, XML and JSON.
codechef-rank-comparator
Web application hosted on Heroku cloud platform based on web scraping in python using lxml library (XML Path Language).
DAM
Temario y ejercicios de Desarrollo de Aplicaciones Multiplataforma (DAM)
DouBanReptile
豆瓣租房小组多线程爬虫。爬取后自动按时间排序生成markdown文件。
1-60 of 81 xpath projects