WringExtract content from webpages using CSS Selectors, XPath, and JS expressions
Stars: ✭ 462 (+926.67%)
DomqueryPHP library for easy 'jQuery like' DOM traversing and manipulation.
Stars: ✭ 84 (+86.67%)
ParselParsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Stars: ✭ 628 (+1295.56%)
Docs《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (+162.22%)
XidelCommand line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+644.44%)
AstpathA command-line search utility for Python ASTs using XPath syntax.
Stars: ✭ 167 (+271.11%)
spparseran async ETL tool written in Python.
Stars: ✭ 34 (-24.44%)
XmlXML without worries
Stars: ✭ 35 (-22.22%)
WebhereHTML scraping for Objective-C.
Stars: ✭ 16 (-64.44%)
Z-Spider一些爬虫开发的技巧和案例
Stars: ✭ 33 (-26.67%)
HarserEasy way for HTML parsing and building XPath
Stars: ✭ 135 (+200%)
Zson专为测试人员打造的JSON解析器
Stars: ✭ 181 (+302.22%)
XpathXPath package for Golang, supports HTML, XML, JSON document query.
Stars: ✭ 376 (+735.56%)
PythonstudyPython related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.
Stars: ✭ 103 (+128.89%)
Jsoupxpath纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java,ha ha.Just try it.
Stars: ✭ 331 (+635.56%)
PugixmlLight-weight, simple and fast XML parser for C++ with XPath support
Stars: ✭ 2,809 (+6142.22%)
DidomSimple and fast HTML and XML parser
Stars: ✭ 1,939 (+4208.89%)
XqerlErlang XQuery 3.1 Processor
Stars: ✭ 44 (-2.22%)
XPathToolsA Visual Studio Extension which can run any XPath and XPath function; navigates through results at the click of a button. Can show and copy any XPath incl. XML namespaces, avoiding XML namespace induced headaches. Keeps track of the current XPath via the statusbar.
Stars: ✭ 40 (-11.11%)
Html Agility PackHtml Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
Stars: ✭ 2,014 (+4375.56%)
web-data-extractorExtracting and parsing structured data with jQuery Selector, XPath or JsonPath from common web format like HTML, XML and JSON.
Stars: ✭ 52 (+15.56%)
Appcrawler基于appium的app自动遍历工具
Stars: ✭ 925 (+1955.56%)
FuziA fast & lightweight XML & HTML parser in Swift with XPath & CSS support
Stars: ✭ 894 (+1886.67%)
DAMTemario y ejercicios de Desarrollo de Aplicaciones Multiplataforma (DAM)
Stars: ✭ 96 (+113.33%)
CssplusCSSplus is a collection of CSS Reprocessor plugins that dynamically update CSS variables
Stars: ✭ 141 (+213.33%)
SirixSirixDB is a temporal, evolutionary database system, which uses an accumulate only approach. It keeps the full history of each resource. Every commit stores a space-efficient snapshot through structural sharing. It is log-structured and never overwrites data. SirixDB uses a novel page-level versioning approach called sliding snapshot.
Stars: ✭ 638 (+1317.78%)
Xmlqueryxmlquery is Golang XPath package for XML query.
Stars: ✭ 209 (+364.44%)
Python Spider豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Stars: ✭ 615 (+1266.67%)
Jsonqueryjsonq package for Go. Golang XPath query for JSON query.
Stars: ✭ 134 (+197.78%)
BasexBaseX Main Repository.
Stars: ✭ 515 (+1044.44%)
Ftr Site ConfigSite-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.
Stars: ✭ 231 (+413.33%)
Camarocamaro is an utility to transform XML to JSON, using Node.js binding to native XML parser pugixml, one of the fastest XML parser around.
Stars: ✭ 438 (+873.33%)
GraphqueryGraphQuery is a query language and execution engine tied to any backend service.
Stars: ✭ 112 (+148.89%)
Spider Flow新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (+711.11%)
Jquery XpathjQuery XPath plugin (with full XPath 2.0 language support)
Stars: ✭ 173 (+284.44%)
Htmlqueryhtmlquery is golang XPath package for HTML query.
Stars: ✭ 338 (+651.11%)
MarkupA Swift package for working with HTML, XML, and other markup languages, based on libxml2.
Stars: ✭ 93 (+106.67%)
FluentdomA fluent api for working with XML in PHP
Stars: ✭ 327 (+626.67%)
MeeseeksAn Elixir library for parsing and extracting data from HTML and XML with CSS or XPath selectors.
Stars: ✭ 252 (+460%)
ExisteXist Native XML Database and Application Platform
Stars: ✭ 294 (+553.33%)
InternettoolsXPath/XQuery 3.1 interpreter for Pascal with compatibility modes for XPath 2.0/XQuery 1.0/3.0, custom and JSONiq extensions, XML/HTML parsers and classes for HTTP/S requests
Stars: ✭ 82 (+82.22%)
Jsoupjsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
Stars: ✭ 9,184 (+20308.89%)
XqueryExtract data or evaluate value from HTML/XML documents using XPath
Stars: ✭ 155 (+244.44%)
ElementFinderFetch data from HTML and XML via xpath/css and prepare it with regexp
Stars: ✭ 29 (-35.56%)
XomXOM™ is a new XML object model. It is an open source (LGPL), tree-based API for processing XML with Java that strives for correctness, simplicity, and performance, in that order.
Stars: ✭ 38 (-15.56%)
Python-notesPython related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.
Stars: ✭ 104 (+131.11%)
NokogiriHTML parser for PHP - Парсер HTML
Stars: ✭ 214 (+375.56%)
XPath2.NetLightweight XPath2 for .NET
Stars: ✭ 26 (-42.22%)
codechef-rank-comparatorWeb application hosted on Heroku cloud platform based on web scraping in python using lxml library (XML Path Language).
Stars: ✭ 23 (-48.89%)
GoxpathAn XPath 1.0 implementation written in the Go programming language.
Stars: ✭ 148 (+228.89%)
Defiant.jshttp://defiantjs.com
Stars: ✭ 907 (+1915.56%)
react-native-macosFork of https://github.com/ptmt/react-native-macos with more features
Stars: ✭ 22 (-51.11%)
OnoA sensible way to deal with XML & HTML for iOS & macOS
Stars: ✭ 2,599 (+5675.56%)
XemblyAssembly for XML: imperative language to modify XML documents
Stars: ✭ 212 (+371.11%)
Xsltdev.ruСправочник web-разработчика с примерами
Stars: ✭ 148 (+228.89%)