Htmlqueryhtmlquery is golang XPath package for HTML query.
Stars: ✭ 338 (+59.43%)
Python-notesPython related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.
Stars: ✭ 104 (-50.94%)
Camarocamaro is an utility to transform XML to JSON, using Node.js binding to native XML parser pugixml, one of the fastest XML parser around.
Stars: ✭ 438 (+106.6%)
go-xmldomXML DOM processing for Golang, supports xpath query
Stars: ✭ 38 (-82.08%)
Jsoupjsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
Stars: ✭ 9,184 (+4232.08%)
ExisteXist Native XML Database and Application Platform
Stars: ✭ 294 (+38.68%)
HarserEasy way for HTML parsing and building XPath
Stars: ✭ 135 (-36.32%)
codechef-rank-comparatorWeb application hosted on Heroku cloud platform based on web scraping in python using lxml library (XML Path Language).
Stars: ✭ 23 (-89.15%)
FuziA fast & lightweight XML & HTML parser in Swift with XPath & CSS support
Stars: ✭ 894 (+321.7%)
BasexBaseX Main Repository.
Stars: ✭ 515 (+142.92%)
DomqueryPHP library for easy 'jQuery like' DOM traversing and manipulation.
Stars: ✭ 84 (-60.38%)
Spider Flow新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (+72.17%)
Xsltdev.ruСправочник web-разработчика с примерами
Stars: ✭ 148 (-30.19%)
FluentdomA fluent api for working with XML in PHP
Stars: ✭ 327 (+54.25%)
XomXOM™ is a new XML object model. It is an open source (LGPL), tree-based API for processing XML with Java that strives for correctness, simplicity, and performance, in that order.
Stars: ✭ 38 (-82.08%)
ElementFinderFetch data from HTML and XML via xpath/css and prepare it with regexp
Stars: ✭ 29 (-86.32%)
XqueryExtract data or evaluate value from HTML/XML documents using XPath
Stars: ✭ 155 (-26.89%)
XPath2.NetLightweight XPath2 for .NET
Stars: ✭ 26 (-87.74%)
Defiant.jshttp://defiantjs.com
Stars: ✭ 907 (+327.83%)
DAMTemario y ejercicios de Desarrollo de Aplicaciones Multiplataforma (DAM)
Stars: ✭ 96 (-54.72%)
Docs《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (-44.34%)
dotnet-security-unit-testsA web application that contains several unit tests for the purpose of .NET security
Stars: ✭ 25 (-88.21%)
SirixSirixDB is a temporal, evolutionary database system, which uses an accumulate only approach. It keeps the full history of each resource. Every commit stores a space-efficient snapshot through structural sharing. It is log-structured and never overwrites data. SirixDB uses a novel page-level versioning approach called sliding snapshot.
Stars: ✭ 638 (+200.94%)
MarkupA Swift package for working with HTML, XML, and other markup languages, based on libxml2.
Stars: ✭ 93 (-56.13%)
WringExtract content from webpages using CSS Selectors, XPath, and JS expressions
Stars: ✭ 462 (+117.92%)
GoxpathAn XPath 1.0 implementation written in the Go programming language.
Stars: ✭ 148 (-30.19%)
XpathXPath package for Golang, supports HTML, XML, JSON document query.
Stars: ✭ 376 (+77.36%)
InternettoolsXPath/XQuery 3.1 interpreter for Pascal with compatibility modes for XPath 2.0/XQuery 1.0/3.0, custom and JSONiq extensions, XML/HTML parsers and classes for HTTP/S requests
Stars: ✭ 82 (-61.32%)
XidelCommand line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+58.02%)
AstpathA command-line search utility for Python ASTs using XPath syntax.
Stars: ✭ 167 (-21.23%)
Jsoupxpath纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java,ha ha.Just try it.
Stars: ✭ 331 (+56.13%)
XqerlErlang XQuery 3.1 Processor
Stars: ✭ 44 (-79.25%)
CssplusCSSplus is a collection of CSS Reprocessor plugins that dynamically update CSS variables
Stars: ✭ 141 (-33.49%)
spparseran async ETL tool written in Python.
Stars: ✭ 34 (-83.96%)
XmlXML without worries
Stars: ✭ 35 (-83.49%)
XPathToolsA Visual Studio Extension which can run any XPath and XPath function; navigates through results at the click of a button. Can show and copy any XPath incl. XML namespaces, avoiding XML namespace induced headaches. Keeps track of the current XPath via the statusbar.
Stars: ✭ 40 (-81.13%)
Zson专为测试人员打造的JSON解析器
Stars: ✭ 181 (-14.62%)
Appcrawler基于appium的app自动遍历工具
Stars: ✭ 925 (+336.32%)
web-data-extractorExtracting and parsing structured data with jQuery Selector, XPath or JsonPath from common web format like HTML, XML and JSON.
Stars: ✭ 52 (-75.47%)
Jsonqueryjsonq package for Go. Golang XPath query for JSON query.
Stars: ✭ 134 (-36.79%)
Z-Spider一些爬虫开发的技巧和案例
Stars: ✭ 33 (-84.43%)
DouBanReptile豆瓣租房小组多线程爬虫。爬取后自动按时间排序生成markdown文件。
Stars: ✭ 31 (-85.38%)
DidomSimple and fast HTML and XML parser
Stars: ✭ 1,939 (+814.62%)
OpenScraperAn open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (-62.26%)
WebhereHTML scraping for Objective-C.
Stars: ✭ 16 (-92.45%)
fontoxpathA minimalistic XPath 3.1 implementation in pure JavaScript
Stars: ✭ 97 (-54.25%)
GraphqueryGraphQuery is a query language and execution engine tied to any backend service.
Stars: ✭ 112 (-47.17%)
ParselParsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Stars: ✭ 628 (+196.23%)
Xmlqueryxmlquery is Golang XPath package for XML query.
Stars: ✭ 209 (-1.42%)
Jquery XpathjQuery XPath plugin (with full XPath 2.0 language support)
Stars: ✭ 173 (-18.4%)
Html Agility PackHtml Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
Stars: ✭ 2,014 (+850%)
PythonstudyPython related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.
Stars: ✭ 103 (-51.42%)
Python Spider豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Stars: ✭ 615 (+190.09%)