DouBanReptile豆瓣租房小组多线程爬虫。爬取后自动按时间排序生成markdown文件。
Stars: ✭ 31 (-93.29%)
pydermanInstall Selenium-compatible Chrome/Firefox/Opera/PhantomJS/Edge webdrivers automatically.
Stars: ✭ 24 (-94.81%)
java-phantomjs-wrapperA Java wrapper around the PhantomJS binaries including a packaged HTML to PDF render script
Stars: ✭ 54 (-88.31%)
selectorlibA library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (-88.53%)
Responsive mockupsTakes screenshots of a webpage in different resolutions and automatically applies it to mockup templates.
Stars: ✭ 274 (-40.69%)
dotnet-security-unit-testsA web application that contains several unit tests for the purpose of .NET security
Stars: ✭ 25 (-94.59%)
Jsoupxpath纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java,ha ha.Just try it.
Stars: ✭ 331 (-28.35%)
XPathToolsA Visual Studio Extension which can run any XPath and XPath function; navigates through results at the click of a button. Can show and copy any XPath incl. XML namespaces, avoiding XML namespace induced headaches. Keeps track of the current XPath via the statusbar.
Stars: ✭ 40 (-91.34%)
web-data-extractorExtracting and parsing structured data with jQuery Selector, XPath or JsonPath from common web format like HTML, XML and JSON.
Stars: ✭ 52 (-88.74%)
jazeee-meteor-spiderableFork of Meteor Spiderable with longer timeout, caching, better server handling
Stars: ✭ 33 (-92.86%)
SlimerjsA scriptable browser like PhantomJS, based on Firefox
Stars: ✭ 2,984 (+545.89%)
Z-Spider一些爬虫开发的技巧和案例
Stars: ✭ 33 (-92.86%)
XidelCommand line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (-27.49%)
go-xmldomXML DOM processing for Golang, supports xpath query
Stars: ✭ 38 (-91.77%)
spparseran async ETL tool written in Python.
Stars: ✭ 34 (-92.64%)
phantom-lordHandy API for Headless Chromium
Stars: ✭ 24 (-94.81%)
Grunt Mocha[MOVED] Grunt task for running mocha specs in a headless browser (PhantomJS)
Stars: ✭ 371 (-19.7%)
ElementFinderFetch data from HTML and XML via xpath/css and prepare it with regexp
Stars: ✭ 29 (-93.72%)
exmlMost simple Elixir wrapper for xmerl xpath
Stars: ✭ 23 (-95.02%)
qtspecsQT4 specifications
Stars: ✭ 22 (-95.24%)
intransient capybaraA set of improvements to Minitest/Capybara/Poltergeist/PhantomJS test stack that reduces the occurrence of transient failures.
Stars: ✭ 25 (-94.59%)
teleniumAutomation for Kivy Application
Stars: ✭ 56 (-87.88%)
ExisteXist Native XML Database and Application Platform
Stars: ✭ 294 (-36.36%)
pyCreeper一个用来快速提取网页内容的信息采集(爬虫)框架, 实现了对网页的动态加载与控制。
Stars: ✭ 25 (-94.59%)
Phantomjs NodePhantomJS integration module for NodeJS
Stars: ✭ 3,544 (+667.1%)
codechef-rank-comparatorWeb application hosted on Heroku cloud platform based on web scraping in python using lxml library (XML Path Language).
Stars: ✭ 23 (-95.02%)
PhantomjsGo client for PhantomJS.
Stars: ✭ 278 (-39.83%)
DAMTemario y ejercicios de Desarrollo de Aplicaciones Multiplataforma (DAM)
Stars: ✭ 96 (-79.22%)
XpathXPath package for Golang, supports HTML, XML, JSON document query.
Stars: ✭ 376 (-18.61%)
crawlkitA crawler based on Phantom. Allows discovery of dynamic content and supports custom scrapers.
Stars: ✭ 23 (-95.02%)
node-qunit-phantomjsRun QUnit unit tests in a headless PhantomJS instance without using Grunt
Stars: ✭ 36 (-92.21%)
OpenScraperAn open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (-82.68%)
Htmlqueryhtmlquery is golang XPath package for HTML query.
Stars: ✭ 338 (-26.84%)
siteshooter📷 Automate full website screenshots and PDF generation with multiple viewport support.
Stars: ✭ 63 (-86.36%)
wdm4jAutomatic Selenium WebDriver binaries management for java
Stars: ✭ 16 (-96.54%)
fontoxpathA minimalistic XPath 3.1 implementation in pure JavaScript
Stars: ✭ 97 (-79%)
NightmareA high-level browser automation library.
Stars: ✭ 19,067 (+4027.06%)
chromateAutomate Headless Chrome.
Stars: ✭ 36 (-92.21%)
FluentdomA fluent api for working with XML in PHP
Stars: ✭ 327 (-29.22%)
gosquitogosquito ("go" + "mosquito") is a pluggable tool for data gathering, data processing and data transmitting to various destinations.
Stars: ✭ 25 (-94.59%)
img-cliAn interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-96.75%)
brackitQuery processor with proven optimizations, ready to use for your document store to query semi-structured data with a JSONiq like extension of XQuery. Can also be used as an ad-hoc in-memory query processor.
Stars: ✭ 28 (-93.94%)
Spider Flow新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (-21%)
Python-notesPython related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.
Stars: ✭ 104 (-77.49%)
reapr🕸→ℹ️ Reap Information from Websites
Stars: ✭ 14 (-96.97%)
Node Html Pdf📄 Html to pdf converter in nodejs. It spawns a phantomjs process and passes the pdf as buffer or as filename.
Stars: ✭ 3,364 (+628.14%)
XPath2.NetLightweight XPath2 for .NET
Stars: ✭ 26 (-94.37%)
Camarocamaro is an utility to transform XML to JSON, using Node.js binding to native XML parser pugixml, one of the fastest XML parser around.
Stars: ✭ 438 (-5.19%)
Browser RunThe easiest way of running code in a browser environment
Stars: ✭ 378 (-18.18%)
Comic DlComic-dl is a command line tool to download manga and comics from various comic and manga sites. Supported sites : readcomiconline.to, mangafox.me, comic naver and many more.
Stars: ✭ 365 (-21%)
BrowsershotConvert HTML to an image, PDF or string
Stars: ✭ 3,526 (+663.2%)
TqExtensionTest your Drupal 7 (D8 in progress) sites easier with TqExtension for Behat.
Stars: ✭ 13 (-97.19%)