Thepiratebay💀 The Pirate Bay node.js client
Stars: ✭ 191 (-70.75%)
Hquery.phpAn extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Stars: ✭ 295 (-54.82%)
CheerioFast, flexible, and lean implementation of core jQuery designed specifically for the server.
Stars: ✭ 24,616 (+3669.68%)
ImdbpyIMDbPY is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies
Stars: ✭ 792 (+21.29%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (-67.69%)
XidelCommand line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (-48.7%)
NomRust parser combinator framework
Stars: ✭ 5,987 (+816.85%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+634%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+640.74%)
PigeonCommand pigeon generates parsers in Go from a PEG grammar.
Stars: ✭ 603 (-7.66%)
Formula ParserJavascript Library parsing Excel Formulas and more
Stars: ✭ 544 (-16.69%)
Rsql ParserParser for RSQL / FIQL – query language for RESTful APIs
Stars: ✭ 463 (-29.1%)
Html Parserphp html parser,类似与PHP Simple HTML DOM Parser,但是比它快好几倍
Stars: ✭ 510 (-21.9%)
FinvizUnofficial API for finviz.com
Stars: ✭ 493 (-24.5%)
Instagram4j📷 Instagram private API in Java
Stars: ✭ 629 (-3.68%)
KongKong is a command-line parser for Go
Stars: ✭ 481 (-26.34%)
Valveresourceformat🔬 Valve's Source 2 resource file format parser and decompiler
Stars: ✭ 638 (-2.3%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (-28.94%)
Instagram ScraperScrapes an instagram user's photos and videos
Stars: ✭ 5,664 (+767.38%)
CompilerThe Hoa\Compiler library.
Stars: ✭ 458 (-29.86%)
Minigominigo🐥is a small Go compiler made from scratch. It can compile itself.
Stars: ✭ 456 (-30.17%)
RemarkableMarkdown parser, done right. Commonmark support, extensions, syntax plugins, high speed - all in one. Gulp and metalsmith plugins available. Used by Facebook, Docusaurus and many others! Use https://github.com/breakdance/breakdance for HTML-to-markdown conversion. Use https://github.com/jonschlinkert/markdown-toc to generate a table of contents.
Stars: ✭ 5,252 (+704.29%)
JikanUnofficial MyAnimeList PHP+REST API which provides functions other than the official API
Stars: ✭ 531 (-18.68%)
ScrapedinLinkedIn Scraper (currently working 2020)
Stars: ✭ 453 (-30.63%)
SwiftcsvCSV parser for Swift
Stars: ✭ 511 (-21.75%)
Lol HtmlLow output latency streaming HTML parser/rewriter with CSS selector-based API
Stars: ✭ 566 (-13.32%)
Operative Frameworkoperative framework is a OSINT investigation framework, you can interact with multiple targets, execute multiple modules, create links with target, export rapport to PDF file, add note to target or results, interact with RESTFul API, write your own modules.
Stars: ✭ 511 (-21.75%)
Imagescraper✂️ High performance, multi-threaded image scraper
Stars: ✭ 630 (-3.52%)
TextxDomain-Specific Languages and parsers in Python made easy http://textx.github.io/textX/
Stars: ✭ 496 (-24.04%)
Pimpmylog🍭 Log viewer for your web server
Stars: ✭ 564 (-13.63%)
GlobalizeA JavaScript library for internationalization and localization that leverages the official Unicode CLDR JSON data
Stars: ✭ 4,612 (+606.28%)
Dicom⚡High Performance DICOM Medical Image Parser in Go.
Stars: ✭ 643 (-1.53%)
TenkoAn 100% spec compliant ES2021 JavaScript parser written in JS
Stars: ✭ 490 (-24.96%)
Sweet CoreSweeten your JavaScript.
Stars: ✭ 4,501 (+589.28%)
Sqlparser RsExtensible SQL Lexer and Parser for Rust
Stars: ✭ 607 (-7.04%)
CorexlsxExcel spreadsheet (XLSX) format parser written in pure Swift
Stars: ✭ 481 (-26.34%)
Deta parser快速中文分词分析word segmentation
Stars: ✭ 476 (-27.11%)
Instagram CrawlerGet Instagram posts/profile/hashtag data without using Instagram API
Stars: ✭ 643 (-1.53%)
Stream JsonThe micro-library of Node.js stream components for creating custom JSON processing pipelines with a minimal memory footprint. It can parse JSON files far exceeding available memory streaming individual primitives using a SAX-inspired API.
Stars: ✭ 462 (-29.25%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (-17.92%)
ModestModest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.
Stars: ✭ 572 (-12.4%)
DataflowkitExtract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (-30.17%)
Json Schema Ref ParserParse, Resolve, and Dereference JSON Schema $ref pointers in Node and browsers
Stars: ✭ 532 (-18.53%)
Form🚂 Decodes url.Values into Go value(s) and Encodes Go value(s) into url.Values. Dual Array and Full map support.
Stars: ✭ 454 (-30.47%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (-2.45%)
TinyrbA tiny subset of Ruby with a Lua'esc VM
Stars: ✭ 452 (-30.78%)
GoogledictionaryapiGoogle does not provide Google Dictionary API so I created one.
Stars: ✭ 528 (-19.14%)
Exifr📷 The fastest and most versatile JS EXIF reading library.
Stars: ✭ 448 (-31.39%)
FasthanfastHan是基于fastNLP与pytorch实现的中文自然语言处理工具,像spacy一样调用方便。
Stars: ✭ 449 (-31.24%)
Body ParserNode.js body parsing middleware
Stars: ✭ 4,962 (+659.88%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-32.62%)
RedditdownloaderScrapes Reddit to download media of your choice.
Stars: ✭ 521 (-20.21%)