XidelCommand line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+1761.11%)
GoogledictionaryapiGoogle does not provide Google Dictionary API so I created one.
Stars: ✭ 528 (+2833.33%)
Linkedin scraperA library that scrapes Linkedin for user data
Stars: ✭ 413 (+2194.44%)
Java Spider一个基于webmagic框架二次开发的java爬虫框架实战,已实现能爬取腾讯,搜狐,今日头条(单独集成功能)等资讯内容,配合elasticsearch框架用法,实现了自动爬虫,已投入线上生产使用。
Stars: ✭ 276 (+1433.33%)
CheerioFast, flexible, and lean implementation of core jQuery designed specifically for the server.
Stars: ✭ 24,616 (+136655.56%)
ScrapersA list of scrapers from around the web.
Stars: ✭ 366 (+1933.33%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+3544.44%)
CryptocmdCryptocurrency historical price data library in Python. Data from https://coinmarketcap.com.
Stars: ✭ 299 (+1561.11%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+26527.78%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+2344.44%)
fb-scraperScrape a Facebook profile and turn it into a JSON file
Stars: ✭ 18 (+0%)
Instagram4j📷 Instagram private API in Java
Stars: ✭ 629 (+3394.44%)
Php GooseReadability / Html Content / Article Extractor & Web Scrapping library written in PHP
Stars: ✭ 392 (+2077.78%)
InformerA Telegram Mass Surveillance Bot in Python
Stars: ✭ 745 (+4038.89%)
Scrape It🔮 A Node.js scraper for humans.
Stars: ✭ 3,773 (+20861.11%)
Instagram ScraperScrapes an instagram user's photos and videos
Stars: ✭ 5,664 (+31366.67%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (+1833.33%)
DuckduckgoAn unofficial DuckDuckGo search API.
Stars: ✭ 6 (-66.67%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+22550%)
Operative Frameworkoperative framework is a OSINT investigation framework, you can interact with multiple targets, execute multiple modules, create links with target, export rapport to PDF file, add note to target or results, interact with RESTFul API, write your own modules.
Stars: ✭ 511 (+2738.89%)
Socialmanagertools Gui🤖 👻 Desktop application for Instagram Bot, Twitter Bot and Facebook Bot
Stars: ✭ 293 (+1527.78%)
Instagram CrawlerGet Instagram posts/profile/hashtag data without using Instagram API
Stars: ✭ 643 (+3472.22%)
Weibo terminator workflowUpdate Version of weibo_terminator, This is Workflow Version aim at Get Job Done!
Stars: ✭ 259 (+1338.89%)
BookcorpusCrawl BookCorpus
Stars: ✭ 443 (+2361.11%)
ProxiesA Simple Proxy Scraper
Stars: ✭ 29 (+61.11%)
Imagescraper✂️ High performance, multi-threaded image scraper
Stars: ✭ 630 (+3400%)
SnscrapeA social networking service scraper in Python
Stars: ✭ 433 (+2305.56%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (+4238.89%)
GosintOSINT Swiss Army Knife
Stars: ✭ 401 (+2127.78%)
Finance Go📊 Financial markets data library implemented in go.
Stars: ✭ 392 (+2077.78%)
ReginaFetch new releases from http://www.juno.co.uk/.
Stars: ✭ 6 (-66.67%)
Osi.igInformation Gathering Instagram.
Stars: ✭ 377 (+1994.44%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+2877.78%)
Micro Open GraphA tiny Node.js microservice to scrape open graph data with joy.
Stars: ✭ 371 (+1961.11%)
OnlyfansScrape all the media from an OnlyFans account - Updated regularly
Stars: ✭ 731 (+3961.11%)
KatanaA Python Tool For google Hacking
Stars: ✭ 355 (+1872.22%)
JikanUnofficial MyAnimeList PHP+REST API which provides functions other than the official API
Stars: ✭ 531 (+2850%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (+1811.11%)
Flight Prices ScraperAutomated Script to scrape flight prices from any website into a csv format
Stars: ✭ 17 (-5.56%)
JavgoJavGo是一个集合影片管理,影片刮削,视频处理,资源搜索等综合一体的全功能影音软件,支持爬取javbus,jav321,javdb,javlibrary进行刮削,支持db,bus的磁力搜索,支持获取library的影片评论。
Stars: ✭ 338 (+1777.78%)
RedditdownloaderScrapes Reddit to download media of your choice.
Stars: ✭ 521 (+2794.44%)
LinkedinLinkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Stars: ✭ 309 (+1616.67%)
SurgeonDeclarative DOM extraction expression evaluator. 👨⚕️
Stars: ✭ 653 (+3527.78%)
Hquery.phpAn extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Stars: ✭ 295 (+1538.89%)
FinvizUnofficial API for finviz.com
Stars: ✭ 493 (+2638.89%)
WebinspectorRuby gem to inspect completely a web page. It scrapes a given URL, and returns you its meta, links, images more.
Stars: ✭ 288 (+1500%)
ImdbpyIMDbPY is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies
Stars: ✭ 792 (+4300%)
RcrawlerAn R web crawler and scraper
Stars: ✭ 274 (+1422.22%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+26772.22%)
lightnovel epub🍭 epub generator for (light)novels (轻) 小说 epub 生成器,支持站点:轻之国度、轻小说文库
Stars: ✭ 89 (+394.44%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (+3438.89%)
DataflowkitExtract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (+2433.33%)
Indonesia News ScraperA news scraper for nodejs that help to scrap news from Indonesian news portal.
Stars: ✭ 18 (+0%)
Gifhub📈 Create GIFs from user's GitHub activity graph
Stars: ✭ 17 (-5.56%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+4283.33%)
Scala ScraperA Scala library for scraping content from HTML pages
Stars: ✭ 631 (+3405.56%)
ScrapedinLinkedIn Scraper (currently working 2020)
Stars: ✭ 453 (+2416.67%)