papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-93.3%)
Rendoradynamic server-side rendering using headless Chrome to effortlessly solve the SEO problem for modern javascript websites
Stars: ✭ 1,853 (+727.23%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (-5.8%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+456.25%)
ScrapyScrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+18803.13%)
Awesome Python Primer自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向
Stars: ✭ 57 (-74.55%)
D4n155OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (-53.12%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+107.14%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+252.23%)
NgmetaDynamic meta tags in your AngularJS single page application
Stars: ✭ 152 (-32.14%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+96.43%)
WebmagicA scalable web crawler framework for Java.
Stars: ✭ 10,186 (+4447.32%)
AntchAntch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-11.61%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+6835.27%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+23.66%)
Sasila一个灵活、友好的爬虫框架
Stars: ✭ 286 (+27.68%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+1720.09%)
NewcrawlerFree Web Scraping Tool with Java
Stars: ✭ 589 (+162.95%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+2059.38%)
GooglescraperA Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.
Stars: ✭ 2,363 (+954.91%)
bots-zooNo description or website provided.
Stars: ✭ 59 (-73.66%)
spiderable-middleware🤖 Prerendering for JavaScript powered websites. Great solution for PWAs (Progressive Web Apps), SPAs (Single Page Applications), and other websites based on top of front-end JavaScript frameworks
Stars: ✭ 29 (-87.05%)
DotnetcrawlerDotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-55.36%)
SerpscrapSEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.
Stars: ✭ 153 (-31.7%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-23.66%)
Jikan RestThe REST API for Jikan
Stars: ✭ 200 (-10.71%)
TumblthreeA Tumblr Backup Application
Stars: ✭ 211 (-5.8%)
IdtImage Dataset Tool (idt) is a cli tool designed to make the otherwise repetitive and slow task of creating image datasets into a fast and intuitive process.
Stars: ✭ 202 (-9.82%)
Statamic PeakStatamic Peak is an opinionated starter kit for all your Statamic sites.
Stars: ✭ 212 (-5.36%)
Videoserver以Node.js基于express以及爬虫实现的视频资源后端
Stars: ✭ 200 (-10.71%)
Laosjgolang light-weight image crawler
Stars: ✭ 199 (-11.16%)
SeoSEO utilities including a unique field type, sitemap & redirect manager
Stars: ✭ 210 (-6.25%)
Jsonframe Cheeriosimple multi-level scraper json input/output for Cheerio
Stars: ✭ 196 (-12.5%)
PychromelessPython Lambda Chrome Automation (naming pending)
Stars: ✭ 219 (-2.23%)
Algoliasearch NetlifyOfficial Algolia Plugin for Netlify. Index your website to Algolia when deploying your project to Netlify with the Algolia Crawler
Stars: ✭ 208 (-7.14%)
Fooproxy稳健高效的评分制-针对性- IP代理池 + API服务,可以自己插入采集器进行代理IP的爬取,针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库,支持MongoDB 4.0 使用 Python3.7(Scored IP proxy pool ,customise proxy data crawler can be added anytime)
Stars: ✭ 195 (-12.95%)
JvppeteerHeadless Chrome For Java (Java 爬虫)
Stars: ✭ 193 (-13.84%)
Media ScraperScrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok
Stars: ✭ 206 (-8.04%)
JuriscraperAn API to scrape American court websites for metadata.
Stars: ✭ 194 (-13.39%)
Seo ManagerSeo Manager Package for Laravel ( with Localization )
Stars: ✭ 192 (-14.29%)
Web Launch Checklist📋 A simple website launch checklist to keep track of the most important enrichment possibilities for a website.
Stars: ✭ 214 (-4.46%)
Tianyanchapip安装的天眼查爬虫API,指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.
Stars: ✭ 206 (-8.04%)
Google Group CrawlerGet (almost) original messages from google group archives. Your data is yours.
Stars: ✭ 190 (-15.18%)
Anime DlAnime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Stars: ✭ 190 (-15.18%)
GeccoEasy to use lightweight web crawler(易用的轻量化网络爬虫)
Stars: ✭ 2,310 (+931.25%)
Ruiji.netcrawler framework, distributed crawler extractor
Stars: ✭ 220 (-1.79%)
Search Engine ParserLightweight package to query popular search engines and scrape for result titles, links and descriptions
Stars: ✭ 216 (-3.57%)
Jd mask robot京东口罩库存监控爬虫(非selenium),扫码登录、查价、加购、下单、秒杀
Stars: ✭ 216 (-3.57%)
ThalGetting started with Puppeteer and Chrome Headless for Web Scraping
Stars: ✭ 2,345 (+946.88%)
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (-15.18%)
SeotoolsSEO Tools for Laravel
Stars: ✭ 2,406 (+974.11%)