All Projects → Arachnid → Similar Projects or Alternatives

824 Open source projects that are alternatives of or similar to Arachnid

Sitemap Generator Cli
Creates an XML-Sitemap by crawling a given site.
Stars: ✭ 214 (-4.46%)
Mutual labels:  crawler, seo
papercut
Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-93.3%)
Mutual labels:  crawler, scraping
Rendora
dynamic server-side rendering using headless Chrome to effortlessly solve the SEO problem for modern javascript websites
Stars: ✭ 1,853 (+727.23%)
Mutual labels:  crawler, seo
Goose Parser
Universal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (-5.8%)
Mutual labels:  crawler, scraping
Geziyor
Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+456.25%)
Mutual labels:  crawler, scraping
Scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+18803.13%)
Mutual labels:  crawler, scraping
Awesome Python Primer
自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向
Stars: ✭ 57 (-74.55%)
Mutual labels:  crawler, scraping
D4n155
OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (-53.12%)
Mutual labels:  crawler, scraping
Scrapple
A framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+107.14%)
Mutual labels:  crawler, scraping
Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+252.23%)
Mutual labels:  crawler, scraping
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+2189.73%)
Mutual labels:  crawler, scraping
Ngmeta
Dynamic meta tags in your AngularJS single page application
Stars: ✭ 152 (-32.14%)
Mutual labels:  crawler, seo
Crawly
Crawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+96.43%)
Mutual labels:  crawler, scraping
Webmagic
A scalable web crawler framework for Java.
Stars: ✭ 10,186 (+4447.32%)
Mutual labels:  crawler, scraping
Antch
Antch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-11.61%)
Mutual labels:  crawler, scraping
Sitemap Generator
Easily create XML sitemaps for your website.
Stars: ✭ 273 (+21.88%)
Mutual labels:  crawler, seo
Easy Scraping Tutorial
Simple but useful Python web scraping tutorial code.
Stars: ✭ 583 (+160.27%)
Mutual labels:  crawler, scraping
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+6835.27%)
Mutual labels:  crawler, scraping
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+23.66%)
Mutual labels:  crawler, scraping
Sasila
一个灵活、友好的爬虫框架
Stars: ✭ 286 (+27.68%)
Mutual labels:  crawler, scraping
Autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+1720.09%)
Mutual labels:  crawler, scraping
Newcrawler
Free Web Scraping Tool with Java
Stars: ✭ 589 (+162.95%)
Mutual labels:  crawler, scraping
Ferret
Declarative web scraping
Stars: ✭ 4,837 (+2059.38%)
Mutual labels:  crawler, scraping
Googlescraper
A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.
Stars: ✭ 2,363 (+954.91%)
Mutual labels:  crawler, scraping
Prerender Java
java framework for prerender
Stars: ✭ 115 (-48.66%)
Mutual labels:  crawler, seo
bots-zoo
No description or website provided.
Stars: ✭ 59 (-73.66%)
Mutual labels:  crawler, scraping
Sitemap Generator Crawler
Script that generates a sitemap by crawling a given URL
Stars: ✭ 169 (-24.55%)
Mutual labels:  crawler, seo
Scrapy Crawlera
Crawlera middleware for Scrapy
Stars: ✭ 281 (+25.45%)
Mutual labels:  crawler, scraping
spiderable-middleware
🤖 Prerendering for JavaScript powered websites. Great solution for PWAs (Progressive Web Apps), SPAs (Single Page Applications), and other websites based on top of front-end JavaScript frameworks
Stars: ✭ 29 (-87.05%)
Mutual labels:  crawler, seo
Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-55.36%)
Mutual labels:  crawler, scraping
Serpscrap
SEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.
Stars: ✭ 153 (-31.7%)
Mutual labels:  scraping, seo
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-23.66%)
Mutual labels:  crawler, scraping
Jikan Rest
The REST API for Jikan
Stars: ✭ 200 (-10.71%)
Mutual labels:  scraping
Tumblthree
A Tumblr Backup Application
Stars: ✭ 211 (-5.8%)
Mutual labels:  crawler
Idt
Image Dataset Tool (idt) is a cli tool designed to make the otherwise repetitive and slow task of creating image datasets into a fast and intuitive process.
Stars: ✭ 202 (-9.82%)
Mutual labels:  scraping
Statamic Peak
Statamic Peak is an opinionated starter kit for all your Statamic sites.
Stars: ✭ 212 (-5.36%)
Mutual labels:  seo
Videoserver
以Node.js基于express以及爬虫实现的视频资源后端
Stars: ✭ 200 (-10.71%)
Mutual labels:  crawler
Laosj
golang light-weight image crawler
Stars: ✭ 199 (-11.16%)
Mutual labels:  crawler
Seo
SEO utilities including a unique field type, sitemap & redirect manager
Stars: ✭ 210 (-6.25%)
Mutual labels:  seo
Jsonframe Cheerio
simple multi-level scraper json input/output for Cheerio
Stars: ✭ 196 (-12.5%)
Mutual labels:  scraping
Pychromeless
Python Lambda Chrome Automation (naming pending)
Stars: ✭ 219 (-2.23%)
Mutual labels:  crawler
Ok ip proxy pool
🍿爬虫代理IP池(proxy pool) python🍟一个还ok的IP代理池
Stars: ✭ 196 (-12.5%)
Mutual labels:  crawler
Algoliasearch Netlify
Official Algolia Plugin for Netlify. Index your website to Algolia when deploying your project to Netlify with the Algolia Crawler
Stars: ✭ 208 (-7.14%)
Mutual labels:  crawler
Fooproxy
稳健高效的评分制-针对性- IP代理池 + API服务,可以自己插入采集器进行代理IP的爬取,针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库,支持MongoDB 4.0 使用 Python3.7(Scored IP proxy pool ,customise proxy data crawler can be added anytime)
Stars: ✭ 195 (-12.95%)
Mutual labels:  crawler
Jvppeteer
Headless Chrome For Java (Java 爬虫)
Stars: ✭ 193 (-13.84%)
Mutual labels:  crawler
Media Scraper
Scrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok
Stars: ✭ 206 (-8.04%)
Mutual labels:  crawler
Juriscraper
An API to scrape American court websites for metadata.
Stars: ✭ 194 (-13.39%)
Mutual labels:  scraping
Seo Manager
Seo Manager Package for Laravel ( with Localization )
Stars: ✭ 192 (-14.29%)
Mutual labels:  seo
Web Launch Checklist
📋 A simple website launch checklist to keep track of the most important enrichment possibilities for a website.
Stars: ✭ 214 (-4.46%)
Mutual labels:  seo
Tianyancha
pip安装的天眼查爬虫API,指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.
Stars: ✭ 206 (-8.04%)
Mutual labels:  crawler
Google Group Crawler
Get (almost) original messages from google group archives. Your data is yours.
Stars: ✭ 190 (-15.18%)
Mutual labels:  crawler
Anime Dl
Anime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Stars: ✭ 190 (-15.18%)
Mutual labels:  scraping
Github Spider
Github 仓库及用户分析爬虫
Stars: ✭ 190 (-15.18%)
Mutual labels:  crawler
Gecco
Easy to use lightweight web crawler(易用的轻量化网络爬虫)
Stars: ✭ 2,310 (+931.25%)
Mutual labels:  crawler
Ruiji.net
crawler framework, distributed crawler extractor
Stars: ✭ 220 (-1.79%)
Mutual labels:  crawler
Search Engine Parser
Lightweight package to query popular search engines and scrape for result titles, links and descriptions
Stars: ✭ 216 (-3.57%)
Mutual labels:  scraping
Jd mask robot
京东口罩库存监控爬虫(非selenium),扫码登录、查价、加购、下单、秒杀
Stars: ✭ 216 (-3.57%)
Mutual labels:  crawler
Thal
Getting started with Puppeteer and Chrome Headless for Web Scraping
Stars: ✭ 2,345 (+946.88%)
Mutual labels:  scraping
Goribot
[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (-15.18%)
Mutual labels:  crawler
Seotools
SEO Tools for Laravel
Stars: ✭ 2,406 (+974.11%)
Mutual labels:  seo
1-60 of 824 similar projects