All Projects → Scrapple → Similar Projects or Alternatives

1958 Open source projects that are alternatives of or similar to Scrapple

Hive
lots of spider (很多爬虫)
Stars: ✭ 110 (-76.29%)
Mutual labels:  scrapy, beautifulsoup
Scrapy Cluster
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Stars: ✭ 921 (+98.49%)
Mutual labels:  scrapy, scraping
Netflix Clone
Netflix like full-stack application with SPA client and backend implemented in service oriented architecture
Stars: ✭ 156 (-66.38%)
Mutual labels:  scrapy, web-scraping
Juno crawler
Scrapy crawler to collect data on the back catalog of songs listed for sale.
Stars: ✭ 150 (-67.67%)
Mutual labels:  scrapy, web-scraping
Ferret
Declarative web scraping
Stars: ✭ 4,837 (+942.46%)
Mutual labels:  crawler, scraping
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+1005.39%)
Mutual labels:  crawler, scraping
Fbcrawl
A Facebook crawler
Stars: ✭ 536 (+15.52%)
Mutual labels:  crawler, scrapy
Scrapy Redis
Redis-based components for Scrapy.
Stars: ✭ 4,998 (+977.16%)
Mutual labels:  crawler, scrapy
Scraper-Projects
🕸 List of mini projects that involve web scraping 🕸
Stars: ✭ 25 (-94.61%)
Mutual labels:  scraping, beautifulsoup
Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+70.04%)
Mutual labels:  crawler, scraping
Linkedin-Client
Web scraper for grabing data from Linkedin profiles or company pages (personal project)
Stars: ✭ 42 (-90.95%)
Mutual labels:  web-scraper, web-scraping
proxi
Proxy pool. Finds and checks proxies with rest api for querying results. Can find over 25k proxies in under 5 minutes.
Stars: ✭ 32 (-93.1%)
Mutual labels:  scraping, scrapy
scrapy facebooker
Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (-95.26%)
Mutual labels:  scraping, scrapy
scrapy-zyte-smartproxy
Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
Stars: ✭ 317 (-31.68%)
Mutual labels:  scraping, scrapy
Universityrecruitment Ssurvey
用严肃的数据来回答“什么样的企业会到什么样的大学招聘”?
Stars: ✭ 30 (-93.53%)
Mutual labels:  crawler, beautifulsoup
Scrapy Fake Useragent
Random User-Agent middleware based on fake-useragent
Stars: ✭ 520 (+12.07%)
Mutual labels:  scrapy, web-scraping
Taiwan News Crawlers
Scrapy-based Crawlers for news of Taiwan
Stars: ✭ 83 (-82.11%)
Mutual labels:  crawler, scrapy
Scrapy Examples
Some scrapy and web.py exmaples
Stars: ✭ 71 (-84.7%)
Mutual labels:  crawler, scrapy
Scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+9025.65%)
Mutual labels:  crawler, scraping
Webmagic
A scalable web crawler framework for Java.
Stars: ✭ 10,186 (+2095.26%)
Mutual labels:  crawler, scraping
Qqmusicspider
基于Scrapy的QQ音乐爬虫(QQ Music Spider),爬取歌曲信息、歌词、精彩评论等,并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料
Stars: ✭ 120 (-74.14%)
Mutual labels:  crawler, scrapy
Terpene Profile Parser For Cannabis Strains
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Stars: ✭ 63 (-86.42%)
Mutual labels:  crawler, scrapy
Marmot
💐Marmot | Web Crawler/HTTP protocol Download Package 🐭
Stars: ✭ 186 (-59.91%)
Mutual labels:  crawler, scrapy
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-63.15%)
Mutual labels:  crawler, scraping
Antch
Antch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-57.33%)
Mutual labels:  crawler, scraping
Scrapingoutsourcing
ScrapingOutsourcing专注分享爬虫代码 尽量每周更新一个
Stars: ✭ 164 (-64.66%)
Mutual labels:  crawler, scrapy
Goose Parser
Universal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (-54.53%)
Mutual labels:  crawler, scraping
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+3248.06%)
Mutual labels:  crawler, scraping
Filesensor
Dynamic file detection tool based on crawler 基于爬虫的动态敏感文件探测工具
Stars: ✭ 227 (-51.08%)
Mutual labels:  crawler, scrapy
Iclr2019 Openreviewdata
Script that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using Selenium and ChromeDriver on Ubuntu.
Stars: ✭ 376 (-18.97%)
Mutual labels:  crawler, tutorial
linkedin-scraper
Tool to scrape linkedin
Stars: ✭ 74 (-84.05%)
Mutual labels:  scraping, beautifulsoup
BookingScraper
🌎 🏨 Scrape Booking.com 🏨 🌎
Stars: ✭ 68 (-85.34%)
Mutual labels:  web-scraping, beautifulsoup
chopper
Chopper is a tool to extract elements from HTML by preserving ancestors and CSS rules
Stars: ✭ 22 (-95.26%)
Mutual labels:  scraping, beautifulsoup
ioweb
Web Scraping Framework
Stars: ✭ 31 (-93.32%)
Mutual labels:  scraping, web-scraping
scrapy-fieldstats
A Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-96.34%)
Mutual labels:  scraping, scrapy
Data-Wrangling-with-Python
Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (-80.6%)
Mutual labels:  web-scraping, beautifulsoup
Euro2016 TerminalApp
⚽ Instantly find 🏆EURO 2016 live-streams & highlights, now a Web App!
Stars: ✭ 54 (-88.36%)
Mutual labels:  scraping, beautifulsoup
browser-pool
A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (-84.7%)
Mutual labels:  scraping, web-scraping
lostark-wait-notifier
🐤️ Lost Ark wait notifier
Stars: ✭ 38 (-91.81%)
Mutual labels:  crawler, beautifulsoup
scrapy-distributed
A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-91.81%)
Mutual labels:  scraping, scrapy
InstaBot
Simple and friendly Bot for Instagram, using Selenium and Scrapy with Python.
Stars: ✭ 32 (-93.1%)
Mutual labels:  scraping, scrapy
restaurant-finder-featureReviews
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (-95.47%)
Mutual labels:  web-scraping, scrapy
memes-api
API for scrapping common meme sites
Stars: ✭ 17 (-96.34%)
Mutual labels:  scraping, scrapy
policy-data-analyzer
Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
Stars: ✭ 22 (-95.26%)
Mutual labels:  scraping, scrapy
Post Tuto Deployment
Build and deploy a machine learning app from scratch 🚀
Stars: ✭ 368 (-20.69%)
Mutual labels:  scrapy, scraping
PythonScrapyBasicSetup
Basic setup with random user agents and IP addresses for Python Scrapy Framework.
Stars: ✭ 57 (-87.72%)
Mutual labels:  scraping, web-scraping
ptt-web-crawler
PTT 網路版爬蟲
Stars: ✭ 20 (-95.69%)
Mutual labels:  crawler, scrapy
TorScrapper
A Scraper made 100% in Python using BeautifulSoup and Tor. It can be used to scrape both normal and onion links. Happy Scraping :)
Stars: ✭ 24 (-94.83%)
Mutual labels:  scraping, beautifulsoup
raspagem-de-dados-fatec
📓 Minicurso de raspagem de dados web com Python ministrado na Semana de Tecnologia da FATEC Jundiaí
Stars: ✭ 22 (-95.26%)
Mutual labels:  scraping, web-scraping
MediumScraper
Scraping articles of medium and providing audio versions 📑 to 🔊 using django
Stars: ✭ 12 (-97.41%)
Mutual labels:  web-scraper, beautifulsoup
Php Curl Class
PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (+525.65%)
Mutual labels:  web-scraping, web-scraper
ARGUS
ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (-85.34%)
Mutual labels:  scraping, scrapy
Apify Js
Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+579.74%)
Mutual labels:  scraping, web-scraping
bots-zoo
No description or website provided.
Stars: ✭ 59 (-87.28%)
Mutual labels:  crawler, scraping
Line Bot Tutorial
line-bot-tutorial use python flask
Stars: ✭ 267 (-42.46%)
Mutual labels:  crawler, tutorial
Basketball reference web scraper
NBA Stats API via Basketball Reference
Stars: ✭ 279 (-39.87%)
Mutual labels:  web-scraping, web-scraper
Sasila
一个灵活、友好的爬虫框架
Stars: ✭ 286 (-38.36%)
Mutual labels:  crawler, scraping
Crawlertutorial
爬蟲極簡教學(fetch, parse, search, multiprocessing, API)- PTT 為例
Stars: ✭ 282 (-39.22%)
Mutual labels:  crawler, tutorial
Python Automation Scripts
Simple yet powerful automation stuffs.
Stars: ✭ 292 (-37.07%)
Mutual labels:  crawler, beautifulsoup
Requests Html
Pythonic HTML Parsing for Humans™
Stars: ✭ 12,268 (+2543.97%)
Mutual labels:  scraping, beautifulsoup
61-120 of 1958 similar projects