restaurant-finder-featureReviewsBuild a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (-73.42%)
Scrapyd Cluster On HerokuSet up free and scalable Scrapyd cluster for distributed web-crawling with just a few clicks. DEMO 👉
Stars: ✭ 106 (+34.18%)
Scrapy CraigslistWeb Scraping Craigslist's Engineering Jobs in NY with Scrapy
Stars: ✭ 54 (-31.65%)
OLX Scraper📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-81.01%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+487.34%)
IMDB-ScraperScrapy project for scraping data from IMDB with Movie Dataset including 58,623 movies' data.
Stars: ✭ 37 (-53.16%)
City ScrapersScrape, standardize and share public meetings from local government websites
Stars: ✭ 220 (+178.48%)
Juno crawlerScrapy crawler to collect data on the back catalog of songs listed for sale.
Stars: ✭ 150 (+89.87%)
Netflix CloneNetflix like full-stack application with SPA client and backend implemented in service oriented architecture
Stars: ✭ 156 (+97.47%)
scrapy-wayback-machineA Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Stars: ✭ 92 (+16.46%)
NScrapyNScrapy is a .net core corss platform Distributed Spider Framework which provide an easy way to write your own Spider
Stars: ✭ 88 (+11.39%)
scrapy-cookiesA middleware of cookies persistence for Scrapy
Stars: ✭ 19 (-75.95%)
AutohomeUsing Scrapy to crawl Autohome, storage into MonogDB, simple analysis and NLP coming soon
Stars: ✭ 23 (-70.89%)
factoryDocker microservice & Crawler by scrapy
Stars: ✭ 56 (-29.11%)
ScrapyProjectScrapy项目(mysql+mongodb豆瓣top250电影)
Stars: ✭ 18 (-77.22%)
scrapy spiderNo description or website provided.
Stars: ✭ 58 (-26.58%)
torchestratorSpin up Tor containers and then proxy HTTP requests via these Tor instances
Stars: ✭ 32 (-59.49%)
selectorlibA library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (-32.91%)
JD Spider👍 京东爬虫(大量注释,对刚入门爬虫者极度友好)
Stars: ✭ 56 (-29.11%)
bgmtoolsBangumi小工具
Stars: ✭ 66 (-16.46%)
browser-poolA Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (-10.13%)
www job com爬取拉勾、BOSS直聘、智联招聘、51job、赶集招聘、58招聘等职位信息
Stars: ✭ 47 (-40.51%)
htmlunit🕸🧰☕️Tools to Scrape Dynamic Web Content via the 'HtmlUnit' Java Library
Stars: ✭ 39 (-50.63%)
leetcode-compensationCompensation analysis on the posts scraped from leetcode.com/discuss/compensation. At present, the reports have been generated only for Indian cities.
Stars: ✭ 83 (+5.06%)
grailerweb scraping tool for grailed.com
Stars: ✭ 30 (-62.03%)
Data-Wrangling-with-PythonSimplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (+13.92%)
163Music163music spider by scrapy.
Stars: ✭ 60 (-24.05%)
cl-torrentsSearching torrents on popular trackers - CLI, readline, GUI, web client. Tutorial and binaries (issue tracker on https://gitlab.com/vindarel/cl-torrents/)
Stars: ✭ 83 (+5.06%)
rymscraperPython API to extract data from rateyourmusic.com.
Stars: ✭ 63 (-20.25%)
WaWebSessionHandler(DISCONTINUED) Save WhatsApp Web Sessions as files and open them everywhere!
Stars: ✭ 27 (-65.82%)
scrapy.dartScrapy, a fast high-level web crawling & scraping framework for dart and Flutter
Stars: ✭ 50 (-36.71%)
aioScrapy基于asyncio与aiohttp的异步协程爬虫框架 欢迎Star
Stars: ✭ 34 (-56.96%)
rreddit𝐫⟋ Get Reddit data
Stars: ✭ 49 (-37.97%)
animecenterThe source code for animecenter
Stars: ✭ 16 (-79.75%)
web full stack applicationshow full stack technology applications : Scrapy + webservice[restful] + websocket + VueJS + MongoDB
Stars: ✭ 16 (-79.75%)
automation-scriptsSimple scripts that I'm using to automate the boring things.
Stars: ✭ 14 (-82.28%)
extractnetA Dragnet that also extract author, headline, date, keywords from context
Stars: ✭ 52 (-34.18%)
faexportThe API for Furaffinity you wish existed
Stars: ✭ 61 (-22.78%)
Pythoncovers python basic to advance topics, practice questions, logical problems in python, web development using html, css, bootstrap, jquery, DOM, Django 🚀🚀. 💥 🌈
Stars: ✭ 29 (-63.29%)
TikTokDownloader PyWebIO🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+1063.29%)
OpenScraperAn open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (+1.27%)
invana-botA Web Crawler that scrapes using YAML and python code.
Stars: ✭ 30 (-62.03%)
reapr🕸→ℹ️ Reap Information from Websites
Stars: ✭ 14 (-82.28%)
iwwAI based web-wrapper for web-content-extraction
Stars: ✭ 61 (-22.78%)
iowebWeb Scraping Framework
Stars: ✭ 31 (-60.76%)