OLX Scraper📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-76.19%)
scrapy facebookerCollection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (-65.08%)
XMQ-BackUp小密圈备份,圈子/话题/图片/文件。
Stars: ✭ 22 (-65.08%)
163Music163music spider by scrapy.
Stars: ✭ 60 (-4.76%)
restaurant-finder-featureReviewsBuild a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (-66.67%)
NScrapyNScrapy is a .net core corss platform Distributed Spider Framework which provide an easy way to write your own Spider
Stars: ✭ 88 (+39.68%)
pythonSpider🕷️some python spiders with BeautifulSoup or scarpy
Stars: ✭ 28 (-55.56%)
scraping-ebayScraping Ebay's products using Scrapy Web Crawling Framework
Stars: ✭ 79 (+25.4%)
memes-apiAPI for scrapping common meme sites
Stars: ✭ 17 (-73.02%)
IMDB-ScraperScrapy project for scraping data from IMDB with Movie Dataset including 58,623 movies' data.
Stars: ✭ 37 (-41.27%)
GPlayCrawlerNo description or website provided.
Stars: ✭ 47 (-25.4%)
factoryDocker microservice & Crawler by scrapy
Stars: ✭ 56 (-11.11%)
ARGUSARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (+7.94%)
elves🎊 Design and implement of lightweight crawler framework.
Stars: ✭ 322 (+411.11%)
torchestratorSpin up Tor containers and then proxy HTTP requests via these Tor instances
Stars: ✭ 32 (-49.21%)
InstaBotSimple and friendly Bot for Instagram, using Selenium and Scrapy with Python.
Stars: ✭ 32 (-49.21%)
BOC FER SpiderUse Scrapy crawl foreign exchange rate from BOC (Bank of China)
Stars: ✭ 18 (-71.43%)
allitebooks.comDownload all the ebooks with indexed csv of "allitebooks.com"
Stars: ✭ 24 (-61.9%)
python-spiderpython爬虫小项目【持续更新】【笔趣阁小说下载、Tweet数据抓取、天气查询、网易云音乐逆向、天天基金网查询、微博数据抓取(生成cookie)、有道翻译逆向、企查查免登陆爬虫、大众点评svg加密破解、B站用户爬虫、拉钩免登录爬虫、自如租房字体加密、知乎问答
Stars: ✭ 45 (-28.57%)
policy-data-analyzerBuilding a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
Stars: ✭ 22 (-65.08%)
proxiProxy pool. Finds and checks proxies with rest api for querying results. Can find over 25k proxies in under 5 minutes.
Stars: ✭ 32 (-49.21%)
scrapy-zyte-smartproxyZyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
Stars: ✭ 317 (+403.17%)
logparserA tool for parsing Scrapy log files periodically and incrementally, extending the HTTP JSON API of Scrapyd.
Stars: ✭ 70 (+11.11%)
PttImageSpiderPTT 圖片下載器 (抓取整個看板的圖片,並用文章標題作為資料夾的名稱 ) (使用Scrapy)
Stars: ✭ 16 (-74.6%)
scrapy-distributedA series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-39.68%)
OpenScraperAn open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (+26.98%)
scrapy-adminA django admin site for scrapy
Stars: ✭ 44 (-30.16%)
scrapy.dartScrapy, a fast high-level web crawling & scraping framework for dart and Flutter
Stars: ✭ 50 (-20.63%)
ip proxy poolGenerating spiders dynamically to crawl and check those free proxy ip on the internet with scrapy.
Stars: ✭ 39 (-38.1%)
hk0weatherWeb scraper project to collect the useful Hong Kong weather data from HKO website
Stars: ✭ 49 (-22.22%)
scrapy-cookiesA middleware of cookies persistence for Scrapy
Stars: ✭ 19 (-69.84%)
ImageGrabberA Scrapy demo : Download all images from a site
Stars: ✭ 33 (-47.62%)
animecenterThe source code for animecenter
Stars: ✭ 16 (-74.6%)
scrapyra simple & tiny scrapy clustering solution, considered a drop-in replacement for scrapyd
Stars: ✭ 50 (-20.63%)
invana-botA Web Crawler that scrapes using YAML and python code.
Stars: ✭ 30 (-52.38%)
devsearchA web search engine built with Python which uses TF-IDF and PageRank to sort search results.
Stars: ✭ 52 (-17.46%)
Douban CrawlerUno Crawler por https://douban.com
Stars: ✭ 13 (-79.37%)
toutiao今日头条科技新闻接口爬虫
Stars: ✭ 17 (-73.02%)
JustDownlink基于Scrapy+Elasticsearch+Django搭建的分布式电影搜索
Stars: ✭ 28 (-55.56%)