OLX Scraper📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-82.14%)
itemadapterCommon interface for data container classes
Stars: ✭ 47 (-44.05%)
lgcrawlpython+scrapy+splash 爬取拉勾全站职位信息
Stars: ✭ 22 (-73.81%)
InventusInventus is a spider designed to find subdomains of a specific domain by crawling it and any subdomains it discovers.
Stars: ✭ 80 (-4.76%)
nginx-moreDevelopment repository for nginx-more package
Stars: ✭ 96 (+14.29%)
ddnsNo description or website provided.
Stars: ✭ 26 (-69.05%)
estate-crawlerScraping the real estate agencies for up-to-date house listings as soon as they arrive!
Stars: ✭ 20 (-76.19%)
scrapy-LBCAraignée LeBonCoin avec Scrapy et ElasticSearch
Stars: ✭ 14 (-83.33%)
asyncpy使用asyncio和aiohttp开发的轻量级异步协程web爬虫框架
Stars: ✭ 86 (+2.38%)
mit-ocw-dlDownload all video lectures from a MIT-OCW course with a single command.
Stars: ✭ 91 (+8.33%)
DNS-over-DiscordA 1.1.1.1 DNS resolver built for Discord
Stars: ✭ 228 (+171.43%)
scrapy-html-storageScrapy downloader middleware that stores response HTMLs to disk.
Stars: ✭ 17 (-79.76%)
archeAnalyze scraped data
Stars: ✭ 49 (-41.67%)
pupflareA webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti bot / anti ddos on any application (like curl)
Stars: ✭ 183 (+117.86%)
ArticleSpiderCrawling zhihu, jobbole, lagou by Scrapy, and using Elasticsearch+Django to build a Search Engine website --- README_zh.md (including: implementation roadmap, distributed-crawler and coping with anti-crawling strategies).
Stars: ✭ 34 (-59.52%)
worker-template-postgresReference demo and modified PostgreSQL driver to connect Cloudflare Workers to a relational database.
Stars: ✭ 75 (-10.71%)
systemd-cfddnsBash-powered DDNS client for Cloudflare-managed domain
Stars: ✭ 29 (-65.48%)
Actual Domain PricesThe real cost of each TLD (top-level-domain). Find out how much your registrar marks up your domain prices.
Stars: ✭ 247 (+194.05%)
Bugs-feedBug's feed is a local hosted portal where you can search for the latest news, videos, CVEs, vulnerabilities...
Stars: ✭ 90 (+7.14%)
RARBG-scraperWith Selenium headless browsing and CAPTCHA solving
Stars: ✭ 38 (-54.76%)
vietnam-ecommerce-crawlerCrawling the data from lazada, websosanh, compare.vn, cdiscount and cungmua with flexible configs
Stars: ✭ 28 (-66.67%)
crawlerpython爬虫项目集合
Stars: ✭ 29 (-65.48%)
akamai-toolkitA set of tools to work on Akamai v1 anti-bot solution. Current supported version: 1.70
Stars: ✭ 215 (+155.95%)
Web-IotaIota is a web scraper which can find all of the images and links/suburls on a webpage
Stars: ✭ 60 (-28.57%)
IPFS PHOTO SHARE💰用甚嚒服务器,ServerLess搭建一个图片分享站点!| 基于CloudFlareWorker无服务器函数和IPFS去中心化存储的图片分享网站
Stars: ✭ 76 (-9.52%)
scrapy helperDynamic configurable crawl (动态可配置化爬虫)
Stars: ✭ 84 (+0%)
cloudflare-ddnsA script to update your Cloudflare DNS records at a glance.
Stars: ✭ 152 (+80.95%)
dominDomain Name Search untuk mencari ketersedian nama domain.
Stars: ✭ 17 (-79.76%)
fernando-pessoaClassificador de poemas do Fernando Pessoa de acordo com os seus heterônimos
Stars: ✭ 31 (-63.1%)
pagserPagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawler
Stars: ✭ 82 (-2.38%)
scrapy-wayback-machineA Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Stars: ✭ 92 (+9.52%)
domainsWorld’s single largest Internet domains dataset
Stars: ✭ 461 (+448.81%)
workers-unsplash-apiServerless API for requesting images from Unsplash's API, designed for use with a React frontend
Stars: ✭ 20 (-76.19%)
cloudflare-worker-routerA super lightweight router (1.3K) with middleware support and ZERO dependencies for CloudFlare Workers.
Stars: ✭ 144 (+71.43%)
ceilHelmut Hoffer von Ankershoffen experimenting with auto-provisioned RPi cluster running K8S on bare-metal
Stars: ✭ 42 (-50%)
scrapy-fieldstatsA Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-79.76%)
warp-upAutomatically generated referrer bonuses for Cloudflare WARP (https://1.1.1.1)
Stars: ✭ 24 (-71.43%)
html2dataLibrary and cli for extracting data from HTML via CSS selectors
Stars: ✭ 62 (-26.19%)
FlaresolverrProxy server to bypass Cloudflare protection
Stars: ✭ 241 (+186.9%)
double-agentA test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (+46.43%)
cloudflaredCloudflare Tunnel Instructions and Template for Unraid
Stars: ✭ 129 (+53.57%)
easypoi简单、免费、高效的百度地图poi采集和分析工具。
Stars: ✭ 87 (+3.57%)
scrapy-kafka-redisDistributed crawling/scraping, Kafka And Redis based components for Scrapy
Stars: ✭ 45 (-46.43%)