pysoundcloudScraping the Un–scrapable™
Stars: ✭ 63 (-81.69%)
bing-ip2hostsbingip2hosts is a Bing.com web scraper that discovers websites by IP address
Stars: ✭ 99 (-71.22%)
SearchScraperAPIAiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of results.
Stars: ✭ 31 (-90.99%)
XidelCommand line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (-2.62%)
Federal-Parliament-ScraperA scraper for obtaining information on the workings of the Belgian Federal Parliament.
Stars: ✭ 18 (-94.77%)
feaplat爬虫管理系统,支持集群,弹性伸缩。支持运行feapder、scrapy、selenium、playwright等各种框架及脚本
Stars: ✭ 42 (-87.79%)
fb-page-chat-downloadPython script to download messages from a Facebook page to a CSV file
Stars: ✭ 51 (-85.17%)
facebook-discussion-tkA collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.
Stars: ✭ 33 (-90.41%)
zhihu搜索你的知乎收藏:可以直观地浏览你的所有收藏夹的内容,并进行全文搜索
Stars: ✭ 39 (-88.66%)
DeadPool该项目是一个使用celery作为主体框架的爬虫应用,能够灵活的添加爬虫任务,并且同时运行多站点的爬虫工作,所有组件都能够原生支持规模并发和分布式,加上celery原生的分布式调用,实现大规模并发。
Stars: ✭ 38 (-88.95%)
dorkscoutDorkScout - Golang tool to automate google dork scan against the entiere internet or specific targets
Stars: ✭ 189 (-45.06%)
scrapy-adminA django admin site for scrapy
Stars: ✭ 44 (-87.21%)
LinkedIn-ScraperA LinkedIn Scraper to scrape up to 10k LinkedIn profiles from company profile links and save their e-mail addresses if available!
Stars: ✭ 62 (-81.98%)
DotnetspiderDotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework
Stars: ✭ 3,233 (+839.83%)
4scannerContinuously search imageboards threads for images/webms and download them
Stars: ✭ 103 (-70.06%)
crawlerdetectGolang module to detect bots and crawlers via the user agent
Stars: ✭ 22 (-93.6%)
trainline-pythonNon-official Python wrapper and CLI tool for Trainline
Stars: ✭ 41 (-88.08%)
evineInteractive CLI Web Crawler
Stars: ✭ 140 (-59.3%)
sedeText-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data
Stars: ✭ 83 (-75.87%)
SupercrawlerA web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.
Stars: ✭ 306 (-11.05%)
TwEaterA Python Bot for Scraping Conversations from Twitter
Stars: ✭ 16 (-95.35%)
covid-19Current and historical coronavirus covid-19 confirmed, recovered, deaths and active case counts segmented by country and region. Includes csv, json and sqlite data along with an interactive website explorer.
Stars: ✭ 15 (-95.64%)
spiderpython 爬虫(amazon, confluence ...)
Stars: ✭ 21 (-93.9%)
PttImageSpiderPTT 圖片下載器 (抓取整個看板的圖片,並用文章標題作為資料夾的名稱 ) (使用Scrapy)
Stars: ✭ 16 (-95.35%)
AlltheplacesA set of spiders and scrapers to extract location information from places that post their location on the internet.
Stars: ✭ 277 (-19.48%)
html-queryA fluent and functional approach to querying HTML
Stars: ✭ 48 (-86.05%)
pyitauUnofficial client to access your Itaú bank data
Stars: ✭ 28 (-91.86%)
Scraper-Projects🕸 List of mini projects that involve web scraping 🕸
Stars: ✭ 25 (-92.73%)
wishlistRead an Amazon wishlist programmatically with Python
Stars: ✭ 44 (-87.21%)
metacritic apiPHP Metacritic API - Mirrored by my GitLab
Stars: ✭ 31 (-90.99%)
snapcrawlCrawl a website and take screenshots
Stars: ✭ 37 (-89.24%)
glyphhangerYour web font utility belt. It can subset web fonts. It can find unicode-ranges for you automatically. It makes julienne fries.
Stars: ✭ 422 (+22.67%)
araneid一个基于Glang语言开发的站群系统(蜘蛛池系统)
Stars: ✭ 25 (-92.73%)
trawlerscraper for facebook, gab, google and tiktok
Stars: ✭ 20 (-94.19%)
CoinstaA Python package for acquiring both historical and current data of cryptocurrencies
Stars: ✭ 47 (-86.34%)
CryptocmdCryptocurrency historical price data library in Python. Data from https://coinmarketcap.com.
Stars: ✭ 299 (-13.08%)
TumblTwoTumblTwo, an Improved Fork of TumblOne, a Tumblr Downloader.
Stars: ✭ 57 (-83.43%)
imdb-scraper🎬 An attempt at the most complete IMDb API
Stars: ✭ 24 (-93.02%)
Z-Spider一些爬虫开发的技巧和案例
Stars: ✭ 33 (-90.41%)
douyin-api抖音接口、抖音API、抖音数据爬虫、抖音直播数据、抖音直播Api、抖音视频Api、抖音爬虫、抖音去水印、抖音视频下载、抖音视频解析、抖音直播监控、抖音数据采集
Stars: ✭ 41 (-88.08%)
AzurLaneWikiScrapersA console application that can scrape the Azur Lane wiki and export the data to Json files
Stars: ✭ 12 (-96.51%)