Python3 SpiderPython爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Stars: ✭ 2,129 (+1800.89%)
Instagram-Scraper-2021Scrape Instagram content and stories anonymously, using a new technique based on the har file (No Token + No public API).
Stars: ✭ 57 (-49.11%)
allitebooks.comDownload all the ebooks with indexed csv of "allitebooks.com"
Stars: ✭ 24 (-78.57%)
chesfCHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages
Stars: ✭ 18 (-83.93%)
Sneakers ProjectUsing Selenium, Neha scraped data about 35 top selling sneakers of Nike and Adidas from stockx.com. She used this data to draw insights about sneaker resales.
Stars: ✭ 32 (-71.43%)
InstaBotSimple and friendly Bot for Instagram, using Selenium and Scrapy with Python.
Stars: ✭ 32 (-71.43%)
non-api-fb-scraperScrape public FaceBook posts from any group or user into a .csv file without needing to register for any API access
Stars: ✭ 40 (-64.29%)
hk0weatherWeb scraper project to collect the useful Hong Kong weather data from HKO website
Stars: ✭ 49 (-56.25%)
Alipayspider ScrapyAlipaySpider on Scrapy(use chrome driver); 支付宝爬虫(基于Scrapy)
Stars: ✭ 70 (-37.5%)
DotnetcrawlerDotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-10.71%)
Scrapy SeleniumScrapy middleware to handle javascript pages using selenium
Stars: ✭ 550 (+391.07%)
SeleniumcrawlerAn example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Stars: ✭ 117 (+4.46%)
fBrowserHelpful Selenium functions to make web-scraping easier and faster
Stars: ✭ 16 (-85.71%)
XMQ-BackUp小密圈备份,圈子/话题/图片/文件。
Stars: ✭ 22 (-80.36%)
RARBG-scraperWith Selenium headless browsing and CAPTCHA solving
Stars: ✭ 38 (-66.07%)
image-crawlerAn image scraper that scraps images from unsplash.com
Stars: ✭ 12 (-89.29%)
ARGUSARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (-39.29%)
Python Spider豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Stars: ✭ 615 (+449.11%)
Wdio Screenshot A WebdriverIO plugin. Additional commands for taking screenshots with WebdriverIO.
Stars: ✭ 101 (-9.82%)
Udemy botAn automation bot for free Udemy courses
Stars: ✭ 91 (-18.75%)
Adidas Multi Session(Python) Program to simulate multiple sessions on adidas queue pages.
Stars: ✭ 90 (-19.64%)
WebdriverextensionsMake your WebDriver based Selenium tests more readable, reusability and maintainable by using WebDriver Extensions!
Stars: ✭ 89 (-20.54%)
Scrapyd Cluster On HerokuSet up free and scalable Scrapyd cluster for distributed web-crawling with just a few clicks. DEMO 👉
Stars: ✭ 106 (-5.36%)
PulsarTurn large Web sites into tables and charts using simple SQLs.
Stars: ✭ 100 (-10.71%)
GlastoseleniumA bot for booking Glastonbury tickets using selenium
Stars: ✭ 89 (-20.54%)
Cli Boot.camp💻 command-line bootcamp adventure in your browser
Stars: ✭ 88 (-21.43%)
Clock可视化任务调度系统,精简到一个二进制文件 (Web visual task scheduler system , yes ! just one binary solve all the problems !)
Stars: ✭ 86 (-23.21%)
AetAET - a system that detects visual changes on web sites and performs basic page health checks
Stars: ✭ 100 (-10.71%)
InstaloctrackAn Instagram OSINT tool to collect all the geotagged locations available on an Instagram profile in order to plot them on a map, and dump them in a JSON.
Stars: ✭ 85 (-24.11%)
InstantwpInstantWP is a complete standalone, portable WordPress development environment.
Stars: ✭ 83 (-25.89%)
ProtractorE2E test framework for Angular apps
Stars: ✭ 8,792 (+7750%)
TaurusAutomation-friendly framework for Continuous Testing by
Stars: ✭ 1,566 (+1298.21%)
SillyniumAutomate the creation of Python Selenium Scripts by drawing coloured boxes on webpage elements
Stars: ✭ 100 (-10.71%)
Echo360Commandline tool for automated downloads of echo360 videos hosted by university
Stars: ✭ 81 (-27.68%)
Hsac Fitnesse FixturesAn environment to define and run integration tests. It contains Fitnesse fixture (base) classes and a baseline FitNesse installation.
Stars: ✭ 99 (-11.61%)
Email ExtractorThe main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Stars: ✭ 81 (-27.68%)
Spam Bot 3000Social media research and promotion, semi-autonomous CLI bot
Stars: ✭ 79 (-29.46%)
Hivelots of spider (很多爬虫)
Stars: ✭ 110 (-1.79%)
Frameworkium CoreFramework for writing maintainable Selenium and REST API tests in Java.
Stars: ✭ 107 (-4.46%)
ImghashPerceptual image hashing for Node.js
Stars: ✭ 98 (-12.5%)
Covid 19 jhu data web scrap and cleaningThis repository contains data and code used to get and clean data from https://github.com/CSSEGISandData/COVID-19 and https://www.worldometers.info/coronavirus/
Stars: ✭ 80 (-28.57%)
CfseleniumA native Selenium WebDriver binding for ColdFusion
Stars: ✭ 77 (-31.25%)
OlxscraperOLX Scraper in Python Scrapy
Stars: ✭ 76 (-32.14%)
Capturercapture pictures from website like sina, lofter, huaban and so on
Stars: ✭ 76 (-32.14%)