All Projects → testdrivenio → selenium-grid-docker-swarm

testdrivenio / selenium-grid-docker-swarm

Licence: MIT license
web scraping in parallel with Selenium Grid and Docker

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to selenium-grid-docker-swarm

selenium-grid-docker-swarm-test
Distribute automated tests with Selenium Grid and Docker Swarm
Stars: ✭ 28 (-12.5%)
Mutual labels:  docker-swarm, selenium, selenium-grid
SeleniumDemo
Selenium automation test framework
Stars: ✭ 84 (+162.5%)
Mutual labels:  selenium, selenium-webdriver, selenium-grid
Selenoid
Selenium Hub successor running browsers within containers. Scalable, immutable, self hosted Selenium-Grid on any platform with single binary.
Stars: ✭ 2,065 (+6353.13%)
Mutual labels:  selenium, selenium-webdriver, selenium-grid
Cdp4j
cdp4j - Chrome DevTools Protocol for Java
Stars: ✭ 232 (+625%)
Mutual labels:  selenium, selenium-webdriver
fBrowser
Helpful Selenium functions to make web-scraping easier and faster
Stars: ✭ 16 (-50%)
Mutual labels:  selenium, webscraping
Zalenium
A flexible and scalable container based Selenium Grid with video recording, live preview, basic auth & dashboard.
Stars: ✭ 2,381 (+7340.63%)
Mutual labels:  selenium, selenium-grid
Selenium Remote Driver
Perl Bindings to the Selenium Webdriver server
Stars: ✭ 169 (+428.13%)
Mutual labels:  selenium, selenium-webdriver
Lambdium
headless chrome + selenium webdriver in AWS Lambda using the serverless application model
Stars: ✭ 246 (+668.75%)
Mutual labels:  selenium, selenium-webdriver
Scrape Linkedin Selenium
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+646.88%)
Mutual labels:  selenium, selenium-webdriver
SeleniumWebDriverWithCSharp
BDD with Selenium Webdriver and Specflow using C#
Stars: ✭ 22 (-31.25%)
Mutual labels:  selenium-webdriver, selenium-grid
google-meet-bot
Bot for scheduling and entering google meet sessions automatically
Stars: ✭ 33 (+3.13%)
Mutual labels:  selenium, selenium-webdriver
Panther
A browser testing and web crawling library for PHP and Symfony
Stars: ✭ 2,480 (+7650%)
Mutual labels:  selenium, selenium-webdriver
Thirtyfour
Selenium WebDriver client for Rust, for automated testing of websites
Stars: ✭ 191 (+496.88%)
Mutual labels:  selenium, selenium-webdriver
Steward
PHP libraries that makes Selenium WebDriver + PHPUnit functional testing easy and robust
Stars: ✭ 215 (+571.88%)
Mutual labels:  selenium, selenium-webdriver
Selenium Jupiter
JUnit 5 extension for Selenium WebDriver
Stars: ✭ 185 (+478.13%)
Mutual labels:  selenium, selenium-webdriver
AppiumGrid
A framework for running appium tests in parallel across devices and also on desktop browser... U like it STAR it !!
Stars: ✭ 17 (-46.87%)
Mutual labels:  selenium-webdriver, selenium-grid
ScatterFly
An attempt to improve user privacy by intelligent data obfuscation.
Stars: ✭ 49 (+53.13%)
Mutual labels:  selenium, selenium-webdriver
Ayespy
A performant visual regression testing tool
Stars: ✭ 150 (+368.75%)
Mutual labels:  selenium, selenium-webdriver
TRA-Ticket-Booker
(已不適用新版臺鐵訂票系統,且不再更新)台灣鐵路訂票應用程式(臺鐵 / 台鐵 / 訂單程票 / 訂來回票),基於 Selenium + PyQt4。
Stars: ✭ 26 (-18.75%)
Mutual labels:  selenium, selenium-webdriver
SHAFT ENGINE
SHAFT is an MIT licensed test automation engine. Powered by best-in-class frameworks like Selenium WebDriver, Appium & RestAssured it provides a wizard-like syntax to increase productivity, and built-in wrappers to eliminate boilerplate code and to ensure your tests are extra stable and your results are extra reliable.
Stars: ✭ 170 (+431.25%)
Mutual labels:  selenium-webdriver, selenium-grid

Concurrent Web Scraping with Selenium Grid and Docker Swarm

Want to learn how to build this project?

Check out the blog post.

Want to use this project?

  1. Fork/Clone

  2. Create and activate a virtual environment

  3. Install the requirements

  4. Sign up for Digital Ocean and generate an access token

  5. Add the token to your environment:

    (env)$ export DIGITAL_OCEAN_ACCESS_TOKEN=[your_token]
  6. Spin up four droplets and deploy Docker Swarm:

    (env)$ sh project/create.sh
  7. Run the scraper:

    (env)$ docker-machine env node-1
    (env)$ eval $(docker-machine env node-1)
    (env)$ NODE=$(docker service ps --format "{{.Node}}" selenium_hub)
    (env)$ for i in {1..8}; do {
             python project/script.py ${i} $(docker-machine ip $NODE) &
           };
           done
  8. Bring down the resources:

    (env)$ sh project/destroy.sh
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].