All Projects → bharatsingh430 → web-image-crawler

bharatsingh430 / web-image-crawler

Licence: other
Code to download web-images

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to web-image-crawler

Mastering Junit5
A comprehensive collection of test examples created with JUnit 5
Stars: ✭ 223 (+1386.67%)
Mutual labels:  selenium
Alsatian
TypeScript testing framework with test cases
Stars: ✭ 244 (+1526.67%)
Mutual labels:  selenium
SmartImage
Reverse image search tool (SauceNao, ImgOps, trace.moe, and more)
Stars: ✭ 346 (+2206.67%)
Mutual labels:  image-search
Cdp4j
cdp4j - Chrome DevTools Protocol for Java
Stars: ✭ 232 (+1446.67%)
Mutual labels:  selenium
Nightwatch Cucumber
[DEPRECATED] Cucumber.js plugin for Nightwatch.js.
Stars: ✭ 243 (+1520%)
Mutual labels:  selenium
Pytest Selenium
Plugin for running Selenium with pytest
Stars: ✭ 246 (+1540%)
Mutual labels:  selenium
Pychromeless
Python Lambda Chrome Automation (naming pending)
Stars: ✭ 219 (+1360%)
Mutual labels:  selenium
google-meet-bot
Bot for scheduling and entering google meet sessions automatically
Stars: ✭ 33 (+120%)
Mutual labels:  selenium
Example Airflow Dags
Example DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+1520%)
Mutual labels:  selenium
pqlite
⚡ A fast embedded library for approximate nearest neighbor search
Stars: ✭ 141 (+840%)
Mutual labels:  image-search
Dockselpy
Dockerized Selenium and Python with support for Chrome, Firefox and PhantomJS
Stars: ✭ 237 (+1480%)
Mutual labels:  selenium
Scrape Linkedin Selenium
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+1493.33%)
Mutual labels:  selenium
Selion
Enabling Test Automation in Java
Stars: ✭ 252 (+1580%)
Mutual labels:  selenium
Awesome Java Crawler
本仓库收集整理爬虫相关资源,开发语言以Java为主
Stars: ✭ 228 (+1420%)
Mutual labels:  selenium
natural-language-joint-query-search
Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.
Stars: ✭ 143 (+853.33%)
Mutual labels:  image-search
Recheck Web
recheck for web apps – change comparison tool with local Golden Masters, Git-like ignore syntax and "Unbreakable Selenium" tests.
Stars: ✭ 224 (+1393.33%)
Mutual labels:  selenium
Lambdium
headless chrome + selenium webdriver in AWS Lambda using the serverless application model
Stars: ✭ 246 (+1540%)
Mutual labels:  selenium
selenium-cheatsheet-java
A comprehensive list of selenium commands in Java
Stars: ✭ 20 (+33.33%)
Mutual labels:  selenium
jest-selenium
This project shows how to drive your selenium tests with Jest.
Stars: ✭ 22 (+46.67%)
Mutual labels:  selenium
Fergun
An utility Discord bot written in C# using Discord.Net
Stars: ✭ 26 (+73.33%)
Mutual labels:  image-search

web-image-crawler

This repo includes code to download images from google web image search results. You can modify the javascript embedded in the python code so that it works for bing, yandex, flikr etc. also. They more or less follow the same DOM.

interactive.py lets you download images in an interactive fashion. api.py takes a text file (like sample.txt) where queries can be mentioned to download.

Since everything is in python, the dependencies can be easily installed. You can change the driver from Firefox to Chrome. Note that driver preferences will need to be changed if you switch the browser.

Since I got a few e-mails asking how I downloaded images from the internet for a couple of my papers (mentioned below), I decided to make the code public. However, I would recommend that you do not use it for commercial purposes and use the API which the search engines provide. This API is made available so that correct web-data is collected for research purposes. I observed that the results from the Google/Bing/Flickr APIs is different from what is shown in the browser. This code gets you the data what a user will actually see in a browser.

@inproceedings{singh2015selecting,
  title={Selecting relevant web trained concepts for automated event retrieval},
  author={Singh, Bharat and Han, Xintong and Wu, Zhe and Morariu, Vlad I and Davis, Larry S},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  pages={4561--4569},
  year={2015}
}

@article{han2017vrfp,
  title={VRFP: On-the-fly video retrieval using web images and fast fisher vector products},
  author={Han, Xintong and Singh, Bharat and Morariu, Vlad and Davis, Larry S},
  journal={IEEE Transactions on Multimedia},
  year={2017},
  publisher={IEEE}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].