All Projects → DannyBen → snapcrawl

DannyBen / snapcrawl

Licence: MIT License
Crawl a website and take screenshots

Programming Languages

ruby
36898 projects - #4 most used programming language

Projects that are alternatives of or similar to snapcrawl

Mac2imgur
⬆ A simple Mac app designed to make uploading images and screenshots to Imgur quick and effortless.
Stars: ✭ 914 (+2370.27%)
Mutual labels:  screenshot, capture
Cocrawler
CoCrawler is a versatile web crawler built using modern tools and concurrency.
Stars: ✭ 148 (+300%)
Mutual labels:  screenshot, crawler
Imgursniper
📷 A quick and easy Image, Screenshot and Screen recording sharing tool
Stars: ✭ 69 (+86.49%)
Mutual labels:  screenshot, capture
1click Webpage Screenshot
Entire page Screenshot extension for Google Chrome. I'm developing open source extension for Google Chrome. All extension are free for use. Let's make Chrome great again!
Stars: ✭ 406 (+997.3%)
Mutual labels:  screenshot, capture
VideoScreenRecorder
Record video of your screen and save the file locally 🎥
Stars: ✭ 36 (-2.7%)
Mutual labels:  screenshot, capture
Quickshot
Capture images of any View, SurfaceView or Bitmap from your Android app in: .jpg .png or .nomedia with simple oneliner codes.
Stars: ✭ 663 (+1691.89%)
Mutual labels:  screenshot, capture
Screenshots
A screenshot plugin for electron
Stars: ✭ 130 (+251.35%)
Mutual labels:  screenshot, capture
Capture Frame
Capture video screenshot from a `<video>` tag (at the current time)
Stars: ✭ 109 (+194.59%)
Mutual labels:  screenshot, capture
Sharex
ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of files to many supported destinations you can choose from.
Stars: ✭ 18,143 (+48935.14%)
Mutual labels:  screenshot, capture
Screenshot Stream
Capture screenshot of a website and return it as a stream
Stars: ✭ 228 (+516.22%)
Mutual labels:  screenshot, capture
Menyoki
Screen{shot,cast} and perform ImageOps on the command line 🌱 🏞️
Stars: ✭ 255 (+589.19%)
Mutual labels:  screenshot, capture
video-snapshot
Get snapshots from a video file in the browser 🎥 🌅
Stars: ✭ 63 (+70.27%)
Mutual labels:  screenshot, capture
KeyPlexer
Capstone: Keylogger Trojan
Stars: ✭ 32 (-13.51%)
Mutual labels:  screenshot, capture
X11 Recorder
xrec helps you capture any area of your screen either as a screenshot or record a gif file.
Stars: ✭ 17 (-54.05%)
Mutual labels:  screenshot, capture
Instagram Crawler
Crawl instagram photos, posts and videos for download.
Stars: ✭ 178 (+381.08%)
Mutual labels:  crawler, gem
Flameshot
Powerful yet simple to use screenshot software 🖥️ 📸
Stars: ✭ 15,429 (+41600%)
Mutual labels:  screenshot, capture
HighlightTranslator
Highlight Translator can help you to translate the words quickly and accurately. By only highlighting, copying, or screenshoting the content you want to translate anywhere on your computer (ex. PDF, PPT, WORD etc.), the translated results will then be automatically displayed before you.
Stars: ✭ 54 (+45.95%)
Mutual labels:  screenshot, capture
1click-webpage-screenshot
Entire page Screenshot extension for Google Chrome. I'm developing open source extension for Google Chrome. All extension are free for use. Let's make Chrome great again!
Stars: ✭ 432 (+1067.57%)
Mutual labels:  screenshot, capture
log-symbols
A ruby 💎gem💎 for generating log symbols
Stars: ✭ 14 (-62.16%)
Mutual labels:  gem
slime
🍰 一个可视化的爬虫平台
Stars: ✭ 27 (-27.03%)
Mutual labels:  crawler

Snapcrawl - crawl a website and take screenshots

Gem Version Build Status Code Climate


Snapcrawl is a command line utility for crawling a website and saving screenshots.

Features

  • Crawls a website to any given depth and saves screenshots
  • Can capture the full length of the page
  • Can use a specific resolution for screenshots
  • Skips capturing if the screenshot was already saved recently
  • Uses local caching to avoid expensive crawl operations if not needed
  • Reports broken links

Install

Using Docker

You can run Snapcrawl by using this docker image (which contains all the necessary prerequisites):

$ alias snapcrawl='docker run --rm -it --network host --volume "$PWD:/app" dannyben/snapcrawl'

For more information on the Docker image, refer to the docker-snapcrawl repository.

Using Ruby

$ gem install snapcrawl

Note that Snapcrawl requires PhantomJS and ImageMagick.

Usage

Snapcrawl can be configured either through a configuration file (YAML), or by specifying options in the command line.

$ snapcrawl
Usage:
  snapcrawl URL [--config FILE] [SETTINGS...]
  snapcrawl -h | --help
  snapcrawl -v | --version

The default configuration filename is snapcrawl.yml.

Using the --config flag will create a template configuration file if it is not present:

$ snapcrawl example.com --config snapcrawl

Specifying options in the command line

All configuration options can be specified in the command line as key=value pairs:

$ snapcrawl example.com log_level=0 depth=2 width=1024

Sample configuration file

# All values below are the default values

# log level (0-4) 0=DEBUG 1=INFO 2=WARN 3=ERROR 4=FATAL
log_level: 1

# log_color (yes, no, auto)
# yes  = always show log color
# no   = never use colors
# auto = only use colors when running in an interactive terminal
log_color: auto

# number of levels to crawl, 0 means capture only the root URL
depth: 1

# screenshot width in pixels
width: 1280

# screenshot height in pixels, 0 means the entire height
height: 0

# number of seconds to consider the page cache and its screenshot fresh
cache_life: 86400

# where to store the HTML page cache
cache_dir: cache

# where to store screenshots
snaps_dir: snaps

# screenshot filename template, where '%{url}' will be replaced with a 
# slug version of the URL (no need to include the .png extension)
name_template: '%{url}'

# urls not matching this regular expression will be ignored
url_whitelist: 

# urls matching this regular expression will be ignored
url_blacklist: 

# take a screenshot of this CSS selector only
css_selector: 

# when true, ignore SSL related errors
skip_ssl_verification: false

# set to any number of seconds to wait for the page to load before taking
# a screenshot, leave empty to not wait at all (only needed for pages with
# animations or other post-load events).
screenshot_delay: 

Contributing / Support

If you experience any issue, have a question or a suggestion, or if you wish to contribute, feel free to open an issue.


Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].