All Projects → egoist → recrawler

egoist / recrawler

Licence: MIT License
Remote web content crawler done right.

Programming Languages

javascript
184084 projects - #8 most used programming language

recrawler NPM version NPM downloads Circle CI

Remote web content crawler done right.

Motivation

Sometimes I want to grab some nice images from a url like http://bbs.005.tv/thread-492392-1-1.html, so I made this little program to combine node-fetch and cheerio to make my attempt fulfilled.

Install

$ npm install --save recrawler

For Single Page Apps please head to recrawler-spa

Usage

const recrawler = require('recrawler')

recrawler('http://some-url.com/a/b/c')
	.then($ => {
		$('img.nice-images').each(function () {
			const url = $(this).attr('src')
			console.log(url)
		})
	})

API

recrawler(url, opts)

opts

cheerio

cheerio options. Except decodeEntities is false by default here.

License

MIT © EGOIST

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].