All Projects → mowlc → Real Estate Scraper

mowlc / Real Estate Scraper

Licence: mit
Web scraper that makes it easier to find real estate in Slovenia.

Programming Languages

javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Real Estate Scraper

Find Cheapest Flights
Use Google Flights API and scrape Expedia to find the cheapest/shortest flights!
Stars: ✭ 18 (-41.94%)
Mutual labels:  scraper
Twitter Get Old Tweets Scraper
A data scraper for retrieving old tweets in Twitter using Python3.
Stars: ✭ 27 (-12.9%)
Mutual labels:  scraper
Node Website Scraper
Download website to local directory (including all css, images, js, etc.)
Stars: ✭ 912 (+2841.94%)
Mutual labels:  scraper
Mailinglistscraper
A python web scraper for public email lists.
Stars: ✭ 19 (-38.71%)
Mutual labels:  scraper
Gisaid Scrapper
Scrapping tool for GISAID data regarding SARS-CoV-2
Stars: ✭ 25 (-19.35%)
Mutual labels:  scraper
Emby.plugins.javscraper
Emby/Jellyfin 的一个日本电影刮削器插件,可以从某些网站抓取影片信息。
Stars: ✭ 864 (+2687.1%)
Mutual labels:  scraper
Flight Prices Scraper
Automated Script to scrape flight prices from any website into a csv format
Stars: ✭ 17 (-45.16%)
Mutual labels:  scraper
Anitop
Anitop is an unofficial simple API from https://anitrendz.net/ site
Stars: ✭ 30 (-3.23%)
Mutual labels:  scraper
Scrapit
Scraping scripts for various websites.
Stars: ✭ 25 (-19.35%)
Mutual labels:  scraper
Pypergrabber
Fetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Stars: ✭ 14 (-54.84%)
Mutual labels:  scraper
Brainly Scraper
package to scrape www.brainly.co.id
Stars: ✭ 22 (-29.03%)
Mutual labels:  scraper
Imagenetscraper
👁 Bulk-download all thumbnails from an ImageNet synset, with optional rescaling
Stars: ✭ 24 (-22.58%)
Mutual labels:  scraper
Scanless
online port scan scraper
Stars: ✭ 875 (+2722.58%)
Mutual labels:  scraper
Yts Scraper
Download .torrent files from YTS YIFY
Stars: ✭ 19 (-38.71%)
Mutual labels:  scraper
Huginn
Create agents that monitor and act on your behalf. Your agents are standing by!
Stars: ✭ 33,694 (+108590.32%)
Mutual labels:  scraper
Indonesia News Scraper
A news scraper for nodejs that help to scrap news from Indonesian news portal.
Stars: ✭ 18 (-41.94%)
Mutual labels:  scraper
Voyages Sncf Api
A scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.
Stars: ✭ 7 (-77.42%)
Mutual labels:  scraper
Pypatent
Search for and retrieve US Patent and Trademark Office Patent Data
Stars: ✭ 31 (+0%)
Mutual labels:  scraper
Ratatouille
A Node.js wrapper for scraping allrecipes.com
Stars: ✭ 26 (-16.13%)
Mutual labels:  scraper
Pycyanide
Stars: ✭ 13 (-58.06%)
Mutual labels:  scraper

real-estate-scraper

Web scraper that makes it easier to find real estate in Slovenia. After scraper finishes you get an email with updates, so you don't have to check web pages all the time but only. In email you get all the information you need: Title with location, short description, price and link to original listing.

Email example:

Email example

Currently supporting two major webpages for real estate Bolha.com and Nepremicnine.net

Download

In order to rund script you have to install [Node.js] (https://nodejs.org/en/) and do the following:

  • Clone repo
git clone https://github.com/mowlc/real-estate-scraper.git
  • Install additional libraries
npm install --save tinyreq
npm install --save cheerio 
npm install --save node-json-db 
npm install --save nodemailer


Run

Firstly you need to get oAuth credentials. Very good tutorial on how to get them can be found here: https://stackoverflow.com/questions/24098461/nodemailer-gmail-what-exactly-is-a-refresh-token-and-how-do-i-get-one Once you have your clientID, clientSecret, refreshToken and accessToken take config.json.example, compy and rename it to config.json. Then fill in the required data:

sender_email - Email from which emails will be sent (must be gmail)
clientID - Client ID for oAuth
clientSecret - Client secret for oAuth
refreshToken - Refresh token for oAuth
accessToken - Inital access token for oAuth, can also be empty because new one is generated upon registration
interval = 15 - Interval on which the script executes (between 15 and 30 minutes is optimal) 
receiver_email - List of email addresses on which to send email 
url_bolha  - list of URLs of selection on bolha.com				
url_nepremicnine - list of URLs of selection on nepremicnine.net				

You can get desired URL's from chosen site(bolha, nepremicnine.net) by configuring search paramteres on the site and then copying the URL in to configuration file.

You run the script by executing following command:

node scraper.js

Troubleshooting

Email not send

Problem could be in your Gmail account settings as Google blocks sign-in attempts from apps that do not use modern security standards. In order to fix that go to Google less secure apps settings and turn Access for less secure apps ON.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].