All Projects → sammyfung → hk0weather

sammyfung / hk0weather

Licence: other
Web scraper project to collect the useful Hong Kong weather data from HKO website

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to hk0weather

Mailinglistscraper
A python web scraper for public email lists.
Stars: ✭ 19 (-61.22%)
Mutual labels:  scrapy, webscraping
allitebooks.com
Download all the ebooks with indexed csv of "allitebooks.com"
Stars: ✭ 24 (-51.02%)
Mutual labels:  scrapy, webscraping
Wswp
Code for the second edition Web Scraping with Python book by Packt Publications
Stars: ✭ 112 (+128.57%)
Mutual labels:  scrapy, webscraping
ARGUS
ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (+38.78%)
Mutual labels:  scrapy, webscraping
Django Dynamic Scraper
Creating Scrapy scrapers via the Django admin interface
Stars: ✭ 1,024 (+1989.8%)
Mutual labels:  scrapy, webscraping
Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (+104.08%)
Mutual labels:  scrapy, webscraping
amelia 2.0
An Artificial Intelligence Chat Bot and Service Provider written in Python and AIML.
Stars: ✭ 19 (-61.22%)
Mutual labels:  weather, webscraping
newsemble
API for fetching data from news websites.
Stars: ✭ 42 (-14.29%)
Mutual labels:  webscraping
scrapism
a work-in-progress guide to web scraping as an artistic and critical practice
Stars: ✭ 43 (-12.24%)
Mutual labels:  webscraping
owmr
An R Interface to OpenWeatherMap
Stars: ✭ 24 (-51.02%)
Mutual labels:  weather
scrapy plus
scrapy 常用爬网必备工具包
Stars: ✭ 18 (-63.27%)
Mutual labels:  scrapy
JustDownlink
基于Scrapy+Elasticsearch+Django搭建的分布式电影搜索
Stars: ✭ 28 (-42.86%)
Mutual labels:  scrapy
open-data-smn
An API to serve publicly available data from the SwissMetNet in 10min intervals
Stars: ✭ 23 (-53.06%)
Mutual labels:  weather
repository.colossus
Colossus Repository for Kodi Addons - Kodi is a registered trademark of the XBMC Foundation. We are not connected to or in any other way affiliated with Kodi - DMCA: [email protected]
Stars: ✭ 13 (-73.47%)
Mutual labels:  webscraping
TrackPurchase
단 몇줄의 코드로 다양한 쇼핑 플랫폼에서 결제 내역을 긁어오자!
Stars: ✭ 19 (-61.22%)
Mutual labels:  webscraping
doto
Let's try something different
Stars: ✭ 78 (+59.18%)
Mutual labels:  weather
V2EX Spider
V2EX爬虫
Stars: ✭ 21 (-57.14%)
Mutual labels:  scrapy
metacritic api
PHP Metacritic API - Mirrored by my GitLab
Stars: ✭ 31 (-36.73%)
Mutual labels:  webscraping
image-crawler
An image scraper that scraps images from unsplash.com
Stars: ✭ 12 (-75.51%)
Mutual labels:  webscraping
kindle-weather-stand-alone
Kindle Weather Stand Project - standalone version (no server-end required)
Stars: ✭ 29 (-40.82%)
Mutual labels:  weather

Build Status codecov

hk0weather

hk0weather is an open source web scraper project using Scrapy to collect the useful weather data from Hong Kong Observatory website.

Scrapy can output collected weather data into the machine-readable formats (eg. CSV, JSON, XML).

Optionally, this project supports a Django app 'openweather' to store the collected weather data to Django web framework, and the data can be shown on web through the Django admin UI.

Available Spiders

  1. regional: Hong Kong Regional Weather Data in 10-minutes update from HKO.
  2. rainfall: Hong Kong Rainfall Data in hourly update from HKO.
  3. hkoforecast: Hong Kong Next 24 hour Weather Forecast Report from HKO Open Data.
  4. hko9dayforecast: Hong Kong 9-day Weather Report from HKO Open Data.

Installation Example

  1. Cloning and setup hk0weather in a Py3 virtual environment

    git clone https://github.com/sammyfung/hk0weather.git  
    virtualenv hk0weatherenv  
    source hk0weatherenv/bin/activate  
    cd hk0weather   
    pip install -r requirements.txt    
    
  2. Optional: Setup hk0weather to use openweather

    pip install -r requirements-django.txt    
    cd ..   
    django-admin startproject yourweatherproject   
    cd yourweatherproject   
    git clone https://github.com/sammyfung/openweather.git   
    

    Please add 'openweather' to INSTALLED_APPS in Django yourweatherproject/settings.py.

    ./manage.py makemigrations    
    ./manage.py migrate   
    ./manage.py createsuperuser   
    ./manage.py runserver &    
    cd ../hk0weather     
    

    Django daemon is now running in the background, its web admin UI can be access at http://localhost:8000/admin.

    export PYTHONPATH=/your-full-path-to/yourweatherproject    
    export DJANGO_SETTINGS_MODULE=yourweatherproject.settings   
    

    Please export PYTHONPATH and DJANGO_SETTINGS_MODULE again after every activation of the Py3 virtual environment.

Run a Scrapy spider

Activate the Py3 virtual environment once before the first running of web spiders.

source hk0weatherenv/bin/activate  

Optionally, if Django is in use, export PYTHONPATH and DJANGO_SETTINGS_MODULE.

export PYTHONPATH=/your-full-path-to/yourweatherproject    
export DJANGO_SETTINGS_MODULE=yourweatherproject.settings   

Optionally, list all available spiders.

scrapy list 

Run a specific spider (eg. regional) in Scrapy

scrapy crawl regional   

and optionally use -t (file format) and -o (filename) to output the data in a json file.

scrapy crawl regional -t json -o test.json

Sponsors

Calvin Tsang.

Thanks for my sponsors, please consider to sponsor my works.

References

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].