Project Timber
Timbr is a web service/application that turns an arbitrary web page into structure JSON data, it provides a friendly interface allow users without programming experiences to let our application know which part of data you want with a few clicks.
Motiviation/ Who can use
There two challenges toward web crawler/scrapers:
- No Unique web crawler/scrapers for every websites due to different DOM structure and design of different websites.
- Writing Web Crawler/scrapers need programming experience/skills.
These two reasons are main reasons Project Timbr come out! Ideally, we want to create a Unique web application (combining crawlering and scraping tasks) for every websites. We provide a friendly interface letting every people (with none or few programming skills) click their mouse few times and our application can know how to deal with all the tasks!
V 1.0
This version is an early but stable version. Currently version is under re-design.
Installation
- You need a NodeJS environment: NodeJS
- Clone the git repo and do following commands on your terminal!
git clone https://github.com/lvyachao/Timbr_V1.git
cd timber
./install.sh
./watch.sh
- our shell scripts install Meteor (An early version)
Run Website
- Change directory to project root
meteor
License
The MIT License (MIT)