All Projects → adipasquale → Voyages Sncf Api

adipasquale / Voyages Sncf Api

A scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Voyages Sncf Api

OpenScraper
An open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (+1042.86%)
Mutual labels:  scraper, scrapy
Advanced Web Scraping Tutorial
The Zipru scraper developed in the Advanced Web Scraping Tutorial.
Stars: ✭ 384 (+5385.71%)
Mutual labels:  scraper, scrapy
scrapy facebooker
Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (+214.29%)
Mutual labels:  scraper, scrapy
Bandcamp Scraper
A scraper for https://bandcamp.com
Stars: ✭ 137 (+1857.14%)
Mutual labels:  api, scraper
Fbcrawl
A Facebook crawler
Stars: ✭ 536 (+7557.14%)
Mutual labels:  scraper, scrapy
scrapy-LBC
Araignée LeBonCoin avec Scrapy et ElasticSearch
Stars: ✭ 14 (+100%)
Mutual labels:  scraper, scrapy
Post Tuto Deployment
Build and deploy a machine learning app from scratch 🚀
Stars: ✭ 368 (+5157.14%)
Mutual labels:  api, scrapy
Anitop
Anitop is an unofficial simple API from https://anitrendz.net/ site
Stars: ✭ 30 (+328.57%)
Mutual labels:  api, scraper
Jikan
Unofficial MyAnimeList PHP+REST API which provides functions other than the official API
Stars: ✭ 531 (+7485.71%)
Mutual labels:  api, scraper
Googledictionaryapi
Google does not provide Google Dictionary API so I created one.
Stars: ✭ 528 (+7442.86%)
Mutual labels:  api, scraper
Google Play Scraper
Node.js scraper to get data from Google Play
Stars: ✭ 1,606 (+22842.86%)
Mutual labels:  api, scraper
Duckduckgo
An unofficial DuckDuckGo search API.
Stars: ✭ 6 (-14.29%)
Mutual labels:  api, scraper
Pymarketcap
Python3 API wrapper and web scraper for https://coinmarketcap.com
Stars: ✭ 73 (+942.86%)
Mutual labels:  api, scraper
OLX Scraper
📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (+114.29%)
Mutual labels:  scraper, scrapy
Public Instagram
Tool to fetch Instagram's public content.
Stars: ✭ 43 (+514.29%)
Mutual labels:  api, scraper
Linkedin
Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Stars: ✭ 309 (+4314.29%)
Mutual labels:  scraper, scrapy
Goribot
[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (+2614.29%)
Mutual labels:  scraper, scrapy
Ruiji.net
crawler framework, distributed crawler extractor
Stars: ✭ 220 (+3042.86%)
Mutual labels:  scraper, scrapy
Finviz
Unofficial API for finviz.com
Stars: ✭ 493 (+6942.86%)
Mutual labels:  api, scraper
Scrapyrt
HTTP API for Scrapy spiders
Stars: ✭ 637 (+9000%)
Mutual labels:  scraper, scrapy

Voyages-SNCF API

a scrapyrt API to retrieve data from voyages-sncf.mobi website

Local setup

$ mkvirtualenv sncflolapi
$ workon sncflolapi
$ pip install -r requirements.txt

If you run into problems with the cryptography package on a osx machine, run this:

$ brew install pkg-config libffi openssl
$ env LDFLAGS="-L$(brew --prefix openssl)/lib" CFLAGS="-I$(brew --prefix openssl)/include" pip install cryptography

(cf this SO thread)

Run Server

if you want livereload, run pip install watchdog and then :

$ workon sncflolapi
$ ./bin/run_local_server

Run crawl from CLI

Useful for debugging, especially to use the inspect_response helper.

$ cd voyagessncf
$ scrapy crawl voyagessncf_mobi -a departure_city=paris -a arrival_city=avignon -a departure_date="01/04/2016" -a departure_hour="18h"

or

$ scrapy crawl voyagessncf_com -a departure_city=paris -a arrival_city=avignon -a departure_date="10/03/2017" -a departure_hour="8"

Call API

$ brew install httpie
$ http "http://localhost:9080/crawl.json" departure_city==marseille arrival_city==paris departure_date=="22/03/2017" card==TGVMAX precise_departure_time=="06h11" price_below==80

or with raw curl :

$ curl "http://localhost:9080/crawl.json?departure_date=11%2F03%2F2017&departure_city=paris&arrival_city=rennes&departure_hour=8&card=TGVMAX"
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].