adipasquale / Voyages Sncf Api
A scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.
Programming Languages
python
139335 projects - #7 most used programming language
Projects that are alternatives of or similar to Voyages Sncf Api
OpenScraper
An open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (+1042.86%)
Mutual labels: scraper, scrapy
Advanced Web Scraping Tutorial
The Zipru scraper developed in the Advanced Web Scraping Tutorial.
Stars: ✭ 384 (+5385.71%)
Mutual labels: scraper, scrapy
scrapy facebooker
Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (+214.29%)
Mutual labels: scraper, scrapy
Bandcamp Scraper
A scraper for https://bandcamp.com
Stars: ✭ 137 (+1857.14%)
Mutual labels: api, scraper
scrapy-LBC
Araignée LeBonCoin avec Scrapy et ElasticSearch
Stars: ✭ 14 (+100%)
Mutual labels: scraper, scrapy
Post Tuto Deployment
Build and deploy a machine learning app from scratch 🚀
Stars: ✭ 368 (+5157.14%)
Mutual labels: api, scrapy
Anitop
Anitop is an unofficial simple API from https://anitrendz.net/ site
Stars: ✭ 30 (+328.57%)
Mutual labels: api, scraper
Jikan
Unofficial MyAnimeList PHP+REST API which provides functions other than the official API
Stars: ✭ 531 (+7485.71%)
Mutual labels: api, scraper
Googledictionaryapi
Google does not provide Google Dictionary API so I created one.
Stars: ✭ 528 (+7442.86%)
Mutual labels: api, scraper
Google Play Scraper
Node.js scraper to get data from Google Play
Stars: ✭ 1,606 (+22842.86%)
Mutual labels: api, scraper
Pymarketcap
Python3 API wrapper and web scraper for https://coinmarketcap.com
Stars: ✭ 73 (+942.86%)
Mutual labels: api, scraper
OLX Scraper
📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (+114.29%)
Mutual labels: scraper, scrapy
Public Instagram
Tool to fetch Instagram's public content.
Stars: ✭ 43 (+514.29%)
Mutual labels: api, scraper
Linkedin
Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Stars: ✭ 309 (+4314.29%)
Mutual labels: scraper, scrapy
Goribot
[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (+2614.29%)
Mutual labels: scraper, scrapy
Ruiji.net
crawler framework, distributed crawler extractor
Stars: ✭ 220 (+3042.86%)
Mutual labels: scraper, scrapy
Voyages-SNCF API
a scrapyrt API to retrieve data from voyages-sncf.mobi website
Local setup
$ mkvirtualenv sncflolapi
$ workon sncflolapi
$ pip install -r requirements.txt
If you run into problems with the cryptography
package on a osx machine, run this:
$ brew install pkg-config libffi openssl
$ env LDFLAGS="-L$(brew --prefix openssl)/lib" CFLAGS="-I$(brew --prefix openssl)/include" pip install cryptography
(cf this SO thread)
Run Server
if you want livereload, run pip install watchdog
and then :
$ workon sncflolapi
$ ./bin/run_local_server
Run crawl from CLI
Useful for debugging, especially to use the inspect_response
helper.
$ cd voyagessncf
$ scrapy crawl voyagessncf_mobi -a departure_city=paris -a arrival_city=avignon -a departure_date="01/04/2016" -a departure_hour="18h"
or
$ scrapy crawl voyagessncf_com -a departure_city=paris -a arrival_city=avignon -a departure_date="10/03/2017" -a departure_hour="8"
Call API
$ brew install httpie
$ http "http://localhost:9080/crawl.json" departure_city==marseille arrival_city==paris departure_date=="22/03/2017" card==TGVMAX precise_departure_time=="06h11" price_below==80
or with raw curl :
$ curl "http://localhost:9080/crawl.json?departure_date=11%2F03%2F2017&departure_city=paris&arrival_city=rennes&departure_hour=8&card=TGVMAX"
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].