All Projects → toddwschneider → Nyc Citibike Data

toddwschneider / Nyc Citibike Data

Licence: mit
NYC Citi Bike system data and analysis

Programming Languages

r
7636 projects

NYC Citi Bike Data

Code originally in support of the post "A Tale of Twenty-Two Million Citi Bike Rides: Analyzing the NYC Bike Share System". Also used in conjunction with the nyc-taxi-data repo for the post "When Are Citi Bikes Faster Than Taxis in New York City?"

This repo provides scripts to download, process, and analyze NYC's Citi Bike share system data. The data is stored in a PostgreSQL database, uses PostGIS for spatial calculations, and R for data analysis.

Instructions

1. Install PostgreSQL and PostGIS

Both are available via Homebrew on Mac

2. Download raw taxi data

./download_raw_data.sh

3. Initialize database and set up schema

./initialize_database.sh

4. Import taxi data into database and map to census tracts

./import_trips.sh

5. Analysis

Additional Postgres and R scripts for analysis are in the analysis/ folder

Other data sources

These are bundled with the repository, so no need to download separately, but:

Questions/issues/contact

[email protected], or open a GitHub issue

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].