All Projects → tsdataclinic → newerhoods

tsdataclinic / newerhoods

Licence: Apache-2.0 License
A Data Clinic project that aggregates NYC Open Data at the tract-level and uses Machine Learning techniques to re-imagine neighborhood boundaries.

Programming Languages

HTML
75241 projects
r
7636 projects

Projects that are alternatives of or similar to newerhoods

genie
Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-41.67%)
Mutual labels:  clustering
dbscan-python
[New Version] Theoretically Efficient and Practical Parallel DBSCAN
Stars: ✭ 18 (-50%)
Mutual labels:  clustering
calib
Calib clusters barcode tagged paired-end reads based on their barcode and sequence similarity.
Stars: ✭ 29 (-19.44%)
Mutual labels:  clustering
Linux-admin
Shell scripts to automate download of GitHub traffic statistics, cluster administration, and create an animated GIF.
Stars: ✭ 23 (-36.11%)
Mutual labels:  clustering
kmeans
K-Means clustering
Stars: ✭ 51 (+41.67%)
Mutual labels:  clustering
rabbitmq-peer-discovery-aws
AWS-based peer discovery backend for RabbitMQ 3.7.0+
Stars: ✭ 23 (-36.11%)
Mutual labels:  clustering
SparseLSH
A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (+252.78%)
Mutual labels:  clustering
ex united
Easily spawn Elixir nodes (supervising, Mix configured, easy asserted / refuted) within ExUnit tests
Stars: ✭ 40 (+11.11%)
Mutual labels:  clustering
magento-cluster
Highly Available and Auto-scalable Magento Cluster
Stars: ✭ 21 (-41.67%)
Mutual labels:  clustering
retinal-exudates-detection
exudates detection using hybrid approach (Image Morphology & Machine Learning)
Stars: ✭ 53 (+47.22%)
Mutual labels:  clustering
pytorch kmeans
Implementation of the k-means algorithm in PyTorch that works for large datasets
Stars: ✭ 38 (+5.56%)
Mutual labels:  clustering
scclusteval
Single Cell Cluster Evaluation
Stars: ✭ 57 (+58.33%)
Mutual labels:  clustering
LIUM
Scripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-22.22%)
Mutual labels:  clustering
Fred
A fast, scalable and light-weight C++ Fréchet distance library, exposed to python and focused on (k,l)-clustering of polygonal curves.
Stars: ✭ 13 (-63.89%)
Mutual labels:  clustering
pulseha
PulseHA is a active-passive high availability cluster daemon that uses GRPC and is written in GO.
Stars: ✭ 15 (-58.33%)
Mutual labels:  clustering
machine-learning-course
Machine Learning Course @ Santa Clara University
Stars: ✭ 17 (-52.78%)
Mutual labels:  clustering
dropClust
Version 2.1.0 released
Stars: ✭ 19 (-47.22%)
Mutual labels:  clustering
realtimemap-dotnet
A showcase for Proto.Actor - an ultra-fast distributed actors solution for Go, C#, and Java/Kotlin.
Stars: ✭ 47 (+30.56%)
Mutual labels:  clustering
rsMove
Remote Sensing for Movement Ecology
Stars: ✭ 25 (-30.56%)
Mutual labels:  spatial-analysis
lannister
A lightweight MQTT broker w/ full spec,Clustering,WebSocket,SSL written in Java
Stars: ✭ 20 (-44.44%)
Mutual labels:  clustering

NewerHoods

New York City’s (NYC’s) neighborhoods are a driving force in the lives of New Yorkers—their identities are closely intertwined and a source of pride. However, the history and evolution of NYC’s neighborhoods don’t follow the rigid, cold lines of statistical and administrative boundaries. Instead, the neighborhoods we live and work in are the result of a more organic confluence of factors.

Data Clinic developed NewerHoods with the goal of helping individuals and organizations better advocate for their communities by enabling them to tailor insights to meet their specific needs. NewerHoods is an interactive web-app that uses open data to generate localized features at the census tract-level and machine learning to create homogeneous clusters. Users are able to select characteristics of interest (currently open data on housing, crime, and 311 complaints), visualize NewerHood clusters on an interactive map, find similar neighborhoods, and compare them against existing administrative boundaries. The tool is designed to enable users without in-depth data expertise to compare and incorporate these redefined neighborhoods into their work and life.

The application is live and available to use here.

Getting Started

The below steps will help you get started and setting up and running NewerHoods locally. Since this is a RShiny application, install RStudio on your machine if you haven't already from here.

Directory Structure

newerhoods/clean_data contains just the cleaned/transformed data sets used directly by the Shiny App.

/src contains all the code to merge and clean the data sets, extract features from it, and cluster the features.

/newerhoods contains the code for the RShiny WebApp.

Running the App

First, the R environment needs to be set up with all the necessary packages.

source("newerhoods/setup.R")

The project uses several APIs from loading data using the APIs developed by NYC Developer Portal and Mapbox for the underlying map visualization in the Shiny App. Getting all of these token are free by signing up here and here. Follow the instructions in the settings.R file which can be found in the newerhoods folder and source the local version of the file to get all the tokens stored in the environment. You would have to source this settings file everytime you start a new session.

Note: If you intend to run only the RShiny App, filling in just the MapBox API Token would suffice.

source("newerhoods/settings_local.R")

Run the App

library(shiny)
runApp("newerhoods")

Alternatively, you can run the application in docker. To build the docker container run

docker build -t newerhoods . 

Then to run the docker container simply run

docker run -it --rm -p 3000:3000 -v $(pwd):/app newerhoods 

any changes you make in the code should trigger an application reload so all you should need to do is refresh your browser to see them.

Contributing to NewerHoods

We invite feedback on the tool and encourage users to contribute. Check out this page for some ways in which you can contribute. To contact Data Clinic about NewerHoods, please email us at [email protected].

Data Sources

  1. NYC Annualized Property Sales Data (2012-2017)
  2. MapPLUTO (18v1)
  3. Geoclient API v1.1
  4. Property Assessment Roll Archives
  5. NYPD Complaint Data Historic
  6. 311 Service Requests from 2010 to Present
  7. NYC 2010 Census Tracts

References

  1. ClustGeo: an R package for hierarchical clustering with spatial constraints
  2. Making Neighborhoods - Understanding New York City Transitions 2000-2010

License

This project is licensed under the Apache 2.0 License - see the LICENSE.md file for details

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].