All Projects β†’ prateekiiest β†’ titanic_survival_exploration

prateekiiest / titanic_survival_exploration

Licence: MIT license
Udacity Machine Learning Nano degree Program Project Predicting Passenger Survival

Programming Languages

Jupyter Notebook
11667 projects
HTML
75241 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to titanic survival exploration

srqm
An introductory statistics course for social scientists, using Stata
Stars: ✭ 43 (+138.89%)
Mutual labels:  statistical-analysis, regression-models
Gitinspector
πŸ“Š The statistical analysis tool for git repositories
Stars: ✭ 2,058 (+11333.33%)
Mutual labels:  analysis, statistical-analysis
mousetrap
Process and Analyze Mouse-Tracking Data
Stars: ✭ 33 (+83.33%)
Mutual labels:  analysis
AppliedStats
A repo with homeworks and labs from a course on applied stats taken by me during my bachelor's degree in MIPT, Ru. Course authors: Andrii Hraboviy, @andriygav and Oleg Bakhteev, @bahleg.
Stars: ✭ 16 (-11.11%)
Mutual labels:  statistical-analysis
pytest-notebook
A pytest plugin for regression testing and regenerating Jupyter Notebooks
Stars: ✭ 35 (+94.44%)
Mutual labels:  ipython-notebook
SqlServer.Rules
SQL Server static code analysis rules for SSDT database projects
Stars: ✭ 20 (+11.11%)
Mutual labels:  analysis
liground
A free, open-source and modern Chess Variant Analysis GUI for the 21st century
Stars: ✭ 41 (+127.78%)
Mutual labels:  analysis
MixingBear
Package for automatic beat-mixing of music files in Python 🐻🎚
Stars: ✭ 73 (+305.56%)
Mutual labels:  analysis
pyemma tutorials
How to analyze molecular dynamics data with PyEMMA
Stars: ✭ 49 (+172.22%)
Mutual labels:  analysis
should-i-play-f6
Chess project to analyze the statistical effect of playing f3 (as white) or f6 (as black) on the outcome of the game.
Stars: ✭ 15 (-16.67%)
Mutual labels:  analysis
pointcloud-processing
A point cloud visualisation and analysis library for Processing.
Stars: ✭ 20 (+11.11%)
Mutual labels:  analysis
root pandas
A Python module for conveniently loading/saving ROOT files as pandas DataFrames
Stars: ✭ 108 (+500%)
Mutual labels:  analysis
fornalder
Visualize long-term trends in collections of Git repositories.
Stars: ✭ 80 (+344.44%)
Mutual labels:  analysis
Kaggle-Competition-Sberbank
Top 1% rankings (22/3270) code sharing for Kaggle competition Sberbank Russian Housing Market: https://www.kaggle.com/c/sberbank-russian-housing-market
Stars: ✭ 31 (+72.22%)
Mutual labels:  regression-models
cis
Home of the Community Intercomparison Suite.
Stars: ✭ 30 (+66.67%)
Mutual labels:  analysis
HumanPilot
Spatial Transcriptomics human DLPFC pilot study part of the spatialLIBD project
Stars: ✭ 22 (+22.22%)
Mutual labels:  analysis
wink-statistics
Fast & numerically stable statistical analysis
Stars: ✭ 36 (+100%)
Mutual labels:  statistical-analysis
Springboard-Data-Science-Immersive
No description or website provided.
Stars: ✭ 52 (+188.89%)
Mutual labels:  statistical-analysis
RAnalysisSkeleton
Files and settings commonly used in analysis projects with R
Stars: ✭ 15 (-16.67%)
Mutual labels:  analysis
micro-code-analyser
A tiny Node.js microservice to detect the language of a code snippet
Stars: ✭ 21 (+16.67%)
Mutual labels:  analysis

Titanic_Survival_Exploration

Codacy Badge Build status Maintainability

chat on Slack made with &hearts in Python

DOI

 Fork this repo

This repository contains project file for Project 0 - Titanic Survival Exploration as part of Udacity's Machine Learning Nanodegree.


KWOC

We are glad to partner with IIT Kharagpur as a part of the Kharagpur Winter of Code. We are proud to host this Open Source event during the winter months and we hope you have a great winter this year.

See Project Ideas here


Description

The sinking of the RMS Titanic is one of the most infamous shipwrecks in history. On April 15, 1912, during her maiden voyage, the Titanic sank after colliding with an iceberg, killing 1502 out of 2224 passengers and crew. This sensational tragedy shocked the international community and led to better safety regulations for ships.

One of the reasons that the shipwreck led to such loss of life was that there were not enough lifeboats for the passengers and crew. Although there was some element of luck involved in surviving the sinking, some groups of people were more likely to survive than others, such as women, children, and the upper-class.

In this problem, we ask you to complete the analysis of what sorts of people were likely to survive. In particular, we ask you to apply the tools of machine learning to predict which passengers survived the tragedy.

In this optional project, you will create decision functions that attempt to predict survival outcomes from the 1912 Titanic disaster based on each passenger’s features, such as sex and age. Start with a simple algorithm and increase its complexity until you are able to accurately predict the outcomes for at least 80% of the passengers in the provided data. This project will introduce you to some of the concepts of machine learning as you start the Nanodegree program.

Install

This project requires Python 2.7 and the following Python libraries installed:

You will also need to have software installed to run and execute an iPython Notebook

Udacity recommends our students install Anaconda, i pre-packaged Python distribution that contains all of the necessary libraries and software for this project.

Code

Template code is provided in the notebook titanic_survival_exploration.ipynb notebook file. Additional supporting code can be found in titanic_visualizations.py. While some code has already been implemented to get you started, you will need to implement additional functionality when requested to successfully complete the project.

This Notebook will show basic examples of:

Data Handling

  • Importing Data with Pandas
  • Cleaning Data
  • Exploring Data through Visualizations with Matplotlib

Data Analysis

  • Supervised Machine learning Techniques: + Logit Regression Model + Plotting results + Support Vector Machine (SVM) using 3 kernels + Basic Random Forest + Plotting results

Valuation of the Analysis

  • K-folds cross validation to valuate results locally
  • Output the results from the IPython Notebook to Kaggle

Run

In a terminal or command window, navigate to the top-level project directory titanic_survival_exploration/ (that contains this README) and run one of the following commands:

ipython notebook titanic_survival_exploration.ipynb jupyter notebook titanic_survival_exploration.ipynb

This will open the iPython Notebook software and project file in your browser.

Data

The dataset used in this project is included as titanic_data.csv. This dataset is provided by Udacity and contains the following attributes:

  • survival ? Survival (0 = No; 1 = Yes)
  • pclass ? Passenger Class (1 = 1st; 2 = 2nd; 3 = 3rd)
  • name ? Name
  • sex ? Sex
  • age ? Age
  • sibsp ? Number of Siblings/Spouses Aboard
  • parch ? Number of Parents/Children Aboard
  • ticket ? Ticket Number
  • fare ? Passenger Fare
  • cabin ? Cabin
  • embarked ? Port of Embarkation (C = Cherbourg; Q = Queenstown; S = Southampton)

Results

Check here Udacity Reviews

Contribution

See CONTRIBUTING.md

Some Video Resources

Online Reading Material

Happy Coding -- Prateek Chanda

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].