All Projects → Jean-njoroge → Breast-cancer-risk-prediction

Jean-njoroge / Breast-cancer-risk-prediction

Licence: MIT license
Classification of Breast Cancer diagnosis Using Support Vector Machines

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to Breast-cancer-risk-prediction

Data Science Your Way
Ways of doing Data Science Engineering and Machine Learning in R and Python
Stars: ✭ 530 (+270.63%)
Mutual labels:  notebook, exploratory-data-analysis
Edge2Guard
Code for PerCom Workshop paper title 'Edge2Guard: Botnet Attacks Detecting Offline Models for Resource-Constrained IoT Devices'
Stars: ✭ 16 (-88.81%)
Mutual labels:  exploratory-data-analysis, supervised-learning
adenine
ADENINE: A Data ExploratioN PipelINE
Stars: ✭ 15 (-89.51%)
Mutual labels:  exploratory-data-analysis, pipelines
turbofan failure
Aircraft engine failure prediction model
Stars: ✭ 23 (-83.92%)
Mutual labels:  svm, prediction-model
Kaio-machine-learning-human-face-detection
Machine Learning project a case study focused on the interaction with digital characters, using a character called "Kaio", which, based on the automatic detection of facial expressions and classification of emotions, interacts with humans by classifying emotions and imitating expressions
Stars: ✭ 18 (-87.41%)
Mutual labels:  notebook, supervised-learning
Gru Svm
[ICMLC 2018] A Neural Network Architecture Combining Gated Recurrent Unit (GRU) and Support Vector Machine (SVM) for Intrusion Detection
Stars: ✭ 76 (-46.85%)
Mutual labels:  svm, supervised-learning
Complete Life Cycle Of A Data Science Project
Complete-Life-Cycle-of-a-Data-Science-Project
Stars: ✭ 140 (-2.1%)
Mutual labels:  exploratory-data-analysis, supervised-learning
Polyaxon
Machine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+1974.13%)
Mutual labels:  notebook, pipelines
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-23.78%)
Mutual labels:  notebook, exploratory-data-analysis
loon
A Toolkit for Interactive Statistical Data Visualization
Stars: ✭ 45 (-68.53%)
Mutual labels:  exploratory-data-analysis
sunrise
NumPy, SciPy, MRI and Music | Presented at ISMRM 2021 Sunrise Educational Session
Stars: ✭ 20 (-86.01%)
Mutual labels:  notebook
idr-notebooks
Jupyter Notebooks for the Image Data Resource
Stars: ✭ 14 (-90.21%)
Mutual labels:  notebook
vent
Vent is a light-weight platform built to automate network collection and analysis pipelines using a flexible set of popular open source tools and technologies. Vent is python-based, extensible, leverages docker containers, and provides both an API and CLI.
Stars: ✭ 73 (-48.95%)
Mutual labels:  pipelines
Machine-Learning-Models
In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
Stars: ✭ 30 (-79.02%)
Mutual labels:  svm
Machine-Learning-Notebooks
15+ Machine/Deep Learning Projects in Ipython Notebooks
Stars: ✭ 66 (-53.85%)
Mutual labels:  notebook
ML-Experiments
整理记录本人担任课程助教设计的四个机器学习实验,主要涉及简单的线性回归、朴素贝叶斯分类器、支持向量机、CNN做文本分类。内附实验指导书、讲解PPT、参考代码,欢迎各位码友讨论交流。
Stars: ✭ 85 (-40.56%)
Mutual labels:  svm
kana
Single cell analysis in the browser
Stars: ✭ 81 (-43.36%)
Mutual labels:  exploratory-data-analysis
keras-audio
keras project for audio deep learning
Stars: ✭ 37 (-74.13%)
Mutual labels:  supervised-learning
Parametric-Contrastive-Learning
Parametric Contrastive Learning (ICCV2021)
Stars: ✭ 155 (+8.39%)
Mutual labels:  supervised-learning
Deception-Detection-on-Amazon-reviews-dataset
A SVM model that classifies the reviews as real or fake. Used both the review text and the additional features contained in the data set to build a model that predicted with over 85% accuracy without using any deep learning techniques.
Stars: ✭ 42 (-70.63%)
Mutual labels:  svm

Breast-cancer-risk-prediction

Necessity, who is the mother of invention. – Plato*

Welcome to my GitHub repository on Using Predictive Analytics model to diagnose breast cancer.


Objective:

The repository is a learning exercise to:

  • Apply the fundamental concepts of machine learning from an available dataset
  • Evaluate and interpret my results and justify my interpretation based on observed data set
  • Create notebooks that serve as computational records and document my thought process.

The analysis is divided into four sections, saved in juypter notebooks in this repository

  1. Identifying the problem and Data Sources
  2. Exploratory Data Analysis
  3. Pre-Processing the Data
  4. Build model to predict whether breast cell tissue is malignant or Benign

Notebook 1: Identifying the problem and Getting data.

Notebook goal:Identify the types of information contained in our data set In this notebook I used Python modules to import external data sets for the purpose of getting to know/familiarize myself with the data to get a good grasp of the data and think about how to handle the data in different ways. 

Notebook 2 Exploratory Data Analysis

Notebook goal:  Explore the variables to assess how they relate to the response variable In this notebook, I am getting familiar with the data using data exploration and visualization techniques using python libraries (Pandas, matplotlib, seaborn. Familiarity with the data is important which will provide useful knowledge for data pre-processing)

Notebook 3 Pre-Processing the data

Notebook goal:Find the most predictive features of the data and filter it so it will enhance the predictive power of the analytics model. In this notebook I use feature selection to reduce high-dimension data, feature extraction and transformation for dimensionality reduction. This is essential in preparing the data before predictive models are developed.

Notebook 4 Predictive model using Support Vector Machine (svm)

Notebook goal: Construct predictive models to predict the diagnosis of a breast tumor. In this notebook, I construct a predictive model using SVM machine learning algorithm to predict the diagnosis of a breast tumor. The diagnosis of a breast tumor is a binary variable (benign or malignant). I also evaluate the model using confusion matrix the receiver operating curves (ROC), which are essential in assessing and interpreting the fitted model.

Notebook 5: Optimizing the Support Vector Classifier

Notebook goal: Construct predictive models to predict the diagnosis of a breast tumor. In this notebook, I aim to tune parameters of the SVM Classification model using scikit-learn.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].