All Projects → anujdutt9 → Disease Prediction From Symptoms

anujdutt9 / Disease Prediction From Symptoms

Disease Prediction based on Symptoms.

Programming Languages

python3
1442 projects

Projects that are alternatives of or similar to Disease Prediction From Symptoms

Ds and ml projects
Data Science & Machine Learning projects and tutorials in python from beginner to advanced level.
Stars: ✭ 56 (-20%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (+310%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Cheatsheets.pdf
📚 Various cheatsheets in PDF
Stars: ✭ 159 (+127.14%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Data Science Complete Tutorial
For extensive instructor led learning
Stars: ✭ 1,027 (+1367.14%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Pythondatasciencehandbook
The book was written and tested with Python 3.5, though other Python versions (including Python 2.7) should work in nearly all cases.
Stars: ✭ 31,995 (+45607.14%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Practical Machine Learning With Python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+2568.57%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Kagglestruggle
Kaggle Struggle
Stars: ✭ 228 (+225.71%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Credit Risk Modelling
Credit Risk analysis by using Python and ML
Stars: ✭ 91 (+30%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Crime Analysis
Association Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-71.43%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Data Science Portfolio
Portfolio of data science projects completed by me for academic, self learning, and hobby purposes.
Stars: ✭ 559 (+698.57%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Pbpython
Code, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+2362.86%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Machine Learning Alpine
Alpine Container for Machine Learning
Stars: ✭ 30 (-57.14%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+2065.71%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Machine Learning With Python
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+3038.57%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Pymc Example Project
Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.
Stars: ✭ 90 (+28.57%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Data Science Projects With Python
A Case Study Approach to Successful Data Science Projects Using Python, Pandas, and Scikit-Learn
Stars: ✭ 198 (+182.86%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (+332.86%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Python for ml
brief introduction to Python for machine learning
Stars: ✭ 29 (-58.57%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
Machinelearningcourse
A collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (-50%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn
The Hello World Of Machine Learning
Learn to build a basic machine learning model from scratch with this repo and tutorial series.
Stars: ✭ 41 (-41.43%)
Mutual labels:  jupyter-notebook, scikit-learn

Disease Prediction from Symptoms

This project explores the use of machine learning algorithms to predict diseases from symptoms.

Algorithms Explored

The following algorithms have been explored in code:

  1. Naive Bayes
  2. Decision Tree
  3. Random Forest
  4. Gradient Boosting

Dataset

Source-1

The dataset for this problem used with the main.py script is downloaded from here:

https://www.kaggle.com/kaushil268/disease-prediction-using-machine-learning

This dataset has 133 total columns, 132 of them being symptoms experienced by patiend and last column in prognosis for the same.

Source-2

The dataset for this problem used with the Jupyter notebook is downloaded from here:

https://impact.dbmi.columbia.edu/~friedma/Projects/DiseaseSymptomKB/index.html

This dataset has 3 columns:

Disease  | Count of Disease Occurrence | Symptom

You can either copy paste the whole table from here to an excel sheet or scrape it out using Beautifulsoup.

Directory Structure

|_ dataset/
         |_ training_data.csv
         |_ test_data.csv

|_ saved_model/
         |_ [ pre-trained models ]

|_ main.py [ code for laoding kaggle dataset, training & saving the model]

|_ notebook/
         |_ dataset/
                  |_ raw_data.xlsx [Columbia dataset for notebook]
         |_ Disease-Prediction-from-Symptoms-checkpoint.ipynb [ IPython Notebook for loading Columbia dataset, training model and Inference ]
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].