All Projects → zeglam → Countries-GDP-prediction

zeglam / Countries-GDP-prediction

Licence: other
Developed a supervised machine learning system that can estimate a country's GDP per capita using regression algorithms.

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Countries-GDP-prediction

rollRegres
R package for fast rolling and expanding linear regression models
Stars: ✭ 20 (+33.33%)
Mutual labels:  regression, regression-models
regression-python
In this repository you can find many different, small, projects which demonstrate regression techniques using python programming language
Stars: ✭ 15 (+0%)
Mutual labels:  regression, regression-models
code-review-estimator
Estimate cost of code review using Machine Learning
Stars: ✭ 49 (+226.67%)
Mutual labels:  regression, estimation
broomExtra
Helpers for regression analyses using `{broom}` & `{easystats}` packages 📈 🔍
Stars: ✭ 45 (+200%)
Mutual labels:  regression, regression-models
brglm2
Estimation and inference from generalized linear models using explicit and implicit methods for bias reduction
Stars: ✭ 18 (+20%)
Mutual labels:  regression, estimation
ARCHModels.jl
A Julia package for estimating ARMA-GARCH models.
Stars: ✭ 63 (+320%)
Mutual labels:  regression
projection-pursuit
An implementation of multivariate projection pursuit regression and univariate classification
Stars: ✭ 24 (+60%)
Mutual labels:  regression
ridge
CRAN R Package: Ridge Regression with automatic selection of the penalty parameter
Stars: ✭ 17 (+13.33%)
Mutual labels:  regression
HurdleDMR.jl
Hurdle Distributed Multinomial Regression (HDMR) implemented in Julia
Stars: ✭ 19 (+26.67%)
Mutual labels:  regression
pywedge
Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking
Stars: ✭ 49 (+226.67%)
Mutual labels:  regression
uav core
The main integrator of MRS UAV packages in ROS, part of the "mrs_uav_system".
Stars: ✭ 28 (+86.67%)
Mutual labels:  estimation
Python-Machine-Learning-Fundamentals
D-Lab's 6 hour introduction to machine learning in Python. Learn how to perform classification, regression, clustering, and do model selection using scikit-learn and TPOT.
Stars: ✭ 46 (+206.67%)
Mutual labels:  regression
titanic survival exploration
Udacity Machine Learning Nano degree Program Project Predicting Passenger Survival
Stars: ✭ 18 (+20%)
Mutual labels:  regression-models
Python-Machine-Learning
Python Machine Learning Algorithms
Stars: ✭ 80 (+433.33%)
Mutual labels:  regression
RegressionTables.jl
Journal-style regression tables
Stars: ✭ 82 (+446.67%)
Mutual labels:  regression
DeepPDE
Deep Learning application to the partial differential equations
Stars: ✭ 27 (+80%)
Mutual labels:  regression-models
Kaggle-Competition-Sberbank
Top 1% rankings (22/3270) code sharing for Kaggle competition Sberbank Russian Housing Market: https://www.kaggle.com/c/sberbank-russian-housing-market
Stars: ✭ 31 (+106.67%)
Mutual labels:  regression-models
Predictive-Maintenance-of-Aircraft-Engine
In this project I aim to apply Various Predictive Maintenance Techniques to accurately predict the impending failure of an aircraft turbofan engine.
Stars: ✭ 48 (+220%)
Mutual labels:  regression
data-science-notes
Open-source project hosted at https://makeuseofdata.com to crowdsource a robust collection of notes related to data science (math, visualization, modeling, etc)
Stars: ✭ 52 (+246.67%)
Mutual labels:  regression
hdfe
No description or website provided.
Stars: ✭ 22 (+46.67%)
Mutual labels:  regression

Countries-GDP-prediction Open in Streamlit

Data Source

We have 'Countries of The World' data set (from kaggle: Fernando Lasso: https://www.kaggle.com/fernandol/countries-of-the-world).

Data Description

This dataset have each country as a data point (227 countries in total), and for each, we have 20 columns, each column represents a different aspect or measure of the specific country.

Project Goal

The goal of the project is to understand this dataset, get some insights from it, and finally to train a model that can predict GDP per capita for each country.

Conclusion

4 different learning regressors (Linear Regression, SVM, Random Forest, and Gradient Boosting) were tested, and we have achieved the best prediction performance using Random Forest, followed by Gradient Boosting, then Linear Regression, while SVM achieved the worst performance of the four.

The best prediction performance was achieved with a Random Forest regressor, using all features in the dataset, and resulted in the following metrics:

  • Mean Absolute Error (MAE): 2142.13
  • Root mean squared error (RMSE): 3097.19
  • R-squared Score (R2_Score): 0.8839

(gdp_per_capita values in the dataset ranges from 500 to 55100 USD).

Deployment

I have created a web app using streamlit library to deploy the final model. In the app, the user can input the different attributes of a certain country, and click a button in order to get the estimated GDP per capita for that country.

please feel free to try my app here, and send me an email if you have any questions or feedback.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].