All Projects → itsmuriuki → Fifa 2018 World Cup Predictions

itsmuriuki / Fifa 2018 World Cup Predictions

I used Machine Learning to make a Logistic Regression model using scikit-learn, pandas, numpy, seaborn and matplotlib to predict the results of FIFA 2018 World Cup.

Projects that are alternatives of or similar to Fifa 2018 World Cup Predictions

Learning by association
This repository contains code for the paper Learning by Association - A versatile semi-supervised training method for neural networks (CVPR 2017) and the follow-up work Associative Domain Adaptation (ICCV 2017).
Stars: ✭ 150 (-0.66%)
Mutual labels:  jupyter-notebook
Book nbs
Notebooks for upcoming fastai book (draft / incomplete)
Stars: ✭ 151 (+0%)
Mutual labels:  jupyter-notebook
Ml From Scratch
All the ML algorithms, ML models are coded from scratch by pure Python/Numpy with the Math under the hood. It works well on CPU.
Stars: ✭ 151 (+0%)
Mutual labels:  jupyter-notebook
Stereoconvnet
Stereo convolutional neural network for depth map prediction from stereo images
Stars: ✭ 150 (-0.66%)
Mutual labels:  jupyter-notebook
Homework fall2020
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2020)
Stars: ✭ 149 (-1.32%)
Mutual labels:  jupyter-notebook
The Python Workshop
A New, Interactive Approach to Learning Python
Stars: ✭ 150 (-0.66%)
Mutual labels:  jupyter-notebook
Ml Workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+1447.68%)
Mutual labels:  jupyter-notebook
Sphereface Plus
SphereFace+ Implementation for <Learning towards Minimum Hyperspherical Energy> in NIPS'18.
Stars: ✭ 151 (+0%)
Mutual labels:  jupyter-notebook
Mprgdeeplearninglecturenotebook
Stars: ✭ 148 (-1.99%)
Mutual labels:  jupyter-notebook
Data Engineering Nanodegree
Projects done in the Data Engineering Nanodegree by Udacity.com
Stars: ✭ 151 (+0%)
Mutual labels:  jupyter-notebook
Parcels
Main code for Parcels (Probably A Really Computationally Efficient Lagrangian Simulator)
Stars: ✭ 148 (-1.99%)
Mutual labels:  jupyter-notebook
Freecodecamp Pandas Real Life Example
Stars: ✭ 148 (-1.99%)
Mutual labels:  jupyter-notebook
Hands On Machine Learning With Scikit Learn Keras And Tensorflow
Notes & exercise solutions of Part I from the book: "Hands-On ML with Scikit-Learn, Keras & TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems" by Aurelien Geron
Stars: ✭ 151 (+0%)
Mutual labels:  jupyter-notebook
Ml Mooc Nptel
This repository contains the Tutorials for the NPTEL MOOC on Machine Learning.
Stars: ✭ 150 (-0.66%)
Mutual labels:  jupyter-notebook
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-0.66%)
Mutual labels:  jupyter-notebook
Deeplearning keras2
Modification of fast.ai deep learning course notebooks for usage with Keras 2 and Python 3.
Stars: ✭ 150 (-0.66%)
Mutual labels:  jupyter-notebook
Newspaper Navigator
Stars: ✭ 150 (-0.66%)
Mutual labels:  jupyter-notebook
Designing Data Intensive Applications Notes
Reading notes on the excellent "Designing Data-Intensive Applications"
Stars: ✭ 151 (+0%)
Mutual labels:  jupyter-notebook
Ai Matrix
To make it easy to benchmark AI accelerators
Stars: ✭ 151 (+0%)
Mutual labels:  jupyter-notebook
Analyzingalpha
Stars: ✭ 148 (-1.99%)
Mutual labels:  jupyter-notebook

FIFA-2018-World-cup-predictions

I used Machine Learning to make a Logistic Regression model using scikit-learn, pandas, numpy, seaborn and matplotlib to predict the results of FIFA 2018 World Cup.

FIFA World Cup 2018 Winner Predictions

Goal

  1. The goal is to use Machine Learning to predict who is going to win the FIFA World Cup 2018.

  2. Predict the outcome of individual matches for the entire competition.

  3. Run simulation of the next matches i.e quarter finals, semi finals and finals.

These goals present a unique real-world Machine Learning prediction problem and involve solving various Machine Learning tasks: data integration, feature modelling and outcome prediction.

Data

I used two data sets from Kaggle - Results of the matches since 1930 and the World Cup 2018 Dataset. I used results of historical matches since the beginning of the championship (1930) for all participating teams.

Environment and tools

  1. Jupyter Notebook
  2. Numpy
  3. Pandas
  4. Seaborn
  5. Matplotlib
  6. Scikit-learn

I chose Logistic Regression in my model and got an accuracy of 57% on the training set and 55% accuracy on the test set. I also used the FIFA ranking as of April 2018 dataset and a dataset containing the fixture of the group stages of the tournament.

According to this model Brazil is likely to win this World Cup.

Areas of further Research/ Improvement

  1. Dataset - to improve dataset you could use FIFA, the game not the organisation, to assess the quality of each team player.

  2. A confusion matrix would be great to analyse which games the model got wrong.

  3. We could ensemble that is, we could try stacking more models together to improve the accuracy.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].