All Projects → wikistat → AI-Frameworks

wikistat / AI-Frameworks

Licence: MIT License
Science des Données Saison 5: Technologies pour l'apprentissage automatique / statistique de données massives et l'Intelligence Artificielle

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language
c
50402 projects - #5 most used programming language

Projects that are alternatives of or similar to AI-Frameworks

MachineLearning
机器学习教程,本教程包含基于numpy、sklearn与tensorflow机器学习,也会包含利用spark、flink加快模型训练等用法。本着能够较全的引导读者入门机器学习。
Stars: ✭ 23 (-36.11%)
Mutual labels:  spark-mllib
Machine-Learning-Model-Deployment
This is a machine learning model deployment project of Iris classification model in a minimal UI using flask web framework and deployed it in Azure cloud using Azure app service. We initially made this project as a requirement for an internship at Indian Servers. We are now making it open to contribution.
Stars: ✭ 70 (+94.44%)
Mutual labels:  scikitlearn-machine-learning
from-fat-controllers-to-use-cases
Rails (API) app that shows different kinds of architecture (one per commit), and in the last one, how to use the Micro::Case gem to handle the application business logic.
Stars: ✭ 74 (+105.56%)
Mutual labels:  use-cases
Uber-Rider-Churn-Analysis
Uber is interested in predicting rider retention. To help explore this question, they have provided a sample dataset of a cohort of users.
Stars: ✭ 20 (-44.44%)
Mutual labels:  scikitlearn-machine-learning
Heart-disease-prediction
No description or website provided.
Stars: ✭ 41 (+13.89%)
Mutual labels:  scikitlearn-machine-learning
SparkMlpDow30
A new stock trading and prediction model based on a MLP neural network utilizing technical analysis indicator values as features (using Apache Spark MLlib)
Stars: ✭ 34 (-5.56%)
Mutual labels:  spark-mllib
Word-Embeddings-and-Document-Vectors
An evaluation of word-embeddings for classification
Stars: ✭ 32 (-11.11%)
Mutual labels:  scikitlearn-machine-learning
Spark ALS
基于spark-ml,spark-mllib,spark-streaming的推荐算法实现
Stars: ✭ 89 (+147.22%)
Mutual labels:  spark-mllib
Machine-Learning
Examples of all Machine Learning Algorithm in Apache Spark
Stars: ✭ 15 (-58.33%)
Mutual labels:  spark-mllib
Trajectory-Analysis-and-Classification-in-Python-Pandas-and-Scikit-Learn
Formed trajectories of sets of points.Experimented on finding similarities between trajectories based on DTW (Dynamic Time Warping) and LCSS (Longest Common SubSequence) algorithms.Modeled trajectories as strings based on a Grid representation.Benchmarked KNN, Random Forest, Logistic Regression classification algorithms to classify efficiently t…
Stars: ✭ 41 (+13.89%)
Mutual labels:  scikitlearn-machine-learning
Grokking-Machine-Learning
This repo aims to contain different machine learning use cases along with the descriptions to the model architectures
Stars: ✭ 54 (+50%)
Mutual labels:  scikitlearn-machine-learning
Igel
a delightful machine learning tool that allows you to train, test, and use models without writing code
Stars: ✭ 2,956 (+8111.11%)
Mutual labels:  scikitlearn-machine-learning
Movie recommend
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+5711.11%)
Mutual labels:  spark-mllib
Spotify-Song-Recommendation-ML
UC Berkeley team's submission for RecSys Challenge 2018
Stars: ✭ 70 (+94.44%)
Mutual labels:  spark-mllib

INSA | Applied mathematics, Data Science

Artificial Intelligence Frameworks

This course follows the Machine Learning and the High Dimensional & Deep Learning courses. In theses courses, you have acquired knowledge in machine and deep learning algoritms and their application on various type of data. This knowledge is primordial to become a DataScientist.

This course has three main objectives. You will

  1. learn how to apply efficiently these algorithms using

  2. discover new field of artificial intelligence applied on (real) datasets that require specific algorithms:

    • Text.
    • Video Game
      • Algorithms: Reinforcement learning, (Policy Gradient algorithm, Q-Learning, Deep Q-learning)
      • *Libraries : AI Gym, Tensorflow.
    • Movies Notations
      • Algorithms: Recommendation system, (User/User and Item/Item filters, NMF, Neural recomendation system)
      • *Libraries : Surprise, Tensorflow.
  3. how to efficiently share reproducible code.

    • Build a Github repository.

NB: Some contents from previous years are still available on the repository (like Spark) but are not covered during theses courses anymore.

Knowledge requirements

Schedule

  • Lectures : 10 hours
  • Practical Works : 30 hours.

The course is divided in 5 topics (of various lentgh) over 5 days.

Course introduction + Github Reminder: Slides/Video

  • Session 1 - 02-11-20

  • Session 2 - 16-11-20

    • Text: Recurrent Network
    • Development for Data Scientist: Python environment + Github Repo + Python Script.
  • Session 3 - 30-11-20

    • Development for Data Scientist: Introduction to Google Cloud Computing.
    • Development for Data Scientist: Docker
  • Session 4 - 07-12-20

    • Introduction to deep Reinforcement learning: Deep Q-learning
      • Slides / Video
      • Q Learning reminder: TP
      • Deep Q Learning on cartpole: TP
      • Deep Q Learning on Gridworld: TP
  • Session 5 14-12-20

  • Session 6 04-01-20

    • Free time on project.

Evaluation

The evaluation is associated to the DEFI-IA

Objective

You will be evaluated on your capacity of acting like a Data Scientist, i.e.

  • Handle a new dataset and explore it.
  • Find a solution to address the defi's problem with a high score (above baseline).
  • Explain the choosen algorithm.
  • Write a complete pipeline to easily reproduce the results.
  • Justify the choice of the algorithms and the environment (CPU/GPU, Cloud etc..).
  • Share it and make your results easily reproducible (Git - docker, conda environment.).

Notations

  1. Project - (60%): a Git repository.

    • The git should contain a clear markdown Readme, which describes (33%)
      • Which result you achieved? In which computation time? On which engine?
      • What do I have to install to be able to reproduce the code?
      • Which command do I have to run to reproduce the results?
    • The code has to be easily reproducible. (33%)
      • Packages required has to be well described. (a requirements.txt files is the best)
      • Conda command or docker command can be furnish
    • The code should be clear and easily readable. (33%)
      • Final results can be run in a script and not a notebook.
      • Only final code can be found in this script.
    • Deadline : January 29 2021.
  2. Rapport - (40%) 10 pages maximum:

    • Quality of the presentation. 25%
    • In-Deep explanation of the chosen algorithm. 25%
    • Choice of the tools-infrastructure used. 25%
    • Results you obtained. 25%
    • Date : January 29, 2021.

Other details

  • Group of 4 to 5 people (DEFI IA's team).

Technical requirements.

All the libraries required for these modules are listed in the requirements.txt (IN CONSTRUCTION/ ONLY SESSION 1 IS OK)

To build a functional environment in pandas execute the following lines:

conda create -n AIF python=3.8
conda activate AIF
pip install -r requirements.txt 
jupyter labextension install [email protected]
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].