All Projects → oluies → Tedsds

oluies / Tedsds

Licence: apache-2.0
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark

Projects that are alternatives of or similar to Tedsds

Cdap
An open source framework for building data analytic applications.
Stars: ✭ 509 (+3535.71%)
Mutual labels:  dataset, spark
Caffenet Benchmark
Evaluation of the CNN design choices performance on ImageNet-2012.
Stars: ✭ 700 (+4900%)
Mutual labels:  jupyter-notebook, dataset
Justenoughscalaforspark
A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+3742.86%)
Mutual labels:  jupyter-notebook, spark
Comma2k19
A driving dataset for the development and validation of fused pose estimators and mapping algorithms
Stars: ✭ 391 (+2692.86%)
Mutual labels:  jupyter-notebook, dataset
Yandex Big Data Engineering
Stars: ✭ 17 (+21.43%)
Mutual labels:  jupyter-notebook, spark
Enterprise gateway
A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
Stars: ✭ 412 (+2842.86%)
Mutual labels:  jupyter-notebook, spark
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+40300%)
Mutual labels:  jupyter-notebook, spark
Whylogs
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 328 (+2242.86%)
Mutual labels:  jupyter-notebook, dataset
Covid Ct
COVID-CT-Dataset: A CT Scan Dataset about COVID-19
Stars: ✭ 820 (+5757.14%)
Mutual labels:  jupyter-notebook, dataset
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+5221.43%)
Mutual labels:  jupyter-notebook, spark
Vpgnet
VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition (ICCV 2017)
Stars: ✭ 382 (+2628.57%)
Mutual labels:  jupyter-notebook, dataset
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (+64.29%)
Mutual labels:  jupyter-notebook, spark
Medmnist
[ISBI'21] MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis
Stars: ✭ 338 (+2314.29%)
Mutual labels:  jupyter-notebook, dataset
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+2850%)
Mutual labels:  jupyter-notebook, spark
Dsprites Dataset
Dataset to assess the disentanglement properties of unsupervised learning methods
Stars: ✭ 340 (+2328.57%)
Mutual labels:  jupyter-notebook, dataset
Hate Speech And Offensive Language
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Stars: ✭ 543 (+3778.57%)
Mutual labels:  jupyter-notebook, dataset
Covid19 twitter
Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development
Stars: ✭ 304 (+2071.43%)
Mutual labels:  jupyter-notebook, dataset
Transportationnetworks
Transportation Networks for Research
Stars: ✭ 312 (+2128.57%)
Mutual labels:  jupyter-notebook, dataset
Elasticsearch Spark Recommender
Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch
Stars: ✭ 707 (+4950%)
Mutual labels:  jupyter-notebook, spark
Spark Scala Tutorial
A free tutorial for Apache Spark.
Stars: ✭ 907 (+6378.57%)
Mutual labels:  jupyter-notebook, spark

tedsds

Turbofan Engine Degradation Simulation Data Set example in Apache Spark

Uses the dataset from [1] to create a demostration of a machine learning setup for a predictive maintainance scenario for Turbofan Engines.

References:

  1. A. Saxena, K. Goebel, D. Simon, and N. Eklund, "Damage Propagation Modeling for Aircraft Engine Run-to-Failure Simulation", in the Proceedings of the Ist International Conference on Prognostics and Health Management (PHM08), Denver CO, Oct 2008., retrieved feb. 2016
  2. NASA Ames Prognostics data repository, retrieved feb. 2016, http://ti.arc.nasa.gov/tech/dash/pcoe/prognostic-data-repository/
  3. Major Challenges in Prognostics: Study on Benchmarking Prognostics Datasets, O. F. Eker1, F. Camci, and I. K. Jennions1, retrieved feb. 2016
  4. Big Data Analytics for eMaintenance : Modeling of high-dimensional data streams. / Zhang, Liangwei. Luleå : Luleå tekniska universitet, 2015. 46 p. (Licentiate thesis / Luleå University of Technology). Publication: Research › Licentiate thesis, retrieved feb. 2016
  5. Microsoft Cortana example with the same dataset, retrieved feb. 2016 Link
  6. H2o.io example with the same dataset, retrieved feb. 2016 Link Presentation
  7. Advanced Analytics with Spark - Patterns for Learning from Data at Scale By Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills. Link Examples
  8. The use of the area under the ROC curve in the evaluation of machine learning algorithms,Andrew P Bradley Link
  9. A Few Useful Things to Know about Machine Learning, Pedro Domingos, Link

Spark libraries

  1. https://github.com/databricks/spark-csv

SBT plugins

  1. https://github.com/databricks/sbt-spark-package
  2. https://github.com/sbt/sbt-git
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].