oluies / Tedsds
Licence: apache-2.0
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14
Projects that are alternatives of or similar to Tedsds
Cdap
An open source framework for building data analytic applications.
Stars: ✭ 509 (+3535.71%)
Mutual labels: dataset, spark
Caffenet Benchmark
Evaluation of the CNN design choices performance on ImageNet-2012.
Stars: ✭ 700 (+4900%)
Mutual labels: jupyter-notebook, dataset
Justenoughscalaforspark
A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+3742.86%)
Mutual labels: jupyter-notebook, spark
Comma2k19
A driving dataset for the development and validation of fused pose estimators and mapping algorithms
Stars: ✭ 391 (+2692.86%)
Mutual labels: jupyter-notebook, dataset
Enterprise gateway
A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
Stars: ✭ 412 (+2842.86%)
Mutual labels: jupyter-notebook, spark
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+40300%)
Mutual labels: jupyter-notebook, spark
Whylogs
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 328 (+2242.86%)
Mutual labels: jupyter-notebook, dataset
Covid Ct
COVID-CT-Dataset: A CT Scan Dataset about COVID-19
Stars: ✭ 820 (+5757.14%)
Mutual labels: jupyter-notebook, dataset
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+5221.43%)
Mutual labels: jupyter-notebook, spark
Vpgnet
VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition (ICCV 2017)
Stars: ✭ 382 (+2628.57%)
Mutual labels: jupyter-notebook, dataset
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (+64.29%)
Mutual labels: jupyter-notebook, spark
Medmnist
[ISBI'21] MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis
Stars: ✭ 338 (+2314.29%)
Mutual labels: jupyter-notebook, dataset
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+2850%)
Mutual labels: jupyter-notebook, spark
Dsprites Dataset
Dataset to assess the disentanglement properties of unsupervised learning methods
Stars: ✭ 340 (+2328.57%)
Mutual labels: jupyter-notebook, dataset
Hate Speech And Offensive Language
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Stars: ✭ 543 (+3778.57%)
Mutual labels: jupyter-notebook, dataset
Covid19 twitter
Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development
Stars: ✭ 304 (+2071.43%)
Mutual labels: jupyter-notebook, dataset
Transportationnetworks
Transportation Networks for Research
Stars: ✭ 312 (+2128.57%)
Mutual labels: jupyter-notebook, dataset
Elasticsearch Spark Recommender
Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch
Stars: ✭ 707 (+4950%)
Mutual labels: jupyter-notebook, spark
Spark Scala Tutorial
A free tutorial for Apache Spark.
Stars: ✭ 907 (+6378.57%)
Mutual labels: jupyter-notebook, spark
tedsds
Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Uses the dataset from [1] to create a demostration of a machine learning setup for a predictive maintainance scenario for Turbofan Engines.
References:
- A. Saxena, K. Goebel, D. Simon, and N. Eklund, "Damage Propagation Modeling for Aircraft Engine Run-to-Failure Simulation", in the Proceedings of the Ist International Conference on Prognostics and Health Management (PHM08), Denver CO, Oct 2008., retrieved feb. 2016
- NASA Ames Prognostics data repository, retrieved feb. 2016, http://ti.arc.nasa.gov/tech/dash/pcoe/prognostic-data-repository/
- Major Challenges in Prognostics: Study on Benchmarking Prognostics Datasets, O. F. Eker1, F. Camci, and I. K. Jennions1, retrieved feb. 2016
- Big Data Analytics for eMaintenance : Modeling of high-dimensional data streams. / Zhang, Liangwei. Luleå : Luleå tekniska universitet, 2015. 46 p. (Licentiate thesis / Luleå University of Technology). Publication: Research › Licentiate thesis, retrieved feb. 2016
- Microsoft Cortana example with the same dataset, retrieved feb. 2016 Link
- H2o.io example with the same dataset, retrieved feb. 2016 Link Presentation
- Advanced Analytics with Spark - Patterns for Learning from Data at Scale By Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills. Link Examples
- The use of the area under the ROC curve in the evaluation of machine learning algorithms,Andrew P Bradley Link
- A Few Useful Things to Know about Machine Learning, Pedro Domingos, Link
Spark libraries
SBT plugins
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].