All Projects → lamastex → Scalable Data Science

lamastex / Scalable Data Science

Licence: unlicense
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.

Programming Languages

scala
5932 projects

Projects that are alternatives of or similar to Scalable Data Science

Pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Stars: ✭ 231 (+62.68%)
Mutual labels:  data-science, apache-spark
Dist Keras
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (+331.69%)
Mutual labels:  data-science, apache-spark
Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+2069.72%)
Mutual labels:  data-science, apache-spark
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-61.27%)
Mutual labels:  data-science, apache-spark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+190.85%)
Mutual labels:  data-science, apache-spark
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-9.86%)
Mutual labels:  data-science, apache-spark
Scilab
Free and Open Source software for numerical computation providing a powerful computing environment for engineering and scientific applications.
Stars: ✭ 138 (-2.82%)
Mutual labels:  data-science
Python Cheat Sheet
Python Cheat Sheet NumPy, Matplotlib
Stars: ✭ 1,739 (+1124.65%)
Mutual labels:  data-science
Youtube Like Predictor
YouTube Like Count Predictions using Machine Learning
Stars: ✭ 137 (-3.52%)
Mutual labels:  data-science
Accelerator
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-3.52%)
Mutual labels:  data-science
Doddle Model
🍰 doddle-model: machine learning in Scala.
Stars: ✭ 142 (+0%)
Mutual labels:  data-science
Book
This book serves as an introduction to a whole new way of thinking systematically about geographic data, using geographical analysis and computation to unlock new insights hidden within data.
Stars: ✭ 141 (-0.7%)
Mutual labels:  data-science
Interactive machine learning
IPython widgets, interactive plots, interactive machine learning
Stars: ✭ 140 (-1.41%)
Mutual labels:  data-science
Traffic
A toolbox for processing and analysing air traffic data
Stars: ✭ 138 (-2.82%)
Mutual labels:  data-science
Nlpaug
Data augmentation for NLP
Stars: ✭ 2,761 (+1844.37%)
Mutual labels:  data-science
Machine Learning And Data Science
This is a repository which contains all my work related Machine Learning, AI and Data Science. This includes my graduate projects, machine learning competition codes, algorithm implementations and reading material.
Stars: ✭ 137 (-3.52%)
Mutual labels:  data-science
Coffee Quality Database
Building the Coffee Quality Institute Database
Stars: ✭ 141 (-0.7%)
Mutual labels:  data-science
Spark On Lambda
Apache Spark on AWS Lambda
Stars: ✭ 137 (-3.52%)
Mutual labels:  apache-spark
Toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
Stars: ✭ 139 (-2.11%)
Mutual labels:  data-science
Matrixprofile
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Stars: ✭ 141 (-0.7%)
Mutual labels:  data-science

This project does not contain a readme.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].