lamastex / Scalable Data Science
Licence: unlicense
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
Stars: ✭ 142
Programming Languages
scala
5932 projects
Projects that are alternatives of or similar to Scalable Data Science
Pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Stars: ✭ 231 (+62.68%)
Mutual labels: data-science, apache-spark
Dist Keras
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (+331.69%)
Mutual labels: data-science, apache-spark
Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+2069.72%)
Mutual labels: data-science, apache-spark
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-61.27%)
Mutual labels: data-science, apache-spark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+190.85%)
Mutual labels: data-science, apache-spark
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-9.86%)
Mutual labels: data-science, apache-spark
Scilab
Free and Open Source software for numerical computation providing a powerful computing environment for engineering and scientific applications.
Stars: ✭ 138 (-2.82%)
Mutual labels: data-science
Python Cheat Sheet
Python Cheat Sheet NumPy, Matplotlib
Stars: ✭ 1,739 (+1124.65%)
Mutual labels: data-science
Youtube Like Predictor
YouTube Like Count Predictions using Machine Learning
Stars: ✭ 137 (-3.52%)
Mutual labels: data-science
Accelerator
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-3.52%)
Mutual labels: data-science
Doddle Model
🍰 doddle-model: machine learning in Scala.
Stars: ✭ 142 (+0%)
Mutual labels: data-science
Book
This book serves as an introduction to a whole new way of thinking systematically about geographic data, using geographical analysis and computation to unlock new insights hidden within data.
Stars: ✭ 141 (-0.7%)
Mutual labels: data-science
Interactive machine learning
IPython widgets, interactive plots, interactive machine learning
Stars: ✭ 140 (-1.41%)
Mutual labels: data-science
Traffic
A toolbox for processing and analysing air traffic data
Stars: ✭ 138 (-2.82%)
Mutual labels: data-science
Machine Learning And Data Science
This is a repository which contains all my work related Machine Learning, AI and Data Science. This includes my graduate projects, machine learning competition codes, algorithm implementations and reading material.
Stars: ✭ 137 (-3.52%)
Mutual labels: data-science
Coffee Quality Database
Building the Coffee Quality Institute Database
Stars: ✭ 141 (-0.7%)
Mutual labels: data-science
Toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
Stars: ✭ 139 (-2.11%)
Mutual labels: data-science
Matrixprofile
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Stars: ✭ 141 (-0.7%)
Mutual labels: data-science
This project does not contain a readme.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].