All Projects → Scalable Data Science → Similar Projects or Alternatives

977 Open source projects that are alternatives of or similar to Scalable Data Science

Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+2069.72%)
Mutual labels:  data-science, apache-spark
Dist Keras
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (+331.69%)
Mutual labels:  data-science, apache-spark
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-9.86%)
Mutual labels:  data-science, apache-spark
Pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Stars: ✭ 231 (+62.68%)
Mutual labels:  data-science, apache-spark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+190.85%)
Mutual labels:  data-science, apache-spark
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-61.27%)
Mutual labels:  data-science, apache-spark
Seq2seq tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Stars: ✭ 132 (-7.04%)
Mutual labels:  data-science
Youtube Like Predictor
YouTube Like Count Predictions using Machine Learning
Stars: ✭ 137 (-3.52%)
Mutual labels:  data-science
Ds Ai Tech Notes
📖 [译] 数据科学和人工智能技术笔记
Stars: ✭ 131 (-7.75%)
Mutual labels:  data-science
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+1111.97%)
Mutual labels:  apache-spark
Interactive machine learning
IPython widgets, interactive plots, interactive machine learning
Stars: ✭ 140 (-1.41%)
Mutual labels:  data-science
2016 Ml Contest
Machine learning contest - October 2016 TLE
Stars: ✭ 135 (-4.93%)
Mutual labels:  data-science
Awesome Scientific Python
A curated list of awesome scientific Python resources
Stars: ✭ 127 (-10.56%)
Mutual labels:  data-science
Automl alex
State-of-the art Automated Machine Learning python library for Tabular Data
Stars: ✭ 132 (-7.04%)
Mutual labels:  data-science
Scilab
Free and Open Source software for numerical computation providing a powerful computing environment for engineering and scientific applications.
Stars: ✭ 138 (-2.82%)
Mutual labels:  data-science
Awesome Datascience Colleges
A list of colleges and universities offering degrees in data science.
Stars: ✭ 131 (-7.75%)
Mutual labels:  data-science
Python Cheat Sheet
Python Cheat Sheet NumPy, Matplotlib
Stars: ✭ 1,739 (+1124.65%)
Mutual labels:  data-science
Awesome Community Detection
A curated list of community detection research papers with implementations.
Stars: ✭ 1,874 (+1219.72%)
Mutual labels:  data-science
Accelerator
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-3.52%)
Mutual labels:  data-science
Dtale Desktop
Build a data visualization dashboard with simple snippets of python code
Stars: ✭ 128 (-9.86%)
Mutual labels:  data-science
Book
This book serves as an introduction to a whole new way of thinking systematically about geographic data, using geographical analysis and computation to unlock new insights hidden within data.
Stars: ✭ 141 (-0.7%)
Mutual labels:  data-science
Unicode plot.rb
Plot your data by Unicode characters
Stars: ✭ 127 (-10.56%)
Mutual labels:  data-science
Qlik Py Tools
Data Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
Stars: ✭ 135 (-4.93%)
Mutual labels:  data-science
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (-11.27%)
Mutual labels:  data-science
Stock Prediction
Smart Algorithms to predict buying and selling of stocks on the basis of Mutual Funds Analysis, Stock Trends Analysis and Prediction, Portfolio Risk Factor, Stock and Finance Market News Sentiment Analysis and Selling profit ratio. Project developed as a part of NSE-FutureTech-Hackathon 2018, Mumbai. Team : Semicolon
Stars: ✭ 125 (-11.97%)
Mutual labels:  data-science
Complete Life Cycle Of A Data Science Project
Complete-Life-Cycle-of-a-Data-Science-Project
Stars: ✭ 140 (-1.41%)
Mutual labels:  data-science
Hermione
ML made simple
Stars: ✭ 135 (-4.93%)
Mutual labels:  data-science
Dive Into Machine Learning
Dive into Machine Learning with Python Jupyter notebook and scikit-learn! First posted in 2016, maintained as of 2021. Pull requests welcome.
Stars: ✭ 10,810 (+7512.68%)
Mutual labels:  data-science
Tntorch
Tensor Network Learning with PyTorch
Stars: ✭ 133 (-6.34%)
Mutual labels:  data-science
Traffic
A toolbox for processing and analysing air traffic data
Stars: ✭ 138 (-2.82%)
Mutual labels:  data-science
Pecan
The Predictive Ecosystem Analyzer (PEcAn) is an integrated ecological bioinformatics toolbox.
Stars: ✭ 132 (-7.04%)
Mutual labels:  data-science
Nlpaug
Data augmentation for NLP
Stars: ✭ 2,761 (+1844.37%)
Mutual labels:  data-science
Rpy2
Interface to use R from Python
Stars: ✭ 132 (-7.04%)
Mutual labels:  data-science
Machine Learning And Data Science
This is a repository which contains all my work related Machine Learning, AI and Data Science. This includes my graduate projects, machine learning competition codes, algorithm implementations and reading material.
Stars: ✭ 137 (-3.52%)
Mutual labels:  data-science
Datascicomp
A collection of popular Data Science Challenges/Competitions || Countdown timers to keep track of the entry deadlines.
Stars: ✭ 1,636 (+1052.11%)
Mutual labels:  data-science
Coffee Quality Database
Building the Coffee Quality Institute Database
Stars: ✭ 141 (-0.7%)
Mutual labels:  data-science
Stats337
Readings in applied data science
Stars: ✭ 1,625 (+1044.37%)
Mutual labels:  data-science
Spark On Lambda
Apache Spark on AWS Lambda
Stars: ✭ 137 (-3.52%)
Mutual labels:  apache-spark
Ripser.py
A Lean Persistent Homology Library for Python
Stars: ✭ 139 (-2.11%)
Mutual labels:  data-science
Torchbear
🔥🐻 The Speakeasy Scripting Engine Which Combines Speed, Safety, and Simplicity
Stars: ✭ 128 (-9.86%)
Mutual labels:  data-science
Data Science Wg
SF Brigade's Data Science Working Group.
Stars: ✭ 135 (-4.93%)
Mutual labels:  data-science
Lifelines
Survival analysis in Python
Stars: ✭ 1,766 (+1143.66%)
Mutual labels:  data-science
Doddle Model
🍰 doddle-model: machine learning in Scala.
Stars: ✭ 142 (+0%)
Mutual labels:  data-science
Data Science For Marketing Analytics
Achieve your marketing goals with the data analytics power of Python
Stars: ✭ 127 (-10.56%)
Mutual labels:  data-science
Pandasschema
A validation library for Pandas data frames using user-friendly schemas
Stars: ✭ 135 (-4.93%)
Mutual labels:  data-science
Pipelinex
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
Stars: ✭ 127 (-10.56%)
Mutual labels:  data-science
Toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
Stars: ✭ 139 (-2.11%)
Mutual labels:  data-science
Cape Python
Collaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-11.97%)
Mutual labels:  data-science
Beyond Jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (-4.93%)
Mutual labels:  data-science
Deeplearning Notes
Notes for Deep Learning Specialization Courses led by Andrew Ng.
Stars: ✭ 126 (-11.27%)
Mutual labels:  data-science
Matrixprofile
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Stars: ✭ 141 (-0.7%)
Mutual labels:  data-science
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+1579.58%)
Mutual labels:  data-science
Blockchain2graph
Blockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Stars: ✭ 134 (-5.63%)
Mutual labels:  data-science
Rightmove webscraper.py
Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-11.97%)
Mutual labels:  data-science
Dbg Pds
Deutsche Boerse's Financial Trading Public Data Set
Stars: ✭ 124 (-12.68%)
Mutual labels:  data-science
Datasciencecoursera
Data Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.
Stars: ✭ 1,928 (+1257.75%)
Mutual labels:  data-science
Accelerators
Data science and AI solution accelerator suite that provides templates for prototyping, reporting, and presenting data science analytics of specific domains
Stars: ✭ 134 (-5.63%)
Mutual labels:  data-science
Awesome Materials Informatics
Curated list of known efforts in materials informatics
Stars: ✭ 123 (-13.38%)
Mutual labels:  data-science
Open Solution Salt Identification
Open solution to the TGS Salt Identification Challenge
Stars: ✭ 124 (-12.68%)
Mutual labels:  data-science
Datasciencer
a curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (+1116.2%)
Mutual labels:  data-science
1-60 of 977 similar projects