All Projects → Agile_data_code_2 → Similar Projects or Alternatives

8782 Open source projects that are alternatives of or similar to Agile_data_code_2

Wirbelsturm
Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Stars: ✭ 332 (-19.61%)
Mutual labels:  apache-kafka, kafka, spark, apache-spark, vagrant
Kafka Storm Starter
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+76.27%)
Mutual labels:  apache-kafka, kafka, spark, apache-spark
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-63.68%)
Introduction Datascience Python Book
Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications
Stars: ✭ 275 (-33.41%)
Pixiedust
Python Helper library for Jupyter Notebooks
Stars: ✭ 998 (+141.65%)
Mutual labels:  jupyter-notebook, data-science, spark
Spring2017 proffosterprovost
Introduction to Data Science
Stars: ✭ 18 (-95.64%)
25daysinmachinelearning
I will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (-87.17%)
Machinelearning
A repo with tutorials for algorithms from scratch
Stars: ✭ 96 (-76.76%)
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+223.97%)
Mutual labels:  jupyter-notebook, data-science, spark
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (-72.88%)
Mutual labels:  jupyter-notebook, data-science, spark
Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+646%)
Mutual labels:  data-science, spark, apache-spark
Sciblog support
Support content for my blog
Stars: ✭ 694 (+68.04%)
Machine learning refined
Notes, examples, and Python demos for the textbook "Machine Learning Refined" (published by Cambridge University Press).
Stars: ✭ 750 (+81.6%)
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-26.63%)
Mutual labels:  jupyter-notebook, kafka, spark
Minerva Training Materials
Learn advanced data science on real-life, curated problems
Stars: ✭ 37 (-91.04%)
Covid19 Dashboard
A site that displays up to date COVID-19 stats, powered by fastpages.
Stars: ✭ 1,212 (+193.46%)
Openml R
R package to interface with OpenML
Stars: ✭ 81 (-80.39%)
Hass Data Detective
Explore and analyse your Home Assistant data
Stars: ✭ 109 (-73.61%)
Mutual labels:  jupyter-notebook, data-science, data
Ds and ml projects
Data Science & Machine Learning projects and tutorials in python from beginner to advanced level.
Stars: ✭ 56 (-86.44%)
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (-60.05%)
Mutual labels:  jupyter-notebook, spark, apache-spark
Scalable Data Science Platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Stars: ✭ 158 (-61.74%)
Mutual labels:  jupyter-notebook, data-science, spark
Articles
A repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (-15.25%)
Mydatascienceportfolio
Applying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (-45.04%)
Mutual labels:  jupyter-notebook, data-science, spark
100 Days Of Ml Code
A day to day plan for this challenge. Covers both theoritical and practical aspects
Stars: ✭ 172 (-58.35%)
Awesome Pulsar
A curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (-86.2%)
Mutual labels:  apache-kafka, spark, apache-spark
Oryx
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+332.2%)
Mutual labels:  apache-kafka, kafka, apache-spark
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+1269.49%)
Mutual labels:  jupyter-notebook, data-science, spark
Free Ai Resources
🚀 FREE AI Resources - 🎓 Courses, 👷 Jobs, 📝 Blogs, 🔬 AI Research, and many more - for everyone!
Stars: ✭ 192 (-53.51%)
Skdata
Python tools for data analysis
Stars: ✭ 16 (-96.13%)
Mutual labels:  jupyter-notebook, data-science, data
Datacompy
Pandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-64.41%)
Mutual labels:  data-science, spark, data
Cartola
Extração de dados da API do CartolaFC, análise exploratória dos dados e modelos preditivos em R e Python - 2014-20. [EN] Data munging, analysis and modeling of CartolaFC - the most popular fantasy football game in Brazil and maybe in the world. Data cover years 2014-19.
Stars: ✭ 304 (-26.39%)
Mutual labels:  jupyter-notebook, data-science, data
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+138.74%)
Mutual labels:  jupyter-notebook, data-science, spark
Machine Learning From Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
Stars: ✭ 42 (-89.83%)
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-73.85%)
Mutual labels:  data-science, spark, data
Suspeitando
Projeto de análise de contratos com suspeita de superfaturamento e má qualidade na prestação de serviços.
Stars: ✭ 76 (-81.6%)
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-84.5%)
Mutual labels:  jupyter-notebook, data-science, spark
Ds With Pysimplegui
Data science and Machine Learning GUI programs/ desktop apps with PySimpleGUI package
Stars: ✭ 93 (-77.48%)
Data Science Cookbook
🎓 Jupyter notebooks from UFC data science course
Stars: ✭ 60 (-85.47%)
Mutual labels:  jupyter-notebook, data-science, spark
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+233.66%)
Mutual labels:  jupyter-notebook, data-science, data
Data Science Stack Cookiecutter
🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Stars: ✭ 153 (-62.95%)
Loandefault Prediction
Lending Club Loan data analysis
Stars: ✭ 113 (-72.64%)
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-86.68%)
Mutual labels:  data-science, spark, apache-spark
Spark Jupyter Aws
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (-37.29%)
Mutual labels:  jupyter-notebook, spark, apache-spark
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-33.9%)
Mutual labels:  jupyter-notebook, data-science, data
Python Machine Learning Book
The "Python Machine Learning (1st edition)" book code repository and info resource
Stars: ✭ 11,428 (+2667.07%)
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-58.6%)
Mutual labels:  jupyter-notebook, data-science, data
Data science blogs
A repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-66.34%)
Mutual labels:  jupyter-notebook, spark, data
Web Database Analytics
Web scrapping and related analytics using Python tools
Stars: ✭ 175 (-57.63%)
Stats Maths With Python
General statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Stars: ✭ 381 (-7.75%)
Interactive machine learning
IPython widgets, interactive plots, interactive machine learning
Stars: ✭ 140 (-66.1%)
Beyond Jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (-67.31%)
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (-78.45%)
Mutual labels:  airflow, jupyter-notebook, spark
Notebooks Statistics And Machinelearning
Jupyter Notebooks from the old UnsupervisedLearning.com (RIP) machine learning and statistics blog
Stars: ✭ 270 (-34.62%)
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+92.01%)
Mutual labels:  airflow, spark, apache-spark
Awesome Streamlit
The purpose of this project is to share knowledge on how awesome Streamlit is and can be
Stars: ✭ 769 (+86.2%)
Mutual labels:  data-science, analytics, data
Model Describer
model-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-94.67%)
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-69.01%)
Azkarra Streams
🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.
Stars: ✭ 146 (-64.65%)
Mutual labels:  apache-kafka, kafka, data
Datascience course
Curso de Data Science em Português
Stars: ✭ 294 (-28.81%)
Mutual labels:  jupyter-notebook, data-science, data
Awesome Kafka
A list about Apache Kafka
Stars: ✭ 397 (-3.87%)
Mutual labels:  apache-kafka, kafka, apache-spark
1-60 of 8782 similar projects