All Projects → maxis42 → Big Data Engineering Coursera Yandex

maxis42 / Big Data Engineering Coursera Yandex

Licence: mit
Big Data for Data Engineers Coursera Specialization from Yandex

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Big Data Engineering Coursera Yandex

Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+15380.28%)
Mutual labels:  spark, big-data, bigdata, mapreduce, hdfs
Yandex Big Data Engineering
Stars: ✭ 17 (-76.06%)
Mutual labels:  jupyter-notebook, spark, mapreduce, hdfs
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+111.27%)
Mutual labels:  jupyter-notebook, spark, big-data, hdfs
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-81.69%)
Mutual labels:  big-data, spark, bigdata, hdfs
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+949.3%)
Mutual labels:  jupyter-notebook, spark, big-data, bigdata
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1784.51%)
Mutual labels:  jupyter-notebook, spark, big-data, bigdata
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+1107.04%)
Mutual labels:  spark, bigdata, mapreduce, hdfs
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (-47.89%)
Mutual labels:  bigdata, hdfs, mapreduce
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-52.11%)
Mutual labels:  big-data, bigdata, mapreduce
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1288.73%)
Mutual labels:  jupyter-notebook, spark, bigdata
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+8361.97%)
Mutual labels:  spark, bigdata, hdfs
Ml Da Coursera Yandex Mipt
Machine Learning and Data Analysis Coursera Specialization from Yandex and MIPT
Stars: ✭ 108 (+52.11%)
Mutual labels:  yandex, jupyter-notebook, coursera
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (+126.76%)
Mutual labels:  jupyter-notebook, spark, hdfs
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (+53.52%)
Mutual labels:  jupyter-notebook, big-data, bigdata
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-80.28%)
Mutual labels:  big-data, spark, hdfs
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+30953.52%)
Mutual labels:  spark, big-data, mapreduce
Cortx
CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (+500%)
Mutual labels:  jupyter-notebook, big-data, bigdata
Dpark
Python clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+3657.75%)
Mutual labels:  spark, bigdata, mapreduce
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+7866.2%)
Mutual labels:  jupyter-notebook, spark, big-data
Hadoop For Geoevent
ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-92.96%)
Mutual labels:  big-data, bigdata, hdfs
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].