简单的可视化湖州通话数据假设数据量很大，没法用浏览器直接绘制热力图，把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后，再使用Apache Spark绘制热力图，然后用leafletjs加载OpenStreetMap图层和热力图图层，以达到良好的交互效果。现在使用Apache Spark实现绘制，可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法，并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .

Stars: ✭ 13 (-90.65%)

Mutual labels: spark, bigdata

Spark Movie Lens

An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset

Stars: ✭ 745 (+435.97%)

Mutual labels: spark, bigdata

Coding Now

学习记录的一些笔记，以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等

Stars: ✭ 750 (+439.57%)

Mutual labels: spark, bigdata

Every Single Day I Tldr

A daily digest of the articles or videos I've found interesting, that I want to share with you.

Stars: ✭ 249 (+79.14%)

Mutual labels: spark, bigdata

TiBigData

TiDB connectors for Flink/Hive/Presto

Stars: ✭ 192 (+38.13%)

Mutual labels: bigdata, flink

flink-learn

Learning Flink : Flink CEP,Flink Core,Flink SQL

Stars: ✭ 70 (-49.64%)

Mutual labels: bigdata, flink

dockerfiles

Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )

Stars: ✭ 29 (-79.14%)

Mutual labels: bigdata, flink

Spline

Data Lineage Tracking And Visualization Solution

Stars: ✭ 306 (+120.14%)

Mutual labels: spark, bigdata

Sidekick

High Performance HTTP Sidecar Load Balancer

Stars: ✭ 366 (+163.31%)

Mutual labels: spark, bigdata

Zeppelin

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

Stars: ✭ 5,513 (+3866.19%)

Mutual labels: spark, flink

Sparkstreaming

💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算)；🚀 支持运行过程中增删topic；🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。

Stars: ✭ 179 (+28.78%)

Mutual labels: spark, flink

Pulsar Spark

When Apache Pulsar meets Apache Spark

Stars: ✭ 55 (-60.43%)

Mutual labels: spark, flink

Model Serving Tutorial

Code and presentation for Strata Model Serving tutorial

Stars: ✭ 57 (-58.99%)

Mutual labels: spark, flink

Kamu Cli

Next generation tool for decentralized exchange and transformation of semi-structured data

Stars: ✭ 69 (-50.36%)

Mutual labels: spark, flink

Hops Examples

Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops

Stars: ✭ 84 (-39.57%)

Mutual labels: spark, flink

Apache Spark Hands On

Educational notes,Hands on problems w/ solutions for hadoop ecosystem

Stars: ✭ 74 (-46.76%)

Mutual labels: spark, bigdata

Repository

个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。

Stars: ✭ 92 (-33.81%)

Mutual labels: spark, flink

Bigdata Notes

大数据入门指南 ⭐

Stars: ✭ 10,991 (+7807.19%)

Mutual labels: spark, bigdata

Dpark

Python clone of Spark, a MapReduce alike framework in Python

Stars: ✭ 2,668 (+1819.42%)

Mutual labels: spark, bigdata

Sparkrdma

RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark

Stars: ✭ 215 (+54.68%)

Mutual labels: spark, bigdata

bigdata-doc

大数据学习笔记，学习路线，技术案例整理。

Stars: ✭ 37 (-73.38%)

Mutual labels: bigdata, flink

Javaorbigdata Interview

Java开发者或者大数据开发者面试知识点整理

Stars: ✭ 203 (+46.04%)

Mutual labels: spark, bigdata

hadoopoffice

HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)

Stars: ✭ 56 (-59.71%)

Mutual labels: bigdata, flink

coolplayflink

Flink: Stateful Computations over Data Streams

Stars: ✭ 14 (-89.93%)

Mutual labels: bigdata, flink

data processing course

Some class materials for a data processing course using PySpark

Stars: ✭ 50 (-64.03%)

Mutual labels: spark, bigdata

Kotlin Spark Api

This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x

Stars: ✭ 183 (+31.65%)

Mutual labels: spark, bigdata

Cloudflow

Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.

Stars: ✭ 278 (+100%)

Mutual labels: spark, flink

Docker Spark Cluster

A simple spark standalone cluster for your testing environment purposses

Stars: ✭ 261 (+87.77%)

Mutual labels: spark, bigdata

Big data architect skills

一个大数据架构师应该掌握的技能

Stars: ✭ 400 (+187.77%)

Mutual labels: spark, bigdata

Big Data Rosetta Code

Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code

Stars: ✭ 254 (+82.73%)

Mutual labels: spark, bigdata

Bdp Dataplatform

大数据生态解决方案数据平台：基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。

Stars: ✭ 456 (+228.06%)

Mutual labels: spark, flink

Bigdataie

大数据博客、笔试题、教程、项目、面经的整理

Stars: ✭ 445 (+220.14%)

Mutual labels: spark, bigdata

Flink Learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Stars: ✭ 11,378 (+8085.61%)

Mutual labels: spark, flink

Data Ingestion Platform

Stars: ✭ 39 (-71.94%)

Mutual labels: spark, flink

Optimus

🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

Stars: ✭ 986 (+609.35%)

Mutual labels: spark, bigdata

Big Data Engineering Coursera Yandex

Big Data for Data Engineers Coursera Specialization from Yandex

Stars: ✭ 71 (-48.92%)

Mutual labels: spark, bigdata

Flinkstreamsql

基于开源的flink，对其实时sql进行扩展；主要实现了流与维表的join，支持原生flink SQL所有的语法

Stars: ✭ 1,682 (+1110.07%)

Mutual labels: bigdata, flink

Java learning practice

java 进阶之路：面试高频算法、akka、多线程、NIO、Netty、SpringBoot、Spark&&Flink 等

Stars: ✭ 110 (-20.86%)

Mutual labels: spark, flink

Cleanframes

type-class based data cleansing library for Apache Spark SQL

Stars: ✭ 75 (-46.04%)

Mutual labels: spark, bigdata

Dataspherestudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

Stars: ✭ 1,195 (+759.71%)

Mutual labels: spark, flink

Spark Py Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 1,338 (+862.59%)

Mutual labels: spark, bigdata

Mobius

C# and F# language binding and extensions to Apache Spark

Stars: ✭ 929 (+568.35%)

Mutual labels: spark, bigdata

Flink Notes

flink学习笔记