简单的可视化湖州通话数据假设数据量很大，没法用浏览器直接绘制热力图，把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后，再使用Apache Spark绘制热力图，然后用leafletjs加载OpenStreetMap图层和热力图图层，以达到良好的交互效果。现在使用Apache Spark实现绘制，可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法，并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .

Stars: ✭ 13 (-90.78%)

Mutual labels: big-data, bigdata

gan deeplearning4j

Automatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.

Stars: ✭ 19 (-86.52%)

Mutual labels: big-data, bigdata

big data

A collection of tutorials on Hadoop, MapReduce, Spark, Docker

Stars: ✭ 34 (-75.89%)

Mutual labels: big-data, bigdata

Countly Sdk Cordova

Countly Product Analytics SDK for Cordova, Icenium and Phonegap

Stars: ✭ 69 (-51.06%)

Mutual labels: big-data, bigdata

Big Data Engineering Coursera Yandex

Big Data for Data Engineers Coursera Specialization from Yandex

Stars: ✭ 71 (-49.65%)

Mutual labels: big-data, bigdata

Spark Py Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 1,338 (+848.94%)

Mutual labels: big-data, bigdata

v6.dooring.public

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

Stars: ✭ 323 (+129.08%)

Mutual labels: big-data, bigdata

Hadoop For Geoevent

ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.

Stars: ✭ 5 (-96.45%)

Mutual labels: big-data, bigdata

twitter-archive-reader

Full featured TypeScript Twitter archive reader and browser

Stars: ✭ 43 (-69.5%)

Mutual labels: big-data, bigdata

Aws Etl Orchestrator

A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.

Stars: ✭ 245 (+73.76%)

Mutual labels: big-data, bigdata

Cortx

CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.

Stars: ✭ 426 (+202.13%)

Mutual labels: big-data, bigdata

Tennis Crystal Ball

Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction

Stars: ✭ 107 (-24.11%)

Mutual labels: big-data, bigdata

young-examples

java学习和项目中一些典型的应用场景样例代码

Stars: ✭ 21 (-85.11%)

Mutual labels: study, bigdata

Clustering4Ever

C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.

Stars: ✭ 126 (-10.64%)

Mutual labels: big-data, bigdata

Spark R Notebooks

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 109 (-22.7%)

Mutual labels: big-data, bigdata

meetups-archivos

Ppts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …

Stars: ✭ 60 (-57.45%)

Mutual labels: big-data, bigdata

Uproot3

ROOT I/O in pure Python and NumPy.

Stars: ✭ 312 (+121.28%)

Mutual labels: big-data, bigdata

Uproot4

ROOT I/O in pure Python and NumPy.

Stars: ✭ 80 (-43.26%)

Mutual labels: big-data, bigdata

Bigdata Notes

大数据入门指南 ⭐

Stars: ✭ 10,991 (+7695.04%)

Mutual labels: big-data, bigdata

Hama

Mirror of Apache Hama

Stars: ✭ 129 (-8.51%)

Mutual labels: big-data

Scala Spark Tutorial

Project for James' Apache Spark with Scala course

Stars: ✭ 121 (-14.18%)

Mutual labels: big-data

Mindforger

Thinking notebook and Markdown editor.

Stars: ✭ 1,695 (+1102.13%)

Mutual labels: study

Sigmf

The Signal Metadata Format Specification

Stars: ✭ 120 (-14.89%)

Mutual labels: big-data

Spark On Lambda

Apache Spark on AWS Lambda

Stars: ✭ 137 (-2.84%)

Mutual labels: big-data

Spark

.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.

Stars: ✭ 1,721 (+1120.57%)

Mutual labels: bigdata

Hdfs Shell

HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS

Stars: ✭ 117 (-17.02%)

Mutual labels: big-data

Drill

Apache Drill is a distributed MPP query layer for self describing data

Stars: ✭ 1,619 (+1048.23%)

Mutual labels: big-data

Gaffer

A large-scale entity and relation database supporting aggregation of properties

Stars: ✭ 1,642 (+1064.54%)

Mutual labels: big-data

Cmak

CMAK is a tool for managing Apache Kafka clusters

Stars: ✭ 10,544 (+7378.01%)

Mutual labels: big-data

Fe Foundation

前端开发学习指南

Stars: ✭ 113 (-19.86%)

Mutual labels: study

Hazelcast Go Client

Hazelcast IMDG Go Client

Stars: ✭ 140 (-0.71%)

Mutual labels: big-data

Accelerator

The Accelerator is a tool for fast and reproducible processing of large amounts of data.

Stars: ✭ 137 (-2.84%)

Mutual labels: big-data

Couchdb Documentation

Apache CouchDB Documentation

Stars: ✭ 128 (-9.22%)

Mutual labels: big-data

Amazon S3 Find And Forget

Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)

Stars: ✭ 115 (-18.44%)

Mutual labels: big-data

Asakusafw

Asakusa Framework

Stars: ✭ 114 (-19.15%)

Mutual labels: big-data

Tajo

Mirror of Apache Tajo

Stars: ✭ 128 (-9.22%)

Mutual labels: big-data

Just Dashboard

📊 📋 Dashboards using YAML or JSON files