leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-7.14%)
SynapseMLSimple and Distributed Machine Learning
Stars: ✭ 3,355 (+23864.29%)
ScriptisScriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Stars: ✭ 696 (+4871.43%)
mmtf-workshop-2018Structural Bioinformatics Training Workshop & Hackathon 2018
Stars: ✭ 50 (+257.14%)
KyuubiKyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark
Stars: ✭ 363 (+2492.86%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+6714.29%)
Quinnpyspark methods to enhance developer productivity 📣 👯 🎉
Stars: ✭ 217 (+1450%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+6942.86%)
Pyspark StubsApache (Py)Spark type annotations (stub files).
Stars: ✭ 98 (+600%)
Spark TdaSparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (+221.43%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (+257.14%)
CuesheetA framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (+514.29%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (+492.86%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (+357.14%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (+671.43%)
Spark On K8s OperatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+12614.29%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+900%)
OpaqueAn encrypted data analytics platform
Stars: ✭ 129 (+821.43%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+16492.86%)
pyspark-cheatsheetPySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Stars: ✭ 115 (+721.43%)
RedashMake Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+143807.14%)
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+2800%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+1664.29%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+1442.86%)
isarn-sketches-sparkRoutines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (+100%)
SparkoraPowerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (+264.29%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+1435.71%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+1328.57%)
spark-extensionA library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (+78.57%)
Kafka Storm StarterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+5100%)
FramelessExpressive types for Spark.
Stars: ✭ 717 (+5021.43%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (+57.14%)
HailScalable genomic data analysis.
Stars: ✭ 706 (+4942.86%)
WalkoffA flexible, easy to use, automation framework allowing users to integrate their capabilities and devices to cut through the repetitive, tedious tasks slowing them down. #nsacyber
Stars: ✭ 855 (+6007.14%)
RotkiA portfolio tracking, analytics, accounting and tax reporting application that protects your privacy
Stars: ✭ 689 (+4821.43%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+191350%)
Ostrio Analytics📊 Visitor's analytics tracking code for ostr.io service
Stars: ✭ 9 (-35.71%)
TopThe daily list of Wikipedia's most-visited articles
Stars: ✭ 19 (+35.71%)
SnowplowThe enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP
Stars: ✭ 5,935 (+42292.86%)
Mixpanel JsOfficial Mixpanel JavaScript Client Library
Stars: ✭ 656 (+4585.71%)
Redux BeaconAnalytics integration for Redux and ngrx/store
Stars: ✭ 645 (+4507.14%)
FathomFathom Lite. Simple, privacy-focused website analytics. Built with Golang & Preact.
Stars: ✭ 6,989 (+49821.43%)