Awesome AdaA curated list of awesome resources related to the Ada and SPARK programming language
Stars: ✭ 299 (-14.33%)
spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (-73.93%)
Book本项目收藏这些年来看过或者听过的一些不错的书籍,在整理文件时看见这些,发现删掉有点可惜,放着又太浪费空间,本着分享的原则,就把它们共享出来,一方面给需要的读者提供这些书籍,另一方面也是一种像知识库的积累吧
Stars: ✭ 47 (-86.53%)
spark-word2vecA parallel implementation of word2vec based on Spark
Stars: ✭ 24 (-93.12%)
CookFair job scheduler on Kubernetes and Mesos for batch workloads and Spark
Stars: ✭ 314 (-10.03%)
shamashAutoscaling for Google Cloud Dataproc
Stars: ✭ 31 (-91.12%)
spark-http-streamspark structured streaming via HTTP communication
Stars: ✭ 17 (-95.13%)
yuzhouwanCode Library for My Blog
Stars: ✭ 39 (-88.83%)
Spark Hbase ConnectorConnect Spark to HBase for reading and writing data with ease
Stars: ✭ 299 (-14.33%)
spark-utillow-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-95.42%)
daf-kyloKylo integration with PDND (previously DAF).
Stars: ✭ 20 (-94.27%)
IqlAn ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)
Stars: ✭ 341 (-2.29%)
spark-druid-olapSparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Stars: ✭ 286 (-18.05%)
swordfishOpen-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-89.97%)
Spark Druid OlapSparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Stars: ✭ 282 (-19.2%)
Spark-ArResources for Spark AR
Stars: ✭ 43 (-87.68%)
spark-data-sourcesDeveloping Spark External Data Sources using the V2 API
Stars: ✭ 36 (-89.68%)
fastdata-clusterFast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-94.27%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+850.72%)
spark-stringmetricSpark functions to run popular phonetic and string matching algorithms
Stars: ✭ 51 (-85.39%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-95.42%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+772.21%)
Hbase RddSpark RDD to read, write and delete from HBase
Stars: ✭ 277 (-20.63%)
Every Single Day I TldrA daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (-28.65%)
Covid19TrackerA Robinhood style COVID-19 🦠 Android tracking app for the US. Open source and built with Kotlin.
Stars: ✭ 65 (-81.38%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-29.23%)
SparklensQubole Sparklens tool for performance tuning Apache Spark
Stars: ✭ 345 (-1.15%)
Neo4j Spark ConnectorNeo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
Stars: ✭ 245 (-29.8%)
SparkV🤖⚡ | The most POWERFUL multipurpose chat/meme bot that will boost the activity in your server.
Stars: ✭ 24 (-93.12%)
RecommendationsystemBook recommender system using collaborative filtering based on Spark
Stars: ✭ 244 (-30.09%)
HelkThe Hunting ELK
Stars: ✭ 3,097 (+787.39%)
Hadoop Docker基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
Stars: ✭ 238 (-31.81%)
trembitaModel complex data transformation pipelines easily
Stars: ✭ 44 (-87.39%)
CrayonSimple framework agnostic UI router for SPAs
Stars: ✭ 310 (-11.17%)
MydatascienceportfolioApplying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (-34.96%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-95.99%)
Spark WorkshopApache Spark™ and Scala Workshops
Stars: ✭ 224 (-35.82%)
Sagemaker SparkA Spark library for Amazon SageMaker.
Stars: ✭ 219 (-37.25%)
smolderHL7 Apache Spark Datasource
Stars: ✭ 33 (-90.54%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-38.11%)
WirbelsturmWirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Stars: ✭ 332 (-4.87%)
spark-demosCollection of different demo applications using Apache Spark
Stars: ✭ 15 (-95.7%)
Spark Knnk-Nearest Neighbors algorithm on Spark
Stars: ✭ 205 (-41.26%)
Spark Jupyter AwsA guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (-25.79%)
MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+730.66%)
tpch-sparkTPC-H queries in Apache Spark SQL using native DataFrames API
Stars: ✭ 63 (-81.95%)
BallistaDistributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+551.58%)
SplineData Lineage Tracking And Visualization Solution
Stars: ✭ 306 (-12.32%)
frovedisFramework of vectorized and distributed data analytics
Stars: ✭ 59 (-83.09%)
OapOptimized Analytics Package for Spark* Platform
Stars: ✭ 343 (-1.72%)
ScalnetA Scala wrapper for Deeplearning4j, inspired by Keras. Scala + DL + Spark + GPUs
Stars: ✭ 342 (-2.01%)
SparklintA tool for monitoring and tuning Spark jobs for efficiency.
Stars: ✭ 316 (-9.46%)
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-13.18%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-96.28%)