Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (+112.82%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+284.62%)
Spark-PMoFSpark Shuffle Optimization with RDMA+AEP
Stars: ✭ 28 (-28.21%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (+48.72%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+258.97%)
Azure Event Hubs☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs
Stars: ✭ 233 (+497.44%)
ApacheDocker container running Apache running on Ubuntu, Composer, Lavavel, TDD via Shippable & CircleCI
Stars: ✭ 15 (-61.54%)
spark-word2vecA parallel implementation of word2vec based on Spark
Stars: ✭ 24 (-38.46%)
vhost-genConfigurable vHost generator for Apache 2.2, Apache 2.4 and Nginx
Stars: ✭ 111 (+184.62%)
CasperA compiler for automatically re-targeting sequential Java code to Apache Spark.
Stars: ✭ 45 (+15.38%)
incubator-linkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,459 (+6205.13%)
sentry-sparkApache Spark Sentry Integration
Stars: ✭ 14 (-64.1%)
semalt-blocker⛔ Self-updating PHP library which blocks referral spam from ruining your website statistics
Stars: ✭ 67 (+71.79%)
zeppelinApache Zeppelin with support for SQL Server
Stars: ✭ 17 (-56.41%)
yuzhouwanCode Library for My Blog
Stars: ✭ 39 (+0%)
mod auth radiusThe FreeRADIUS Apache module for RADIUS authentication
Stars: ✭ 35 (-10.26%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-30.77%)
frovedisFramework of vectorized and distributed data analytics
Stars: ✭ 59 (+51.28%)
h2goApache H2 Go SQL Driver
Stars: ✭ 35 (-10.26%)
ksmbdksmbd kernel server(SMB/CIFS server)
Stars: ✭ 98 (+151.28%)
smolderHL7 Apache Spark Datasource
Stars: ✭ 33 (-15.38%)
spark-druid-olapSparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Stars: ✭ 286 (+633.33%)
startpagea cute little home for my browser
Stars: ✭ 26 (-33.33%)
spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (+133.33%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+184.62%)
qpid-proton-jMirror of Apache Qpid Proton-J
Stars: ✭ 28 (-28.21%)
spark-extensionA library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (-35.9%)
baikal-dockerProvides a ready-to-go Baikal server, incl. docker-compose.yml & Systemd service file
Stars: ✭ 85 (+117.95%)
shamashAutoscaling for Google Cloud Dataproc
Stars: ✭ 31 (-20.51%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-64.1%)
Search Ads Web ServiceOnline search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]
Stars: ✭ 30 (-23.08%)
tpch-sparkTPC-H queries in Apache Spark SQL using native DataFrames API
Stars: ✭ 63 (+61.54%)
spark-utillow-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-58.97%)
qpid-jmsMirror of Apache Qpid JMS
Stars: ✭ 60 (+53.85%)
b2ntpKanban style New Tab Page extension with your bookmarks and powerful search
Stars: ✭ 50 (+28.21%)
apache-baselineDevSec Apache Baseline - InSpec Profile
Stars: ✭ 37 (-5.13%)
awesome-AI-kubernetes❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (+143.59%)
error-log-parserSimple PHP library to parse Apache or Nginx error-log file entries for further usage.
Stars: ✭ 19 (-51.28%)
modulesMesos modules examples and open source modules outside of the Apache Mesos source tree.
Stars: ✭ 26 (-33.33%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-66.67%)
ODSC India 2018My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-33.33%)
swordfishOpen-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-10.26%)
visionsType System for Data Analysis in Python
Stars: ✭ 136 (+248.72%)
kafka-compose🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (-17.95%)
jota-cert-checkerCheck SSL certificate expiration date of a list of sites.
Stars: ✭ 45 (+15.38%)
sparkar-voltsAn extensive non-reactive Typescript framework that eases the development experience in Spark AR
Stars: ✭ 15 (-61.54%)
ap-airflowAstronomer Core Docker Images
Stars: ✭ 87 (+123.08%)