telleryTellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Stars: ✭ 219 (-22.34%)
yuzhouwanCode Library for My Blog
Stars: ✭ 39 (-86.17%)
ODSC India 2018My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-90.78%)
sparkar-voltsAn extensive non-reactive Typescript framework that eases the development experience in Spark AR
Stars: ✭ 15 (-94.68%)
experimentsCode examples for my blog posts
Stars: ✭ 21 (-92.55%)
spark-extensionA library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (-91.13%)
splinkImplementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (-35.82%)
spark-http-streamspark structured streaming via HTTP communication
Stars: ✭ 17 (-93.97%)
visualize-data-with-pythonA Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.
Stars: ✭ 60 (-78.72%)
CasperA compiler for automatically re-targeting sequential Java code to Apache Spark.
Stars: ✭ 45 (-84.04%)
OLAP-cubeis an hypercube of data
Stars: ✭ 23 (-91.84%)
BlazerBusiness intelligence made simple
Stars: ✭ 3,102 (+1000%)
visionsType System for Data Analysis in Python
Stars: ✭ 136 (-51.77%)
EDAEnterprise Data Analytics by Jortilles ( EDA )
Stars: ✭ 59 (-79.08%)
daf-kyloKylo integration with PDND (previously DAF).
Stars: ✭ 20 (-92.91%)
query2reportQuery2Report is a simple open source business intelligence platform that allows users to build report/dashboard for business analytics or enterprise reporting
Stars: ✭ 43 (-84.75%)
spark-demosCollection of different demo applications using Apache Spark
Stars: ✭ 15 (-94.68%)
dashinatorDashinator the daringly delightful dashboard. A replacement for dashing
Stars: ✭ 56 (-80.14%)
Spark Jupyter AwsA guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (-8.16%)
datartDatart is a next generation Data Visualization Open Platform
Stars: ✭ 1,042 (+269.5%)
tpch-sparkTPC-H queries in Apache Spark SQL using native DataFrames API
Stars: ✭ 63 (-77.66%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-93.97%)
AulasUEMGMaterial usado nas aulas da UEMG, a partir de 2018-2
Stars: ✭ 19 (-93.26%)
frovedisFramework of vectorized and distributed data analytics
Stars: ✭ 59 (-79.08%)
Data-VisualizationsData Visualizations is emerging as one of the most essential skills in almost all of the IT and Non IT Background Sectors and Jobs. Using Data Visualizations to make wiser decisions which could land the Business to make bigger profits and understand the root cause and behavioral analysis of people and customers associated to it. In this Reposito…
Stars: ✭ 55 (-80.5%)
Hbase RddSpark RDD to read, write and delete from HBase
Stars: ✭ 277 (-1.77%)
Spark-PMoFSpark Shuffle Optimization with RDMA+AEP
Stars: ✭ 28 (-90.07%)
Spark Fast TestsApache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Stars: ✭ 249 (-11.7%)
HyperspaceAn open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (-12.77%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-95.39%)
DparkPython clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+846.1%)
Big Data Rosetta CodeCode snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Stars: ✭ 254 (-9.93%)
docker-sparkApache Spark docker container image (Standalone mode)
Stars: ✭ 34 (-87.94%)
Azure Event Hubs☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs
Stars: ✭ 233 (-17.38%)
HelkThe Hunting ELK
Stars: ✭ 3,097 (+998.23%)
Ruby SparkRuby wrapper for Apache Spark
Stars: ✭ 221 (-21.63%)
Spark ExcelA Spark plugin for reading Excel files via Apache POI
Stars: ✭ 216 (-23.4%)
confluent-spark-avroSpark UDFs to deserialize Avro messages with schemas stored in Schema Registry.
Stars: ✭ 18 (-93.62%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-23.76%)
Example SparkSpark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (-27.3%)
Search Ads Web ServiceOnline search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]
Stars: ✭ 30 (-89.36%)
CloudflowCloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Stars: ✭ 278 (-1.42%)
Knowage ServerKnowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
Stars: ✭ 276 (-2.13%)
Docker Spark ClusterA simple spark standalone cluster for your testing environment purposses
Stars: ✭ 261 (-7.45%)
blogblog entries
Stars: ✭ 39 (-86.17%)