LikelikeAn implementation of locality sensitive hashing with Hadoop
Stars: ✭ 58 (-25.64%)
Spark FlamegraphEasy CPU Profiling for Apache Spark applications
Stars: ✭ 30 (-61.54%)
Vagrant ProjectsVagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR
Stars: ✭ 34 (-56.41%)
Zemberek Nlp ServerZemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Stars: ✭ 60 (-23.08%)
AkkeeperAn easy way to deploy your Akka services to a distributed environment.
Stars: ✭ 30 (-61.54%)
Docker HadoopApache Hadoop docker image
Stars: ✭ 1,190 (+1425.64%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+1123.08%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-25.64%)
PucketBucketing and partitioning system for Parquet
Stars: ✭ 29 (-62.82%)
AtsdAxibase Time Series Database Documentation
Stars: ✭ 68 (-12.82%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-25.64%)
Storm Camel ExampleReal-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.
Stars: ✭ 28 (-64.1%)
HeraclesHigh performance HBase / Spark SQL engine
Stars: ✭ 27 (-65.38%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+40435.9%)
Fast MrmrAn improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).
Stars: ✭ 67 (-14.1%)
FlintA Time Series Library for Apache Spark
Stars: ✭ 878 (+1025.64%)
TedsdsApache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-82.05%)
Awesome PulsarA curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (-26.92%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-82.05%)
KontextfreiWriting application logic for Spark jobs that can be unit-tested without a SparkContext
Stars: ✭ 67 (-14.1%)
UrhoxUrho3D extension library
Stars: ✭ 13 (-83.33%)
Sparkling TitanicTraining models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-84.62%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-29.49%)
MlfeatureFeature engineering toolkit for Spark MLlib.
Stars: ✭ 12 (-84.62%)
Lpa DetectorOptimize and improve the Label propagation algorithm
Stars: ✭ 75 (-3.85%)
SrcA light-weight distributed stream computing framework for Golang
Stars: ✭ 67 (-14.1%)
MareMaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-85.9%)
SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-85.9%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+1271.79%)
Hadoop PotA scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.
Stars: ✭ 8 (-89.74%)
ThingsboardOpen-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+13394.87%)
Spark Submit UiThis is a based on playframwork for submit spark app
Stars: ✭ 53 (-32.05%)
Tiledb VcfEfficient variant-call data storage and retrieval library using the TileDB storage library.
Stars: ✭ 26 (-66.67%)
Stormtweetssentimentd3vizComputes and visualizes the sentiment analysis of tweets of US States in real-time using Storm.
Stars: ✭ 25 (-67.95%)
Spark SwaggerSpark (http://sparkjava.com/) support for Swagger (https://swagger.io/)
Stars: ✭ 25 (-67.95%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+1091.03%)
HomeApacheCN 开源组织:公告、介绍、成员、活动、交流方式
Stars: ✭ 1,199 (+1437.18%)
Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-3.85%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-16.67%)
Hadoop SolrCode to index HDFS to Solr using MapReduce
Stars: ✭ 51 (-34.62%)
ChroniclerScala toolchain for InfluxDB
Stars: ✭ 24 (-69.23%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (-35.9%)
DigitrecognizerJava Convolutional Neural Network example for Hand Writing Digit Recognition
Stars: ✭ 23 (-70.51%)
Spark BigqueryGoogle BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Stars: ✭ 65 (-16.67%)
Basehttps://www.researchgate.net/profile/Rajah_Iyer
Stars: ✭ 48 (-38.46%)
LabsResearch on distributed system
Stars: ✭ 73 (-6.41%)
JumbuneJumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (-17.95%)