Example SparkSpark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (-80.84%)
Every Single Day I TldrA daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (-76.73%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+60.84%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (-13.18%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-76.92%)
ThingsboardOpen-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+883.74%)
LearningsparkScala examples for learning to use Spark
Stars: ✭ 421 (-60.65%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (-92.24%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-86.92%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+210.09%)
Kinesis SqlKinesis Connector for Structured Streaming
Stars: ✭ 120 (-88.79%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-94.58%)
waspWASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-98.22%)
SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (-52.06%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+73.46%)
cassandra.realtimeDifferent ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (-97.66%)
CloudflowCloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Stars: ✭ 278 (-74.02%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-79.81%)
CdapAn open source framework for building data analytic applications.
Stars: ✭ 509 (-52.43%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+503.55%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (-10.84%)
GatkOfficial code repository for GATK versions 4 and up
Stars: ✭ 1,002 (-6.36%)
PucketBucketing and partitioning system for Parquet
Stars: ✭ 29 (-97.29%)
Data Algorithms Book MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (-11.31%)
Akka Typed Sessionadd-on to Akka Typed that tracks effects for use with Session Types
Stars: ✭ 47 (-95.61%)
HeraclesHigh performance HBase / Spark SQL engine
Stars: ✭ 27 (-97.48%)
Project FortisRepository for all parts of the Fortis architecture
Stars: ✭ 27 (-97.48%)
PixiedustPython Helper library for Jupyter Notebooks
Stars: ✭ 998 (-6.73%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+2854.95%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (-95.33%)
Psf LoginserverEmulated PlanetSide 1 world and login server by the PSForever project.
Stars: ✭ 46 (-95.7%)
Lagom ExampleExample usage of the Lagom Framework for writing Java-based microservices
Stars: ✭ 20 (-98.13%)
SnappydataProject SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Stars: ✭ 995 (-7.01%)
FlintA Time Series Library for Apache Spark
Stars: ✭ 878 (-17.94%)
TedsdsApache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-98.69%)
HeimdallrHeimdallr, a Large-scale chat application server based on Redis Pubsub and Akka's actor model.
Stars: ✭ 38 (-96.45%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-98.69%)
UrhoxUrho3D extension library
Stars: ✭ 13 (-98.79%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (-7.85%)
Sparkling TitanicTraining models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-98.88%)
NsdbNatural Series Database
Stars: ✭ 49 (-95.42%)
Akka WampWAMP - Web Application Messaging Protocol implementation written with Akka
Stars: ✭ 45 (-95.79%)
MlfeatureFeature engineering toolkit for Spark MLlib.
Stars: ✭ 12 (-98.88%)
MareMaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-98.97%)
SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-98.97%)
WormholeWormhole is a SPaaS (Stream Processing as a Service) Platform
Stars: ✭ 863 (-19.35%)
Spark TdaSparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (-95.79%)