Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-75.73%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-9.62%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (-65.27%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+3.35%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+347.7%)
Tweet-Analysis-With-Kafka-and-SparkA real time analytics dashboard to analyze the trending hashtags and @ mentions at any location using kafka and spark streaming.
Stars: ✭ 18 (-92.47%)
Example SparkSpark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (-14.23%)
Project FortisRepository for all parts of the Fortis architecture
Stars: ✭ 27 (-88.7%)
interview-refresh-java-bigdataa one-stop repo to lookup for code snippets of core java concepts, sql, data structures as well as big data. It also consists of interview questions asked in real-life.
Stars: ✭ 25 (-89.54%)
RegistrySchema Registry
Stars: ✭ 184 (-23.01%)
WormholeWormhole is a SPaaS (Stream Processing as a Service) Platform
Stars: ✭ 863 (+261.09%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+288.7%)
Bandar LogMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-92.05%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+2602.09%)
Spark ALS基于spark-ml,spark-mllib,spark-streaming的推荐算法实现
Stars: ✭ 89 (-62.76%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-25.94%)
SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+114.64%)
ScramjetSimple yet powerful live data computation framework
Stars: ✭ 171 (-28.45%)
CdapAn open source framework for building data analytic applications.
Stars: ✭ 509 (+112.97%)
LearningsparkScala examples for learning to use Spark
Stars: ✭ 421 (+76.15%)
Movie recommend基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+775.31%)
SylphStream computing platform for bigdata
Stars: ✭ 362 (+51.46%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+1288.28%)
StreamlineStreamLine - Streaming Analytics
Stars: ✭ 151 (-36.82%)
bandar-logMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 20 (-91.63%)
fdp-modelserverAn umbrella project for multiple implementations of model serving
Stars: ✭ 47 (-80.33%)