Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+791.67%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (-30.83%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+80%)
SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+327.5%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+1334.17%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1446.67%)
Example SparkSpark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (+70.83%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+2665%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+5281.67%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+674.17%)
LearningsparkScala examples for learning to use Spark
Stars: ✭ 421 (+250.83%)
CdapAn open source framework for building data analytic applications.
Stars: ✭ 509 (+324.17%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+16.67%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+105.83%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-51.67%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1015%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-10%)
ArchivesparkAn Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Stars: ✭ 111 (-7.5%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+9381.67%)
Ammonite SparkRun spark calculations from Ammonite
Stars: ✭ 88 (-26.67%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-23.33%)
Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-6.67%)
Big Data🔧 Use dplyr to analyze Big Data 🐘
Stars: ✭ 93 (-22.5%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-10%)
Spark LucenerddSpark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (-5%)
Spark Nlp ModelsModels and Pipelines for the Spark NLP library
Stars: ✭ 88 (-26.67%)
Seldon ServerMachine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+1095.83%)
ElephasDistributed Deep learning with Keras & Spark
Stars: ✭ 1,521 (+1167.5%)
LogigskA Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-10.83%)
CuesheetA framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-28.33%)
FlintWebex Bot SDK for Node.js (deprecated in favor of https://github.com/webex/webex-bot-node-framework)
Stars: ✭ 85 (-29.17%)
Spark On K8s OperatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+1383.33%)
Hops ExamplesExamples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-30%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+1258.33%)
Spring Shiro SparkSpring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
Stars: ✭ 114 (-5%)
Lambda ArchApplying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-7.5%)
SparktutorialSource code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-12.5%)
Hadoop cookbookCookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-31.67%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-12.5%)
MleapMLeap: Deploy ML Pipelines to Production
Stars: ✭ 1,232 (+926.67%)
LeharVisualize data using relative ordering
Stars: ✭ 81 (-32.5%)
Spark GbtlrHybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark
Stars: ✭ 81 (-32.5%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-34.17%)
Spark FfmFFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101 (-15.83%)
Docker Spark🚢 Docker image for Apache Spark
Stars: ✭ 78 (-35%)