Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+1518.54%)
LearningsparkScala examples for learning to use Spark
Stars: ✭ 421 (+105.37%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+3050.24%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+353.17%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-31.71%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-71.71%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+5.37%)
SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+150.24%)
CdapAn open source framework for building data analytic applications.
Stars: ✭ 509 (+148.29%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (-59.51%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+805.37%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+421.95%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+20.49%)
Kinesis SqlKinesis Connector for Structured Streaming
Stars: ✭ 120 (-41.46%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+739.51%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-14.63%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-25.85%)
SparkmonitorMonitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (-24.88%)
RegistrySchema Registry
Stars: ✭ 184 (-10.24%)
ScramjetSimple yet powerful live data computation framework
Stars: ✭ 171 (-16.59%)
QuillCompile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+874.63%)
GlowAn open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-22.44%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-22.93%)
AzuredatabricksbestpracticesVersion 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
Stars: ✭ 186 (-9.27%)
SparkFirely's open source FHIR server
Stars: ✭ 174 (-15.12%)
Movie recommend基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+920.49%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-2.44%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (-25.37%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1128.29%)
PowderkegLive-coding the cluster!
Stars: ✭ 152 (-25.85%)
StreamlineStreamLine - Streaming Analytics
Stars: ✭ 151 (-26.34%)
Kotlin Spark ApiThis projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (-10.73%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+5888.78%)
Spark TsneDistributed t-SNE via Apache Spark
Stars: ✭ 151 (-26.34%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+916.59%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+795.12%)
AztkAZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-25.85%)
BallistaDistributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+1009.27%)
RoaringbitmapA better compressed bitset in Java
Stars: ✭ 2,460 (+1100%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-26.83%)
Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (-28.29%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-28.29%)
GeopysparkGeoTrellis for PySpark
Stars: ✭ 167 (-18.54%)
Technology Talk汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+5820%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+749.76%)