SparkFirely's open source FHIR server
Stars: ✭ 174 (-30.12%)
Example SparkSpark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (-17.67%)
Sparkstreaming💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (-28.11%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-36.55%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-13.25%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+736.95%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+832.93%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-19.68%)
RoaringbitmapA better compressed bitset in Java
Stars: ✭ 2,460 (+887.95%)
SparkmonitorMonitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (-38.15%)
Sagemaker SparkA Spark library for Amazon SageMaker.
Stars: ✭ 219 (-12.05%)
Azure Event Hubs☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs
Stars: ✭ 233 (-6.43%)
VividusVividus is all in one test automation tool
Stars: ✭ 170 (-31.73%)
RecommendationsystemBook recommender system using collaborative filtering based on Spark
Stars: ✭ 244 (-2.01%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-34.14%)
GlowAn open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-36.14%)
DatageneDataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
Stars: ✭ 156 (-37.35%)
ScannsA scalable nearest neighbor search library in Apache Spark
Stars: ✭ 190 (-23.69%)
Kotlin Spark ApiThis projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (-26.51%)
QuillCompile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+702.41%)
Ruby SparkRuby wrapper for Apache Spark
Stars: ✭ 221 (-11.24%)
Hadoop Docker基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
Stars: ✭ 238 (-4.42%)
XsqlUnified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-29.32%)
Spark ExcelA Spark plugin for reading Excel files via Apache POI
Stars: ✭ 216 (-13.25%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-29.72%)
DparkPython clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+971.49%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+911.24%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-13.65%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+4830.52%)
Vim VspecVim plugin: Testing framework for Vim script
Stars: ✭ 207 (-16.87%)
GeopysparkGeoTrellis for PySpark
Stars: ✭ 167 (-32.93%)
HyperspaceAn open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (-1.2%)
Big WhaleSpark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-34.54%)
Spark Knnk-Nearest Neighbors algorithm on Spark
Stars: ✭ 205 (-17.67%)
MydatascienceportfolioApplying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (-8.84%)
Vue Info CardSimple and beautiful card component with an elegant spark line, for VueJS.
Stars: ✭ 159 (-36.14%)
MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+1064.26%)
BanditHuman-friendly unit testing for C++11
Stars: ✭ 240 (-3.61%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-38.96%)
BallistaDistributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+813.25%)
Spark WorkshopApache Spark™ and Scala Workshops
Stars: ✭ 224 (-10.04%)
Js SparkRealtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (-24.9%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-0.8%)
Neo4j Spark ConnectorNeo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
Stars: ✭ 245 (-1.61%)
Recheck Webrecheck for web apps – change comparison tool with local Golden Masters, Git-like ignore syntax and "Unbreakable Selenium" tests.
Stars: ✭ 224 (-10.04%)
AzuredatabricksbestpracticesVersion 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
Stars: ✭ 186 (-25.3%)