openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-3.57%)
Example SparkSpark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (+632.14%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+614.29%)
darpcDaRPC: Data Center Remote Procedure Call
Stars: ✭ 49 (+75%)
ScannsA scalable nearest neighbor search library in Apache Spark
Stars: ✭ 190 (+578.57%)
ksmbdksmbd kernel server(SMB/CIFS server)
Stars: ✭ 98 (+250%)
AzuredatabricksbestpracticesVersion 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
Stars: ✭ 186 (+564.29%)
pDPMPassive Disaggregated Persistent Memory at USENIX ATC 2020.
Stars: ✭ 38 (+35.71%)
RoaringbitmapA better compressed bitset in Java
Stars: ✭ 2,460 (+8685.71%)
docker-sparkApache Spark docker container image (Standalone mode)
Stars: ✭ 34 (+21.43%)
Sparkstreaming💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (+539.29%)
ashuffleAutomatic library-wide shuffle for mpd.
Stars: ✭ 64 (+128.57%)
spark-druid-olapSparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Stars: ✭ 286 (+921.43%)
SparkFirely's open source FHIR server
Stars: ✭ 174 (+521.43%)
CoyoteFramework providing operating system abstractions and a range of shared networking (RDMA, TCP/IP) and memory services to common modern heterogeneous platforms.
Stars: ✭ 80 (+185.71%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+43746.43%)
Turbo-TransposeTranspose: SIMD Integer+Floating Point Compression Filter
Stars: ✭ 50 (+78.57%)
GeopysparkGeoTrellis for PySpark
Stars: ✭ 167 (+496.43%)
swordfishOpen-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (+25%)
Big WhaleSpark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (+482.14%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-53.57%)
Vue Info CardSimple and beautiful card component with an elegant spark line, for VueJS.
Stars: ✭ 159 (+467.86%)
Spark Fast TestsApache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Stars: ✭ 249 (+789.29%)
Spark-ArResources for Spark AR
Stars: ✭ 43 (+53.57%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+442.86%)
HyperspaceAn open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (+778.57%)
SparkmonitorMonitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (+450%)
Search Ads Web ServiceOnline search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]
Stars: ✭ 30 (+7.14%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (+446.43%)
DparkPython clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+9428.57%)
Spark TsneDistributed t-SNE via Apache Spark
Stars: ✭ 151 (+439.29%)
fastdata-clusterFast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-28.57%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+6453.57%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+435.71%)
Azure Event Hubs☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs
Stars: ✭ 233 (+732.14%)
spark-stringmetricSpark functions to run popular phonetic and string matching algorithms
Stars: ✭ 51 (+82.14%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+6121.43%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (+407.14%)
Ruby SparkRuby wrapper for Apache Spark
Stars: ✭ 221 (+689.29%)
kafka-compose🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (+14.29%)
spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (+225%)
spark-utillow-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-42.86%)
scstNo description or website provided.
Stars: ✭ 61 (+117.86%)