SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-14.63%)
Spring Shiro SparkSpring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
Stars: ✭ 114 (-7.32%)
Alfred WorkflowAlfred Workflow教程与实例; CDto: 打开Terminal并转到任意文件夹或文件所在目录; Effective IP:查询本机和外网IP地址,解析任意URL和域名的IP地址,以及进行归属地和运营商查询; UpdateAllNPM: 更新所有Node.js全局模块; UpdateAllPIP: 更新所有Python模块
Stars: ✭ 98 (-20.33%)
Cube.js📊 Cube — Open-Source Analytics API for Building Data Apps
Stars: ✭ 11,983 (+9642.28%)
Seldon ServerMachine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+1066.67%)
TeddySpark Streaming监控平台,支持任务部署与告警、自启动
Stars: ✭ 120 (-2.44%)
Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-8.94%)
RevogridPowerful virtual data grid smartsheet with advanced customization. Best features from excel plus incredible performance 🔋
Stars: ✭ 1,870 (+1420.33%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+987.8%)
Hosts BlocklistsAutomatically updated, moderated and optimized lists for blocking ads, trackers, malware and other garbage
Stars: ✭ 1,749 (+1321.95%)
Filter GraftingFilter Grafting for Deep Neural Networks(CVPR 2020)
Stars: ✭ 110 (-10.57%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-12.2%)
Spark LucenerddSpark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (-7.32%)
Spark On K8s OperatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+1347.15%)
Ng2 Smart TableAngular Smart Data Table component
Stars: ✭ 1,590 (+1192.68%)
BoswatchPython Script to process input data from rtl_fm and multimon-NG - multiple Plugin support
Stars: ✭ 101 (-17.89%)
ElassandraElassandra = Elasticsearch + Apache Cassandra
Stars: ✭ 1,610 (+1208.94%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-21.14%)
Lambda ArchApplying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-9.76%)
BoomfiltersProbabilistic data structures for processing continuous, unbounded streams.
Stars: ✭ 1,333 (+983.74%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+1225.2%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1408.94%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-10.57%)
Jsonapi.rbLightweight, simple and maintained JSON:API support for your next Ruby HTTP API.
Stars: ✭ 116 (-5.69%)
Parquet IndexSpark SQL index for Parquet tables
Stars: ✭ 109 (-11.38%)
DeequDeequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Stars: ✭ 2,020 (+1542.28%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-12.2%)
Active hash relationActiveHash Relation: Simple gem that allows you to run multiple ActiveRecord::Relation using hash. Perfect for APIs.
Stars: ✭ 115 (-6.5%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+9150.41%)
Vue2 Bootstrap TableA sortable and searchable table, as a Vue2 component, using bootstrap styling.
Stars: ✭ 120 (-2.44%)
LogigskA Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-13.01%)
SparktutorialSource code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-14.63%)
FungenReplace boilerplate code with functional patterns using 'go generate'
Stars: ✭ 122 (-0.81%)
SearchableSearch/filter functionality for Laravel's Eloquent models
Stars: ✭ 113 (-8.13%)
Spark FfmFFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101 (-17.89%)
Kinesis SqlKinesis Connector for Structured Streaming
Stars: ✭ 120 (-2.44%)
MuuriInfinite responsive, sortable, filterable and draggable layouts
Stars: ✭ 9,797 (+7865.04%)
Xlearning Xdmlextremely distributed machine learning
Stars: ✭ 113 (-8.13%)
AlmondA Scala kernel for Jupyter
Stars: ✭ 1,354 (+1000.81%)
GlslsmartdenoiseFast glsl deNoise spatial filter, with circular gaussian kernel, full configurable
Stars: ✭ 121 (-1.63%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-21.14%)
FaltuSearch sort, filter, limit an array of objects in Mongo-style.
Stars: ✭ 112 (-8.94%)
Wavelets.jlA Julia package for fast discrete wavelet transforms and utilities
Stars: ✭ 118 (-4.07%)
ArchivesparkAn Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Stars: ✭ 111 (-9.76%)
SieveA simple, clean and elegant way to filter Eloquent models.
Stars: ✭ 123 (+0%)
Spark AlchemyCollection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-0.81%)
ZparkioBoiler plate framework to use Spark and ZIO together.
Stars: ✭ 121 (-1.63%)
ElephasDistributed Deep learning with Keras & Spark
Stars: ✭ 1,521 (+1136.59%)