Spring Shiro SparkSpring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
Stars: ✭ 114 (-31.33%)
LiftThe LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning workflows.
Stars: ✭ 127 (-23.49%)
Cape PythonCollaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-24.7%)
Technology Talk汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+7210.84%)
Spark Bigquery ConnectorBigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Stars: ✭ 126 (-24.1%)
Vue Info CardSimple and beautiful card component with an elegant spark line, for VueJS.
Stars: ✭ 159 (-4.22%)
Logdeeplog anomaly detection toolkit including DeepLog
Stars: ✭ 125 (-24.7%)
StumpySTUMPY is a powerful and scalable Python library for modern time series analysis
Stars: ✭ 2,019 (+1116.27%)
Spark Infotheoretic Feature SelectionThis package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is based on the common theoretic framework presented by Gavin Brown. Implementations of mRMR, InfoGain, JMI and other commonly used FS filters are provided.
Stars: ✭ 123 (-25.9%)
QuillCompile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+1103.61%)
DeequDeequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Stars: ✭ 2,020 (+1116.87%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-14.46%)
Big WhaleSpark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-1.81%)
TeddySpark Streaming监控平台,支持任务部署与告警、自启动
Stars: ✭ 120 (-27.71%)
PyoddsAn End-to-end Outlier Detection System
Stars: ✭ 141 (-15.06%)
ElassandraElassandra = Elasticsearch + Apache Cassandra
Stars: ✭ 1,610 (+869.88%)
Kitnet PyKitNET is a lightweight online anomaly detection algorithm, which uses an ensemble of autoencoders.
Stars: ✭ 152 (-8.43%)
Cube.js📊 Cube — Open-Source Analytics API for Building Data Apps
Stars: ✭ 11,983 (+7118.67%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-15.66%)
Spark LucenerddSpark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (-31.33%)
RemixautomlR package for automation of machine learning, forecasting, feature engineering, model evaluation, model interpretation, data generation, and recommenders.
Stars: ✭ 159 (-4.22%)
Sparkling GraphSparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-16.27%)
Pytorch cppDeep Learning sample programs using PyTorch in C++
Stars: ✭ 114 (-31.33%)
Xlearning Xdmlextremely distributed machine learning
Stars: ✭ 113 (-31.93%)
PowderkegLive-coding the cluster!
Stars: ✭ 152 (-8.43%)
Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-32.53%)
GpndGenerative Probabilistic Novelty Detection with Adversarial Autoencoders
Stars: ✭ 112 (-32.53%)
ArchivesparkAn Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Stars: ✭ 111 (-33.13%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-1.2%)
GlowAn open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-4.22%)
Adaptive AlertingAnomaly detection for streaming time series, featuring automated model selection.
Stars: ✭ 152 (-8.43%)
ElephasDistributed Deep learning with Keras & Spark
Stars: ✭ 1,521 (+816.27%)
Lambda ArchApplying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-33.13%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1018.07%)
QuicksqlA Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+996.99%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-33.73%)
Novelty DetectionLatent space autoregression for novelty detection.
Stars: ✭ 152 (-8.43%)
Parquet IndexSpark SQL index for Parquet tables
Stars: ✭ 109 (-34.34%)
Apache Spark NodeNode.js bindings for Apache Spark DataFrame APIs
Stars: ✭ 136 (-18.07%)
Spark TsneDistributed t-SNE via Apache Spark
Stars: ✭ 151 (-9.04%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+7094.58%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+6754.22%)
Seldon ServerMachine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+764.46%)
LogigskA Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-35.54%)
Skip GanomalySource code for Skip-GANomaly paper
Stars: ✭ 107 (-35.54%)
Spark On K8s OperatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+972.29%)
SparktutorialSource code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-36.75%)
DeepaiDetection of Accounting Anomalies using Deep Autoencoder Neural Networks - A lab we prepared for NVIDIA's GPU Technology Conference 2018 that will walk you through the detection of accounting anomalies using deep autoencoder neural networks. The majority of the lab content is based on Jupyter Notebook, Python and PyTorch.
Stars: ✭ 104 (-37.35%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-36.75%)