All Projects → Spark → Similar Projects or Alternatives

399 Open source projects that are alternatives of or similar to Spark

Isolation Forest
A Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Stars: ✭ 139 (-20.11%)
Mutual labels:  spark
Opaque
An encrypted data analytics platform
Stars: ✭ 129 (-25.86%)
Mutual labels:  spark
Spark Ml Source Analysis
spark ml 算法原理剖析以及具体的源码实现分析
Stars: ✭ 1,873 (+976.44%)
Mutual labels:  spark
Data science blogs
A repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-20.11%)
Mutual labels:  spark
Lift
The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning workflows.
Stars: ✭ 127 (-27.01%)
Mutual labels:  spark
Quill
Compile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+1048.28%)
Mutual labels:  spark
Horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+6763.79%)
Mutual labels:  spark
Linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+1235.06%)
Mutual labels:  spark
Airflow Pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-26.44%)
Mutual labels:  spark
Cc Pyspark
Process Common Crawl data with Python and Spark
Stars: ✭ 147 (-15.52%)
Mutual labels:  spark
Spark Authorizer
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-18.97%)
Mutual labels:  spark
Scala Samples
There are pieces of scala code that explain Scala syntax and related things - like what you can do with all this
Stars: ✭ 125 (-28.16%)
Mutual labels:  spark
Learningapachespark
LearningApacheSpark
Stars: ✭ 155 (-10.92%)
Mutual labels:  spark
Ecommercerecommendsystem
商品大数据实时推荐系统。前端:Vue + TypeScript + ElementUI,后端 Spring + Spark
Stars: ✭ 139 (-20.11%)
Mutual labels:  spark
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-5.75%)
Mutual labels:  spark
Spark On Lambda
Apache Spark on AWS Lambda
Stars: ✭ 137 (-21.26%)
Mutual labels:  spark
Powderkeg
Live-coding the cluster!
Stars: ✭ 152 (-12.64%)
Mutual labels:  spark
Iot Traffic Monitor
Stars: ✭ 131 (-24.71%)
Mutual labels:  spark
Spark Iforest
Isolation Forest on Spark
Stars: ✭ 166 (-4.6%)
Mutual labels:  spark
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+889.08%)
Mutual labels:  spark
Aztk
AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-12.64%)
Mutual labels:  spark
Spring Boot Quick
🌿 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、spring-batch、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等📌
Stars: ✭ 1,819 (+945.4%)
Mutual labels:  spark
Glow
An open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-8.62%)
Mutual labels:  spark
Hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-27.59%)
Mutual labels:  spark
Datacompy
Pandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-15.52%)
Mutual labels:  spark
Nd4j
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+901.15%)
Mutual labels:  spark
Spark Infotheoretic Feature Selection
This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is based on the common theoretic framework presented by Gavin Brown. Implementations of mRMR, InfoGain, JMI and other commonly used FS filters are provided.
Stars: ✭ 123 (-29.31%)
Mutual labels:  spark
Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-12.64%)
Mutual labels:  spark
Rasterframes
Geospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-18.39%)
Mutual labels:  spark
Big Whale
Spark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-6.32%)
Mutual labels:  spark
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-19.54%)
Mutual labels:  spark
Sparkmonitor
Monitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (-11.49%)
Mutual labels:  spark
Sparkling Graph
SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-20.11%)
Mutual labels:  spark
Spark Structured Streaming Examples
Spark Structured Streaming / Kafka / Cassandra / Elastic
Stars: ✭ 168 (-3.45%)
Mutual labels:  spark
Quicksql
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+946.55%)
Mutual labels:  spark
Spark.jl
Julia binding for Apache Spark
Stars: ✭ 153 (-12.07%)
Mutual labels:  spark
Apache Spark Node
Node.js bindings for Apache Spark DataFrame APIs
Stars: ✭ 136 (-21.84%)
Mutual labels:  spark
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (-7.47%)
Mutual labels:  spark
Aliyun Emapreduce Datasources
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
Stars: ✭ 132 (-24.14%)
Mutual labels:  spark
Spark Tsne
Distributed t-SNE via Apache Spark
Stars: ✭ 151 (-13.22%)
Mutual labels:  spark
Abris
Avro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-25.29%)
Mutual labels:  spark
Deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+6955.75%)
Mutual labels:  spark
Spylon Kernel
Jupyter kernel for scala and spark
Stars: ✭ 129 (-25.86%)
Mutual labels:  spark
Benchm Ml
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+954.6%)
Mutual labels:  spark
Gaffer
A large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+843.68%)
Mutual labels:  spark
Vue Info Card
Simple and beautiful card component with an elegant spark line, for VueJS.
Stars: ✭ 159 (-8.62%)
Mutual labels:  spark
Feast
Feature Store for Machine Learning
Stars: ✭ 2,576 (+1380.46%)
Mutual labels:  spark
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-13.79%)
Mutual labels:  spark
Openuba
A robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-27.01%)
Mutual labels:  spark
Geopyspark
GeoTrellis for PySpark
Stars: ✭ 167 (-4.02%)
Mutual labels:  spark
Cape Python
Collaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-28.16%)
Mutual labels:  spark
Pyspark Learning
Updated repository
Stars: ✭ 147 (-15.52%)
Mutual labels:  spark
Spark Bigquery Connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Stars: ✭ 126 (-27.59%)
Mutual labels:  spark
Scalable Data Science Platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Stars: ✭ 158 (-9.2%)
Mutual labels:  spark
Spark Cassandra Connector
DataStax Spark Cassandra Connector
Stars: ✭ 1,816 (+943.68%)
Mutual labels:  spark
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (+1347.13%)
Mutual labels:  spark
Transmogrifai
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+1097.7%)
Mutual labels:  spark
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (-5.17%)
Mutual labels:  spark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-9.2%)
Mutual labels:  spark
Technology Talk
汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+6874.71%)
Mutual labels:  spark
1-60 of 399 similar projects