All Projects → Example Spark → Similar Projects or Alternatives

435 Open source projects that are alternatives of or similar to Example Spark

Glow
An open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-22.44%)
Mutual labels:  spark
Iot Traffic Monitor
Stars: ✭ 131 (-36.1%)
Mutual labels:  spark
Spark Kafka Writer
Write your Spark data to Kafka seamlessly
Stars: ✭ 175 (-14.63%)
Mutual labels:  spark
Opaque
An encrypted data analytics platform
Stars: ✭ 129 (-37.07%)
Mutual labels:  spark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-22.93%)
Mutual labels:  spark
Azuredatabricksbestpractices
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
Stars: ✭ 186 (-9.27%)
Mutual labels:  spark
Airflow Pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-37.56%)
Mutual labels:  spark
Learningapachespark
LearningApacheSpark
Stars: ✭ 155 (-24.39%)
Mutual labels:  spark
Spring Boot Quick
🌿 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、spring-batch、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等📌
Stars: ✭ 1,819 (+787.32%)
Mutual labels:  spark
Spark
Firely's open source FHIR server
Stars: ✭ 174 (-15.12%)
Mutual labels:  spark
Lift
The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning workflows.
Stars: ✭ 127 (-38.05%)
Mutual labels:  spark
Movie recommend
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+920.49%)
Mutual labels:  spark-streaming
Hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-38.54%)
Mutual labels:  spark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-2.44%)
Mutual labels:  spark
Scala Samples
There are pieces of scala code that explain Scala syntax and related things - like what you can do with all this
Stars: ✭ 125 (-39.02%)
Mutual labels:  spark
Spark.jl
Julia binding for Apache Spark
Stars: ✭ 153 (-25.37%)
Mutual labels:  spark
Spark Alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-40.49%)
Mutual labels:  spark
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (+1128.29%)
Mutual labels:  spark
Zparkio
Boiler plate framework to use Spark and ZIO together.
Stars: ✭ 121 (-40.98%)
Mutual labels:  spark
Streamline
StreamLine - Streaming Analytics
Stars: ✭ 151 (-26.34%)
Mutual labels:  spark-streaming
Kotlin Spark Api
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (-10.73%)
Mutual labels:  spark
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-47.32%)
Mutual labels:  spark
Spark Ml Source Analysis
spark ml 算法原理剖析以及具体的源码实现分析
Stars: ✭ 1,873 (+813.66%)
Mutual labels:  spark
Ibis
A pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+695.12%)
Mutual labels:  spark
Transmogrifai
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+916.59%)
Mutual labels:  spark
Spark Lucenerdd
Spark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (-44.39%)
Mutual labels:  spark
Aztk
AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-25.85%)
Mutual labels:  spark
Nd4j
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+749.76%)
Mutual labels:  spark
Flink Learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+5450.24%)
Mutual labels:  spark
Javaorbigdata Interview
Java开发者或者大数据开发者面试知识点整理
Stars: ✭ 203 (-0.98%)
Mutual labels:  spark
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (-45.37%)
Mutual labels:  spark
Cc Pyspark
Process Common Crawl data with Python and Spark
Stars: ✭ 147 (-28.29%)
Mutual labels:  spark
Spark Iforest
Isolation Forest on Spark
Stars: ✭ 166 (-19.02%)
Mutual labels:  spark
Logigsk
A Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-47.8%)
Mutual labels:  spark
Datacompy
Pandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-28.29%)
Mutual labels:  spark
Bigdataclass
Two-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-46.34%)
Mutual labels:  spark
Sparkstreaming
💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (-12.68%)
Mutual labels:  spark
Big Whale
Spark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-20.49%)
Mutual labels:  spark
Rasterframes
Geospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-30.73%)
Mutual labels:  spark
Sparktutorial
Source code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-48.78%)
Mutual labels:  spark
Distributed Dataset
A distributed data processing framework in Haskell.
Stars: ✭ 108 (-47.32%)
Mutual labels:  spark
Technology Talk
汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+5820%)
Mutual labels:  spark
Hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-47.32%)
Mutual labels:  spark
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (-19.51%)
Mutual labels:  spark
Seldon Server
Machine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+600%)
Mutual labels:  spark
Spark Authorizer
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-31.22%)
Mutual labels:  spark
Spark On K8s Operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+768.29%)
Mutual labels:  spark
Scanns
A scalable nearest neighbor search library in Apache Spark
Stars: ✭ 190 (-7.32%)
Mutual labels:  spark
Splash
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-48.78%)
Mutual labels:  spark
Data science blogs
A repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-32.2%)
Mutual labels:  spark
Spark Terasort
Spark Terasort
Stars: ✭ 101 (-50.73%)
Mutual labels:  spark
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-20%)
Mutual labels:  spark
Spark Ffm
FFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101 (-50.73%)
Mutual labels:  spark
Bigdata Notebook
Stars: ✭ 100 (-51.22%)
Mutual labels:  spark
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+5261.46%)
Mutual labels:  spark
Ecommercerecommendsystem
商品大数据实时推荐系统。前端:Vue + TypeScript + ElementUI,后端 Spring + Spark
Stars: ✭ 139 (-32.2%)
Mutual labels:  spark
Almond
A Scala kernel for Jupyter
Stars: ✭ 1,354 (+560.49%)
Mutual labels:  spark
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-52.68%)
Mutual labels:  spark
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-13.66%)
Mutual labels:  spark-streaming
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (-21.46%)
Mutual labels:  spark
61-120 of 435 similar projects