All Projects → Kotlin Spark Api → Similar Projects or Alternatives

529 Open source projects that are alternatives of or similar to Kotlin Spark Api

Opaque
An encrypted data analytics platform
Stars: ✭ 129 (-29.51%)
Mutual labels:  spark
Bigdata practice
大数据分析可视化实践
Stars: ✭ 166 (-9.29%)
Mutual labels:  bigdata
Spark Tsne
Distributed t-SNE via Apache Spark
Stars: ✭ 151 (-17.49%)
Mutual labels:  spark
Airflow Pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-30.05%)
Mutual labels:  spark
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (+1275.96%)
Mutual labels:  spark
Spring Boot Quick
🌿 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、spring-batch、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等📌
Stars: ✭ 1,819 (+893.99%)
Mutual labels:  spark
Benchm Ml
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+902.73%)
Mutual labels:  spark
Lift
The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning workflows.
Stars: ✭ 127 (-30.6%)
Mutual labels:  spark
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-10.38%)
Mutual labels:  spark
Volcano
A Cloud Native Batch System (Project under CNCF)
Stars: ✭ 2,114 (+1055.19%)
Mutual labels:  bigdata
Aztk
AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-16.94%)
Mutual labels:  spark
Spring Shiro Spark
Spring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
Stars: ✭ 114 (-37.7%)
Mutual labels:  spark
Xsql
Unified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-3.83%)
Mutual labels:  spark
Scala Samples
There are pieces of scala code that explain Scala syntax and related things - like what you can do with all this
Stars: ✭ 125 (-31.69%)
Mutual labels:  spark
Athenacli
AthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.
Stars: ✭ 151 (-17.49%)
Mutual labels:  bigdata
Spark Alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-33.33%)
Mutual labels:  spark
Linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+1169.4%)
Mutual labels:  spark
Zparkio
Boiler plate framework to use Spark and ZIO together.
Stars: ✭ 121 (-33.88%)
Mutual labels:  spark
Avro
Apache Avro is a data serialization system.
Stars: ✭ 2,005 (+995.63%)
Mutual labels:  bigdata
Example Spark Kafka
Apache Spark and Apache Kafka integration example
Stars: ✭ 120 (-34.43%)
Mutual labels:  spark
Transmogrifai
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+1038.8%)
Mutual labels:  spark
Kinesis Sql
Kinesis Connector for Structured Streaming
Stars: ✭ 120 (-34.43%)
Mutual labels:  spark
Datacompy
Pandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-19.67%)
Mutual labels:  spark
Ibis
A pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+790.71%)
Mutual labels:  spark
Java Notes
☕️ Java 基础 👫 面向对象思想✏️ 算法 📝 操作系统 ☁️ 网络 💾 数据库 🙊 Spring 💡 系统架构🐘大数据
Stars: ✭ 160 (-12.57%)
Mutual labels:  bigdata
Spark Lucenerdd
Spark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (-37.7%)
Mutual labels:  spark
Poli
An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
Stars: ✭ 1,850 (+910.93%)
Mutual labels:  bigdata
Spark Streaming With Kafka
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
Stars: ✭ 180 (-1.64%)
Mutual labels:  spark
Spark Mllib Twitter Sentiment Analysis
🌟 ✨ Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Stars: ✭ 113 (-38.25%)
Mutual labels:  spark
Spark Kafka Writer
Write your Spark data to Kafka seamlessly
Stars: ✭ 175 (-4.37%)
Mutual labels:  spark
Spark Structured Streaming Examples
Spark Structured Streaming / Kafka / Cassandra / Elastic
Stars: ✭ 168 (-8.2%)
Mutual labels:  spark
Glow
An open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-13.11%)
Mutual labels:  spark
Technology Talk
汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+6531.69%)
Mutual labels:  spark
Xlearning Xdml
extremely distributed machine learning
Stars: ✭ 113 (-38.25%)
Mutual labels:  spark
Liteflow
liteflow是一个基于任务版本来实现的分布式任务流调度系统
Stars: ✭ 112 (-38.8%)
Mutual labels:  bigdata
Nd4j
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+851.91%)
Mutual labels:  spark
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (-38.8%)
Mutual labels:  spark
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (+743.72%)
Mutual labels:  bigdata
Scalable Data Science Platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Stars: ✭ 158 (-13.66%)
Mutual labels:  spark
Spark Authorizer
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-22.95%)
Mutual labels:  spark
Archivespark
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Stars: ✭ 111 (-39.34%)
Mutual labels:  spark
Rasterframes
Geospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-22.4%)
Mutual labels:  spark
Elephas
Distributed Deep learning with Keras & Spark
Stars: ✭ 1,521 (+731.15%)
Mutual labels:  spark
Waterdrop
Production Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+914.21%)
Mutual labels:  spark
Spark Iforest
Isolation Forest on Spark
Stars: ✭ 166 (-9.29%)
Mutual labels:  spark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-13.66%)
Mutual labels:  spark
Big Data Study
🐳 big data study
Stars: ✭ 141 (-22.95%)
Mutual labels:  bigdata
Java learning practice
java 进阶之路:面试高频算法、akka、多线程、NIO、Netty、SpringBoot、Spark&&Flink 等
Stars: ✭ 110 (-39.89%)
Mutual labels:  spark
Bigdataclass
Two-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-39.89%)
Mutual labels:  spark
Data science blogs
A repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-24.04%)
Mutual labels:  spark
Books
技术书籍等
Stars: ✭ 110 (-39.89%)
Mutual labels:  bigdata
Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-16.94%)
Mutual labels:  spark
Flinkstreamsql
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Stars: ✭ 1,682 (+819.13%)
Mutual labels:  bigdata
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-40.44%)
Mutual labels:  bigdata
Parquet Index
Spark SQL index for Parquet tables
Stars: ✭ 109 (-40.44%)
Mutual labels:  spark
Daudit
🌲 Configuration flaws detector for Hadoop, MongoDB, MySQL, and more!
Stars: ✭ 108 (-40.98%)
Mutual labels:  bigdata
Distributed Dataset
A distributed data processing framework in Haskell.
Stars: ✭ 108 (-40.98%)
Mutual labels:  spark
Kraps Rpc
A RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-4.37%)
Mutual labels:  spark
Geopyspark
GeoTrellis for PySpark
Stars: ✭ 167 (-8.74%)
Mutual labels:  spark
Learningapachespark
LearningApacheSpark
Stars: ✭ 155 (-15.3%)
Mutual labels:  spark
61-120 of 529 similar projects