All Projects → Ballista → Similar Projects or Alternatives

1203 Open source projects that are alternatives of or similar to Ballista

arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
Stars: ✭ 2,360 (+3.78%)
Mutual labels:  arrow, dataframe, datafusion
Datafusion
DataFusion has now been donated to the Apache Arrow project
Stars: ✭ 611 (-73.13%)
Mutual labels:  dataframe, spark, arrow
delta
DDD-centric event-sourcing library for the JVM
Stars: ✭ 15 (-99.34%)
Mutual labels:  jvm, distributed
Net.jgp.labs.spark
Apache Spark examples exclusively in Java
Stars: ✭ 55 (-97.58%)
Mutual labels:  dataframe, spark
Xlearning Xdml
extremely distributed machine learning
Stars: ✭ 113 (-95.03%)
Mutual labels:  spark, distributed
Titanoboa
Titanoboa makes complex workflows easy. It is a low-code workflow orchestration platform for JVM - distributed, highly scalable and fault tolerant.
Stars: ✭ 787 (-65.39%)
Mutual labels:  jvm, distributed
Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (-59.15%)
Mutual labels:  dataframe, spark
Koalas
Koalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+33.86%)
Mutual labels:  dataframe, spark
Modin
Modin: Speed up your Pandas workflows by changing a single line of code
Stars: ✭ 6,639 (+191.95%)
Mutual labels:  dataframe, distributed
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-95.12%)
Mutual labels:  spark, dataframe
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+148.72%)
Mutual labels:  spark, distributed
Sparklyr
R interface for Apache Spark
Stars: ✭ 775 (-65.92%)
Mutual labels:  spark, distributed
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-93.4%)
Mutual labels:  dataframe, spark
Distributed Dataset
A distributed data processing framework in Haskell.
Stars: ✭ 108 (-95.25%)
Mutual labels:  spark, distributed
Js Spark
Realtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (-91.78%)
Mutual labels:  spark, distributed
Pdf
编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
Stars: ✭ 12,009 (+428.1%)
Mutual labels:  spark, jvm
bow
Go data analysis / manipulation library built on top of Apache Arrow
Stars: ✭ 20 (-99.12%)
Mutual labels:  arrow, dataframe
Ruby Spark
Ruby wrapper for Apache Spark
Stars: ✭ 221 (-90.28%)
Mutual labels:  spark, distributed
Java Notes
📚 计算机科学基础知识、Java开发、后端/服务端、面试相关 📚 computer-science/Java-development/backend/interview
Stars: ✭ 1,284 (-43.54%)
Mutual labels:  jvm, distributed
scalecube-config
ScaleCube Config is a configuration access management library for JVM based distributed applications
Stars: ✭ 15 (-99.34%)
Mutual labels:  jvm, distributed
polars
Fast multi-threaded DataFrame library in Rust | Python | Node.js
Stars: ✭ 6,368 (+180.04%)
Mutual labels:  arrow, dataframe
Spark Daria
Essential Spark extensions and helper methods ✨😲
Stars: ✭ 553 (-75.68%)
Mutual labels:  dataframe, spark
Ytk Learn
Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).
Stars: ✭ 337 (-85.18%)
Mutual labels:  spark, distributed
Spark Redis
A connector for Spark that allows reading and writing to/from Redis cluster
Stars: ✭ 773 (-66.01%)
Mutual labels:  dataframe, spark
Nd4j
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (-23.39%)
Mutual labels:  spark, jvm
Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-93.32%)
Mutual labels:  dataframe, spark
Diaspora
A privacy-aware, distributed, open source social network.
Stars: ✭ 12,937 (+468.91%)
Mutual labels:  distributed
Xsql
Unified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-92.26%)
Mutual labels:  spark
Spark Kafka Writer
Write your Spark data to Kafka seamlessly
Stars: ✭ 175 (-92.3%)
Mutual labels:  spark
Kraps Rpc
A RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-92.3%)
Mutual labels:  spark
Zi5book
book.zi5.me全站kindle电子书籍爬取,按照作者书籍名分类,每本书有mobi和equb两种格式,采用分布式进行全站爬取
Stars: ✭ 191 (-91.6%)
Mutual labels:  distributed
Learningnotes
Enjoy Learning.
Stars: ✭ 12,682 (+457.7%)
Mutual labels:  jvm
Spark
Firely's open source FHIR server
Stars: ✭ 174 (-92.35%)
Mutual labels:  spark
Bigben
BigBen - a generic, multi-tenant, time-based event scheduler and cron scheduling framework
Stars: ✭ 174 (-92.35%)
Mutual labels:  distributed
Kotlin Spark Api
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (-91.95%)
Mutual labels:  spark
Spoon
🥄 A package for building specific Proxy Pool for different Sites.
Stars: ✭ 173 (-92.39%)
Mutual labels:  distributed
Yvm
[yvm] low performance garbage-collectable jvm
Stars: ✭ 173 (-92.39%)
Mutual labels:  jvm
Tfmesos
Tensorflow in Docker on Mesos #tfmesos #tensorflow #mesos
Stars: ✭ 194 (-91.47%)
Mutual labels:  distributed
Scanns
A scalable nearest neighbor search library in Apache Spark
Stars: ✭ 190 (-91.64%)
Mutual labels:  spark
Roaringbitmap
A better compressed bitset in Java
Stars: ✭ 2,460 (+8.18%)
Mutual labels:  spark
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (+10.73%)
Mutual labels:  spark
Ditching Excel For Python
Functionalities in Excel translated to Python
Stars: ✭ 172 (-92.44%)
Mutual labels:  dataframe
Bastion
Highly-available Distributed Fault-tolerant Runtime
Stars: ✭ 2,333 (+2.59%)
Mutual labels:  distributed
Idworker
idworker 是一个基于zookeeper和snowflake算法的分布式ID生成工具,通过zookeeper自动注册机器(最多1024台),无需手动指定workerId和datacenterId
Stars: ✭ 171 (-92.48%)
Mutual labels:  distributed
Lightgbm
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Stars: ✭ 13,293 (+484.56%)
Mutual labels:  distributed
Arewedistributedyet
Website + Community effort to unlock the peer-to-peer web at arewedistributedyet.com ⚡🌐🔑
Stars: ✭ 189 (-91.69%)
Mutual labels:  distributed
Xiaomiadbfastboottools
A simple tool for managing Xiaomi devices on desktop using ADB and Fastboot
Stars: ✭ 2,810 (+23.57%)
Mutual labels:  jvm
Deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+439.89%)
Mutual labels:  spark
Node Jvm
java virtual machine in pure node.js
Stars: ✭ 2,053 (-9.72%)
Mutual labels:  jvm
Pandasgui
PandasGUI is a GUI for viewing, plotting and analyzing Pandas DataFrames.
Stars: ✭ 2,495 (+9.72%)
Mutual labels:  dataframe
Transmogrifai
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (-8.36%)
Mutual labels:  spark
Onyx
Distributed, masterless, high performance, fault tolerant data processing
Stars: ✭ 2,019 (-11.21%)
Mutual labels:  distributed
Inspectdf
🛠️ 📊 Tools for Exploring and Comparing Data Frames
Stars: ✭ 195 (-91.42%)
Mutual labels:  dataframe
Interviewguide
《大厂面试指北》——包括Java基础、JVM、数据库、mysql、redis、计算机网络、算法、数据结构、操作系统、设计模式、系统设计、框架原理。最佳阅读地址:http://notfound9.github.io/interviewGuide/
Stars: ✭ 3,117 (+37.07%)
Mutual labels:  jvm
Miraiandroid
QQ机器人 /(实验性)在Android上运行Mirai-console,支持插件
Stars: ✭ 188 (-91.73%)
Mutual labels:  jvm
Groovyinaction
Source code of the book Groovy in Action, 2nd edition
Stars: ✭ 181 (-92.04%)
Mutual labels:  jvm
Panthera
Data-frames & arrays on Clojure
Stars: ✭ 168 (-92.61%)
Mutual labels:  dataframe
Spark Structured Streaming Examples
Spark Structured Streaming / Kafka / Cassandra / Elastic
Stars: ✭ 168 (-92.61%)
Mutual labels:  spark
Katana
Lightweight, minimalistic dependency injection library for Kotlin & Android
Stars: ✭ 181 (-92.04%)
Mutual labels:  jvm
Spark Iforest
Isolation Forest on Spark
Stars: ✭ 166 (-92.7%)
Mutual labels:  spark
1-60 of 1203 similar projects