arrow-datafusionApache Arrow DataFusion SQL Query Engine
Stars: ✭ 2,360 (+3.78%)
DatafusionDataFusion has now been donated to the Apache Arrow project
Stars: ✭ 611 (-73.13%)
deltaDDD-centric event-sourcing library for the JVM
Stars: ✭ 15 (-99.34%)
Xlearning Xdmlextremely distributed machine learning
Stars: ✭ 113 (-95.03%)
TitanoboaTitanoboa makes complex workflows easy. It is a low-code workflow orchestration platform for JVM - distributed, highly scalable and fault tolerant.
Stars: ✭ 787 (-65.39%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (-59.15%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+33.86%)
ModinModin: Speed up your Pandas workflows by changing a single line of code
Stars: ✭ 6,639 (+191.95%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-95.12%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+148.72%)
SparklyrR interface for Apache Spark
Stars: ✭ 775 (-65.92%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-93.4%)
Js SparkRealtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (-91.78%)
Pdf编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
Stars: ✭ 12,009 (+428.1%)
bowGo data analysis / manipulation library built on top of Apache Arrow
Stars: ✭ 20 (-99.12%)
Ruby SparkRuby wrapper for Apache Spark
Stars: ✭ 221 (-90.28%)
Java Notes📚 计算机科学基础知识、Java开发、后端/服务端、面试相关 📚 computer-science/Java-development/backend/interview
Stars: ✭ 1,284 (-43.54%)
scalecube-configScaleCube Config is a configuration access management library for JVM based distributed applications
Stars: ✭ 15 (-99.34%)
polarsFast multi-threaded DataFrame library in Rust | Python | Node.js
Stars: ✭ 6,368 (+180.04%)
Spark DariaEssential Spark extensions and helper methods ✨😲
Stars: ✭ 553 (-75.68%)
Ytk LearnYtk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).
Stars: ✭ 337 (-85.18%)
Spark RedisA connector for Spark that allows reading and writing to/from Redis cluster
Stars: ✭ 773 (-66.01%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (-23.39%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-93.32%)
DiasporaA privacy-aware, distributed, open source social network.
Stars: ✭ 12,937 (+468.91%)
XsqlUnified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-92.26%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-92.3%)
Zi5bookbook.zi5.me全站kindle电子书籍爬取,按照作者书籍名分类,每本书有mobi和equb两种格式,采用分布式进行全站爬取
Stars: ✭ 191 (-91.6%)
SparkFirely's open source FHIR server
Stars: ✭ 174 (-92.35%)
BigbenBigBen - a generic, multi-tenant, time-based event scheduler and cron scheduling framework
Stars: ✭ 174 (-92.35%)
Kotlin Spark ApiThis projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (-91.95%)
Spoon🥄 A package for building specific Proxy Pool for different Sites.
Stars: ✭ 173 (-92.39%)
Yvm[yvm] low performance garbage-collectable jvm
Stars: ✭ 173 (-92.39%)
TfmesosTensorflow in Docker on Mesos #tfmesos #tensorflow #mesos
Stars: ✭ 194 (-91.47%)
ScannsA scalable nearest neighbor search library in Apache Spark
Stars: ✭ 190 (-91.64%)
RoaringbitmapA better compressed bitset in Java
Stars: ✭ 2,460 (+8.18%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+10.73%)
BastionHighly-available Distributed Fault-tolerant Runtime
Stars: ✭ 2,333 (+2.59%)
Idworkeridworker 是一个基于zookeeper和snowflake算法的分布式ID生成工具,通过zookeeper自动注册机器(最多1024台),无需手动指定workerId和datacenterId
Stars: ✭ 171 (-92.48%)
LightgbmA fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Stars: ✭ 13,293 (+484.56%)
ArewedistributedyetWebsite + Community effort to unlock the peer-to-peer web at arewedistributedyet.com ⚡🌐🔑
Stars: ✭ 189 (-91.69%)
XiaomiadbfastboottoolsA simple tool for managing Xiaomi devices on desktop using ADB and Fastboot
Stars: ✭ 2,810 (+23.57%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+439.89%)
Node Jvmjava virtual machine in pure node.js
Stars: ✭ 2,053 (-9.72%)
PandasguiPandasGUI is a GUI for viewing, plotting and analyzing Pandas DataFrames.
Stars: ✭ 2,495 (+9.72%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (-8.36%)
OnyxDistributed, masterless, high performance, fault tolerant data processing
Stars: ✭ 2,019 (-11.21%)
Inspectdf🛠️ 📊 Tools for Exploring and Comparing Data Frames
Stars: ✭ 195 (-91.42%)
Interviewguide《大厂面试指北》——包括Java基础、JVM、数据库、mysql、redis、计算机网络、算法、数据结构、操作系统、设计模式、系统设计、框架原理。最佳阅读地址:http://notfound9.github.io/interviewGuide/
Stars: ✭ 3,117 (+37.07%)
MiraiandroidQQ机器人 /(实验性)在Android上运行Mirai-console,支持插件
Stars: ✭ 188 (-91.73%)
GroovyinactionSource code of the book Groovy in Action, 2nd edition
Stars: ✭ 181 (-92.04%)
PantheraData-frames & arrays on Clojure
Stars: ✭ 168 (-92.61%)
KatanaLightweight, minimalistic dependency injection library for Kotlin & Android
Stars: ✭ 181 (-92.04%)