MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+1242.13%)
Spark TsneDistributed t-SNE via Apache Spark
Stars: ✭ 151 (-30.09%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-18.98%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+749.54%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-30.56%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1065.74%)
Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (-31.94%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-7.41%)
DocxtemplaterGenerate docx pptx and xlsx (Microsoft Word, Powerpoint, Excel documents) from templates, from Node.js, the Browser and the command line / Demo: https://www.docxtemplater.com/demo
Stars: ✭ 1,990 (+821.3%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+5583.8%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-31.94%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-0.46%)
Technology Talk汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+5518.52%)
DocumentserverONLYOFFICE Document Server is an online office suite comprising viewers and editors for texts, spreadsheets and presentations, fully compatible with Office Open XML formats: .docx, .xlsx, .pptx and enabling collaborative editing in real time.
Stars: ✭ 2,335 (+981.02%)
Xlsx.jlExcel file reader and writer coded in pure Julia.
Stars: ✭ 145 (-32.87%)
Calx.jsjQuery Calx - a jQuery plugin for creating formula-based calculation form
Stars: ✭ 190 (-12.04%)
Spark AuthorizerA Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-34.72%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-34.26%)
Vue Blog🎉 基于vue全家桶 + element-ui 构建的一个后台管理集成解决方案
Stars: ✭ 208 (-3.7%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-35.19%)
ExcelmapperMap POCO objects to Excel files
Stars: ✭ 166 (-23.15%)
Sparkling GraphSparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-35.65%)
Js SparkRealtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (-13.43%)
Tui.grid🍞🔡 The Powerful Component to Display and Edit Data. Experience the Ultimate Data Transformer!
Stars: ✭ 1,859 (+760.65%)
YargYet Another Report Generator - CUBA Platform reporting engine
Stars: ✭ 215 (-0.46%)
TransformalizeConfigurable Extract, Transform, and Load
Stars: ✭ 125 (-42.13%)
Sqlite2xlLibrary to Convert SQLite to Excel and Vice-Versa
Stars: ✭ 136 (-37.04%)
Xlwingsxlwings is a BSD-licensed Python library that makes it easy to call Python from Excel and vice versa. It works with Microsoft Excel on Windows and macOS.
Stars: ✭ 2,181 (+909.72%)
AriawaseAriawase is free library for VBA cowboys.
Stars: ✭ 185 (-14.35%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+5429.17%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-24.07%)
PylightxlA light weight, zero dependency, minimal functionality excel read/writer python library
Stars: ✭ 134 (-37.96%)
Spark Knnk-Nearest Neighbors algorithm on Spark
Stars: ✭ 205 (-5.09%)
XlsxFast and reliable way to work with Microsoft Excel™ [xlsx] files in Golang
Stars: ✭ 132 (-38.89%)
Xresloader跨平台Excel导表工具(Excel=>protobuf/msgpack/lua/javascript/json/xml)
Stars: ✭ 161 (-25.46%)
AbrisAvro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-39.81%)
RoaringbitmapA better compressed bitset in Java
Stars: ✭ 2,460 (+1038.89%)
Spylon KernelJupyter kernel for scala and spark
Stars: ✭ 129 (-40.28%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+975.46%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+660.19%)
Excel to codeRoughly translate some Excel spreadsheets to Ruby or C.
Stars: ✭ 214 (-0.93%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+1092.59%)
Excel Plus❇️ Improve the productivity of the Excel operation library. https://git.io/vNjQy
Stars: ✭ 160 (-25.93%)
OpenubaA robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-41.2%)
LiftThe LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning workflows.
Stars: ✭ 127 (-41.2%)
HadoopcryptoledgerHadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-41.67%)
Excel Parser ProcessorSimply does the tedious, repetitive operations for all rows of excel files step by step and reports after the job is done. It can download files from URL(s) in a column of Excel files. If a new filename is provided at column B it will rename the file before saving. It will even create sub folders if column C is full with a valid folder name.
Stars: ✭ 177 (-18.06%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-29.63%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-26.85%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+0%)
FastexcelFast Excel Reading and Writing in .Net
Stars: ✭ 213 (-1.39%)