PoseidonA search engine which can hold 100 trillion lines of log data.
Stars: ✭ 1,793 (+2616.67%)
storm-mlan online learning algorithm library for Storm
Stars: ✭ 18 (-72.73%)
musterMassively Scalable Clustering
Stars: ✭ 22 (-66.67%)
iot-master物联大师是开源免费的物联网智能网关系统,集成了标准Modbus和主流PLC等多种协议,支持数据采集、公式计算、定时控制、自动控制、异常报警、流量监控、Web组态、远程调试等功能,适用于大部分物联网和工业互联网应用场景。
Stars: ✭ 119 (+80.3%)
yaskYASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Stars: ✭ 81 (+22.73%)
MLBDMaterials for "Machine Learning on Big Data" course
Stars: ✭ 20 (-69.7%)
bftkvA distributed key-value storage that's tolerant to Byzantine fault.
Stars: ✭ 27 (-59.09%)
couchdb-mangoMirror of Apache CouchDB Mango
Stars: ✭ 34 (-48.48%)
mpiGraphMPI benchmark to generate network bandwidth images
Stars: ✭ 17 (-74.24%)
matrix multiplicationParallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Stars: ✭ 41 (-37.88%)
clusterdockclusterdock is a framework for creating Docker-based container clusters
Stars: ✭ 26 (-60.61%)
fuzzballOngoing development of the Fuzzball MUCK server software and associated functionality.
Stars: ✭ 38 (-42.42%)
classifai🔥 One of the most comprehensive open-source data annotation platform.
Stars: ✭ 99 (+50%)
PartitionedArrays.jlVectors and sparse matrices partitioned into pieces for parallel distributed-memory computations.
Stars: ✭ 45 (-31.82%)
ByteSlice"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Stars: ✭ 24 (-63.64%)
pyspark-cheatsheetPySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Stars: ✭ 115 (+74.24%)
mloperatorMachine Learning Operator & Controller for Kubernetes
Stars: ✭ 85 (+28.79%)
future.mapreduce[EXPERIMENTAL] R package: future.mapreduce - Utility Functions for Future Map-Reduce API Packages
Stars: ✭ 12 (-81.82%)
xcastA High-Performance Data Science Toolkit for the Earth Sciences
Stars: ✭ 28 (-57.58%)
hack parallelThe core parallel and shared memory library used by Hack, Flow, and Pyre
Stars: ✭ 39 (-40.91%)
arrow-datafusionApache Arrow DataFusion SQL Query Engine
Stars: ✭ 2,360 (+3475.76%)
SGDLibraryMATLAB/Octave library for stochastic optimization algorithms: Version 1.0.20
Stars: ✭ 165 (+150%)
pystellaA code generator for grid-based PDE solving on CPUs and GPUs
Stars: ✭ 18 (-72.73%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-1.52%)
check-engineData validation library for PySpark 3.0.0
Stars: ✭ 29 (-56.06%)
EFDCPluswww.eemodelingsystem.com
Stars: ✭ 9 (-86.36%)
opendcCollaborative Datacenter Simulation and Exploration for Everybody
Stars: ✭ 40 (-39.39%)
PencilArrays.jlDistributed Julia arrays using the MPI protocol
Stars: ✭ 40 (-39.39%)
subsemblesubsemble R package for ensemble learning on subsets of data
Stars: ✭ 40 (-39.39%)
tbslasA parallel, fast solver for the scalar advection-diffusion and the incompressible Navier-Stokes equations based on semi-Lagrangian/Volume-Integral method.
Stars: ✭ 21 (-68.18%)
SynapseMLSimple and Distributed Machine Learning
Stars: ✭ 3,355 (+4983.33%)
FoldsCUDA.jlData-parallelism on CUDA using Transducers.jl and for loops (FLoops.jl)
Stars: ✭ 48 (-27.27%)
hyper-enginePython library for Bayesian hyper-parameters optimization
Stars: ✭ 80 (+21.21%)
es pytorchHigh performance implementation of Deep neuroevolution in pytorch using mpi4py. Intended for use on HPC clusters
Stars: ✭ 20 (-69.7%)
Big-Data-Demo基于Vue、three.js、echarts,数据可视化展示项目,包含三维模型导入交互、三维模型标注等功能
Stars: ✭ 146 (+121.21%)
pytorch kmeansImplementation of the k-means algorithm in PyTorch that works for large datasets
Stars: ✭ 38 (-42.42%)
ImplicitGlobalGrid.jlAlmost trivial distributed parallelization of stencil-based GPU and CPU applications on a regular staggered grid
Stars: ✭ 88 (+33.33%)
talariaTalariaDB is a distributed, highly available, and low latency time-series database for Presto
Stars: ✭ 148 (+124.24%)
big-data-liteSamples to the Oracle Big Data Lite VM
Stars: ✭ 41 (-37.88%)
meetups-archivosPpts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (-9.09%)
open-soqlOpen source implementation of the SOQL.
Stars: ✭ 15 (-77.27%)
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-48.48%)
libquoDynamic execution environments for coupled, thread-heterogeneous MPI+X applications
Stars: ✭ 21 (-68.18%)
wranglerWrangler Transform: A DMD system for transforming Big Data
Stars: ✭ 63 (-4.55%)
LoL-Match-PredictionWin probability predictions for League of Legends matches using neural networks
Stars: ✭ 34 (-48.48%)
v6.dooring.public可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Stars: ✭ 323 (+389.39%)
predictionioPredictionIO, a machine learning server for developers and ML engineers.
Stars: ✭ 12,510 (+18854.55%)
egisEgis - a handy Ruby interface for AWS Athena
Stars: ✭ 38 (-42.42%)
big-sorterJava library that sorts very large files of records by splitting into smaller sorted files and merging
Stars: ✭ 49 (-25.76%)
wxparaverwxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowledge of the performance of applications, libraries, processors and whole architectures.
Stars: ✭ 23 (-65.15%)
falconMirror of Apache Falcon
Stars: ✭ 95 (+43.94%)