xcastA High-Performance Data Science Toolkit for the Earth Sciences
Stars: ✭ 28 (-42.86%)
image-sorter2One-click image sorting/labelling script
Stars: ✭ 65 (+32.65%)
check-engineData validation library for PySpark 3.0.0
Stars: ✭ 29 (-40.82%)
Big-Data-Demo基于Vue、three.js、echarts,数据可视化展示项目,包含三维模型导入交互、三维模型标注等功能
Stars: ✭ 146 (+197.96%)
pysorterA command line utility for organizing files and directories according to regex patterns.
Stars: ✭ 40 (-18.37%)
wranglerWrangler Transform: A DMD system for transforming Big Data
Stars: ✭ 63 (+28.57%)
SGDLibraryMATLAB/Octave library for stochastic optimization algorithms: Version 1.0.20
Stars: ✭ 165 (+236.73%)
prioraAn Object Prioritization Utility for Ruby
Stars: ✭ 30 (-38.78%)
sparkucxA high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer
Stars: ✭ 32 (-34.69%)
subsemblesubsemble R package for ensemble learning on subsets of data
Stars: ✭ 40 (-18.37%)
MLBDMaterials for "Machine Learning on Big Data" course
Stars: ✭ 20 (-59.18%)
CS Book🔥 Latest computer science e-books。提供最新技术类电子书下载, “我无非就是想卷死各位,或者被各位卷死!”
Stars: ✭ 40 (-18.37%)
pimpableNo description or website provided.
Stars: ✭ 102 (+108.16%)
talariaTalariaDB is a distributed, highly available, and low latency time-series database for Presto
Stars: ✭ 148 (+202.04%)
v6.dooring.public可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Stars: ✭ 323 (+559.18%)
arrow-datafusionApache Arrow DataFusion SQL Query Engine
Stars: ✭ 2,360 (+4716.33%)
predictionioPredictionIO, a machine learning server for developers and ML engineers.
Stars: ✭ 12,510 (+25430.61%)
cloudberryBig Data Visualization
Stars: ✭ 89 (+81.63%)
hyper-enginePython library for Bayesian hyper-parameters optimization
Stars: ✭ 80 (+63.27%)
softpoolSoftPoolNet: Shape Descriptor for Point Cloud Completion and Classification - ECCV 2020 oral
Stars: ✭ 62 (+26.53%)
opendcCollaborative Datacenter Simulation and Exploration for Everybody
Stars: ✭ 40 (-18.37%)
rastercuberastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Stars: ✭ 15 (-69.39%)
cakephp-sequenceCakePHP plugin for maintaining a contiguous sequence of records
Stars: ✭ 41 (-16.33%)
SynapseMLSimple and Distributed Machine Learning
Stars: ✭ 3,355 (+6746.94%)
algorithmsAlgorithms in python and C
Stars: ✭ 71 (+44.9%)
ag-gridThe best JavaScript Data Table for building Enterprise Applications. Supports React / Angular / Vue / Plain JavaScript.
Stars: ✭ 8,743 (+17742.86%)
ByteSlice"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Stars: ✭ 24 (-51.02%)
couchdb-mangoMirror of Apache CouchDB Mango
Stars: ✭ 34 (-30.61%)
meetups-archivosPpts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (+22.45%)
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-30.61%)
LoL-Match-PredictionWin probability predictions for League of Legends matches using neural networks
Stars: ✭ 34 (-30.61%)
falconMirror of Apache Falcon
Stars: ✭ 95 (+93.88%)
clusterdockclusterdock is a framework for creating Docker-based container clusters
Stars: ✭ 26 (-46.94%)
incubator-liminalApache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (+138.78%)
picsortOrganize your photos by date in one click 👏
Stars: ✭ 22 (-55.1%)
nebulaA distributed block-based data storage and compute engine
Stars: ✭ 127 (+159.18%)
lua-algorithmsLua algorithms library that covers commonly used data structures and algorithms
Stars: ✭ 57 (+16.33%)
beekeeperService for automatically managing and cleaning up unreferenced data
Stars: ✭ 43 (-12.24%)
type-comparatorUseful comparator functions written on Typescript
Stars: ✭ 56 (+14.29%)
siembolAn open-source, real-time Security Information & Event Management tool based on big data technologies, providing a scalable, advanced security analytics framework.
Stars: ✭ 153 (+212.24%)
classifai🔥 One of the most comprehensive open-source data annotation platform.
Stars: ✭ 99 (+102.04%)
bftkvA distributed key-value storage that's tolerant to Byzantine fault.
Stars: ✭ 27 (-44.9%)
ultra-sortDSL for SIMD Sorting on AVX2 & AVX512
Stars: ✭ 29 (-40.82%)
storm-mlan online learning algorithm library for Storm
Stars: ✭ 18 (-63.27%)
JRCLUSTJRCLUST
Stars: ✭ 32 (-34.69%)
pyspark-cheatsheetPySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Stars: ✭ 115 (+134.69%)
big-data-liteSamples to the Oracle Big Data Lite VM
Stars: ✭ 41 (-16.33%)
directed graphDart implementation of a directed graph. Provides algorithms for sorting vertices, retrieving a topological ordering or detecting cycles.
Stars: ✭ 37 (-24.49%)
ecto rankedRanking models for Ecto
Stars: ✭ 37 (-24.49%)