GlusterfsGluster Filesystem : Build your distributed storage in minutes
Stars: ✭ 3,437 (+235.32%)
InfinitThe Infinit policy-based software-defined storage platform.
Stars: ✭ 363 (-64.59%)
HazelcastOpen-source distributed computation and storage platform
Stars: ✭ 4,662 (+354.83%)
lustre-releaseMirror of official Lustre development repository http://git.whamcloud.com/
Stars: ✭ 35 (-96.59%)
Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (-87.71%)
LizardfsLizardFS is an Open Source Distributed File System licensed under GPLv3.
Stars: ✭ 793 (-22.63%)
fusell-seedFUSE (the low-level interface) file system boilerplate 📂 🔌 💾
Stars: ✭ 13 (-98.73%)
Goofysa high-performance, POSIX-ish Amazon S3 file system written in Go
Stars: ✭ 3,932 (+283.61%)
OzoneScalable, redundant, and distributed object store for Apache Hadoop
Stars: ✭ 330 (-67.8%)
CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (-58.44%)
Go Fastdfsgo-fastdfs 是一个简单的分布式文件系统(私有云存储),具有无中心、高性能,高可靠,免维护等优点,支持断点续传,分块上传,小文件合并,自动同步,自动修复。Go-fastdfs is a simple distributed file system (private cloud storage), with no center, high performance, high reliability, maintenance free and other advantages, support breakpoint continuation, block upload, small file merge, automatic synchronization, automatic r…
Stars: ✭ 2,923 (+185.17%)
ChubaofsChubaoFS (abbrev. CBFS) is a cloud native distributed file system and object store.
Stars: ✭ 2,482 (+142.15%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-85.37%)
SeaweedfsSeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
Stars: ✭ 13,380 (+1205.37%)
JuicefsJuiceFS is a distributed POSIX file system built on top of Redis and S3.
Stars: ✭ 4,262 (+315.8%)
cubefsCubeFS is a cloud native distributed storage platform.
Stars: ✭ 3,062 (+198.73%)
Fusell SeedFUSE (the low-level interface) file system boilerplate 📂 🔌 💾
Stars: ✭ 9 (-99.12%)
Flydrive☁️ Flexible and Fluent framework-agnostic driver based system to manage storage in Node.js
Stars: ✭ 275 (-73.17%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+346.93%)
BtfsA bittorrent filesystem based on FUSE.
Stars: ✭ 2,984 (+191.12%)
CloudbreakA tool for provisioning and managing Apache Hadoop clusters in the cloud. Cloudbreak, as part of the Hortonworks Data Platform, makes it easy to provision, configure and elastically grow HDP clusters on cloud infrastructure. Cloudbreak can be used to provision Hadoop across cloud infrastructure providers including AWS, Azure, GCP and OpenStack.
Stars: ✭ 301 (-70.63%)
Jnr FuseFUSE implementation in Java using Java Native Runtime (JNR)
Stars: ✭ 266 (-74.05%)
ShrineFile Attachment toolkit for Ruby applications
Stars: ✭ 2,903 (+183.22%)
WinfspWindows File System Proxy - FUSE for Windows
Stars: ✭ 4,071 (+297.17%)
TezApache Tez
Stars: ✭ 313 (-69.46%)
DiskoverFile system crawler, disk space usage, file search engine and file system analytics powered by Elasticsearch
Stars: ✭ 977 (-4.68%)
ElasticlusterCreate clusters of VMs on the cloud and configure them with Ansible.
Stars: ✭ 298 (-70.93%)
CoherenceOracle Coherence Community Edition
Stars: ✭ 328 (-68%)
vzvolvzvol is a general use ZFS zvol management tool, that handles creation, destruction, listing, and formatting with various FSes, in an easy to use single program
Stars: ✭ 27 (-97.37%)
Protoactor GoProto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Stars: ✭ 3,934 (+283.8%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-64.78%)
SvfsThe Swift Virtual File System
Stars: ✭ 375 (-63.41%)
HiveApache Hive
Stars: ✭ 4,031 (+293.27%)
Linstor ServerHigh Performance Software-Defined Block Storage for container, cloud and virtualisation. Fully integrated with Docker, Kubernetes, Openstack, Proxmox etc.
Stars: ✭ 374 (-63.51%)
IgniteApache Ignite
Stars: ✭ 4,027 (+292.88%)
BigdlBuilding Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+272%)
FilerNode-like file system for browsers
Stars: ✭ 389 (-62.05%)
fbindA versatile Android mounting utility for folders, EXT4 images, LUKS/LUKS2 encrypted volumes, regular partitions and more.
Stars: ✭ 42 (-95.9%)
DokanyUser mode file system library for windows with FUSE Wrapper
Stars: ✭ 4,055 (+295.61%)
OrcApache ORC - the smallest, fastest columnar storage for Hadoop workloads
Stars: ✭ 389 (-62.05%)
ThrillThrill - An EXPERIMENTAL Algorithmic Distributed Big Data Batch Processing Framework in C++
Stars: ✭ 528 (-48.49%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+2051.02%)
SecurefsFilesystem in userspace (FUSE) with transparent authenticated encryption
Stars: ✭ 518 (-49.46%)
ExfatFree exFAT file system implementation
Stars: ✭ 528 (-48.49%)
TinydirLightweight, portable and easy to integrate C directory and file reader
Stars: ✭ 575 (-43.9%)
S3fs FuseFUSE-based file system backed by Amazon S3
Stars: ✭ 5,733 (+459.32%)
FilegatorPowerful Multi-User File Manager
Stars: ✭ 587 (-42.73%)
SirixSirixDB is a temporal, evolutionary database system, which uses an accumulate only approach. It keeps the full history of each resource. Every commit stores a space-efficient snapshot through structural sharing. It is log-structured and never overwrites data. SirixDB uses a novel page-level versioning approach called sliding snapshot.
Stars: ✭ 638 (-37.76%)
S5cmdParallel S3 and local filesystem execution tool.
Stars: ✭ 565 (-44.88%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+451.8%)