CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (+29.09%)
MoosefsMooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (+210.61%)
FoundatioPluggable foundation blocks for building distributed apps.
Stars: ✭ 1,365 (+313.64%)
TezApache Tez
Stars: ✭ 313 (-5.15%)
ThanosHighly available Prometheus setup with long term storage capabilities. A CNCF Incubating project.
Stars: ✭ 9,820 (+2875.76%)
Go StorageAn application-oriented unified storage layer for Golang.
Stars: ✭ 87 (-73.64%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+1288.18%)
UptocA static file deployment tool that supports multiple platforms./ 一个支持多家云厂商的静态文件部署工具
Stars: ✭ 159 (-51.82%)
esopCloud-enabled backup and restore tool for Apache Cassandra
Stars: ✭ 40 (-87.88%)
minio-rclone-webdav-serverA @rclone served WebDAV server with @minio as the s3 storage backend docker example
Stars: ✭ 17 (-94.85%)
S4S4 is 100% S3 compatible storage, accessed through Tor and distributed using IPFS.
Stars: ✭ 67 (-79.7%)
JuicefsJuiceFS is a distributed POSIX file system built on top of Redis and S3.
Stars: ✭ 4,262 (+1191.52%)
Radosgw Admin4jA Ceph Object Storage Admin SDK / Client Library for Java ✨🍰✨
Stars: ✭ 50 (-84.85%)
Cloud VolumeRead and write Neuroglancer datasets programmatically.
Stars: ✭ 63 (-80.91%)
Noobaa CoreNooBaa is a Dynamic Data Gateway for cloud-native, hybrid and multi cloud environments ☁️🚀
Stars: ✭ 131 (-60.3%)
McMinIO Client is a replacement for ls, cp, mkdir, diff and rsync commands for filesystems and object storage.
Stars: ✭ 1,962 (+494.55%)
Cakephp File StorageAbstract file storage and upload plugin for CakePHP. Write to local disk, FTP, S3, Dropbox and more through a single interface. It's not just yet another uploader but a complete storage solution.
Stars: ✭ 202 (-38.79%)
Amazon S3 Find And ForgetAmazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-65.15%)
sparkucxA high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer
Stars: ✭ 32 (-90.3%)
beekeeperService for automatically managing and cleaning up unreferenced data
Stars: ✭ 43 (-86.97%)
CloudbreakA tool for provisioning and managing Apache Hadoop clusters in the cloud. Cloudbreak, as part of the Hortonworks Data Platform, makes it easy to provision, configure and elastically grow HDP clusters on cloud infrastructure. Cloudbreak can be used to provision Hadoop across cloud infrastructure providers including AWS, Azure, GCP and OpenStack.
Stars: ✭ 301 (-8.79%)
go-storageA vendor-neutral storage library for Golang: Write once, run on every storage service.
Stars: ✭ 387 (+17.27%)
Less3Less3 is an S3-compatible object storage server that runs on your laptop, servers, just about anywhere!
Stars: ✭ 16 (-95.15%)
ShrineFile Attachment toolkit for Ruby applications
Stars: ✭ 2,903 (+779.7%)
MinioHigh Performance, Kubernetes Native Object Storage
Stars: ✭ 30,698 (+9202.42%)
S5cmdParallel S3 and local filesystem execution tool.
Stars: ✭ 565 (+71.21%)
Arc📎 Flexible file upload and attachment library for Elixir
Stars: ✭ 1,087 (+229.39%)
AkubraSimple solution to keep a independent S3 storages in sync
Stars: ✭ 79 (-76.06%)
Terraform Aws S3 Log StorageThis module creates an S3 bucket suitable for receiving logs from other AWS services such as S3, CloudFront, and CloudTrail
Stars: ✭ 65 (-80.3%)
Streamxkafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (-70.91%)
MortStorage and image processing server written in Go
Stars: ✭ 420 (+27.27%)
CashHTTP response caching for Koa. Supports Redis, in-memory store, and more!
Stars: ✭ 122 (-63.03%)
PinsPin, Discover and Share Resources
Stars: ✭ 149 (-54.85%)
S3 Uploader🍎 macOS Electron+React App for uploading files to S3 directly from Status Bar
Stars: ✭ 119 (-63.94%)
Tus Ruby ServerRuby server for tus resumable upload protocol
Stars: ✭ 172 (-47.88%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-95.15%)
InfinitThe Infinit policy-based software-defined storage platform.
Stars: ✭ 363 (+10%)
rastercuberastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Stars: ✭ 15 (-95.45%)
BlobHelperBlobHelper is a common, consistent storage interface for Microsoft Azure, Amazon S3, Komodo, Kvpbase, and local filesystem written in C#.
Stars: ✭ 23 (-93.03%)
Flydrive☁️ Flexible and Fluent framework-agnostic driver based system to manage storage in Node.js
Stars: ✭ 275 (-16.67%)
awesome-storageA curated list of storage open source tools. Backups, redundancy, sharing, distribution, encryption, etc.
Stars: ✭ 324 (-1.82%)
clusterdockclusterdock is a framework for creating Docker-based container clusters
Stars: ✭ 26 (-92.12%)
storageGo library providing common interface for working across multiple cloud storage backends
Stars: ✭ 154 (-53.33%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-88.18%)
alluxio-pyAlluxio Python client - Access Any Data Source with Python
Stars: ✭ 18 (-94.55%)
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-89.7%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-96.06%)
big-data-liteSamples to the Oracle Big Data Lite VM
Stars: ✭ 41 (-87.58%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-95.76%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-34.85%)
EdgefsEdgeFS - decentralized, scalable data fabric platform for Edge/IoT Computing and Kubernetes apps
Stars: ✭ 358 (+8.48%)
nifiDeploy a secured, clustered, auto-scaling NiFi service in AWS.
Stars: ✭ 37 (-88.79%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-66.36%)