All Projects → Cluster Pack → Similar Projects or Alternatives

517 Open source projects that are alternatives of or similar to Cluster Pack

Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+1665.22%)
Mutual labels:  pyspark, hdfs
Tiledb Py
Python interface to the TileDB storage manager
Stars: ✭ 78 (+239.13%)
Mutual labels:  s3, hdfs
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+8.7%)
Mutual labels:  s3, pyspark
Tiledb
The Universal Storage Engine
Stars: ✭ 1,072 (+4560.87%)
Mutual labels:  s3, hdfs
Smart open
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Stars: ✭ 2,306 (+9926.09%)
Mutual labels:  s3, hdfs
Seaweedfs
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
Stars: ✭ 13,380 (+58073.91%)
Mutual labels:  s3, hdfs
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+552.17%)
Mutual labels:  pyspark, hdfs
Storagetapper
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Stars: ✭ 232 (+908.7%)
Mutual labels:  s3, hdfs
Kafka Connect Ui
Web tool for Kafka Connect |
Stars: ✭ 388 (+1586.96%)
Mutual labels:  s3, hdfs
Rumble
⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (+152.17%)
Mutual labels:  s3, hdfs
kafka-connect-fs
Kafka Connect FileSystem Connector
Stars: ✭ 107 (+365.22%)
Mutual labels:  s3, hdfs
Juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
Stars: ✭ 4,262 (+18430.43%)
Mutual labels:  s3, hdfs
Romfont
VGA and BIOS rom font extraction
Stars: ✭ 443 (+1826.09%)
Mutual labels:  s3
Stock Analysis Engine
Backtest 1000s of minute-by-minute trading algorithms for training AI with automated pricing data from: IEX, Tradier and FinViz. Datasets and trading performance automatically published to S3 for building AI training datasets for teaching DNNs how to trade. Runs on Kubernetes and docker-compose. >150 million trading history rows generated from +5000 algorithms. Heads up: Yahoo's Finance API was disabled on 2019-01-03 https://developer.yahoo.com/yql/
Stars: ✭ 605 (+2530.43%)
Mutual labels:  s3
Cortx
CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (+1752.17%)
Mutual labels:  s3
Spark Syntax
This is a repo documenting the best practices in PySpark.
Stars: ✭ 412 (+1691.3%)
Mutual labels:  pyspark
Pgbackrest
Reliable PostgreSQL Backup & Restore
Stars: ✭ 766 (+3230.43%)
Mutual labels:  s3
S3fs Fuse
FUSE-based file system backed by Amazon S3
Stars: ✭ 5,733 (+24826.09%)
Mutual labels:  s3
S3rver
A fake S3 server written in NodeJs
Stars: ✭ 410 (+1682.61%)
Mutual labels:  s3
S3monkey
A Python library that allows you to interact with Amazon S3 Buckets as if they are your local filesystem.
Stars: ✭ 399 (+1634.78%)
Mutual labels:  s3
S5cmd
Parallel S3 and local filesystem execution tool.
Stars: ✭ 565 (+2356.52%)
Mutual labels:  s3
Gulp Awspublish
gulp plugin to publish files to amazon s3
Stars: ✭ 398 (+1630.43%)
Mutual labels:  s3
Infinit
The Infinit policy-based software-defined storage platform.
Stars: ✭ 363 (+1478.26%)
Mutual labels:  s3
Aws Toolkit Vscode
AWS Toolkit for Visual Studio Code, an extension for working with AWS services including AWS Lambda.
Stars: ✭ 823 (+3478.26%)
Mutual labels:  s3
Rome
Carthage cache for S3, Minio, Ceph, Google Storage, Artifactory and many others
Stars: ✭ 724 (+3047.83%)
Mutual labels:  s3
Backup
Easy full stack backup operations on UNIX-like systems.
Stars: ✭ 4,682 (+20256.52%)
Mutual labels:  s3
S3cmd
Official s3cmd repo -- Command line tool for managing Amazon S3 and CloudFront services
Stars: ✭ 3,767 (+16278.26%)
Mutual labels:  s3
Filestash
🦄 A modern web client for SFTP, S3, FTP, WebDAV, Git, Minio, LDAP, CalDAV, CardDAV, Mysql, Backblaze, ...
Stars: ✭ 5,231 (+22643.48%)
Mutual labels:  s3
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+2652.17%)
Mutual labels:  pyspark
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+26021.74%)
Mutual labels:  hdfs
Hasura Backend Plus
🔑Auth and 📦Storage for Hasura. The quickest way to get Auth and Storage working for your next app based on Hasura.
Stars: ✭ 776 (+3273.91%)
Mutual labels:  s3
Mort
Storage and image processing server written in Go
Stars: ✭ 420 (+1726.09%)
Mutual labels:  s3
Kodexplorer
A web based file manager,web IDE / browser based code editor
Stars: ✭ 5,490 (+23769.57%)
Mutual labels:  s3
Hadoop For Geoevent
ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-78.26%)
Mutual labels:  hdfs
Kafka Connect Hdfs
Kafka Connect HDFS connector
Stars: ✭ 400 (+1639.13%)
Mutual labels:  hdfs
Django S3direct
Directly upload files to S3 compatible services with Django.
Stars: ✭ 570 (+2378.26%)
Mutual labels:  s3
Aws Sdk Js V3
Modularized AWS SDK for JavaScript.
Stars: ✭ 737 (+3104.35%)
Mutual labels:  s3
Commuter
🚎 Notebook sharing hub
Stars: ✭ 353 (+1434.78%)
Mutual labels:  s3
Helm S3
Helm plugin that allows to set up a chart repository in AWS S3.
Stars: ✭ 372 (+1517.39%)
Mutual labels:  s3
S3 Benchmark
Measure Amazon S3's performance from any location.
Stars: ✭ 525 (+2182.61%)
Mutual labels:  s3
Clickhouse Backup
Tool for easy ClickHouse backup and restore with cloud storages support
Stars: ✭ 359 (+1460.87%)
Mutual labels:  s3
Yandex Big Data Engineering
Stars: ✭ 17 (-26.09%)
Mutual labels:  hdfs
Edgefs
EdgeFS - decentralized, scalable data fabric platform for Edge/IoT Computing and Kubernetes apps
Stars: ✭ 358 (+1456.52%)
Mutual labels:  s3
Sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+2130.43%)
Mutual labels:  hdfs
Scriptis
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Stars: ✭ 696 (+2926.09%)
Mutual labels:  pyspark
Nodb
NoDB isn't a database.. but it sort of looks like one.
Stars: ✭ 353 (+1434.78%)
Mutual labels:  s3
Javakeeper
✍️ Java 工程师必备架构体系知识总结:涵盖分布式、微服务、RPC等互联网公司常用架构,以及数据存储、缓存、搜索等必备技能
Stars: ✭ 502 (+2082.61%)
Mutual labels:  s3
Goofys
a high-performance, POSIX-ish Amazon S3 file system written in Go
Stars: ✭ 3,932 (+16995.65%)
Mutual labels:  s3
Ozone
Scalable, redundant, and distributed object store for Apache Hadoop
Stars: ✭ 330 (+1334.78%)
Mutual labels:  s3
Aws
A collection of bash shell scripts for automating various tasks with Amazon Web Services using the AWS CLI and jq.
Stars: ✭ 493 (+2043.48%)
Mutual labels:  s3
S3mock
A simple mock implementation of the AWS S3 API startable as Docker image, JUnit 4 rule, or JUnit Jupiter extension
Stars: ✭ 332 (+1343.48%)
Mutual labels:  s3
Pyspark Boilerplate
A boilerplate for writing PySpark Jobs
Stars: ✭ 318 (+1282.61%)
Mutual labels:  pyspark
Winscp
WinSCP is a popular free SFTP and FTP client for Windows, a powerful file manager that will improve your productivity. It supports also Amazon S3, FTPS, SCP and WebDAV protocols. Power users can automate WinSCP using .NET assembly.
Stars: ✭ 794 (+3352.17%)
Mutual labels:  s3
Minio
High Performance, Kubernetes Native Object Storage
Stars: ✭ 30,698 (+133369.57%)
Mutual labels:  s3
S3 Sync Action
🔄 GitHub Action to sync a directory with a remote S3 bucket 🧺
Stars: ✭ 497 (+2060.87%)
Mutual labels:  s3
Wal E
Continuous Archiving for Postgres
Stars: ✭ 3,313 (+14304.35%)
Mutual labels:  s3
Spark Gotchas
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
Stars: ✭ 308 (+1239.13%)
Mutual labels:  pyspark
Moto
A library that allows you to easily mock out tests based on AWS infrastructure.
Stars: ✭ 5,428 (+23500%)
Mutual labels:  s3
Vue Cli Plugin S3 Deploy
A vue-cli plugin that uploads your built Vue.js project to an S3 bucket
Stars: ✭ 304 (+1221.74%)
Mutual labels:  s3
Aws.s3
Amazon Simple Storage Service (S3) API Client
Stars: ✭ 302 (+1213.04%)
Mutual labels:  s3
1-60 of 517 similar projects