Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+1665.22%)
Tiledb PyPython interface to the TileDB storage manager
Stars: ✭ 78 (+239.13%)
jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+8.7%)
TiledbThe Universal Storage Engine
Stars: ✭ 1,072 (+4560.87%)
Smart openUtils for streaming large files (S3, HDFS, gzip, bz2...)
Stars: ✭ 2,306 (+9926.09%)
SeaweedfsSeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
Stars: ✭ 13,380 (+58073.91%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+552.17%)
StoragetapperStorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Stars: ✭ 232 (+908.7%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (+152.17%)
JuicefsJuiceFS is a distributed POSIX file system built on top of Redis and S3.
Stars: ✭ 4,262 (+18430.43%)
RomfontVGA and BIOS rom font extraction
Stars: ✭ 443 (+1826.09%)
Stock Analysis EngineBacktest 1000s of minute-by-minute trading algorithms for training AI with automated pricing data from: IEX, Tradier and FinViz. Datasets and trading performance automatically published to S3 for building AI training datasets for teaching DNNs how to trade. Runs on Kubernetes and docker-compose. >150 million trading history rows generated from +5000 algorithms. Heads up: Yahoo's Finance API was disabled on 2019-01-03 https://developer.yahoo.com/yql/
Stars: ✭ 605 (+2530.43%)
CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (+1752.17%)
Spark SyntaxThis is a repo documenting the best practices in PySpark.
Stars: ✭ 412 (+1691.3%)
PgbackrestReliable PostgreSQL Backup & Restore
Stars: ✭ 766 (+3230.43%)
S3fs FuseFUSE-based file system backed by Amazon S3
Stars: ✭ 5,733 (+24826.09%)
S3rverA fake S3 server written in NodeJs
Stars: ✭ 410 (+1682.61%)
S3monkeyA Python library that allows you to interact with Amazon S3 Buckets as if they are your local filesystem.
Stars: ✭ 399 (+1634.78%)
S5cmdParallel S3 and local filesystem execution tool.
Stars: ✭ 565 (+2356.52%)
Gulp Awspublishgulp plugin to publish files to amazon s3
Stars: ✭ 398 (+1630.43%)
InfinitThe Infinit policy-based software-defined storage platform.
Stars: ✭ 363 (+1478.26%)
Aws Toolkit VscodeAWS Toolkit for Visual Studio Code, an extension for working with AWS services including AWS Lambda.
Stars: ✭ 823 (+3478.26%)
RomeCarthage cache for S3, Minio, Ceph, Google Storage, Artifactory and many others
Stars: ✭ 724 (+3047.83%)
BackupEasy full stack backup operations on UNIX-like systems.
Stars: ✭ 4,682 (+20256.52%)
S3cmdOfficial s3cmd repo -- Command line tool for managing Amazon S3 and CloudFront services
Stars: ✭ 3,767 (+16278.26%)
Filestash🦄 A modern web client for SFTP, S3, FTP, WebDAV, Git, Minio, LDAP, CalDAV, CardDAV, Mysql, Backblaze, ...
Stars: ✭ 5,231 (+22643.48%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+2652.17%)
God Of Bigdata专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+26021.74%)
Hasura Backend Plus🔑Auth and 📦Storage for Hasura. The quickest way to get Auth and Storage working for your next app based on Hasura.
Stars: ✭ 776 (+3273.91%)
MortStorage and image processing server written in Go
Stars: ✭ 420 (+1726.09%)
KodexplorerA web based file manager,web IDE / browser based code editor
Stars: ✭ 5,490 (+23769.57%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-78.26%)
Django S3directDirectly upload files to S3 compatible services with Django.
Stars: ✭ 570 (+2378.26%)
Aws Sdk Js V3Modularized AWS SDK for JavaScript.
Stars: ✭ 737 (+3104.35%)
Commuter🚎 Notebook sharing hub
Stars: ✭ 353 (+1434.78%)
Helm S3Helm plugin that allows to set up a chart repository in AWS S3.
Stars: ✭ 372 (+1517.39%)
S3 BenchmarkMeasure Amazon S3's performance from any location.
Stars: ✭ 525 (+2182.61%)
Clickhouse BackupTool for easy ClickHouse backup and restore with cloud storages support
Stars: ✭ 359 (+1460.87%)
EdgefsEdgeFS - decentralized, scalable data fabric platform for Edge/IoT Computing and Kubernetes apps
Stars: ✭ 358 (+1456.52%)
SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+2130.43%)
ScriptisScriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Stars: ✭ 696 (+2926.09%)
NodbNoDB isn't a database.. but it sort of looks like one.
Stars: ✭ 353 (+1434.78%)
Javakeeper✍️ Java 工程师必备架构体系知识总结:涵盖分布式、微服务、RPC等互联网公司常用架构,以及数据存储、缓存、搜索等必备技能
Stars: ✭ 502 (+2082.61%)
Goofysa high-performance, POSIX-ish Amazon S3 file system written in Go
Stars: ✭ 3,932 (+16995.65%)
OzoneScalable, redundant, and distributed object store for Apache Hadoop
Stars: ✭ 330 (+1334.78%)
AwsA collection of bash shell scripts for automating various tasks with Amazon Web Services using the AWS CLI and jq.
Stars: ✭ 493 (+2043.48%)
S3mockA simple mock implementation of the AWS S3 API startable as Docker image, JUnit 4 rule, or JUnit Jupiter extension
Stars: ✭ 332 (+1343.48%)
WinscpWinSCP is a popular free SFTP and FTP client for Windows, a powerful file manager that will improve your productivity. It supports also Amazon S3, FTPS, SCP and WebDAV protocols. Power users can automate WinSCP using .NET assembly.
Stars: ✭ 794 (+3352.17%)
MinioHigh Performance, Kubernetes Native Object Storage
Stars: ✭ 30,698 (+133369.57%)
S3 Sync Action🔄 GitHub Action to sync a directory with a remote S3 bucket 🧺
Stars: ✭ 497 (+2060.87%)
Wal EContinuous Archiving for Postgres
Stars: ✭ 3,313 (+14304.35%)
Spark GotchasSpark Gotchas. A subjective compilation of the Apache Spark tips and tricks
Stars: ✭ 308 (+1239.13%)
MotoA library that allows you to easily mock out tests based on AWS infrastructure.
Stars: ✭ 5,428 (+23500%)
Vue Cli Plugin S3 DeployA vue-cli plugin that uploads your built Vue.js project to an S3 bucket
Stars: ✭ 304 (+1221.74%)
Aws.s3Amazon Simple Storage Service (S3) API Client
Stars: ✭ 302 (+1213.04%)