AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (+0%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+5875%)
incremental trainingRepo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'
Stars: ✭ 110 (+450%)
DiscreetlyETLy is an add-on dashboard service on top of Apache Airflow.
Stars: ✭ 60 (+200%)
aircan💨🥫 A Data Factory system for running data processing pipelines built on AirFlow and tailored to CKAN. Includes evolution of DataPusher and Xloader for loading data to DataStore.
Stars: ✭ 24 (+20%)
XeneA distributed workflow runner focusing on performance and simplicity.
Stars: ✭ 56 (+180%)
sigilAWS SSM Session manager client
Stars: ✭ 67 (+235%)
Argo WorkflowsWorkflow engine for Kubernetes
Stars: ✭ 10,024 (+50020%)
ExDeMonA general purpose metrics monitor implemented with Apache Spark. Kafka source, Elastic sink, aggregate metrics, different analysis, notifications, actions, live configuration update, missing metrics, ...
Stars: ✭ 19 (-5%)
amazon-cloudwatch-auto-alarmsAutomatically create and configure Amazon CloudWatch alarms for EC2 instances, RDS, and AWS Lambda using tags for standard and custom CloudWatch Metrics.
Stars: ✭ 52 (+160%)
ElyraElyra extends JupyterLab Notebooks with an AI centric approach.
Stars: ✭ 839 (+4095%)
akka-persistence-s3akka-persistence journal/snapshot plugin for AWS S3(support aws sdk for java v2)
Stars: ✭ 19 (-5%)
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+3865%)
amigen7Set of tools to provide automation of tasks for creating STIG-partitioned EL7 AMIs
Stars: ✭ 33 (+65%)
termscp🖥 A feature rich terminal UI file transfer and explorer with support for SCP/SFTP/FTP/S3
Stars: ✭ 707 (+3435%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+1965%)
ossperfA lightweight tool for analyzing the performance and data integrity of object-based storage services
Stars: ✭ 67 (+235%)
Aws Airflow StackTurbine: the bare metals that gets you Airflow
Stars: ✭ 352 (+1660%)
trackit2-homeTrackIt helps you to optimize your AWS cloud
Stars: ✭ 46 (+130%)
ec2detailsAPI providing AWS EC2 Instance Type Data
Stars: ✭ 37 (+85%)
Airflow ChartA Helm chart to install Apache Airflow on Kubernetes
Stars: ✭ 137 (+585%)
Airflow OperatorKubernetes custom controller and CRDs to managing Airflow
Stars: ✭ 278 (+1290%)
Example Airflow DagsExample DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+1115%)
helpdeskYet another helpdesk based on multiple providers
Stars: ✭ 14 (-30%)
kedro-airflowKedro-Airflow makes it easy to deploy Kedro projects to Airflow.
Stars: ✭ 121 (+505%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-20%)
Spark ALS基于spark-ml,spark-mllib,spark-streaming的推荐算法实现
Stars: ✭ 89 (+345%)
Soda SqlMetric collection, data testing and monitoring for SQL accessible data
Stars: ✭ 173 (+765%)
airflow-code-editorA plugin for Apache Airflow that allows you to edit DAGs in browser
Stars: ✭ 195 (+875%)
image-uploaderJavaScript Image Uploader Library for use with Amazon S3
Stars: ✭ 19 (-5%)
airflow-dbtApache Airflow integration for dbt
Stars: ✭ 233 (+1065%)
s3 uploaderMultithreaded recursive directory upload to S3 using FOG
Stars: ✭ 36 (+80%)
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+295%)
Data Science Stack Cookiecutter🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Stars: ✭ 153 (+665%)
lessGo serverless website on AWS Lambda.
Stars: ✭ 22 (+10%)
jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+25%)
viewflowViewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+450%)
nexus-blobstore-s3[*No longer maintained*] Nexus Repository S3 Blobstores
Stars: ✭ 59 (+195%)
s3uploadUpload multiple files to AWS S3, make them public, and get their URLs easily from the command line.
Stars: ✭ 24 (+20%)
Beyond Jupyter🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (+575%)
ml-opsGet your MLOps (Level 1) platform started and going fast.
Stars: ✭ 81 (+305%)
fab-oidcFlask-AppBuilder SecurityManager for OpenIDConnect
Stars: ✭ 28 (+40%)