jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-50%)
airflow-ciApache Airflow CI pipeline
Stars: ✭ 18 (-64%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-68%)
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+58%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+6%)
Airflow OperatorKubernetes custom controller and CRDs to managing Airflow
Stars: ✭ 278 (+456%)
Udacity Data Engineering ProjectsFew projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+816%)
airflow-boilerplateA complete development environment setup for working with Airflow
Stars: ✭ 94 (+88%)
airflow-dbtApache Airflow integration for dbt
Stars: ✭ 233 (+366%)
T-WatchReal Time Twitter Sentiment Analysis Product
Stars: ✭ 20 (-60%)
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+1486%)
helpdeskYet another helpdesk based on multiple providers
Stars: ✭ 14 (-72%)
aircalVisualize Airflow's schedule by exporting future DAG runs as events to Google Calendar.
Stars: ✭ 66 (+32%)
ElyraElyra extends JupyterLab Notebooks with an AI centric approach.
Stars: ✭ 839 (+1578%)
torchxTorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
Stars: ✭ 165 (+230%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+726%)
k3aiA lightweight tool to get an AI Infrastructure Stack up in minutes not days. K3ai will take care of setup K8s for You, deploy the AI tool of your choice and even run your code on it.
Stars: ✭ 105 (+110%)
airflow-code-editorA plugin for Apache Airflow that allows you to edit DAGs in browser
Stars: ✭ 195 (+290%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-46%)
incremental trainingRepo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'
Stars: ✭ 110 (+120%)
airflow-tutorialUse Airflow to move data from multiple MySQL databases to BigQuery
Stars: ✭ 96 (+92%)
Airflow Rest Api PluginA plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces
Stars: ✭ 281 (+462%)
viewflowViewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+120%)
ml-opsGet your MLOps (Level 1) platform started and going fast.
Stars: ✭ 81 (+62%)
Incubator DolphinschedulerApache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
Stars: ✭ 6,916 (+13732%)
incubator-liminalApache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (+134%)
ObjinsyncContinuously synchronize directories from remote object store to local filesystem
Stars: ✭ 29 (-42%)
opentrials-airflowConfiguration and definitions of Airflow for OpenTrials
Stars: ✭ 18 (-64%)
FastETLPlugins do Airflow para implementação de pipelines de dados
Stars: ✭ 31 (-38%)
AirflowApache Airflow - A platform to programmatically author, schedule, and monitor workflows
Stars: ✭ 24,101 (+48102%)
ap-airflowAstronomer Core Docker Images
Stars: ✭ 87 (+74%)
qunomonTestbed of AI Systems Quality Management
Stars: ✭ 15 (-70%)
DatabookA facebook for data
Stars: ✭ 26 (-48%)
udacity-data-eng-proj2A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract data from S3, apply a series of transformations and load into S3 and Redshift.
Stars: ✭ 25 (-50%)
fairflowFunctional Airflow DAG definitions.
Stars: ✭ 38 (-24%)
Dag FactoryDynamically generate Apache Airflow DAGs from YAML configuration files
Stars: ✭ 385 (+670%)
ecs-airflowCloudformation templates for deploying Airflow in ECS
Stars: ✭ 37 (-26%)
Docker AirflowRepo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializing GCP while container booting. https://abhioncbr.github.io/docker-airflow/
Stars: ✭ 29 (-42%)