Aws Ecs AirflowRun Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (-55.97%)
DiscreetlyETLy is an add-on dashboard service on top of Apache Airflow.
Stars: ✭ 60 (-75.31%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-78.19%)
aircan💨🥫 A Data Factory system for running data processing pipelines built on AirFlow and tailored to CKAN. Includes evolution of DataPusher and Xloader for loading data to DataStore.
Stars: ✭ 24 (-90.12%)
aircalVisualize Airflow's schedule by exporting future DAG runs as events to Google Calendar.
Stars: ✭ 66 (-72.84%)
Goeat ApiRest API for a food delivery application - Built with Express, Postgres, Redis, MongoDB and Nodemailer
Stars: ✭ 36 (-85.19%)
Python Spider豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Stars: ✭ 615 (+153.09%)
TransporterSync data between persistence engines, like ETL only not stodgy
Stars: ✭ 1,175 (+383.54%)
Steedos Platform华炎魔方低代码/无代码平台。内核采用了元数据、微服务、微前端、K8S等最新的技术架构。Steedos Low-Code / No-Code Platform in nodejs and mongodb.
Stars: ✭ 310 (+27.57%)
Argo WorkflowsWorkflow engine for Kubernetes
Stars: ✭ 10,024 (+4025.1%)
Monstachea go daemon that syncs MongoDB to Elasticsearch in realtime
Stars: ✭ 736 (+202.88%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (-70.37%)
MailerA light-weight, modular, message representation and mail delivery framework for Python.
Stars: ✭ 225 (-7.41%)
artefactory-connectors-kitACK is an E(T)L tool specialized in API data ingestion. It is accessible through a Command-Line Interface. The application allows you to easily extract, stream and load data (with minimum transformations), from the API source to the destination of your choice.
Stars: ✭ 34 (-86.01%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+151.85%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (-91.77%)
lineageGenerate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-93.42%)
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (-67.49%)
MgobMongoDB dockerized backup agent. Runs schedule backups with retention, S3 & SFTP upload, notifications, instrumentation with Prometheus and more.
Stars: ✭ 573 (+135.8%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+391.77%)
Etl.netMass processing data with a complete ETL for .net developers
Stars: ✭ 129 (-46.91%)
XeneA distributed workflow runner focusing on performance and simplicity.
Stars: ✭ 56 (-76.95%)
File StorageFile storage abstraction for Yii2
Stars: ✭ 116 (-52.26%)
Mongo EsA MongoDB to Elasticsearch connector
Stars: ✭ 185 (-23.87%)
Next GaNext.js HOC to integrate Google Analytics on every page change
Stars: ✭ 228 (-6.17%)
DockselpyDockerized Selenium and Python with support for Chrome, Firefox and PhantomJS
Stars: ✭ 237 (-2.47%)
DbData access layer for PostgreSQL, CockroachDB, MySQL, SQLite and MongoDB with ORM-like features.
Stars: ✭ 2,832 (+1065.43%)
Django SalesforceSalesforce integration for Django's ORM using the SF REST API.
Stars: ✭ 241 (-0.82%)
ElasticR client for the Elasticsearch HTTP API
Stars: ✭ 227 (-6.58%)
Mastering Junit5A comprehensive collection of test examples created with JUnit 5
Stars: ✭ 223 (-8.23%)
Spring Dubbo Service微服务 spring dubbo项目:dubbo rpc;druid数据源连接池;mybatis配置集成,多数据源;jmx监控MBean;定时任务;aop;ftp;测试;Metrics监控;参数验证;跨域处理;shiro权限控制;consul服务注册,发现;redis分布式锁;SPI服务机制;cat监控;netty服务代理;websocket;disconf;mongodb集成;rest;docker;fescar
Stars: ✭ 224 (-7.82%)
Node Typescript Api🚀Complete Node.js API built using 👉Typescript | Jest | MongoDB | Express
Stars: ✭ 234 (-3.7%)
Recheck Webrecheck for web apps – change comparison tool with local Golden Masters, Git-like ignore syntax and "Unbreakable Selenium" tests.
Stars: ✭ 224 (-7.82%)
FsqioA monorepo that holds all of Foursquare's opensource projects
Stars: ✭ 223 (-8.23%)
Machine LearningWeb-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)
Stars: ✭ 235 (-3.29%)
Scrape Linkedin Selenium`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (-1.65%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (-3.29%)
Mongolid LaravelEasy, powerful and ultrafast MongoDB ODM for Laravel.
Stars: ✭ 222 (-8.64%)
Full Reactive StackFull Reactive Stack with Spring Boot (WebFlux), MongoDB and Angular
Stars: ✭ 221 (-9.05%)
MongockLightweight MongoDB migration tool for Java
Stars: ✭ 220 (-9.47%)
Mongo2goMongo2Go - MongoDB for integration tests (.NET Core)
Stars: ✭ 240 (-1.23%)
Ppspiderweb spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架,提供灵活的任务队列管理调度方案,提供便捷的数据保存方案(nedb/mongodb),提供数据可视化和用户交互的实现方案
Stars: ✭ 237 (-2.47%)
Spring Security Pac4jpac4j security library for Spring Security: OAuth, CAS, SAML, OpenID Connect, LDAP, JWT...
Stars: ✭ 231 (-4.94%)
PaperboyA web frontend for scheduling Jupyter notebook reports
Stars: ✭ 221 (-9.05%)
VmimeVMime Mail Library
Stars: ✭ 218 (-10.29%)