DagsterAn orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+1303.77%)
BigsliceA serverless cluster computing system for the Go programming language
Stars: ✭ 469 (+60.62%)
PantherDetect threats with log data and improve cloud security posture
Stars: ✭ 885 (+203.08%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (-16.1%)
WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+27.4%)
Aws Auto Terminate Idle EmrAWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.
Stars: ✭ 21 (-92.81%)
thainThain is a distributed flow schedule platform.
Stars: ✭ 81 (-72.26%)
openrefine-batchShell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (-73.97%)
TiBigDataTiDB connectors for Flink/Hive/Presto
Stars: ✭ 192 (-34.25%)
awesome-coder-resources编程路上加油站!------【持续更新中...欢迎star,欢迎常回来看看......】【内容:编程/学习/阅读资源,开源项目,面试题,网站,书,博客,教程等等】
Stars: ✭ 54 (-81.51%)
lectures-hse-sparkМасштабируемое машинное обучение и анализ больших данных с Apache Spark
Stars: ✭ 20 (-93.15%)
dt-sql-parserSQL Parsers for BigData, built with antlr4.
Stars: ✭ 135 (-53.77%)
diffidoWatch web pages for changes
Stars: ✭ 19 (-93.49%)
amasAmas is recursive acronym for “Amas, monitor alert system”.
Stars: ✭ 77 (-73.63%)
aoc-dev-resourcesUseful repositories and articles related to developing software and analysis for Age of Empires II.
Stars: ✭ 40 (-86.3%)
counter-interview.deva collaborative collection of interview questions collected from both sides of the game: Interviewer(s) and Interviewee.
Stars: ✭ 102 (-65.07%)
cronerTrigger functions and/or evaluate cron expressions in JavaScript. No dependencies. Most features. All environments.
Stars: ✭ 169 (-42.12%)
naas⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (-25%)
shiftingA privacy-focused list of alternatives to mainstream services to help the competition.
Stars: ✭ 31 (-89.38%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (-89.04%)
indicium🔎 A simple in-memory search for collections and key-value stores.
Stars: ✭ 41 (-85.96%)
YaEtlYet Another ETL in PHP
Stars: ✭ 60 (-79.45%)
TT Jobs基于 Swoole 定时管理系统
Stars: ✭ 22 (-92.47%)
web-avant-gardes💇♀️ Collection of experimental, radical, or unorthodox websites
Stars: ✭ 48 (-83.56%)
derivejsDeriveJS is a reactive ODM - Object Document Mapper - framework, a "wrapper" around a database, that removes all the hassle of data-persistence by handling it transparently in the background, in a DRY manner.
Stars: ✭ 54 (-81.51%)
EasyAlbum📷 A lightweight, pure-Swift library for pick up photo from your album.
Stars: ✭ 31 (-89.38%)
hubot-scheduleA hubot script to schedule a message in both cron-style and datetime-based format pattern
Stars: ✭ 46 (-84.25%)
Kali-TXCustomized Kali Linux - Ansible playbook
Stars: ✭ 54 (-81.51%)
FlowMasterETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (-93.49%)
humpback-centerHumpback Center 主要为 Humpback 平台提供集群容器调度服务,以集群中心角色实现各个 Group 的容器分配管理。
Stars: ✭ 37 (-87.33%)
morph-kgcPowerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (-73.63%)
iex-stocksETL for the IEX Stocks API
Stars: ✭ 19 (-93.49%)
greycatGreyCat - Data Analytics, Temporal data, What-if, Live machine learning
Stars: ✭ 104 (-64.38%)
awesome-integrationA curated list of awesome system integration software and resources.
Stars: ✭ 117 (-59.93%)
time.cljtime util for Clojure(Script)
Stars: ✭ 45 (-84.59%)
HelloWorldsHello-World program in most programming languages
Stars: ✭ 102 (-65.07%)
neo4j-jdbcJDBC driver for Neo4j
Stars: ✭ 110 (-62.33%)
csv-cruncherTreats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (-89.04%)
scala-datapipeline-dslDomain-specific language to help build and maintain AWS Data Pipelines
Stars: ✭ 25 (-91.44%)
dart-moreMore Dart — Literally.
Stars: ✭ 81 (-72.26%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-94.18%)
schedulerMaintenance fork of Apache Aurora's Scheduler
Stars: ✭ 21 (-92.81%)
dtd2mysqlMySQL / MariaDB import for DTD feeds (fares, timetable and routeing)
Stars: ✭ 25 (-91.44%)
ptSchedulerPretty tiny Scheduler or ptScheduler is an Arduino library for writing non-blocking periodic tasks easily.
Stars: ✭ 14 (-95.21%)
Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (-56.85%)
PersonNotes个人笔记集中营,快糙猛的形式记录技术性Notes .. 📚☕️⌨️🎧
Stars: ✭ 61 (-79.11%)
xamarin-forms-demo-appA demo application in this repository demonstrates the capabilities of the DevExpress Mobile UI for Xamarin.Forms: Data Grid, Editors, Charts, Scheduler, Data Form, and other controls.
Stars: ✭ 74 (-74.66%)
bigquery-data-lineageReference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Stars: ✭ 112 (-61.64%)
intersect一道面试题的思考 - 6000万数据包和300万数据包在50M内存使用环境中求交集
Stars: ✭ 54 (-81.51%)
YACLibYet Another Concurrency Library
Stars: ✭ 193 (-33.9%)
wikirepoPython based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (-88.7%)