All Projects → Metl → Similar Projects or Alternatives

571 Open source projects that are alternatives of or similar to Metl

Bk Sops

蓝鲸智云标准运维(SOPS)

Stars: ✭ 632 (+313.07%)

Mutual labels: pipeline

web-click-flow

网站点击流离线日志分析

Stars: ✭ 14 (-90.85%)

Mutual labels: etl

Chain.jl

A Julia package for piping a value through a series of transformation expressions using a more convenient syntax than Julia's native piping functionality.

Stars: ✭ 118 (-22.88%)

Mutual labels: pipeline

dflib

In-memory Java DataFrame library

Stars: ✭ 50 (-67.32%)

Mutual labels: etl

Drake

An R-focused pipeline toolkit for reproducibility and high-performance computing

Stars: ✭ 1,301 (+750.33%)

Mutual labels: pipeline

Datakit

Connect processes into powerful data pipelines with a simple git-like filesystem interface

Stars: ✭ 951 (+521.57%)

Mutual labels: pipeline

open-semantic-desktop-search

Virtual Machine for Desktop Search with Open Semantic Search

Stars: ✭ 22 (-85.62%)

Mutual labels: etl

get phylomarkers

A pipeline to select optimal markers for microbial phylogenomics and species tree estimation using coalescent and concatenation approaches

Stars: ✭ 34 (-77.78%)

Mutual labels: pipeline

Mlbox

MLBox is a powerful Automated Machine Learning python library.

Stars: ✭ 1,199 (+683.66%)

Mutual labels: pipeline

Proposal Pipeline Operator

A proposal for adding a useful pipe operator to JavaScript.

Stars: ✭ 5,899 (+3755.56%)

Mutual labels: pipeline

Rangeless

c++ LINQ -like library of higher-order functions for data manipulation

Stars: ✭ 148 (-3.27%)

Mutual labels: pipeline

sagemaker-sparkml-serving-container

This code is used to build & run a Docker container for performing predictions against a Spark ML Pipeline.

Stars: ✭ 44 (-71.24%)

Mutual labels: pipeline

Ananas Desktop

A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.

Stars: ✭ 551 (+260.13%)

Mutual labels: etl

classification

Catalyst.Classification

Stars: ✭ 35 (-77.12%)

Mutual labels: pipeline

Dataspherestudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

Stars: ✭ 1,195 (+681.05%)

Mutual labels: etl

predict-fraud-using-auto-ai

Use AutoAI to detect fraud

Stars: ✭ 27 (-82.35%)

Mutual labels: pipeline

Elegant error/exception handling in Elixir, with result monads.

Stars: ✭ 517 (+237.91%)

Mutual labels: pipeline

TOGGLE

Toolbox for generic NGS analyses - A framework to quickly build pipelines and to perform large-scale NGS analysis

Stars: ✭ 18 (-88.24%)

Mutual labels: pipeline

Lastbackend

System for containerized apps management. From build to scaling.

Stars: ✭ 1,536 (+903.92%)

Mutual labels: pipeline

Bigslice

A serverless cluster computing system for the Go programming language

Stars: ✭ 469 (+206.54%)

Mutual labels: etl

Addax

Addax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.

Stars: ✭ 615 (+301.96%)

Mutual labels: etl

go-bqloader

bqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.

Stars: ✭ 16 (-89.54%)

Mutual labels: etl

Locopy

locopy: Loading/Unloading to Redshift and Snowflake using Python.

Stars: ✭ 73 (-52.29%)

Mutual labels: etl

cobrix

A COBOL parser and Mainframe/EBCDIC data source for Apache Spark

Stars: ✭ 109 (-28.76%)

Mutual labels: etl

Smartcode

SmartCode = IDataSource -> IBuildTask -> IOutput => Build Everything!!!

Stars: ✭ 464 (+203.27%)

Mutual labels: etl

A full-stack DevOps on AWS framework

Stars: ✭ 948 (+519.61%)

Mutual labels: pipeline

germline-DNA

A BioWDL variantcalling pipeline for germline DNA data. Starting with FASTQ files to produce VCF files. Category:Multi-Sample

Stars: ✭ 21 (-86.27%)

Mutual labels: pipeline

Pglogical

Logical Replication extension for PostgreSQL 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.

Stars: ✭ 455 (+197.39%)

Mutual labels: etl

gamechanger-data

GAMECHANGER aspires to be the Department’s trusted solution for evidence-based, data-driven decision-making across the universe of DoD requirements

Stars: ✭ 17 (-88.89%)

Mutual labels: etl

lightflow

A lightweight, distributed workflow system

Stars: ✭ 67 (-56.21%)

Mutual labels: pipeline

Transporter

Sync data between persistence engines, like ETL only not stodgy

Stars: ✭ 1,175 (+667.97%)

Mutual labels: etl

towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Stars: ✭ 821 (+436.6%)

Mutual labels: pipeline

Pipeline

Pipeline is a package to build multi-staged concurrent workflows with a centralized logging output.

Stars: ✭ 433 (+183.01%)

Mutual labels: pipeline

wrangle

A data transformation package for deep learning with Autonomio, Keras and TensorFlow.

Stars: ✭ 15 (-90.2%)

Mutual labels: etl

Europa

Puppet Container Registry

Stars: ✭ 114 (-25.49%)

Mutual labels: pipeline

Open Solution Salt Identification

Open solution to the TGS Salt Identification Challenge

Stars: ✭ 124 (-18.95%)

Mutual labels: pipeline

Udacity Data Engineering

Udacity Data Engineering Nano Degree (DEND)

Stars: ✭ 89 (-41.83%)

Mutual labels: etl

Pytorch Toolbelt

PyTorch extensions for fast R&D prototyping and Kaggle farming

Stars: ✭ 942 (+515.69%)

Mutual labels: pipeline

image-processing-pipeline

An image build orchestrator for the modern web

Stars: ✭ 43 (-71.9%)

Mutual labels: pipeline

Rush

A cross-platform command-line tool for executing jobs in parallel

Stars: ✭ 421 (+175.16%)

Mutual labels: pipeline

html-pipeline

HTML processing filters and utilities in Go version

Stars: ✭ 18 (-88.24%)

Mutual labels: pipeline

Globalbioticinteractions

Global Biotic Interactions provides access to existing species interaction datasets

Stars: ✭ 71 (-53.59%)

Mutual labels: etl-framework

jenkins-terraform-pipeline

create a jenkins pipeline which uses terraform to manage AWS resources

Stars: ✭ 17 (-88.89%)

Mutual labels: pipeline

Serving

A flexible, high-performance carrier for machine learning models（『飞桨』服务化部署框架）

Stars: ✭ 403 (+163.4%)

Mutual labels: pipeline

architect big data solutions with spark

code, labs and lectures for the course

Stars: ✭ 40 (-73.86%)

Mutual labels: etl

Demo Jenkins Config As Code

Demo of Jenkins Configuration-As-Code with Docker and Groovy Hook Scripts

Stars: ✭ 143 (-6.54%)

Mutual labels: pipeline

MIPS-pipeline-processor

A pipelined implementation of the MIPS processor featuring hazard detection as well as forwarding

Stars: ✭ 92 (-39.87%)

Mutual labels: pipeline

Datacleaner

The premier open source Data Quality solution

Stars: ✭ 391 (+155.56%)

Mutual labels: etl

Apos.Content

Content builder library for MonoGame.

Stars: ✭ 14 (-90.85%)

Mutual labels: pipeline

Awesome Business Intelligence

Actively curated list of awesome BI tools. PRs welcome!

Stars: ✭ 1,157 (+656.21%)

Mutual labels: etl

hyperdrive

Extensible streaming ingestion pipeline on top of Apache Spark

Stars: ✭ 31 (-79.74%)

Mutual labels: pipeline

Flowex

Flow-Based Programming framework for Elixir

Stars: ✭ 383 (+150.33%)

Mutual labels: pipeline

smag

Show Me A Graph - Command Line Graphing

Stars: ✭ 78 (-49.02%)

Mutual labels: pipeline

Ugene

UGENE is free open-source cross-platform bioinformatics software

Stars: ✭ 112 (-26.8%)

Mutual labels: pipeline

skippa

SciKIt-learn Pipeline in PAndas

Stars: ✭ 33 (-78.43%)

Mutual labels: pipeline

Git Push Deploy

Simple Automated CI/CD Pipeline for GitHub and GitLab Projects

Stars: ✭ 21 (-86.27%)

Mutual labels: pipeline

Credit

An example project that predicts risk of credit card default using a Logistic Regression classifier and a 30,000 sample dataset.

Stars: ✭ 18 (-88.24%)

Mutual labels: pipeline

openrefine-docker

OpenRefine is a free, open source power tool for working with messy data and improving it. This repository contains Dockerbuild files for automated builds.

Stars: ✭ 19 (-87.58%)

Mutual labels: etl

Etl

LinkedPipes ETL is an RDF based, lightweight ETL tool

Stars: ✭ 88 (-42.48%)

Mutual labels: etl

Yunmai Data Extract

Extract your data from the Yunmai weighing scales cloud API so you can use it elsewhere

Stars: ✭ 21 (-86.27%)

Mutual labels: etl

301-360 of 571 similar projects

first

‹

›