All Projects → datajob → Similar Projects or Alternatives

465 Open source projects that are alternatives of or similar to datajob

aws-pdf-textract-pipeline
🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
Stars: ✭ 141 (+39.6%)
Mutual labels:  data-pipeline, aws-cdk
saisoku
Saisoku is a Python module that helps you build complex pipelines of batch file/directory transfer/sync jobs.
Stars: ✭ 40 (-60.4%)
Mutual labels:  pipeline, data-pipeline
instance-watcher
Get notified for Instances mistakenly left running across all AWS regions for specific AWS Account
Stars: ✭ 90 (-10.89%)
Mutual labels:  glue, sagemaker
sagemaker-sparkml-serving-container
This code is used to build & run a Docker container for performing predictions against a Spark ML Pipeline.
Stars: ✭ 44 (-56.44%)
Mutual labels:  pipeline, sagemaker
GLUE-bert4keras
基于bert4keras的GLUE基准代码
Stars: ✭ 59 (-41.58%)
Mutual labels:  glue
Shifu
An end-to-end machine learning and data mining framework on Hadoop
Stars: ✭ 207 (+104.95%)
Mutual labels:  pipeline
Hands On Devops
A hands-on DevOps course covering the culture, methods and repeated practices of modern software development involving Packer, Vagrant, VirtualBox, Ansible, Kubernetes, K3s, MetalLB, Traefik, Docker-Compose, Docker, Taiga, GitLab, Drone CI, SonarQube, Selenium, InSpec, Alpine 3.10, Ubuntu-bionic, CentOS 7...
Stars: ✭ 196 (+94.06%)
Mutual labels:  pipeline
Zumis
zUMIs: A fast and flexible pipeline to process RNA sequencing data with UMIs
Stars: ✭ 178 (+76.24%)
Mutual labels:  pipeline
makepipe
Tools for constructing simple make-like pipelines in R.
Stars: ✭ 23 (-77.23%)
Mutual labels:  pipeline
cdk-chalice
AWS CDK construct for AWS Chalice
Stars: ✭ 41 (-59.41%)
Mutual labels:  aws-cdk
Rnaseq Workflow
A repository for setting up a RNAseq workflow
Stars: ✭ 170 (+68.32%)
Mutual labels:  pipeline
Bulk Writer
Provides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.
Stars: ✭ 210 (+107.92%)
Mutual labels:  pipeline
targets-tutorial
Short course on the targets R package
Stars: ✭ 87 (-13.86%)
Mutual labels:  pipeline
Lightautoml
LAMA - automatic model creation framework
Stars: ✭ 196 (+94.06%)
Mutual labels:  pipeline
transtats
Track translations and automate workflow.
Stars: ✭ 31 (-69.31%)
Mutual labels:  pipeline
Pipeline.rs
☔️ => ⛅️ => ☀️
Stars: ✭ 188 (+86.14%)
Mutual labels:  pipeline
gofast
High performance transport protocol for distributed applications.
Stars: ✭ 19 (-81.19%)
Mutual labels:  pipeline
Pypyr
pypyr task-runner cli & api for automation pipelines. Automate anything by combining commands, different scripts in different languages & applications into one pipeline process.
Stars: ✭ 173 (+71.29%)
Mutual labels:  pipeline
bodywork-ml-pipeline-project
Deployment template for a continuous training pipeline.
Stars: ✭ 22 (-78.22%)
Mutual labels:  pipeline
Morphl Community Edition
MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc.) through personalization
Stars: ✭ 253 (+150.5%)
Mutual labels:  pipeline
Plex
Open Source Pipeline for Maya, Houdini, 3ds Max and Nuke .
Stars: ✭ 170 (+68.32%)
Mutual labels:  pipeline
Unity resources
A list of resources and tutorials for those doing programming in Unity.
Stars: ✭ 170 (+68.32%)
Mutual labels:  pipeline
Cloud Dev
云研发,是一种生于云上的闭环 + 代码化的软件开发方式。它可以让业务人员、开发人员、运营人员等在同一个云端共同协作、透明化地完成整个软件的生命周期(需求、设计、编码、构建、部署、运营),而非相互隔离,又或者是借助于多个软件才能完成工作。
Stars: ✭ 164 (+62.38%)
Mutual labels:  pipeline
pipe
Functional Pipeline in Go
Stars: ✭ 30 (-70.3%)
Mutual labels:  pipeline
Docker Android Build Box
An optimized docker image includes Android, Kotlin, Flutter sdk.
Stars: ✭ 245 (+142.57%)
Mutual labels:  pipeline
Operator
Kubernetes operator to manage installation, updation and uninstallation of tektoncd projects (pipeline, …)
Stars: ✭ 161 (+59.41%)
Mutual labels:  pipeline
Hkube
🐟 High Performance Computing over Kubernetes - Core Repo 🎣
Stars: ✭ 214 (+111.88%)
Mutual labels:  pipeline
Spacy Wordnet
spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
Stars: ✭ 156 (+54.46%)
Mutual labels:  pipeline
Flowcraft
FlowCraft: a component-based pipeline composer for omics analysis using Nextflow. 🐳📦
Stars: ✭ 208 (+105.94%)
Mutual labels:  pipeline
lncpipe
UNDER DEVELOPMENT--- Analysis of long non-coding RNAs from RNA-seq datasets
Stars: ✭ 24 (-76.24%)
Mutual labels:  pipeline
Whispers
Identify hardcoded secrets and dangerous behaviours
Stars: ✭ 66 (-34.65%)
Mutual labels:  pipeline
scicloj.ml
A Clojure machine learning library
Stars: ✭ 152 (+50.5%)
Mutual labels:  data-pipeline
Jenkinsdocs
Jenkins实践文档 最新站点地址: http://www.idevops.site
Stars: ✭ 200 (+98.02%)
Mutual labels:  pipeline
AnimationDNA
Maya > Arnold > Nuke pipeline
Stars: ✭ 101 (+0%)
Mutual labels:  pipeline
Drone Cache
A Drone plugin for caching current workspace files between builds to reduce your build times
Stars: ✭ 194 (+92.08%)
Mutual labels:  pipeline
assume-role-arn
🤖🎩assume-role-arn allows you to easily assume an AWS IAM role in your CI/CD pipelines, without worrying about external dependencies.
Stars: ✭ 54 (-46.53%)
Mutual labels:  pipeline
Ssh Steps Plugin
Jenkins pipeline steps which provides SSH facilities such as command execution or file transfer for continuous delivery.
Stars: ✭ 183 (+81.19%)
Mutual labels:  pipeline
managed ml systems and iot
Managed Machine Learning Systems and Internet of Things Live Lesson
Stars: ✭ 35 (-65.35%)
Mutual labels:  sagemaker
Proposal Smart Pipelines
Old archived draft proposal for smart pipelines. Go to the new Hack-pipes proposal at js-choi/proposal-hack-pipes.
Stars: ✭ 177 (+75.25%)
Mutual labels:  pipeline
frizzle
The magic message bus
Stars: ✭ 14 (-86.14%)
Mutual labels:  pipeline
Faas Flow
Function Composition for OpenFaaS
Stars: ✭ 172 (+70.3%)
Mutual labels:  pipeline
nemesyst
Generalised and highly customisable, hybrid-parallelism, database based, deep learning framework.
Stars: ✭ 17 (-83.17%)
Mutual labels:  pipeline
Vectorsql
VectorSQL is a free analytics DBMS for IoT & Big Data, compatible with ClickHouse.
Stars: ✭ 171 (+69.31%)
Mutual labels:  pipeline
Al usdmaya
This repo is no longer updated. Please see https://github.com/Autodesk/maya-usd
Stars: ✭ 253 (+150.5%)
Mutual labels:  pipeline
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+4770.3%)
Mutual labels:  pipeline
scATAC-pro
A comprehensive tool for processing, analyzing and visulizing single cell chromatin accessibility sequencing data
Stars: ✭ 63 (-37.62%)
Mutual labels:  pipeline
Dolphinbeat
A server that pulls and parses MySQL binlog, pushs change data into different sinks like Kafka.
Stars: ✭ 164 (+62.38%)
Mutual labels:  pipeline
Mipt Mips
Cycle-accurate pre-silicon simulator of RISC-V and MIPS CPUs
Stars: ✭ 250 (+147.52%)
Mutual labels:  pipeline
Core
The safe post-production pipeline - https://getavalon.github.io/2.0
Stars: ✭ 162 (+60.4%)
Mutual labels:  pipeline
AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-80.2%)
Mutual labels:  data-pipeline
Aws Serverless Cicd Workshop
Learn how to build a CI/CD pipeline for SAM-based applications
Stars: ✭ 158 (+56.44%)
Mutual labels:  pipeline
Cli
A CLI for interacting with Tekton!
Stars: ✭ 229 (+126.73%)
Mutual labels:  pipeline
pd3f
🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
Stars: ✭ 132 (+30.69%)
Mutual labels:  pipeline
Batchflow
BatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
Stars: ✭ 156 (+54.46%)
Mutual labels:  pipeline
Automlpipeline.jl
A package that makes it trivial to create and evaluate machine learning pipeline architectures.
Stars: ✭ 223 (+120.79%)
Mutual labels:  pipeline
Ects
Elastic Crontab System 简单易用的分布式定时任务管理系统
Stars: ✭ 156 (+54.46%)
Mutual labels:  pipeline
Fluids
Fluid dynamics component of Chemical Engineering Design Library (ChEDL)
Stars: ✭ 154 (+52.48%)
Mutual labels:  pipeline
Bedops
🔬 BEDOPS: high-performance genomic feature operations
Stars: ✭ 215 (+112.87%)
Mutual labels:  pipeline
domino-research
Projects developed by Domino's R&D team
Stars: ✭ 74 (-26.73%)
Mutual labels:  sagemaker
http-api-aws-fargate-cdk
Build HTTP API Based Services using Amazon API Gateway, AWS PrivateLink, AWS Fargate and AWS CDK
Stars: ✭ 5 (-95.05%)
Mutual labels:  aws-cdk
1-60 of 465 similar projects