Bk Sops蓝鲸智云标准运维(SOPS)
Stars: ✭ 632 (+313.07%)
Chain.jlA Julia package for piping a value through a series of transformation expressions using a more convenient syntax than Julia's native piping functionality.
Stars: ✭ 118 (-22.88%)
dflibIn-memory Java DataFrame library
Stars: ✭ 50 (-67.32%)
DrakeAn R-focused pipeline toolkit for reproducibility and high-performance computing
Stars: ✭ 1,301 (+750.33%)
DatakitConnect processes into powerful data pipelines with a simple git-like filesystem interface
Stars: ✭ 951 (+521.57%)
get phylomarkersA pipeline to select optimal markers for microbial phylogenomics and species tree estimation using coalescent and concatenation approaches
Stars: ✭ 34 (-77.78%)
MlboxMLBox is a powerful Automated Machine Learning python library.
Stars: ✭ 1,199 (+683.66%)
Rangelessc++ LINQ -like library of higher-order functions for data manipulation
Stars: ✭ 148 (-3.27%)
Ananas DesktopA hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Stars: ✭ 551 (+260.13%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+681.05%)
OkElegant error/exception handling in Elixir, with result monads.
Stars: ✭ 517 (+237.91%)
TOGGLEToolbox for generic NGS analyses - A framework to quickly build pipelines and to perform large-scale NGS analysis
Stars: ✭ 18 (-88.24%)
LastbackendSystem for containerized apps management. From build to scaling.
Stars: ✭ 1,536 (+903.92%)
BigsliceA serverless cluster computing system for the Go programming language
Stars: ✭ 469 (+206.54%)
AddaxAddax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (+301.96%)
go-bqloaderbqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-89.54%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-52.29%)
cobrixA COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Stars: ✭ 109 (-28.76%)
SmartcodeSmartCode = IDataSource -> IBuildTask -> IOutput => Build Everything!!!
Stars: ✭ 464 (+203.27%)
MuA full-stack DevOps on AWS framework
Stars: ✭ 948 (+519.61%)
germline-DNAA BioWDL variantcalling pipeline for germline DNA data. Starting with FASTQ files to produce VCF files. Category:Multi-Sample
Stars: ✭ 21 (-86.27%)
PglogicalLogical Replication extension for PostgreSQL 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
Stars: ✭ 455 (+197.39%)
gamechanger-dataGAMECHANGER aspires to be the Department’s trusted solution for evidence-based, data-driven decision-making across the universe of DoD requirements
Stars: ✭ 17 (-88.89%)
lightflowA lightweight, distributed workflow system
Stars: ✭ 67 (-56.21%)
TransporterSync data between persistence engines, like ETL only not stodgy
Stars: ✭ 1,175 (+667.97%)
towheeTowhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Stars: ✭ 821 (+436.6%)
PipelinePipeline is a package to build multi-staged concurrent workflows with a centralized logging output.
Stars: ✭ 433 (+183.01%)
wrangleA data transformation package for deep learning with Autonomio, Keras and TensorFlow.
Stars: ✭ 15 (-90.2%)
EuropaPuppet Container Registry
Stars: ✭ 114 (-25.49%)
Pytorch ToolbeltPyTorch extensions for fast R&D prototyping and Kaggle farming
Stars: ✭ 942 (+515.69%)
RushA cross-platform command-line tool for executing jobs in parallel
Stars: ✭ 421 (+175.16%)
html-pipelineHTML processing filters and utilities in Go version
Stars: ✭ 18 (-88.24%)
GlobalbioticinteractionsGlobal Biotic Interactions provides access to existing species interaction datasets
Stars: ✭ 71 (-53.59%)
ServingA flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)
Stars: ✭ 403 (+163.4%)
MIPS-pipeline-processorA pipelined implementation of the MIPS processor featuring hazard detection as well as forwarding
Stars: ✭ 92 (-39.87%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (+155.56%)
Apos.ContentContent builder library for MonoGame.
Stars: ✭ 14 (-90.85%)
hyperdriveExtensible streaming ingestion pipeline on top of Apache Spark
Stars: ✭ 31 (-79.74%)
FlowexFlow-Based Programming framework for Elixir
Stars: ✭ 383 (+150.33%)
smagShow Me A Graph - Command Line Graphing
Stars: ✭ 78 (-49.02%)
UgeneUGENE is free open-source cross-platform bioinformatics software
Stars: ✭ 112 (-26.8%)
skippaSciKIt-learn Pipeline in PAndas
Stars: ✭ 33 (-78.43%)
Git Push DeploySimple Automated CI/CD Pipeline for GitHub and GitLab Projects
Stars: ✭ 21 (-86.27%)
CreditAn example project that predicts risk of credit card default using a Logistic Regression classifier and a 30,000 sample dataset.
Stars: ✭ 18 (-88.24%)
openrefine-dockerOpenRefine is a free, open source power tool for working with messy data and improving it. This repository contains Dockerbuild files for automated builds.
Stars: ✭ 19 (-87.58%)
EtlLinkedPipes ETL is an RDF based, lightweight ETL tool
Stars: ✭ 88 (-42.48%)
Yunmai Data ExtractExtract your data from the Yunmai weighing scales cloud API so you can use it elsewhere
Stars: ✭ 21 (-86.27%)