WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+143.14%)
pypiperPython toolkit for building restartable pipelines
Stars: ✭ 34 (-77.78%)
MuA full-stack DevOps on AWS framework
Stars: ✭ 948 (+519.61%)
germline-DNAA BioWDL variantcalling pipeline for germline DNA data. Starting with FASTQ files to produce VCF files. Category:Multi-Sample
Stars: ✭ 21 (-86.27%)
Geoweavera web system to allow users to automatically record history and manage complicated scientific workflows in web browsers involving the online spatial data facilities, high-performance computation platforms, and open-source libraries.
Stars: ✭ 32 (-79.08%)
DQCS数据质量控制系统
Stars: ✭ 34 (-77.78%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1113.07%)
PipelinesAn experimental programming language for data flow
Stars: ✭ 354 (+131.37%)
skinnerSkin export / import tools for Autodesk Maya
Stars: ✭ 68 (-55.56%)
Scrapy S3pipelineScrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.
Stars: ✭ 57 (-62.75%)
MNISTHandwritten digit recognizer using a feed-forward neural network and the MNIST dataset of 70,000 human-labeled handwritten digits.
Stars: ✭ 28 (-81.7%)
Webkettle基于web版kettle开发的一套分布式综合调度,管理,ETL开发的用户专业版B/S架构工具
Stars: ✭ 334 (+118.3%)
gamechanger-dataGAMECHANGER aspires to be the Department’s trusted solution for evidence-based, data-driven decision-making across the universe of DoD requirements
Stars: ✭ 17 (-88.89%)
MotorwayCloud ready pure-python streaming data pipeline library
Stars: ✭ 150 (-1.96%)
RnaseqRNA sequencing analysis pipeline using STAR, RSEM, HISAT2 or Salmon with gene/isoform counts and extensive quality control.
Stars: ✭ 305 (+99.35%)
Pytorch ToolbeltPyTorch extensions for fast R&D prototyping and Kaggle farming
Stars: ✭ 942 (+515.69%)
TormesMaking whole bacterial genome sequencing data analysis easy
Stars: ✭ 56 (-63.4%)
plumbingThis repo holds configuration for infrastructure used across the tektoncd org 🏗️
Stars: ✭ 41 (-73.2%)
TargetsFunction-oriented Make-like declarative workflows for R
Stars: ✭ 293 (+91.5%)
PulsarOpen source VFX pipeline tool
Stars: ✭ 20 (-86.93%)
Gitlab Dashboard📺 TV dashboard for a global view on Gitlab Pipelines
Stars: ✭ 107 (-30.07%)
scrnaseqA single-cell RNAseq pipeline for 10X genomics data
Stars: ✭ 60 (-60.78%)
Production Level Deep LearningA guideline for building practical production-level deep learning systems to be deployed in real world applications.
Stars: ✭ 3,358 (+2094.77%)
hms-av-pipeline-demoHUAWEI AV Pipeline Kit sample code project, which contains the Java sample code to implement functions like video playback, video super-resolution and media asset management. C++ sample code is contained for calling MediaFilter to use the sound event detection plugin.
Stars: ✭ 14 (-90.85%)
Dawn🌅 Dawn is a lightweight task management and build tool for front-end and nodejs.
Stars: ✭ 1,057 (+590.85%)
DagsterAn orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+2579.08%)
CVparserCVparser is software for parsing or extracting data out of CV/resumes.
Stars: ✭ 28 (-81.7%)
uptasticsearchAn Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (-69.28%)
SparkflowEasy to use library to bring Tensorflow on Apache Spark
Stars: ✭ 282 (+84.31%)
django-calaccess-raw-dataA Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (-60.13%)
Kiba PlusKiba enhancement for Ruby ETL.
Stars: ✭ 47 (-69.28%)
redundansRedundans is a pipeline that assists an assembly of heterozygous/polymorphic genomes.
Stars: ✭ 90 (-41.18%)
LettersA tiny debugging library for Ruby
Stars: ✭ 273 (+78.43%)
Csv2dbThe CSV to database command line loader
Stars: ✭ 102 (-33.33%)
piper-nfRNA mapping pipeline
Stars: ✭ 18 (-88.24%)
DgshShell supporting pipelines to and from multiple processes
Stars: ✭ 261 (+70.59%)
streamalgExtensible stream pipelines with object algebras.
Stars: ✭ 26 (-83.01%)
SnsAnalysis pipelines for sequencing data
Stars: ✭ 43 (-71.9%)
pyrealtimeRealtime data processing and plotting pipelines in Python
Stars: ✭ 62 (-59.48%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-65.36%)
Go spider[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
Stars: ✭ 1,745 (+1040.52%)
skippaSciKIt-learn Pipeline in PAndas
Stars: ✭ 33 (-78.43%)
pachinkomodular pluggable media sorter
Stars: ✭ 27 (-82.35%)
Ensembl HiveEnsEMBL Hive - a system for creating and running pipelines on a distributed compute resource
Stars: ✭ 44 (-71.24%)
Git Push DeploySimple Automated CI/CD Pipeline for GitHub and GitLab Projects
Stars: ✭ 21 (-86.27%)
CreditAn example project that predicts risk of credit card default using a Logistic Regression classifier and a 30,000 sample dataset.
Stars: ✭ 18 (-88.24%)
openrefine-dockerOpenRefine is a free, open source power tool for working with messy data and improving it. This repository contains Dockerbuild files for automated builds.
Stars: ✭ 19 (-87.58%)
EtlLinkedPipes ETL is an RDF based, lightweight ETL tool
Stars: ✭ 88 (-42.48%)
Yunmai Data ExtractExtract your data from the Yunmai weighing scales cloud API so you can use it elsewhere
Stars: ✭ 21 (-86.27%)
cardano-pyPython3 lib and cli for operating a Cardano Passive Node and using the API's. (PRE-ALPHA)
Stars: ✭ 17 (-88.89%)
SpewAutomatic Packaging and Distribution of Bioinformatics Pipelines
Stars: ✭ 21 (-86.27%)
mlbgamedayMulti-core processing of 'Gameday' data from Major League Baseball Advanced Media. Additional tools to parallelize large data sets and write them to a database.
Stars: ✭ 37 (-75.82%)
spot-termination-exporterPrometheus spot instance exporter to monitor AWS instance termination with Hollowtrees
Stars: ✭ 30 (-80.39%)