All Projects → Shifu → Similar Projects or Alternatives

840 Open source projects that are alternatives of or similar to Shifu

Pyfunctional
Python library for creating data pipelines with chain functional programming
Stars: ✭ 1,943 (+838.65%)
Mutual labels:  pipeline
Flinkx
Based on Apache Flink. support data synchronization/integration and streaming SQL computation.
Stars: ✭ 2,651 (+1180.68%)
Mutual labels:  bigdata
Rangeless
c++ LINQ -like library of higher-order functions for data manipulation
Stars: ✭ 148 (-28.5%)
Mutual labels:  pipeline
Zumis
zUMIs: A fast and flexible pipeline to process RNA sequencing data with UMIs
Stars: ✭ 178 (-14.01%)
Mutual labels:  pipeline
Pipcook
Machine learning platform for Web developers
Stars: ✭ 2,186 (+956.04%)
Mutual labels:  pipeline
Dolphinbeat
A server that pulls and parses MySQL binlog, pushs change data into different sinks like Kafka.
Stars: ✭ 164 (-20.77%)
Mutual labels:  pipeline
Hadoop
Apache Hadoop
Stars: ✭ 12,177 (+5782.61%)
Mutual labels:  hadoop
Drone Cache
A Drone plugin for caching current workspace files between builds to reduce your build times
Stars: ✭ 194 (-6.28%)
Mutual labels:  pipeline
Parquet Rs
Apache Parquet implementation in Rust
Stars: ✭ 144 (-30.43%)
Mutual labels:  hadoop
Big Whale
Spark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-21.26%)
Mutual labels:  hadoop
Demo Jenkins Config As Code
Demo of Jenkins Configuration-As-Code with Docker and Groovy Hook Scripts
Stars: ✭ 143 (-30.92%)
Mutual labels:  pipeline
Proposal Smart Pipelines
Old archived draft proposal for smart pipelines. Go to the new Hack-pipes proposal at js-choi/proposal-hack-pipes.
Stars: ✭ 177 (-14.49%)
Mutual labels:  pipeline
Awesome Decision Tree Papers
A collection of research papers on decision, classification and regression trees with implementations.
Stars: ✭ 1,908 (+821.74%)
Mutual labels:  random-forest
Core
The safe post-production pipeline - https://getavalon.github.io/2.0
Stars: ✭ 162 (-21.74%)
Mutual labels:  pipeline
Eel Sdk
Big Data Toolkit for the JVM
Stars: ✭ 140 (-32.37%)
Mutual labels:  hadoop
Jenkinsdocs
Jenkins实践文档 最新站点地址: http://www.idevops.site
Stars: ✭ 200 (-3.38%)
Mutual labels:  pipeline
Go spider
[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
Stars: ✭ 1,745 (+743%)
Mutual labels:  pipeline
Machine Learning Models
Decision Trees, Random Forest, Dynamic Time Warping, Naive Bayes, KNN, Linear Regression, Logistic Regression, Mixture Of Gaussian, Neural Network, PCA, SVD, Gaussian Naive Bayes, Fitting Data to Gaussian, K-Means
Stars: ✭ 160 (-22.71%)
Mutual labels:  random-forest
Xlearning
AI on Hadoop
Stars: ✭ 1,709 (+725.6%)
Mutual labels:  hadoop
Chefboost
A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Stars: ✭ 176 (-14.98%)
Mutual labels:  random-forest
Jenkins Pipeline Library
wcm.io Jenkins Pipeline Library for CI/CD
Stars: ✭ 134 (-35.27%)
Mutual labels:  pipeline
Aws Serverless Cicd Workshop
Learn how to build a CI/CD pipeline for SAM-based applications
Stars: ✭ 158 (-23.67%)
Mutual labels:  pipeline
Karton
Distributed malware processing framework based on Python, Redis and MinIO.
Stars: ✭ 134 (-35.27%)
Mutual labels:  pipeline
Pipeline.rs
☔️ => ⛅️ => ☀️
Stars: ✭ 188 (-9.18%)
Mutual labels:  pipeline
Mara Pipelines
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (+789.37%)
Mutual labels:  pipeline
Spacy Wordnet
spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
Stars: ✭ 156 (-24.64%)
Mutual labels:  pipeline
Randomforest
随机森林,Random Forest(RF)
Stars: ✭ 132 (-36.23%)
Mutual labels:  random-forest
Randomforestexplainer
A set of tools to understand what is happening inside a Random Forest
Stars: ✭ 175 (-15.46%)
Mutual labels:  random-forest
Tipdm
TipDM建模平台,开源的数据挖掘工具。
Stars: ✭ 130 (-37.2%)
Mutual labels:  bigdata
Ects
Elastic Crontab System 简单易用的分布式定时任务管理系统
Stars: ✭ 156 (-24.64%)
Mutual labels:  pipeline
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+731.4%)
Mutual labels:  bigdata
Scrapy demo
all kinds of scrapy demo
Stars: ✭ 128 (-38.16%)
Mutual labels:  pipeline
Fluids
Fluid dynamics component of Chemical Engineering Design Library (ChEDL)
Stars: ✭ 154 (-25.6%)
Mutual labels:  pipeline
Airflow Pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-38.16%)
Mutual labels:  hadoop
Spydra
Ephemeral Hadoop clusters using Google Compute Platform
Stars: ✭ 128 (-38.16%)
Mutual labels:  hadoop
Pypyr
pypyr task-runner cli & api for automation pipelines. Automate anything by combining commands, different scripts in different languages & applications into one pipeline process.
Stars: ✭ 173 (-16.43%)
Mutual labels:  pipeline
Nmflibrary
MATLAB library for non-negative matrix factorization (NMF): Version 1.8.1
Stars: ✭ 153 (-26.09%)
Mutual labels:  bigdata
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-38.16%)
Mutual labels:  hadoop
Fpart
Sort files and pack them into partitions
Stars: ✭ 127 (-38.65%)
Mutual labels:  bigdata
Open Solution Toxic Comments
Open solution to the Toxic Comment Classification Challenge
Stars: ✭ 154 (-25.6%)
Mutual labels:  pipeline
Volcano
A Cloud Native Batch System (Project under CNCF)
Stars: ✭ 2,114 (+921.26%)
Mutual labels:  bigdata
Handwritten Digit Recognition Using Deep Learning
Handwritten Digit Recognition using Machine Learning and Deep Learning
Stars: ✭ 127 (-38.65%)
Mutual labels:  random-forest
Ssh Steps Plugin
Jenkins pipeline steps which provides SSH facilities such as command execution or file transfer for continuous delivery.
Stars: ✭ 183 (-11.59%)
Mutual labels:  pipeline
Faas Flow
Function Composition for OpenFaaS
Stars: ✭ 172 (-16.91%)
Mutual labels:  pipeline
Javainterview
最全的Java技术知识点,以及Java源码分析。为开源贡献自己的一份力。
Stars: ✭ 154 (-25.6%)
Mutual labels:  bigdata
Ml Projects
ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
Stars: ✭ 127 (-38.65%)
Mutual labels:  random-forest
Squeezemeta
A complete pipeline for metagenomic analysis
Stars: ✭ 128 (-38.16%)
Mutual labels:  pipeline
Emlearn
Machine Learning inference engine for Microcontrollers and Embedded devices
Stars: ✭ 154 (-25.6%)
Mutual labels:  random-forest
Pipelinex
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
Stars: ✭ 127 (-38.65%)
Mutual labels:  pipeline
Rnaseq Workflow
A repository for setting up a RNAseq workflow
Stars: ✭ 170 (-17.87%)
Mutual labels:  pipeline
Pipeline Live
Pipeline Extension for Live Trading
Stars: ✭ 154 (-25.6%)
Mutual labels:  pipeline
Semsegpipeline
A simpler way of reading and augmenting image segmentation data into TensorFlow
Stars: ✭ 126 (-39.13%)
Mutual labels:  pipeline
Parquet4s
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Stars: ✭ 125 (-39.61%)
Mutual labels:  hadoop
Movie recommend
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+910.63%)
Mutual labels:  hadoop
The Data Science Workshop
A New, Interactive Approach to Learning Data Science
Stars: ✭ 126 (-39.13%)
Mutual labels:  random-forest
Sarek
Detect germline or somatic variants from normal or tumour/normal whole-genome or targeted sequencing
Stars: ✭ 124 (-40.1%)
Mutual labels:  pipeline
Kotlin Spark Api
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (-11.59%)
Mutual labels:  bigdata
Deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+5830.92%)
Mutual labels:  hadoop
Metl
mito ETL tool
Stars: ✭ 153 (-26.09%)
Mutual labels:  pipeline
Open Solution Salt Identification
Open solution to the TGS Salt Identification Challenge
Stars: ✭ 124 (-40.1%)
Mutual labels:  pipeline
61-120 of 840 similar projects