All Projects → minerva-ml → Steppy

minerva-ml / Steppy

Licence: mit
Lightweight, Python library for fast and reproducible experimentation 🔬

Programming Languages

python
139335 projects - #7 most used programming language
python3
1442 projects

Projects that are alternatives of or similar to Steppy

Steppy Toolkit
Curated set of transformers that make your work with steppy faster and more effective 🔭
Stars: ✭ 21 (-82.35%)
Mutual labels:  data-science, pipeline, open-source, reproducible-research, reproducibility
Drake Examples
Example workflows for the drake R package
Stars: ✭ 57 (-52.1%)
Mutual labels:  data-science, pipeline, reproducible-research, reproducibility
Drake
An R-focused pipeline toolkit for reproducibility and high-performance computing
Stars: ✭ 1,301 (+993.28%)
Mutual labels:  data-science, pipeline, reproducible-research, reproducibility
Targets
Function-oriented Make-like declarative workflows for R
Stars: ✭ 293 (+146.22%)
Mutual labels:  data-science, pipeline, reproducible-research, reproducibility
Open Solution Salt Identification
Open solution to the TGS Salt Identification Challenge
Stars: ✭ 124 (+4.2%)
Mutual labels:  data-science, pipeline, image-processing
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+4033.61%)
Mutual labels:  data-science, pipeline, open-source
Open Solution Data Science Bowl 2018
Open solution to the Data Science Bowl 2018
Stars: ✭ 159 (+33.61%)
Mutual labels:  data-science, open-source, image-processing
targets-tutorial
Short course on the targets R package
Stars: ✭ 87 (-26.89%)
Mutual labels:  pipeline, reproducible-research, reproducibility
targets-minimal
A minimal example data analysis project with the targets R package
Stars: ✭ 50 (-57.98%)
Mutual labels:  pipeline, reproducible-research, reproducibility
Otb
Github mirror of https://gitlab.orfeo-toolbox.org/orfeotoolbox/otb
Stars: ✭ 265 (+122.69%)
Mutual labels:  open-source, image-processing, reproducible-research
Open Solution Home Credit
Open solution to the Home Credit Default Risk challenge 🏡
Stars: ✭ 397 (+233.61%)
Mutual labels:  pipeline, open-source, reproducibility
open-solution-googleai-object-detection
Open solution to the Google AI Object Detection Challenge 🍁
Stars: ✭ 46 (-61.34%)
Mutual labels:  pipeline, reproducible-research, reproducibility
Segmentation
Catalyst.Segmentation
Stars: ✭ 27 (-77.31%)
Mutual labels:  pipeline, image-processing, reproducibility
Open Solution Value Prediction
Open solution to the Santander Value Prediction Challenge 🐠
Stars: ✭ 34 (-71.43%)
Mutual labels:  data-science, open-source, reproducibility
Presentations
Talks & Workshops by the CODAIT team
Stars: ✭ 50 (-57.98%)
Mutual labels:  data-science, open-source
Computervision Recipes
Best Practices, code samples, and documentation for Computer Vision.
Stars: ✭ 8,214 (+6802.52%)
Mutual labels:  data-science, image-processing
Reproducibility Guide
project page for creating a guide to reproducible research
Stars: ✭ 116 (-2.52%)
Mutual labels:  reproducible-research, reproducibility
Itkexamples
Cookbook examples for the Insight Toolkit documented with Sphinx
Stars: ✭ 38 (-68.07%)
Mutual labels:  open-source, reproducible-research
Evalai
☁️ 🚀 📊 📈 Evaluating state of the art in AI
Stars: ✭ 1,087 (+813.45%)
Mutual labels:  reproducible-research, reproducibility
Reproducible Image Denoising State Of The Art
Collection of popular and reproducible image denoising works.
Stars: ✭ 1,776 (+1392.44%)
Mutual labels:  image-processing, reproducible-research

Steppy

license

What is Steppy?

  1. Steppy is a lightweight, open-source, Python 3 library for fast and reproducible experimentation.
  2. Steppy lets data scientist focus on data science, not on software development issues.
  3. Steppy's minimal interface does not impose constraints, however, enables clean machine learning pipeline design.

What problem steppy solves?

Problems

In the course of the project, data scientist faces two problems:

  1. Difficulties with reproducibility in data science / machine learning projects.
  2. Lack of the ability to prepare or extend experiments quickly.

Solution

Steppy address both problems by introducing two simple abstractions: Step and Tranformer. We consider it minimal interface for building machine learning pipelines.

  1. Step is a wrapper over the transformer and handles multiple aspects of the execution of the pipeline, such as saving intermediate results (if needed), checkpointing the model during training and much more.
  2. Tranformer in turn, is purely computational, data scientist-defined piece that takes an input data and produces some output data. Typical Transformers are neural network, machine learning algorithms and pre- or post-processing routines.

Start using steppy

Installation

Steppy requires python3.5 or above.

pip3 install steppy

(you probably want to install it in your virtualenv)

Resources

  1. 📒 Documentation
  2. 💻 Source
  3. 📛 Bugs reports
  4. 🚀 Feature requests
  5. 🌟 Tutorial notebooks (their repository):

Feature Requests

Please send us your ideas on how to improve steppy library! We are looking for your comments here: Feature requests.

Roadmap

⏩ At this point steppy is early-stage library heavily tested on multiple machine learning challenges (data-science-bowl, toxic-comment-classification-challenge, mapping-challenge) and educational projects (minerva-advanced-data-scientific-training).

⏩ We are developing steppy towards practical tool for data scientists who can run their experiments easily and change their pipelines with just few manipulations in the code.

Related projects

We are also building steppy-toolkit, a collection of high quality implementations of the top deep learning architectures -> all of them with the same, intuitive interface.

Contributing

You are welcome to contribute to the Steppy library. Please check CONTRIBUTING for more information.

Terms of use

Steppy is MIT-licensed.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].