All Projects → quantumblacklabs → Kedro

quantumblacklabs / Kedro

Licence: other
A Python framework for creating reproducible, maintainable and modular data science code.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Kedro

kedro
A Python framework for creating reproducible, maintainable and modular data science code.
Stars: ✭ 6,068 (+27.37%)
Mutual labels:  pipeline, pipelines-as-code, data-versioning, mlops, kedro
Lambdacd
a library to define a continuous delivery pipeline in code
Stars: ✭ 655 (-86.25%)
Mutual labels:  hacktoberfest, pipeline
Great expectations
Always know what to expect from your data.
Stars: ✭ 5,808 (+21.91%)
Mutual labels:  pipeline, mlops
Demo Jenkins Config As Code
Demo of Jenkins Configuration-As-Code with Docker and Groovy Hook Scripts
Stars: ✭ 143 (-97%)
Mutual labels:  hacktoberfest, pipeline
Pipelines
Machine Learning Pipelines for Kubeflow
Stars: ✭ 2,607 (-45.28%)
Mutual labels:  pipeline, mlops
Galaxy
Data intensive science for everyone.
Stars: ✭ 812 (-82.96%)
Mutual labels:  hacktoberfest, pipeline
Cimonitor
Displays CI statuses on a dashboard and triggers fun modules representing the status!
Stars: ✭ 34 (-99.29%)
Mutual labels:  hacktoberfest, pipeline
Drone Cache
A Drone plugin for caching current workspace files between builds to reduce your build times
Stars: ✭ 194 (-95.93%)
Mutual labels:  hacktoberfest, pipeline
bodywork-ml-pipeline-project
Deployment template for a continuous training pipeline.
Stars: ✭ 22 (-99.54%)
Mutual labels:  pipeline, mlops
kedro-airflow-k8s
Kedro Plugin to support running pipelines on Kubernetes using Airflow.
Stars: ✭ 22 (-99.54%)
Mutual labels:  mlops, kedro
krsh
A declarative KubeFlow Management Tool
Stars: ✭ 127 (-97.33%)
Mutual labels:  pipeline, mlops
pipelines-as-code
Pipelines as Code
Stars: ✭ 37 (-99.22%)
Mutual labels:  pipeline, pipelines-as-code
Jenkins
Jenkins automation server
Stars: ✭ 18,225 (+282.56%)
Mutual labels:  hacktoberfest, pipelines-as-code
Drools
Drools is a rule engine, DMN engine and complex event processing (CEP) engine for Java.
Stars: ✭ 4,062 (-14.74%)
Mutual labels:  hacktoberfest
Pg Mem
An in memory postgres DB instance for your unit tests
Stars: ✭ 350 (-92.65%)
Mutual labels:  hacktoberfest
Kube Bench
Checks whether Kubernetes is deployed according to security best practices as defined in the CIS Kubernetes Benchmark
Stars: ✭ 4,359 (-8.5%)
Mutual labels:  hacktoberfest
Tsuru
Open source and extensible Platform as a Service (PaaS).
Stars: ✭ 3,761 (-21.05%)
Mutual labels:  hacktoberfest
Exceptionless.net
Exceptionless clients for the .NET platform
Stars: ✭ 362 (-92.4%)
Mutual labels:  hacktoberfest
Metacpan Web
Web interface for MetaCPAN
Stars: ✭ 361 (-92.42%)
Mutual labels:  hacktoberfest
5e Database
Database for the D&D 5th Edition API
Stars: ✭ 354 (-92.57%)
Mutual labels:  hacktoberfest

Kedro Logo Banner Python version PyPI version Conda version License Discord Server CircleCI - Main Branch Develop Branch Build Documentation

What is Kedro?

Kedro is an open-source Python framework for creating reproducible, maintainable and modular data science code. It borrows concepts from software engineering and applies them to machine-learning code; applied concepts include modularity, separation of concerns and versioning.

How do I install Kedro?

To install Kedro from the Python Package Index (PyPI) simply run:

pip install kedro

It is also possible to install Kedro using conda:

conda install -c conda-forge kedro

Our Get Started guide contains full installation instructions, and includes how to set up Python virtual environments.

What are the main features of Kedro?

Kedro-Viz Pipeline Visualisation A pipeline visualisation generated using Kedro-Viz

Feature What is this?
Project Template A standard, modifiable and easy-to-use project template based on Cookiecutter Data Science.
Data Catalog A series of lightweight data connectors used to save and load data across many different file formats and file systems, including local and network file systems, cloud object stores, and HDFS. The Data Catalog also includes data and model versioning for file-based systems.
Pipeline Abstraction Automatic resolution of dependencies between pure Python functions and data pipeline visualisation using Kedro-Viz.
Coding Standards Test-driven development using pytest, produce well-documented code using Sphinx, create linted code with support for flake8, isort and black and make use of the standard Python logging library.
Flexible Deployment Deployment strategies that include single or distributed-machine deployment as well as additional support for deploying on Argo, Prefect, Kubeflow, AWS Batch and Databricks.

How do I use Kedro?

The Kedro documentation includes three examples to help get you started:

Why does Kedro exist?

Kedro is built upon our collective best-practice (and mistakes) trying to deliver real-world ML applications that have vast amounts of raw unvetted data. We developed Kedro to achieve the following:

  • To address the main shortcomings of Jupyter notebooks, one-off scripts, and glue-code because there is a focus on creating maintainable data science code
  • To enhance team collaboration when different team members have varied exposure to software engineering concepts
  • To increase efficiency, because applied concepts like modularity and separation of concerns inspire the creation of reusable analytics code

The humans behind Kedro

Kedro is maintained by a product team and a number of contributors from across the world.

Can I contribute?

Yes! Want to help build Kedro? Check out our guide to contributing to Kedro.

Where can I learn more?

There is a growing community around Kedro. Have a look at the Kedro FAQs to find projects using Kedro and links to articles, podcasts and talks.

Who likes Kedro?

There are Kedro users across the world, who work at start-ups, major enterprises and academic institutions like Absa, Acensi, Advanced Programming Solutions SL, AI Singapore, Augment Partners, AXA UK, Belfius, Beamery, Caterpillar, CRIM, Dendra Systems, Element AI, GetInData, GMO, Indicium, Imperial College London, ING, Jungle Scout, Helvetas, Leapfrog, McKinsey & Company, Mercado Libre Argentina, Modec, Mosaic Data Science, NaranjaX, NASA, Open Data Science LatAm, Prediqt, QuantumBlack, Retrieva, Roche, Sber, Société Générale, Telkomsel, Universidad Rey Juan Carlos, UrbanLogiq, Wildlife Studios, WovenLight and XP.

Kedro has also won Best Technical Tool or Framework for AI in the 2019 Awards AI competition and a merit award for the 2020 UK Technical Communication Awards. It is listed on the 2020 ThoughtWorks Technology Radar and the 2020 Data & AI Landscape.

How can I cite Kedro?

If you're an academic, Kedro can also help you, for example, as a tool to solve the problem of reproducible research. Use the "Cite this repository" button on our repository to generate a citation from the CITATION.cff file.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].