vtrokhymenko / dst

Licence: MIT license

yet another custom data science template via cookiecutter

Programming Languages

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to dst

A professionally curated list of awesome Conformal Prediction videos, tutorials, books, papers, PhD and MSc theses, articles and open-source libraries.

Stars: ✭ 998 (+1591.53%)

Mutual labels: datascience, machinelearning, deeplearning

gan deeplearning4j

Automatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.

Stars: ✭ 19 (-67.8%)

Mutual labels: datascience, machinelearning, deeplearning

python

Python codes from tutorials on the Data Professor YouTube channel

Stars: ✭ 51 (-13.56%)

Mutual labels: datascience, machinelearning, machinelearning-python

Free-Courses-on-Data-Science

No description or website provided.

Stars: ✭ 24 (-59.32%)

Mutual labels: datascience, deeplearning-ai, machinelearning-python

Data-Scientist-In-Python

This repository contains notes and projects of Data scientist track from dataquest course work.

Stars: ✭ 23 (-61.02%)

Mutual labels: datascience, machinelearning, deeplearning

Ludwig

Data-centric declarative deep learning framework

Stars: ✭ 8,018 (+13489.83%)

Mutual labels: datascience, machinelearning, deeplearning

Ai Series

Stars: ✭ 702 (+1089.83%)

Mutual labels: datascience, machinelearning, deeplearning

ML-For-Beginners

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

Stars: ✭ 40,023 (+67735.59%)

Mutual labels: machinelearning, machinelearning-python

Plant Disease Detection

Plant Disease Detector Web Application

Stars: ✭ 181 (+206.78%)

Mutual labels: machinelearning, machinelearning-python

Groundbreaking-Papers

ML Research paper summaries, annotated papers and implementation walkthroughs

Stars: ✭ 90 (+52.54%)

Mutual labels: machinelearning, deeplearning

66Days NaturalLanguageProcessing

I am sharing my Journey of 66DaysofData in Natural Language Processing.

Stars: ✭ 127 (+115.25%)

Mutual labels: datascience, deeplearning

continuous Bernoulli

There are C language computer programs about the simulator, transformation, and test statistic of continuous Bernoulli distribution. More than that, the book contains continuous Binomial distribution and continuous Trinomial distribution.

Stars: ✭ 22 (-62.71%)

Mutual labels: deeplearning, deeplearning-ai

mllint

`mllint` is a command-line utility to evaluate the technical quality of Python Machine Learning (ML) projects by means of static analysis of the project's repository.

Stars: ✭ 67 (+13.56%)

Mutual labels: machinelearning, machinelearning-python

cookiecutter-modern-datascience

Start a data science project with modern tools

Stars: ✭ 136 (+130.51%)

Mutual labels: cookiecutter, datascience

comet-for-mlflow

Comet-For-MLFlow Extension

Stars: ✭ 48 (-18.64%)

Mutual labels: datascience, machinelearning

ml-book

Codice sorgente ed Errata Corrige del mio libro "A tu per tu col Machine Learning"

Stars: ✭ 16 (-72.88%)

Mutual labels: datascience, machinelearning

RcppDynProg

Dynamic Programming implemented in Rcpp. Includes example partition and out of sample fitting applications.

Stars: ✭ 13 (-77.97%)

Mutual labels: datascience, machinelearning

Kapsul-Aglari-ile-Isaret-Dili-Tanima

Recognition of Sign Language using Capsule Networks

Stars: ✭ 42 (-28.81%)

Mutual labels: machinelearning, deeplearning

Data-Science-Resources

A guide to getting started with Data Science and ML.

Stars: ✭ 17 (-71.19%)

Mutual labels: datascience, machinelearning

type4py

Type4Py: Deep Similarity Learning-Based Type Inference for Python

Stars: ✭ 41 (-30.51%)

Mutual labels: machinelearning, deeplearning

View All Similar Projects ➔

data science template

in this repo u can look at default template for ds/ml/dl/.. projects or similar

how to use

before creating a new project from this template, u need to install the next dependencies
- cookiecutter
```
brew install cookiecutter
```
  or
```
pip install cookiecutter
```
- github cli
  - macos
    - install
      brew install gh
    - upgrade
      brew upgrade gh
  - linux
    
    look at the linux installation instructions
after go to the directory where u want to create your project and run
```
cookiecutter gh:vtrokhymenko/dst
```
follow the instruction

using the next project structure

├── .github                       <- some actions
│   ├── workflows
│   │   └── ci.yml
│   └── dependabot.yml
│
├── LICENSE                       <- will be created if u choose
├── README.md                     <- the main readme
│
├── config                        <- often it's yaml-files with some parameters
│
├── data
│   ├── external                  <- data from third party sources
│   ├── interim                   <- intermediate data that has been transformed
│   ├── processed                 <- the final, canonical data sets for modeling
│   ├── raw                       <- the original, immutable data dump
│   ├── features                  <- another
│   └── README.md
│
├── docs                          <- a default sphinx project (see sphinx-doc.org for details)
│
├── experiments                   <- for any experiments
│   └── README.md
│
├── models                        <- trained & serialized models, model predictions, or model summaries
│   └── README.md
│
├── notebooks                     <- notebooks for research
│                                    naming convention is a number (for ordering), the creator's initials, and a short `-`
│                                    delimited description, eg `1.0-jqp-initial-data-exploration`
│
├── references                    <- data dictionaries, manuals, and all other explanatory materials
│   └── README.md
│
├── tests                         <- test for project
│
├── {{ cookiecutter.repo_name }}  <- source code
│   ├── __init__.py               <- makes src a python module eg propose generate with `mkinit`
│   │
│   ├── data                      <- scripts to download or generate data
│   │
│   ├── models                    <- scripts to train models and then use trained models to make predictions
│   │
│   └── visualization             <- scripts to create exploratory and results oriented visualizations
│
├── .gitignore                    <- default for python
│
└── .pre-commit-config.yaml       <- custom pcc with `reorder_python_imports`, `black`, `flake8`, `pre-commit-pyright`, `pre-commit-hooks`

other similar templates

propose to use next tools

gh – github on the terminal
dvc – open-source version control system for ds projects
cml – continuous machine learning | ci/cd for ml/dl
renovate - yet another dependency management
hydra – to configuring complex applications
pipreqs – autogenerate pip requirements
pre-commit – framework for managing & maintaining multi-language pre-commit hooks
code style/review/formatter/typer
- codefactor
- snyk
- deepsource
- prettier
- pycodestyle
- pyre-check
- pyright
- restyled (autopep8, black, isort, prettier-markdown, reorder-python-imports, yapf)
- super-linter (pylint, flake8, awesome-flake8-extensions, black)
- yapf
- vulture
tests
- codecov
- coveragepy
- pytest (guide)
- pytest-cov
- mutmut
profiler/debugger
- birdseye
- heartrate
- palanteer
- py-spy
- pyheat
- snoop
- viztracer
spellcheckers

citation

@misc{dst,
  author = {trokhymenko viktor},
  title = {data science template},
  year = {2020},
  publisher = {github},
  howpublished = {\url{https://github.com/vtrokhymenko/dst}}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

vtrokhymenko / dst

Programming Languages

Labels

Projects that are alternatives of or similar to dst

data science template

how to use

using the next project structure

other similar templates

propose to use next tools

citation