All Projects → aws-samples → python-data-science-template

aws-samples / python-data-science-template

Licence: MIT-0 license
Template for a modular, Python-based data science project.

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to python-data-science-template

pyramid-cookiecutter-alchemy
[DEPRECATED - Please use https://github.com/pylons/pyramid-cookiecutter-starter instead] A Cookiecutter (project template) for creating a Pyramid project using SQLite for persistent storage, SQLAlchemy for an ORM, Alembic for database migrations, URL dispatch for routing, and Jinja2 for templating.
Stars: ✭ 39 (+105.26%)
Mutual labels:  cookiecutter-template
cookiecutter-homeassistant-component
A cookiecutter project template for generating the structure for a new custom component.
Stars: ✭ 22 (+15.79%)
Mutual labels:  cookiecutter-template
theme-cookiecutter
A cookiecutter template to help you make new JupyterLab theme extensions
Stars: ✭ 47 (+147.37%)
Mutual labels:  cookiecutter-template
cookiecutter-qt-app
A cookiecutter to create Qt applications, with translations and packaging
Stars: ✭ 29 (+52.63%)
Mutual labels:  cookiecutter-template
cookiecutter-pypackage
A cookiecutter template for Python package with heavy use of Github actions
Stars: ✭ 19 (+0%)
Mutual labels:  cookiecutter-template
Cookiecutter Data Science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Stars: ✭ 5,271 (+27642.11%)
Mutual labels:  cookiecutter-template
cookiecutter-pyms
Cookiecutter template for a Python microservice.
Stars: ✭ 49 (+157.89%)
Mutual labels:  cookiecutter-template
cookiecutter-go
boilerplate, golang project starter tool, support go-zero/go-micro/gin
Stars: ✭ 63 (+231.58%)
Mutual labels:  cookiecutter-template
cookiedozer
📱 Cookiecutter for i18n Kivy Apps
Stars: ✭ 48 (+152.63%)
Mutual labels:  cookiecutter-template
python-template
Netherlands eScience Center Python Template
Stars: ✭ 136 (+615.79%)
Mutual labels:  cookiecutter-template
at-python-template
The official Python Project Template of Alexander Thamm GmbH
Stars: ✭ 34 (+78.95%)
Mutual labels:  cookiecutter-template
cookiecutter-flask-skeleton
Flask Starter Project
Stars: ✭ 56 (+194.74%)
Mutual labels:  cookiecutter-template
cookiecutter-python-cli
A cookiecutter template for creating a Python CLI application using click
Stars: ✭ 65 (+242.11%)
Mutual labels:  cookiecutter-template
qt-qml-project-template-with-ci
Template for a Qt/QML application with batteries included: GitHub C.I. for your QML app; automated gui testing with Xvfb; automatic code-format checks and more. Compiles for Desktop and Mobile (Linux, Mac, Windows, and Android).
Stars: ✭ 33 (+73.68%)
Mutual labels:  cookiecutter-template
FastAPI-template
Feature rich robust FastAPI template.
Stars: ✭ 660 (+3373.68%)
Mutual labels:  cookiecutter-template
cookiecutter-modern-pypackage
Cookiecutter template for a modern Python package.
Stars: ✭ 97 (+410.53%)
Mutual labels:  cookiecutter-template
cookiecutter-rimworld-mod-development
A cookiecutter project that builds the basic Rimworld mod development file structure and build environment.
Stars: ✭ 59 (+210.53%)
Mutual labels:  cookiecutter-template
cookiecutter-modern-datascience
Start a data science project with modern tools
Stars: ✭ 136 (+615.79%)
Mutual labels:  cookiecutter-template
cookiecutter-homeassistant-custom-component
Cookiecutter template for Home Assistant custom component
Stars: ✭ 37 (+94.74%)
Mutual labels:  cookiecutter-template
beeware-android-template
Template for starting a native Android app using Python and the BeeWare tools
Stars: ✭ 31 (+63.16%)
Mutual labels:  cookiecutter-template

A cookiecutter template for Python data science projects

Pre-requisite

You need to have the cli cookiecutter available in your Python environment. Please see its installation instructions here.

Usage

To generate a directory structure for a new data science project, you can run the following commands in your Python environment.

cookiecutter https://github.com/aws-samples/python-data-science-template

setup-example-640px

Alternatively, you can also clone this repository to use a local template:

# Clone to a local repository in the current directory.
git clone https://github.com/aws-samples/python-data-science-template.git

# The above command creates python-data-science-template/ in the current dir.

# Use the local repo to generate project structure
cookiecutter python-data-science-template

Project Structure

By using this template, your data science project is auto-generated as follows:

.
|-- bin/
|-- notebooks                    # A directory to place all notebooks files.
|   |-- *.ipynb
|   `-- my_nb_path.py            # Imported by *.ipynb to treat src/ as PYTHONPATH
|-- requirements/
|-- src
|   |-- my_custom_module         # Your custom module
|   |-- my_nb_color.py           # Imported by *.ipynb to colorize their outputs
|   `-- source_dir               # Additional codes such as SageMaker source dir
|-- tests/                       # Unit tests
|-- MANIFEST.in                  # Required by setup.py (if module name specified)
|-- setup.py                     # To pip install your Python module (if module name specified)

# These sample configuration files are auto-generated too:
|-- .editorconfig                # Sample editor config (for IDE / editor that supports this)
|-- .gitattributes               # Sample .gitattributes
|-- .gitleaks.toml               # Sample Gitleaks config (if pre_commit is advanced)
|-- .gitignore                   # Sample .gitignore
|-- .pre-commit-config.yaml      # Sample precommit hooks
|-- LICENSE                      # Boilperplate (auto-generated)
|-- README.md                    # Template for you to customize
|-- pyproject.toml               # Sample configurations for Python toolchains
`-- tox.ini                      # Sample configurations for Python toolchains

This structure has been used in a few other places as well, e.g., aws-samples/sagemaker-rl-energy-storage-system and aws-samples/amazon-sagemaker-gluonts-entrypoint. Feel free to look at those repositories and observe the project structure documented in their README.md.

Related Projects

Ready to start your new data science project on AWS? If so, you may want to check on these related samples.

  1. Do you like to work on EC2 instances? Then why don't you check out these simple template to setup basic Vim, Tmux, Zsh for the Deep Learning AMI Amazon Linux 2 for data scientsts.

  2. Do you like to work on SageMaker classic notebook instances? Then why don't you check out the one-liner customization command that quickly applies common tweaks on a fresh (i.e., newly created or rebooted) SageMaker classic notebook instance, to make the notebook instance a little bit more ergonomic for prolonged usage.

  3. Are you loooking for a quickstart to accelerate the delivery of custom ML solutions to production, without having to make too many design choices? Then why don't you check out the ML Max repo which includes templates for four pillars: training pipeline, inference pipeline, development environment and data management/ETL.

  4. Are you tired of repeatedly writing the same boilerplate codes for common, tactical data science tasks? Then why don't you check on the SageMaker meta-entrypoint utilities, and the smallmatter library.

Security

See CONTRIBUTING for more information.

License

This library is licensed under the MIT-0 License. See the LICENSE file.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].