Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → BlackHC → Toma

BlackHC / Toma

Licence: mit

Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory

Programming Languages

139335 projects - #7 most used programming language

Labels

machine-learning pytorch data-science gpu

Projects that are alternatives of or similar to Toma

Deep Learning Boot Camp

A community run, 5-day PyTorch Deep Learning Bootcamp

Stars: ✭ 1,270 (+813.67%)

Mutual labels: data-science, gpu

An Open Source, Self-Hosted Platform For Applied Deep Learning Development

Stars: ✭ 259 (+86.33%)

Mutual labels: data-science, gpu

🛠 All-in-one web-based IDE specialized for machine learning and data science.

Stars: ✭ 2,337 (+1581.29%)

Mutual labels: data-science, gpu

An open-source, low-code machine learning library in Python

Stars: ✭ 4,594 (+3205.04%)

Mutual labels: data-science, gpu

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

Stars: ✭ 5,656 (+3969.06%)

Mutual labels: data-science, gpu

High-performance Vision library in Python. Scale your research, not boilerplate.

Stars: ✭ 452 (+225.18%)

Mutual labels: data-science, gpu

Awesome Distributed Deep Learning

A curated list of awesome Distributed Deep Learning resources.

Stars: ✭ 277 (+99.28%)

Mutual labels: data-science, gpu

50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster

Stars: ✭ 1,204 (+766.19%)

Mutual labels: data-science, gpu

BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.

Stars: ✭ 1,652 (+1088.49%)

Mutual labels: data-science, gpu

Webgl Fluid Simulation

Play with fluids in your browser (works even on mobile)

Stars: ✭ 11,621 (+8260.43%)

Mutual labels: gpu

Youtube Like Predictor

YouTube Like Count Predictions using Machine Learning

Stars: ✭ 137 (-1.44%)

Mutual labels: data-science

🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)

Stars: ✭ 135 (-2.88%)

Mutual labels: data-science

A validation library for Pandas data frames using user-friendly schemas

Stars: ✭ 135 (-2.88%)

Mutual labels: data-science

Pure C# machine learning framework

Stars: ✭ 136 (-2.16%)

Mutual labels: gpu

Data Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).

Stars: ✭ 135 (-2.88%)

Mutual labels: data-science

A toolbox for processing and analysing air traffic data

Stars: ✭ 138 (-0.72%)

Mutual labels: data-science

ML made simple

Stars: ✭ 135 (-2.88%)

Mutual labels: data-science

Blockchain2graph

Blockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).

Stars: ✭ 134 (-3.6%)

Mutual labels: data-science

Datasciencecoursera

Data Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.

Stars: ✭ 1,928 (+1287.05%)

Mutual labels: data-science

Free and Open Source software for numerical computation providing a powerful computing environment for engineering and scientific applications.

Stars: ✭ 138 (-0.72%)

Mutual labels: data-science

View All Similar Projects ➔

Torch Memory-adaptive Algorithms (TOMA)

A collection of helpers to make it easier to write code that adapts to the available (CUDA) memory. Specifically, it retries code that fails due to OOM (out-of-memory) conditions and lowers batchsizes automatically.

To avoid failing over repeatedly, a simple cache is implemented that memorizes that last successful batchsize given the call and available free memory.

Installation

To install using pip, use:

pip install toma

To run the tests, use:

python setup.py test

Example

from toma import toma

@toma.batch(initial_batchsize=512)
def run_inference(batchsize, model, dataset):
    # ...

run_inference(batchsize, model, dataset)

This will try to execute train_model with batchsize=512. If a memory error is thrown, it will decrease the batchsize until it succeeds.

Note: This batch size can be different from the batch size used to accumulate gradients by only calling optimizer.step() every so often.

To make it easier to loop over a ranges, there are also toma.range and toma.chunked:

@toma.chunked(initial_step=512)
def compute_result(out: torch.Tensor, start: int, end: int):
    # ...

result = torch.empty((8192, ...))
compute_result(result)

This will chunk result and pass the chunks to compute_result one by one. Again, if it fails due to OOM, the step will be halfed etc. Compared to toma.batch, this allows for reduction of the step size while looping over the chunks. This can save computation.

@toma.range(initial_step=32)
def reduce_data(start: int, end: int, out: torch.Tensor, dataA: torch.Tensor, dataB: torch.Tensor):
    # ...

reduce_data(0, 1024, result, dataA, dataB)

toma.range iterates over range(start, end, step) with step=initial_step. If it fails due to OOM, it will lower the step size and continue.

`toma.execute`

To make it easier to just execute a block without having to extract it into a function and then call it, we also provide toma.execute.batch, toma.execute.range and toma.execute.chunked, which are somewhat unorthodox and call the function that is passed to them right away. (Mainly because there is no support for anonymous functions in Python beyond lambda expressions.)

def function():
    # ... other code

    @toma.execute.chunked(batched_data, initial_step=128):
    def compute(chunk, start, end):
        # ...

Cache

There are 3 available cache types at the moment. They can be changed by either setting toma.DEFAULT_CACHE_TYPE or by passing cache_type to the calls.

For example:

@toma.batch(initial_batchsize=512, cache_type=toma.GlobalBatchsizeCache)

or

toma.explicit.batch(..., toma_cache_type=toma.GlobalBatchsizeCache)

`StacktraceMemoryBatchsizeCache`: Stacktrace & Available Memory (the default)

This memorizes the successful batchsizes for a given call trace and available memory at that point. For most machine learning code, this is sufficient to remember the right batchsize without having to look at the actual arguments and understanding more of the semantics.

The implicit assumption is that after a few iterations a stable state will be reached in regards to GPU and CPU memory usage.

To limit the CPU memory of the process, toma provides:

import toma.cpu_memory

toma.cpu_memory.set_cpu_memory_limit(8)

This can also be useful to avoid accidental swap thrashing.

`GlobalBatchsizeCache`: Global per Function

This reuses the last successful batchsize independently from where the call happened.

`NoBatchsizeCache`: No Caching

Always starts with the suggested batchsize and fails over if necessary.

Benchmark/Overhead

There is overhead involved. Toma should only be used with otherwise time/memory-consuming operations.

---------------------------------------------------------------------------------- benchmark: 5 tests ----------------------------------------------------------------------------------
Name (time in ms)          Min                Max               Mean            StdDev             Median                IQR            Outliers       OPS            Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_native             2.1455 (1.0)       3.7733 (1.0)       2.3037 (1.0)      0.1103 (1.0)       2.2935 (1.0)       0.1302 (1.0)          81;5  434.0822 (1.0)         448           1
test_simple            17.4657 (8.14)     27.0049 (7.16)     21.0453 (9.14)     2.6233 (23.79)    20.4881 (8.93)      3.4384 (26.42)        13;0   47.5165 (0.11)         39           1
test_toma_no_cache     31.4380 (14.65)    40.8567 (10.83)    33.2749 (14.44)    2.2530 (20.43)    32.2698 (14.07)     2.8210 (21.67)         4;1   30.0527 (0.07)         25           1
test_explicit          33.0759 (15.42)    52.1866 (13.83)    39.6956 (17.23)    6.9620 (63.14)    38.4929 (16.78)    11.2344 (86.31)         4;0   25.1917 (0.06)         20           1
test_toma              36.9633 (17.23)    57.0220 (15.11)    43.5201 (18.89)    6.7318 (61.05)    41.6034 (18.14)     7.2173 (55.45)         2;2   22.9779 (0.05)         13           1
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Thanks

Thanks to @y0ast for feedback and discussion.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 139

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗