Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → NegatioN → Warp Pytorch

NegatioN / Warp Pytorch

Licence: mit

WARP loss for Pytorch as described by the paper: WSABIE: Scaling Up To Large Vocabulary Image Annotation

Labels

jupyter-notebook

Projects that are alternatives of or similar to Warp Pytorch

Restaurant success model

A model that predicts if a restaurant is going to fail within the next 4 years.

Stars: ✭ 34 (-2.86%)

Mutual labels: jupyter-notebook

Machine Learning / Randomized Algorithm and more

Stars: ✭ 34 (-2.86%)

Mutual labels: jupyter-notebook

stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables

Stars: ✭ 35 (+0%)

Mutual labels: jupyter-notebook

Ds502 Ai Engineer

DS502-AI工程师直通车课程项目，包含每一期课程学习和实践资料，同学们可以在issue中发布自己的疑问，互相交流。

Stars: ✭ 34 (-2.86%)

Mutual labels: jupyter-notebook

Kepler.gl is a powerful open source geospatial analysis tool for large-scale data sets.

Stars: ✭ 8,231 (+23417.14%)

Mutual labels: jupyter-notebook

Health Check ✔ is a Machine Learning Web Application made using Flask that can predict mainly three diseases i.e. Diabetes, Heart Disease, and Cancer.

Stars: ✭ 35 (+0%)

Mutual labels: jupyter-notebook

Stars: ✭ 34 (-2.86%)

Mutual labels: jupyter-notebook

Chinese Stock Prediction Using Weibo Baidu News Sentiment

📈 A neural network regression model trained to predict the mean stock price percentage change everyday using financial factors like previous close price, actual previous close price, open price, market capitalization, total market value, price-to-earning ratio and price-to-book ratio, along with corresponding Sina Weibo and Baidu News sentiment scores.

Stars: ✭ 35 (+0%)

Mutual labels: jupyter-notebook

Advanced data structures for handling temporal segments with attached labels.

Stars: ✭ 34 (-2.86%)

Mutual labels: jupyter-notebook

Graph Isomorphism Networks

A Tensorflow 2.0 implementation of Graph Isomorphism Networks.

Stars: ✭ 35 (+0%)

Mutual labels: jupyter-notebook

basic pandas tutorials

Stars: ✭ 34 (-2.86%)

Mutual labels: jupyter-notebook

Welcome tutorials

Various tutorials given for welcoming new students at MILA.

Stars: ✭ 975 (+2685.71%)

Mutual labels: jupyter-notebook

Stars: ✭ 35 (+0%)

Mutual labels: jupyter-notebook

Teacher Student Training

This repository stores the files used for my summer internship's work on "teacher-student learning", an experimental method for training deep neural networks using a trained teacher model.

Stars: ✭ 34 (-2.86%)

Mutual labels: jupyter-notebook

一份不断完善的NLP学习笔记

Stars: ✭ 35 (+0%)

Mutual labels: jupyter-notebook

Learn Data Analysis W Python

Source code for 'Learn Data Analysis with Python; by A.J. Henley and Dave Wolf

Stars: ✭ 35 (+0%)

Mutual labels: jupyter-notebook

Satellite imagery analysis

Implementation of different techniques to find insights from the satellite data using Python.

Stars: ✭ 31 (-11.43%)

Mutual labels: jupyter-notebook

Stars: ✭ 35 (+0%)

Mutual labels: jupyter-notebook

Drugs Recommendation Using Reviews

Analyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.

Stars: ✭ 35 (+0%)

Mutual labels: jupyter-notebook

Code from the Pearson Addison-Wesley book Machine Learning with Python for Everyone

Stars: ✭ 34 (-2.86%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

WARP-Pytorch

An implementation of WARP loss which uses matrixes and stays on the GPU in PyTorch.

This means instead of using a for-loop to find the first offending negative sample that ranks above our positive, we compute all of them at once. Only later do we find which sample is the first offender, and compute the loss with respect to this sample.

The advantage is that it can use the speedups that comes with GPU-usage.

When is WARP loss advantageous?

If you're ranking items or making models for recommendations, it's often advantageous to let your loss function directly optimize for this case. WARP loss looks at 1 explicit positive up against the implicit negative items that a user never sampled, and allows us to adjust weights of the network accordingly.

Install

pip install warp_loss

How to use

The loss function requires scores for both positive examples, and negative examples to be supplied, such as in the example below.

from torch import nn
import torch

class OurModel(nn.Module):
    def __init__(self, num_labels, emb_dim=10):
        super(OurModel, self).__init__()
        self.emb = nn.Embedding(num_labels, emb_dim)
        self.user_embs = nn.Embedding(1, emb_dim)

    def forward(self, pos, neg):
        batch_size = neg.size(0)
        one_user_vector = self.user_embs(torch.zeros(1).long())
        repeated_user_vector = one_user_vector.repeat((batch_size, 1)).view(batch_size, -1, 1)
        pos_res = torch.bmm(self.emb(pos), repeated_user_vector).squeeze(2)
        neg_res = torch.bmm(self.emb(neg), repeated_user_vector).squeeze(2)

        return pos_res, neg_res
        
num_labels = 100
model = OurModel(num_labels)

pos_labels = torch.randint(high=num_labels, size=(3,1)) # our five labels
neg_labels = torch.randint(high=num_labels, size=(3,2)) # a few random negatives per positive

pos_res, neg_res = model(pos_labels, neg_labels)
print('Positive Labels:', pos_labels)
print('Negative Labels:', neg_labels)
print('Model positive scores:', pos_res)
print('Model negative scores:', neg_res)
loss = warp_loss(pos_res, neg_res, num_labels=num_labels, device=torch.device('cpu'))
print('Loss:', loss)
loss.backward()

Positive Labels: tensor([[21],
        [71],
        [26]])
Negative Labels: tensor([[47, 10],
        [56, 78],
        [44, 55]])
Model positive scores: tensor([[-4.9562],
        [-1.6886],
        [ 3.3984]], grad_fn=<SqueezeBackward1>)
Model negative scores: tensor([[ 1.0491,  4.9357],
        [-2.1289,  0.4496],
        [ 3.4541,  0.0931]], grad_fn=<SqueezeBackward1>)
Loss: tensor(39.6134, grad_fn=<SumBackward0>)

print('We can also see that the gradient is only active for 2x the number of positive labels:', (model.emb.weight.grad.sum(1) != 0).sum().item())
print('Meaning we correctly discard the gradients for all other than the offending negative label.')

We can also see that the gradient is only active for 2x the number of positive labels: 6
Meaning we correctly discard the gradients for all other than the offending negative label.

Assumptions

The loss function assumes you have already sampled your negatives randomly.

As an example this could be done in your dataloader:

Assume we have a total dataset of 100 items
Select a positive sample with index 8
Your negatives should be a random selection from 0-100 excluding 8.

Ex input to loss function: model scores for pos: [8] neg: [88, 3, 99, 7]

Should work on all pytorch-versions from 0.4 and up

References

WSABIE: Scaling Up To Large Vocabulary Image Annotation
Intro to WARP loss - Automatic differentiation and PyTorch
LightFM as a reference implementaiton

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 35

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗