Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → lucidrains → nystrom-attention

lucidrains / nystrom-attention

Licence: MIT license

Implementation of Nyström Self-attention, from the paper Nyströmformer

Programming Languages

139335 projects - #7 most used programming language

Labels

artificial-intelligence attention-mechanism

Projects that are alternatives of or similar to nystrom-attention

AttentionGatedVNet3D

Attention Gated VNet3D Model for KiTS19——2019 Kidney Tumor Segmentation Challenge

Stars: ✭ 35 (-57.83%)

Mutual labels: attention-mechanism

Arbitrary Style Transfer with Style-Attentional Networks

Stars: ✭ 105 (+26.51%)

Mutual labels: attention-mechanism

ntua-slp-semeval2018

Deep-learning models of NTUA-SLP team submitted in SemEval 2018 tasks 1, 2 and 3.

Stars: ✭ 79 (-4.82%)

Mutual labels: attention-mechanism

Implementation of ABCNN(Attention-Based Convolutional Neural Network) on Pytorch

Stars: ✭ 35 (-57.83%)

Mutual labels: attention-mechanism

Drug effect prediction using neural network

Stars: ✭ 17 (-79.52%)

Mutual labels: attention-mechanism

a collection of visualization function

Stars: ✭ 189 (+127.71%)

Mutual labels: attention-mechanism

A minimal nmt example to serve as an seq2seq+attention reference.

Stars: ✭ 36 (-56.63%)

Mutual labels: attention-mechanism

Patient2Vec: A Personalized Interpretable Deep Representation of the Longitudinal Electronic Health Record

Stars: ✭ 85 (+2.41%)

Mutual labels: attention-mechanism

halonet-pytorch

Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones

Stars: ✭ 181 (+118.07%)

Mutual labels: attention-mechanism

automatic-personality-prediction

[AAAI 2020] Modeling Personality with Attentive Networks and Contextual Embeddings

Stars: ✭ 43 (-48.19%)

Mutual labels: attention-mechanism

domain-attention

codes for paper "Domain Attention Model for Multi-Domain Sentiment Classification"

Stars: ✭ 22 (-73.49%)

Mutual labels: attention-mechanism

Multigrid-Neural-Architectures

Multigrid Neural Architecture

Stars: ✭ 28 (-66.27%)

Mutual labels: attention-mechanism

OverlapPredator

[CVPR 2021, Oral] PREDATOR: Registration of 3D Point Clouds with Low Overlap.

Stars: ✭ 293 (+253.01%)

Mutual labels: attention-mechanism

A-Persona-Based-Neural-Conversation-Model

No description or website provided.

Stars: ✭ 22 (-73.49%)

Mutual labels: attention-mechanism

A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering

Stars: ✭ 33 (-60.24%)

Mutual labels: attention-mechanism

Playground for implementing custom layers and other components compatible with keras, with the purpose to learn the framework better and perhaps in future offer some utils for others.

Stars: ✭ 18 (-78.31%)

Mutual labels: attention-mechanism

dcsp segmentation

No description or website provided.

Stars: ✭ 34 (-59.04%)

Mutual labels: attention-mechanism

💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA

Stars: ✭ 19 (-77.11%)

Mutual labels: attention-mechanism

keras-deep-learning

Various implementations and projects on CNN, RNN, LSTM, GAN, etc

Stars: ✭ 22 (-73.49%)

Mutual labels: attention-mechanism

Deep Attentive Features for Prostate Segmentation in 3D Transrectal Ultrasound

Stars: ✭ 60 (-27.71%)

Mutual labels: attention-mechanism

View All Similar Projects ➔

Nyström Attention

Implementation of Nyström Self-attention, from the paper Nyströmformer.

Yannic Kilcher video

Install

$ pip install nystrom-attention

Usage

import torch
from nystrom_attention import NystromAttention

attn = NystromAttention(
    dim = 512,
    dim_head = 64,
    heads = 8,
    num_landmarks = 256,    # number of landmarks
    pinv_iterations = 6,    # number of moore-penrose iterations for approximating pinverse. 6 was recommended by the paper
    residual = True         # whether to do an extra residual with the value or not. supposedly faster convergence if turned on
)

x = torch.randn(1, 16384, 512)
mask = torch.ones(1, 16384).bool()

attn(x, mask = mask) # (1, 16384, 512)

Nyströmformer, layers of Nyström attention

import torch
from nystrom_attention import Nystromformer

model = Nystromformer(
    dim = 512,
    dim_head = 64,
    heads = 8,
    depth = 6,
    num_landmarks = 256,
    pinv_iterations = 6
)

x = torch.randn(1, 16384, 512)
mask = torch.ones(1, 16384).bool()

model(x, mask = mask) # (1, 16384, 512)

You can also import it as Nyströmer if you wish

from nystrom_attention import Nystromer

Citations

@misc{xiong2021nystromformer,
    title   = {Nyströmformer: A Nyström-Based Algorithm for Approximating Self-Attention},
    author  = {Yunyang Xiong and Zhanpeng Zeng and Rudrasis Chakraborty and Mingxing Tan and Glenn Fung and Yin Li and Vikas Singh},
    year    = {2021},
    eprint  = {2102.03902},
    archivePrefix = {arXiv},
    primaryClass = {cs.CL}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 83

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗