Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → wenguanwang → Dhf1k

wenguanwang / Dhf1k

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

Programming Languages

matlab

3953 projects

Labels

attention-mechanism cvpr cvpr2018

Projects that are alternatives of or similar to Dhf1k

Guided Attention Inference Network

Contains implementation of Guided Attention Inference Network (GAIN) presented in Tell Me Where to Look(CVPR 2018). This repository aims to apply GAIN on fcn8 architecture used for segmentation.

Stars: ✭ 204 (+112.5%)

Mutual labels: attention-mechanism, cvpr2018

Ylg

[CVPR 2020] Official Implementation: "Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models".

Stars: ✭ 109 (+13.54%)

Mutual labels: attention-mechanism, cvpr

Prm

Weakly Supervised Instance Segmentation using Class Peak Response, in CVPR 2018 (Spotlight)

Stars: ✭ 322 (+235.42%)

Mutual labels: cvpr, cvpr2018

Attentive Gan Derainnet

Unofficial tensorflow implemention of "Attentive Generative Adversarial Network for Raindrop Removal from A Single Image (CVPR 2018) " model https://maybeshewill-cv.github.io/attentive-gan-derainnet/

Stars: ✭ 184 (+91.67%)

Mutual labels: attention-mechanism, cvpr2018

Awesome Cvpr Paper

CVPR 论文收集，包含但不限于2021、2020、2019、2018、2017文章

Stars: ✭ 493 (+413.54%)

Mutual labels: cvpr, cvpr2018

Sarcasm Detection

Detecting Sarcasm on Twitter using both traditonal machine learning and deep learning techniques.

Stars: ✭ 73 (-23.96%)

Mutual labels: attention-mechanism

Attend infer repeat

A Tensorfflow implementation of Attend, Infer, Repeat

Stars: ✭ 82 (-14.58%)

Mutual labels: attention-mechanism

Se3 Transformer Pytorch

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch. This specific repository is geared towards integration with eventual Alphafold2 replication.

Stars: ✭ 73 (-23.96%)

Mutual labels: attention-mechanism

Pytorch Attention Guided Cyclegan

Pytorch implementation of Unsupervised Attention-guided Image-to-Image Translation.

Stars: ✭ 67 (-30.21%)

Mutual labels: attention-mechanism

Eqtransformer

EQTransformer, a python package for earthquake signal detection and phase picking using AI.

Stars: ✭ 95 (-1.04%)

Mutual labels: attention-mechanism

Tracknpred

This is the code base for our ACM CSCS 2019 paper: "RobustTP: End-to-End Trajectory Prediction for Heterogeneous Road-Agents in Dense Traffic with Noisy Sensor Inputs". This codebase contains implementations for several trajectory prediction methods including Social-GAN and TraPHic.

Stars: ✭ 88 (-8.33%)

Mutual labels: cvpr

Simplednn

SimpleDNN is a machine learning lightweight open-source library written in Kotlin designed to support relevant neural network architectures in natural language processing tasks

Stars: ✭ 81 (-15.62%)

Mutual labels: attention-mechanism

Fake news detection deep learning

Fake News Detection using Deep Learning models in Tensorflow

Stars: ✭ 74 (-22.92%)

Mutual labels: attention-mechanism

Surfacenetworks

Source code for CVPR 2018 Oral paper "Surface Networks"

Stars: ✭ 83 (-13.54%)

Mutual labels: cvpr2018

Super Slowmo

An attempt at a PyTorch implimentation of "Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation"

Stars: ✭ 73 (-23.96%)

Mutual labels: cvpr2018

Competitive Inner Imaging Senet

Source code of paper: (not available now)

Stars: ✭ 89 (-7.29%)

Mutual labels: attention-mechanism

Group Level Emotion Recognition

Model submitted for the ICMI 2018 EmotiW Group-Level Emotion Recognition Challenge

Stars: ✭ 70 (-27.08%)

Mutual labels: attention-mechanism

Dispnet Flownet Docker

Dockerfile and runscripts for DispNet and FlowNet1 (estimation of disparity and optical flow)

Stars: ✭ 78 (-18.75%)

Mutual labels: cvpr

Attention unet

Raw implementation of attention gated U-Net by Keras

Stars: ✭ 85 (-11.46%)

Mutual labels: attention-mechanism

Deepaffinity

Protein-compound affinity prediction through unified RNN-CNN

Stars: ✭ 75 (-21.87%)

Mutual labels: attention-mechanism

View All Similar Projects ➔

DHF1K

===========================================================================

Wenguan Wang, J. Shen, M.-M Cheng and A. Borji,

Revisiting Video Saliency: A Large-scale Benchmark and a New Model,

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018 and

IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2019

===========================================================================

The code (ACLNet) and dataset (DHF1K with raw gaze records, UCF-sports are new added!) can be downloaded from:

Google disk：https://drive.google.com/open?id=1sW0tf9RQMO4RR7SyKhU8Kmbm4jwkFGpQ

Baidu pan: https://pan.baidu.com/s/110NIlwRIiEOTyqRwYdDnVg

The Hollywood-2 (74.6G, including attention maps) can be downloaded from:

Google disk：https://drive.google.com/file/d/1vfRKJloNSIczYEOVjB4zMK8r0k4VJuWk/view?usp=sharing

Baidu pan: link：https://pan.baidu.com/s/16BIAuaGEDDbbjylJ8zziuA code：bt3x

Since so many people are interested in the training code, I decide to upload it in above webdisks. Enjoy it.

===========================================================================

Files:

'video': 1000 videos (videoname.AVI)

'annotation/videoname/maps': continuous saliency maps in '.png' format

'annotation/videoname/fixation': binary eye fixation maps in '.png' format

'annotation/videoname/maps': binary eye fixation maps stored in mat file

'generate_frame.m': used for extracting the frame images from AVI videos.

Please note raw data of individual viewers are stored in 'exportdata_train.rar'.

Note that please do not change the way of naming frames.

===========================================================================

Dataset splitting:

Training set: first 600 videos (001.AVI-600.AVI)

Validation set: 100 videos (601.AVI-700.AVI)

Testing set: 300 videos (701.AVI-1000.AVI)

The annotations for the training and val sets are released, but the

annotations of the testing set are held-out for benchmarking.

===========================================================================

We have corrected some statistics of our results (baseline training setting (iii)) on UCF sports dataset. Please see our newest version in ArXiv.

===========================================================================

Note that, for Holly-wood2 dataset, we used the split videos (each video only contains one shot), instead of the full videos.

===========================================================================

The raw data of gaze record "exportdata_train.rar" has been uploaded.

===========================================================================

For DHF1K dataset, we use following functions to generate continous saliency map:

[x,y]=find(fixations);

densityMap= make_gauss_masks(y,x,[video_res_y,video_res_x]);

make_gauss_masks.m has been uploaded.

For UCF and Hollywood, I directly use following functions:

densityMap = imfilter(fixations,fspecial('gaussian',150,20),'replicate');

===========================================================================

Results submission.

Please orgnize your results in following format:

yourmethod/videoname/framename.png

Note that the frames and framenames should be generated by 'generate_frame.m'.

Then send your results to '[email protected]'.

You can only sumbmit ONCE within One week.

Please first test your model on the val set or other video saliency dataset.

The response may be more than one week.

If you want to list your results on our web, please send your name, model

name, paper title, short description of your method and the link of the web

of your project (if you have).

===========================================================================

We use

Keras: 2.2.2

tensorflow: 1.10.0

to implement our model.

===========================================================================

Citation:

@InProceedings{Wang_2018_CVPR,
author = {Wang, Wenguan and Shen, Jianbing and Guo, Fang and Cheng, Ming-Ming and Borji, Ali},
title = {Revisiting Video Saliency: A Large-Scale Benchmark and a New Model},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition},
year = {2018}
}

@ARTICLE{Wang_2019_revisitingVS, 
author={W. {Wang} and J. {Shen} and J. {Xie} and M. {Cheng} and H. {Ling} and A. {Borji}}, 
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, 
title={Revisiting Video Saliency Prediction in the Deep Learning Era}, 
year={2019}, 
}

If you find our dataset is useful, please cite above papers.

===========================================================================

Code (ACLNet):

You can find the code in google disk: https://drive.google.com/open?id=1sW0tf9RQMO4RR7SyKhU8Kmbm4jwkFGpQ

===========================================================================

The dataset and code are licensed under a Creative Commons Attribution 4.0 License.

===========================================================================

Contact Information Email: [email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 96

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (3) 🔗