All Projects → lartpang → Minet

lartpang / Minet

Licence: mit
CVPR2020, Multi-scale Interactive Network for Salient Object Detection

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Minet

Mirai
Mirai 未来 - A powerful Minecraft Server Software coming from the future
Stars: ✭ 325 (+109.68%)
Mutual labels:  experimental, paper
Wavegrad
A fast, high-quality neural vocoder.
Stars: ✭ 138 (-10.97%)
Mutual labels:  paper
Simple
The Simple Intelligent and Modular Programming Language and Environment
Stars: ✭ 120 (-22.58%)
Mutual labels:  experimental
Rhisis
Rhisis is an experimental FlyFF MMORPG emulator built with C# 9 and .NET 5
Stars: ✭ 132 (-14.84%)
Mutual labels:  experimental
Betonquest
An advanced and powerful quest scripting plugin for Minecraft. Features built-in RPG style conversations and integration for over 25 other plugins.
Stars: ✭ 121 (-21.94%)
Mutual labels:  paper
Yolo Powered robot vision
Stars: ✭ 133 (-14.19%)
Mutual labels:  paper
Deep K Means Pytorch
[ICML 2018] "Deep k-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions"
Stars: ✭ 115 (-25.81%)
Mutual labels:  paper
Paper
🌈 一个类纸风的主题paper🎉(still updating...)
Stars: ✭ 142 (-8.39%)
Mutual labels:  paper
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-10.32%)
Mutual labels:  paper
Top 10 Computer Vision Papers 2020
A list of the top 10 computer vision papers in 2020 with video demos, articles, code and paper reference.
Stars: ✭ 132 (-14.84%)
Mutual labels:  paper
3pu
Patch-base progressive 3D Point Set Upsampling
Stars: ✭ 131 (-15.48%)
Mutual labels:  paper
Finetune Transformer Lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
Stars: ✭ 1,605 (+935.48%)
Mutual labels:  paper
Timings
Source to the Aikar's Minecraft Timings Viewer
Stars: ✭ 132 (-14.84%)
Mutual labels:  paper
Phd
A list of resources on how/why to do a PhD
Stars: ✭ 120 (-22.58%)
Mutual labels:  paper
E2e Model Learning
Task-based end-to-end model learning in stochastic optimization
Stars: ✭ 140 (-9.68%)
Mutual labels:  paper
Commandapi
An API for the command UI introduced in Minecraft 1.13
Stars: ✭ 116 (-25.16%)
Mutual labels:  paper
Thismypc
ThisMyPC provides a neat web interface that can be used for browsing your desktop drives from any device in your browser itself. With the help of NodeJs, the file details are displayed in JSON format that can then we easily displayed in web browsers.
Stars: ✭ 128 (-17.42%)
Mutual labels:  experimental
Alma
ALgoloid with MAcros -- a language with Algol-family syntax where macros take center stage
Stars: ✭ 132 (-14.84%)
Mutual labels:  experimental
Music Dance Video Synthesis
(ACM MM 20 Oral) PyTorch implementation of Self-supervised Dance Video Synthesis Conditioned on Music
Stars: ✭ 150 (-3.23%)
Mutual labels:  paper
Pixel Cnn
Code for the paper "PixelCNN++: A PixelCNN Implementation with Discretized Logistic Mixture Likelihood and Other Modifications"
Stars: ✭ 1,774 (+1044.52%)
Mutual labels:  paper

Multi-scale Interactive Network for Salient Object Detection

License: MIT GitHub last commit GitHub issues GitHub stars CVPR Page Arxiv Page

CVPR 2020.

NONE: For subsequent updates of the paper, please see the arixv version.

Changelog

The code and experimental results have be released now 😄.

  • 2021/1/18:
  • 2020/12/08:
    • Simplify the code structure.
    • Add a simple version to the simpler branch, but I have no time to test it. So, if you find any bugs, you can create an issue.
    • Add the pre-commit hook to format the code before the commit.
  • 2020/7/23:
    • [X] Improve and simplify the configuration of the project.
    • [X] Introduce the support of Amp: Automatic Mixed Precision. (Quick Start)
    • [X] Confirm the correctness of the function of saving and resuming the checkpoint.
      • [X] The training was interrupted.
      • [X] Training from scratch.
      • [X] Test only.
    • [X] Confirm the correctness of the function of exporting xlsx.
  • 2020/7/16:
    • Fix the mistake pointed out by @sjf18 in our code for testing FPS, which will be corrected in the later version of the paper.
  • 2020/7/14:
    • The actual interpolation method is pointed out.
    • Modify the mode of F.interpolate in SIM from scale_factor to size.
  • 2020/6/21: Some useful modifications.
    • I optimized the testing process. For details, you can see code/readme.md.
    • Added some backbone models using ws+gn which can further improve the generalization performance of the model. If there is a need, I can provide the pretrained parameter file of the corresponding improved version of MINet. :)
    • The directory structure has been adjusted.
  • 2020/5/10: Some meaningful changes.
    • Some methods have been tried to improve the performance of the model on specific operating environments and devices. The specific experimental results can be found in the section "More Experiments" of ./code/readme.md.
    • Use black as the repository formatting tool, and set the configuration file ./code/pyproject.toml.
  • 2020/5/6: Add some new attempts and improvements.
    • Modified the method of importing model from the directly setting in config.py, and change it to the automatic selection and instantiation of the specific model class according to the model alias (exp_name in config.py).
    • Added a lighter setting for AIM and MInet.
    • Added an attempt to checkpoint features of PyTorch.
    • Added learning rate decay strategy with learning rate warm-up. However, the parameter setting is not flexible at present, and needs to be improved.
    • A new optimization strategy has been added to imitate the settings from F3Net.
    • Upload the MIT LICENSE file.
  • 2020/4/16: Modify some misleading descriptions in the readme.md file.
  • 2020/4/7: Simplify the structure of the repository.
  • 2020/3/29: Update the method of evaluating results. (See the readme.md for more details.)
  • 2020/3/28: Update our code, results, pretrained parameters and some documents.

Repository Details

  • code: Complete training and testing code about our method. The readme.md file describes how to use the code.
  • docs: Github page about out paper. Here are some paper details.

Related Links

Paper Details

Abstract

Deep-learning based salient object detection methods achieve great progress. However, the variable scale and unknown category of salient objects are great challenges all the time. These are closely related to the utilization of multi-level and multi-scale features. In this paper, we propose the aggregate interaction modules to integrate the features from adjacent levels, in which less noise is introduced because of only using small up-/down-sampling rates. To obtain more efficient multi-scale features from the integrated features, the self-interaction modules are embedded in each decoder unit. Besides, the class imbalance issue caused by the scale variation weakens the effect of the binary cross entropy loss and results in the spatial inconsistency of the predictions. Therefore, we exploit the consistency-enhanced loss to highlight the fore-/back-ground difference and preserve the intra-class consistency. Experimental results on five benchmark datasets demonstrate that the proposed method without any post-processing performs favorably against 23 state-of-the-art approaches. The source code will be publicly available at https://github.com/lartpang/MINet.

Architecture

Comparison

BibTeX

@inproceedings{MINet-CVPR2020,
    author = {Pang, Youwei and Zhao, Xiaoqi and Zhang, Lihe and Lu, Huchuan},
    title = {Multi-Scale Interactive Network for Salient Object Detection},
    booktitle = CVPR,
    month = {June},
    year = {2020}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].