All Projects → youngfly11 → LCMCG-PyTorch

youngfly11 / LCMCG-PyTorch

Licence: other
AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language
Cuda
1817 projects
C++
36643 projects - #6 most used programming language
cython
566 projects
shell
77523 projects

Projects that are alternatives of or similar to LCMCG-PyTorch

LR-GCCF
Revisiting Graph based Collaborative Filtering: A Linear Residual Graph Convolutional Network Approach, AAAI2020
Stars: ✭ 99 (+86.79%)
Mutual labels:  aaai2020
symmetrical-synthesis
Official Tensorflow implementation of "Symmetrical Synthesis for Deep Metric Learning" (AAAI 2020)
Stars: ✭ 67 (+26.42%)
Mutual labels:  aaai2020
WeFEND-AAAI20
Dataset for paper "Weak Supervision for Fake News Detection via Reinforcement Learning" published in AAAI'2020.
Stars: ✭ 67 (+26.42%)
Mutual labels:  aaai2020
ACT AAAI20
code for AAAI 2020 paper "ACT"
Stars: ✭ 105 (+98.11%)
Mutual labels:  aaai2020
TSP-PRL
Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video (AAAI2020)
Stars: ✭ 39 (-26.42%)
Mutual labels:  aaai2020
recurrent-decoding-cell
[AAAI'20] Segmenting Medical MRI via Recurrent Decoding Cell (Spotlight)
Stars: ✭ 14 (-73.58%)
Mutual labels:  aaai2020
Dyamic Graph Representation
Official Dynamic Graph Representation PyTorch implement for iris/face recognition
Stars: ✭ 22 (-58.49%)
Mutual labels:  aaai2020
InteractE
AAAI 2020 - InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions
Stars: ✭ 69 (+30.19%)
Mutual labels:  aaai2020
FISR
Official repository of FISR (AAAI 2020).
Stars: ✭ 72 (+35.85%)
Mutual labels:  aaai2020
SSNM-Coseg
[AAAI20] Deep Object Co-segmentation via Spatial-Semantic Network Modulation(Oral paper)
Stars: ✭ 21 (-60.38%)
Mutual labels:  aaai2020

LCMCG.Pytorch

This repo is the official implementation of "Learning Cross-Modal Context Graph for Visual Grounding" (AAAI2020)

Installation

Check INSTALL.md for installation instructions.

pre-requirements

  1. Download the flickr30k dataset in this link
  2. Pre-computed bounding boxes are extracted by using FasterRCNN
    We use the config "e2e_faster_rcnn_R_50_C4_1x.yaml" to train the object detector on MSCOCO dataset and extract the feature map at C4 layer.
  3. Language graph extraction by using SceneGraphParser. I have uploaded the sg_anno.json into Google drive. You can download it now.
  4. Some pre-processing data, like sentence annotations, box annotations.
  5. You need to create the './flickr_datasets' folder and put all annotation in it. I would highly recommend you to figure all the data path out in this project. You can refer this two file "maskrcnn_benchmark/config/paths_catalog.py" and "maskrcnn_benchmark/data/flickr.py" for details.

The pretrained object detector weights and annotations can be found here at baidu-disk (link:https://pan.baidu.com/s/1bYbGUsHcZJQHele87MzcMg password:5ie6) or google drive

training

  1. You can train our model by running the scripts
sh scripts/train.sh

""

citation

If you are interested in our paper, please cite it.

@inproceedings{liu2019learning,
  title={Learning Cross-modal Context Graph for Visual Grounding},
  author={Liu, Yongfei and Wan, Bo and Zhu, Xiaodan and He, Xuming},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligenc}
  year={2020}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].