All Projects → ChenglongChen → Tensorflow Dsmm

ChenglongChen / Tensorflow Dsmm

Licence: mit
Tensorflow implementations of various Deep Semantic Matching Models (DSMM).

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Tensorflow Dsmm

Pytorch Question Answering
Important paper implementations for Question Answering using PyTorch
Stars: ✭ 154 (-29.03%)
Mutual labels:  question-answering
Rat Sql
A relation-aware semantic parsing model from English to SQL
Stars: ✭ 169 (-22.12%)
Mutual labels:  question-answering
Simpletransformers
Transformers for Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
Stars: ✭ 2,881 (+1227.65%)
Mutual labels:  question-answering
Chinese Rc Datasets
Collections of Chinese reading comprehension datasets
Stars: ✭ 159 (-26.73%)
Mutual labels:  question-answering
Improved Dynamic Memory Networks Dmn Plus
Theano Implementation of DMN+ (Improved Dynamic Memory Networks) from the paper by Xiong, Merity, & Socher at MetaMind, http://arxiv.org/abs/1603.01417 (Dynamic Memory Networks for Visual and Textual Question Answering)
Stars: ✭ 165 (-23.96%)
Mutual labels:  question-answering
Mspars
Stars: ✭ 177 (-18.43%)
Mutual labels:  question-answering
Question answering models
This repo collects and re-produces models related to domains of question answering and machine reading comprehension
Stars: ✭ 139 (-35.94%)
Mutual labels:  question-answering
Kb Qa
基于知识库的中文问答系统(biLSTM)
Stars: ✭ 195 (-10.14%)
Mutual labels:  question-answering
Hq bot
📲 Bot to help solve HQ trivia
Stars: ✭ 167 (-23.04%)
Mutual labels:  question-answering
Awesome Kgqa
A collection of some materials of knowledge graph question answering
Stars: ✭ 188 (-13.36%)
Mutual labels:  question-answering
Denspi
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)
Stars: ✭ 162 (-25.35%)
Mutual labels:  question-answering
Awesomemrc
This repo is our research summary and playground for MRC. More features are coming.
Stars: ✭ 162 (-25.35%)
Mutual labels:  question-answering
Triviaqa
Code for the TriviaQA reading comprehension dataset
Stars: ✭ 184 (-15.21%)
Mutual labels:  question-answering
Nspm
🤖 Neural SPARQL Machines for Knowledge Graph Question Answering.
Stars: ✭ 156 (-28.11%)
Mutual labels:  question-answering
Anyq
FAQ-based Question Answering System
Stars: ✭ 2,336 (+976.5%)
Mutual labels:  question-answering
Cape Webservices
Entrypoint for all backend cape webservices
Stars: ✭ 149 (-31.34%)
Mutual labels:  question-answering
Questgen.ai
Question generation using state-of-the-art Natural Language Processing algorithms
Stars: ✭ 169 (-22.12%)
Mutual labels:  question-answering
Awesome Deep Learning And Machine Learning Questions
【不定期更新】收集整理的一些网站中(如知乎、Quora、Reddit、Stack Exchange等)与深度学习、机器学习、强化学习、数据科学相关的有价值的问题
Stars: ✭ 203 (-6.45%)
Mutual labels:  question-answering
Flowqa
Implementation of conversational QA model: FlowQA (with slight improvement)
Stars: ✭ 194 (-10.6%)
Mutual labels:  question-answering
Openqa
The source code of ACL 2018 paper "Denoising Distantly Supervised Open-Domain Question Answering".
Stars: ✭ 188 (-13.36%)
Mutual labels:  question-answering

tensorflow-DSMM

Ongoing project for implementing various Deep Semantic Matching Models (DSMM). DSMM is widely used for:

  • duplicate detection
  • sentence similarity
  • question answering
  • search relevance
  • ...

Quickstart

Data

This project is developed with regard to the data format provided in the 第三届魔镜杯大赛.

You can see /data/DATA.md for the data format description and prepared data accordingly. Your data should be placed in the data directory. Current data directory also holds a toy data.

If you want to run a quick demo, you can download data from the above competition link. Download is allowed after registration.

Demo

python src/main.py

Supported Models

Representation based methods

  • DSSM style models
    • DSSM: use FastText as encoder
    • CDSSM: use TextCNN as encoder
    • RDSSM: use TextRNN/TextBiRNN as encoder

Interaction based methods

  • MatchPyramid style models
    • MatchPyramid: use identity/cosine similarity/dot product as match matrix
    • General MatchPyramid: use match matrices based on various embeddings and various match scores
      • word embeddings
        • original word embedding
        • compressed word embedding
        • contextual word embedding (use an encoder to encode contextual information)
      • match score
        • identity
        • cosine similarity/dot product
        • element product
        • element concat
  • BCNN style models
    • BCNN
    • ABCNN1
    • ABCNN2
    • ABCNN3
  • ESIM
  • DecAtt (Decomposable Attention)

Building Blocks

Encoder layers

  • FastText
  • TimeDistributed Dense Projection
  • TextCNN (Gated CNN and also Residual Gated CNN)
  • TextRNN/TextBiRNN with GRU and LSTM cell

Attention layers

  • mean/max/min pooling
  • scalar-based and vector-based attention
  • self and context attention
  • multi-head attention

Acknowledgments

This project gets inspirations from the following projects:

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].