Instructions for how to convert a BERT Tensorflow model to work with HuggingFace's pytorch-transformers, and spaCy. This walk-through uses DeepPavlov's RuBERT as example.

Stars: ✭ 26 (-10.34%)

Mutual labels: spacy, bert

anonymisation

Anonymization of legal cases (Fr) based on Flair embeddings

Stars: ✭ 85 (+193.1%)

Mutual labels: spacy, bert

KitanaQA

KitanaQA: Adversarial training and data augmentation for neural question-answering models

Stars: ✭ 58 (+100%)

Mutual labels: question-answering, bert

contextualSpellCheck

✔️Contextual word checker for better suggestions

Stars: ✭ 274 (+844.83%)

Mutual labels: spacy, bert

backprop

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

Stars: ✭ 229 (+689.66%)

Mutual labels: question-answering, bert

Medi-CoQA

Conversational Question Answering on Clinical Text

Stars: ✭ 22 (-24.14%)

Mutual labels: question-answering, bert

hf-experiments

Experiments with Hugging Face 🔬 🤗

Stars: ✭ 37 (+27.59%)

Mutual labels: question-answering, huggingface

mcQA

🔮 Answering multiple choice questions with Language Models.

Stars: ✭ 23 (-20.69%)

Mutual labels: question-answering, bert

FinBERT-QA

Financial Domain Question Answering with pre-trained BERT Language Model

Stars: ✭ 70 (+141.38%)

Mutual labels: question-answering, bert

cmrc2019

A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)

Stars: ✭ 118 (+306.9%)

Mutual labels: question-answering, bert

Haystack

🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.

Stars: ✭ 3,409 (+11655.17%)

Mutual labels: question-answering, bert

converse

Conversational text Analysis using various NLP techniques

Stars: ✭ 147 (+406.9%)

Mutual labels: spacy, huggingface

View All Similar Projects ➔

DrFAQ

DrFAQ is a plug-and-play question answering chatbot that can be generally applied to any organiation's text corpora.
Designed and implemented a NLP Question Answering architecture using spaCy, huggingface’s BERT language model, ElasticSearch, Telegram Bot API, and hosted on Heroku.

News

4 Mar 2021 - Transfer learning of language models alongside evaluation study is currently in progress.
13 Dec 2019 - Implementation of 4-step question-answering methodology completed.

Objective

Given an organisation's corpus of documents, generate a chatbot to enable natural question-answering capabilities.

Methodology

When a question is asked, the following processes are performed:

FAQ Question Matching using spaCy's Similarity - /match
- From a given list of Frequently Asked Questions (FAQs), the chatbot detects similarity to the specified question and selects the best answer from the existing list.
NLP Question Answering using huggingface's BERT - /nlp
- If the question asked is dissimilar to any existing FAQs, perform question answering on the knowledge base and return a sufficiently confident answer.
Answer Search using ElasticSearch - /search
- If the answer is not sufficiently confident, perform a search on the document corpus and return the search results.
Human Intervention
- If the search results are still not relevant, prompt a human to add the question-answer pair to the existing list of specified FAQs, or speak to a human.

Research

Transfer learning of language models researched in a benchmark study shows that:
- If a large and clean QA dataset is available, RoBERTa is the best language model.
- If only a small and unclean generated QA dataset is available, MobileBERT is the best language model.
- If the QA dataset contains many 'Who' questions, RoBERTa should be considered.

Future Work

Release DrFAQ as a pip package.
Make an interactive demo available.
Integrate abstractive question-answering into the methodology.
Leverage databases and cloud services.

References

explosion/spaCy - Industrial-strength Natural Language Processing (NLP) with Python and Cython
huggingface/transformers - Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and Pytorch
elastic/elasticsearch-py - Official Python low-level client for Elasticsearch
python-telegram-bot/python-telegram-bot - Python Wrapper for Telegram Bots
google-research/bert - TensorFlow code and pre-trained models for BERT
BERT - Pre-training of Deep Bidirectional Transformers for Language Understanding

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

jetnew / DrFAQ

Programming Languages

Labels

Projects that are alternatives of or similar to DrFAQ

DrFAQ

News

Objective

Methodology

Research

Future Work

References