All Projects → beir → Similar Projects or Alternatives

380 Open source projects that are alternatives of or similar to beir

policy-data-analyzer

Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.

Stars: ✭ 22 (-97.02%)

Mutual labels: bert, sentence-transformers, sbert

🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.

Stars: ✭ 3,409 (+361.92%)

Mutual labels: information-retrieval, bert, dpr

Text2Text: Cross-lingual natural language processing and generation toolkit

Stars: ✭ 188 (-74.53%)

Mutual labels: information-retrieval, bert, question-generation

question generator

An NLP system for generating reading comprehension questions

Stars: ✭ 188 (-74.53%)

Mutual labels: bert, question-generation

Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".

Stars: ✭ 43 (-94.17%)

Mutual labels: information-retrieval, bert

Tianchi2020ChineseMedicineQuestionGeneration

2020 阿里云天池大数据竞赛-中医药文献问题生成挑战赛

Stars: ✭ 20 (-97.29%)

Mutual labels: bert, question-generation

Search Git commits in natural language

Stars: ✭ 50 (-93.22%)

Mutual labels: information-retrieval, sentence-transformers

spacy-sentence-bert

Sentence transformers models for SpaCy

Stars: ✭ 88 (-88.08%)

Mutual labels: bert, sentence-transformers

Financial Domain Question Answering with pre-trained BERT Language Model

Stars: ✭ 70 (-90.51%)

Mutual labels: information-retrieval, bert

An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统，一键部署微信闲聊机器人)

Stars: ✭ 94 (-87.26%)

Mutual labels: retrieval, bert

Transformer-QG-on-SQuAD

Implement Question Generator with SOTA pre-trained Language Models (RoBERTa, BERT, GPT, BART, T5, etc.)

Stars: ✭ 28 (-96.21%)

Mutual labels: bert, question-generation

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

Stars: ✭ 216 (-70.73%)

Mutual labels: information-retrieval, bert

Fielded Sequential Dependence Model (code and runs)

Stars: ✭ 32 (-95.66%)

Mutual labels: information-retrieval, retrieval

AnnA Anki neuronal Appendix

Using machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity

Stars: ✭ 39 (-94.72%)

Mutual labels: bert, sbert

📑 Neural Search

Stars: ✭ 196 (-73.44%)

Mutual labels: information-retrieval, retrieval

⛔ [NOT MAINTAINED] A web interface for cdQA and other question answering systems.

Stars: ✭ 19 (-97.43%)

Mutual labels: information-retrieval, bert

question-generation

Neural Models for Key Phrase Detection and Question Generation

Stars: ✭ 29 (-96.07%)

Mutual labels: question-generation

🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺

Stars: ✭ 46 (-93.77%)

Mutual labels: bert

[EMNLP 2019] Mixture Content Selection for Diverse Sequence Generation (Question Generation / Abstractive Summarization)

Stars: ✭ 109 (-85.23%)

Mutual labels: question-generation

A well-structured summarization dataset for the Persian language!

Stars: ✭ 29 (-96.07%)

Mutual labels: bert

SENet-for-Weakly-Supervised-Relation-Extraction

No description or website provided.

Stars: ✭ 39 (-94.72%)

Mutual labels: information-retrieval

Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.

Stars: ✭ 32 (-95.66%)

Mutual labels: retrieval

DrFAQ is a plug-and-play question answering NLP chatbot that can be generally applied to any organisation's text corpora.

Stars: ✭ 29 (-96.07%)

Mutual labels: bert

Bert-model-code-interpretation

解读tensorflow版本Bert中modeling.py数据流

Stars: ✭ 19 (-97.43%)

Mutual labels: bert

A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)

Stars: ✭ 118 (-84.01%)

Mutual labels: bert

AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0

Stars: ✭ 28 (-96.21%)

Mutual labels: bert

unsupervised-qa

Template-Based Question Generation from Retrieved Sentences for Improved Unsupervised Question Answering

Stars: ✭ 47 (-93.63%)

Mutual labels: question-generation

Content Based Image Retrieval Techniques (e.g. knn, svm using MatLab GUI)

Stars: ✭ 51 (-93.09%)

Mutual labels: information-retrieval

⚡ A fast embedded library for approximate nearest neighbor search

Stars: ✭ 141 (-80.89%)

Mutual labels: information-retrieval

ERNIE-text-classification-pytorch

This repo contains a PyTorch implementation of a pretrained ERNIE model for text classification.

Stars: ✭ 49 (-93.36%)

Mutual labels: bert

Implementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021

Stars: ✭ 64 (-91.33%)

Mutual labels: bert

pytorch implementation of the TwinBert paper

Stars: ✭ 36 (-95.12%)

Mutual labels: bert

sigir19-neural-ir

Source code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19

Stars: ✭ 44 (-94.04%)

Mutual labels: information-retrieval

A Pre-trained BERT on StackOverflow Corpus

Stars: ✭ 40 (-94.58%)

Mutual labels: bert

Automated coding using machine-learning and remapping the U.S. nonprofit sector: A guide and benchmark

Stars: ✭ 18 (-97.56%)

Mutual labels: bert

A BERT-based reverse dictionary of Korean proverbs

Stars: ✭ 95 (-87.13%)

Mutual labels: bert

Solutions of the various test exams of the Information Retrieval course

Stars: ✭ 28 (-96.21%)

Mutual labels: information-retrieval

les-military-mrc-rank7

莱斯杯：全国第二届“军事智能机器阅读”挑战赛 - Rank7 解决方案

Stars: ✭ 37 (-94.99%)

Mutual labels: bert

query-wellformedness

25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.

Stars: ✭ 80 (-89.16%)

Mutual labels: information-retrieval

LAMB Optimizer TF

LAMB Optimizer for Large Batch Training (TensorFlow version)

Stars: ✭ 119 (-83.88%)

Mutual labels: bert

Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"

Stars: ✭ 36 (-95.12%)

Mutual labels: information-retrieval

OpenUE是一个轻量级知识图谱抽取工具 (An Open Toolkit for Universal Extraction from Text published at EMNLP2020: https://aclanthology.org/2020.emnlp-demos.1.pdf)

Stars: ✭ 274 (-62.87%)

Mutual labels: bert

Some Cool NLP and CV Repositories and Solutions （收集NLP中常见任务的开源解决方案、数据集、工具、学习资料等）

Stars: ✭ 143 (-80.62%)

Mutual labels: bert

GLUE-bert4keras

基于bert4keras的GLUE基准代码

Stars: ✭ 59 (-92.01%)

Mutual labels: bert

Apache Solr open-source search software

Stars: ✭ 651 (-11.79%)

Mutual labels: information-retrieval

Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval

Stars: ✭ 49 (-93.36%)

Mutual labels: information-retrieval

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain.

Stars: ✭ 252 (-65.85%)

Mutual labels: bert

protonet-bert-text-classification

finetune bert for small dataset text classification in a few-shot learning manner using ProtoNet

Stars: ✭ 28 (-96.21%)

Mutual labels: bert

Conceptualsearch

Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs

Stars: ✭ 245 (-66.8%)

Mutual labels: information-retrieval

SImple SenTence EmbeddeR

Stars: ✭ 66 (-91.06%)

Mutual labels: bert

中文NER的那些事儿

Stars: ✭ 241 (-67.34%)

Mutual labels: bert

Trinity IR Infrastructure

Stars: ✭ 227 (-69.24%)

Mutual labels: information-retrieval

Accelerated deep learning R&D

Stars: ✭ 2,804 (+279.95%)

Mutual labels: information-retrieval

Text Classification TF

用tf实现各种文本分类模型，并且封装restful接口，可以直接工程化

Stars: ✭ 32 (-95.66%)

Mutual labels: bert

Drop in solution for Decentralized Neural Information Retrieval. Index latent vectors along with JSON metadata and do efficient k-NN search.

Stars: ✭ 222 (-69.92%)

Mutual labels: information-retrieval

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Stars: ✭ 1,479 (+100.41%)

Mutual labels: bert

NLPDataAugmentation

Chinese NLP Data Augmentation， BERT Contextual Augmentation

Stars: ✭ 94 (-87.26%)

Mutual labels: bert

My (slightly modified) Keras implementation of RankNet and PyTorch implementation of LambdaRank.

Stars: ✭ 211 (-71.41%)

Mutual labels: information-retrieval

Burp Extender plugin that generates a sitemap of a website using Wayback Machine

Stars: ✭ 203 (-72.49%)

Mutual labels: information-retrieval

HDLTex: Hierarchical Deep Learning for Text Classification

Stars: ✭ 191 (-74.12%)

Mutual labels: information-retrieval

1-60 of 380 similar projects