Code and Dataset for the Bhola et al. (2020) Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classification Framework

Stars: ✭ 33 (-82.45%)

Mutual labels: bert

linguistic-style-transfer-pytorch

Implementation of "Disentangled Representation Learning for Non-Parallel Text Style Transfer(ACL 2019)" in Pytorch

Stars: ✭ 55 (-70.74%)

Mutual labels: natural-language-generation

DRhard

SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.

Stars: ✭ 93 (-50.53%)

Mutual labels: information-retrieval

DeepNER

An Easy-to-use, Modular and Prolongable package of deep-learning based Named Entity Recognition Models.

Stars: ✭ 9 (-95.21%)

Mutual labels: bert

pair2vec

pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference

Stars: ✭ 62 (-67.02%)

Mutual labels: question-answering

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Stars: ✭ 816 (+334.04%)

Mutual labels: data-augmentation

BM25Transformer

(Python) transform a document-term matrix to an Okapi/BM25 representation

Stars: ✭ 50 (-73.4%)

Mutual labels: information-retrieval

ODSQA

ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET

Stars: ✭ 43 (-77.13%)

Mutual labels: question-answering

easse

Easier Automatic Sentence Simplification Evaluation

Stars: ✭ 109 (-42.02%)

Mutual labels: natural-language-generation

Keywords-Abstract-TFIDF-TextRank4ZH

使用tf-idf, TextRank4ZH等不同方式从中文文本中提取关键字，从中文文本中提取摘要和关键词

Stars: ✭ 26 (-86.17%)

Mutual labels: tf-idf

tutorials

A tutorial series by Preferred.AI

Stars: ✭ 136 (-27.66%)

Mutual labels: information-retrieval

2021-dialogue-summary-competition

[2021 훈민정음 한국어 음성•자연어 인공지능 경진대회] 대화요약 부문 알라꿍달라꿍 팀의 대화요약 학습 및 추론 코드를 공유하기 위한 레포입니다.

Stars: ✭ 86 (-54.26%)

Mutual labels: summarization

ilmulti

Tooling to play around with multilingual machine translation for Indian Languages.

Stars: ✭ 19 (-89.89%)

Mutual labels: tokenizer

CODER

CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]

Stars: ✭ 24 (-87.23%)

Mutual labels: embeddings

mtdata

A tool that locates, downloads, and extracts machine translation corpora

Stars: ✭ 95 (-49.47%)

Mutual labels: natural-language-generation

src

tools for fast reading of docs

Stars: ✭ 40 (-78.72%)

Mutual labels: information-retrieval

tika-similarity

Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.

Stars: ✭ 92 (-51.06%)

Mutual labels: information-retrieval

DeepLTranslator

The DeepL Translator is an API written in Java that translates via the DeepL website sentences. Without API key.

Stars: ✭ 45 (-76.06%)

Mutual labels: translator

psr2r-sniffer

A PSR-2-R code sniffer and code-style auto-correction-tool - including many useful additions

Stars: ✭ 32 (-82.98%)

Mutual labels: tokenizer

mnist-challenge

My solution to TUM's Machine Learning MNIST challenge 2016-2017 [winner]

Stars: ✭ 68 (-63.83%)

Mutual labels: data-augmentation

Unets

Implemenation of UNets for Lung Segmentation

Stars: ✭ 18 (-90.43%)

Mutual labels: data-augmentation

semantic-parsing-dual

Source code and data for ACL 2019 Long Paper ``Semantic Parsing with Dual Learning".

Stars: ✭ 17 (-90.96%)

Mutual labels: data-augmentation

audio degrader

Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.

Stars: ✭ 40 (-78.72%)

Mutual labels: data-augmentation

syntaxmaker

The NLG tool for Finnish

Stars: ✭ 19 (-89.89%)

Mutual labels: natural-language-generation

TCE

This repository contains the code implementation used in the paper Temporally Coherent Embeddings for Self-Supervised Video Representation Learning (TCE).

Stars: ✭ 51 (-72.87%)

Mutual labels: embeddings

deepfrog

An NLP-suite powered by deep learning

Stars: ✭ 16 (-91.49%)

Mutual labels: transformers

spellchecker-wasm

SpellcheckerWasm is an extrememly fast spellchecker for WebAssembly based on SymSpell

Stars: ✭ 46 (-75.53%)

Mutual labels: levenshtein-distance

WSDM-Cup-2019

[ACM-WSDM] 3rd place solution at WSDM Cup 2019, Fake News Classification on Kaggle.

Stars: ✭ 62 (-67.02%)

Mutual labels: bert

GNN-Recommender-Systems

An index of recommendation algorithms that are based on Graph Neural Networks.

Stars: ✭ 505 (+168.62%)

Mutual labels: information-retrieval

Quality-Estimation2

机器翻译子任务-翻译质量评价-在BERT模型后面加上Bi-LSTM进行fine-tuning

Stars: ✭ 31 (-83.51%)

Mutual labels: bert

factedit

🧐 Code & Data for Fact-based Text Editing (Iso et al; ACL 2020)

Stars: ✭ 16 (-91.49%)

Mutual labels: natural-language-generation

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Stars: ✭ 861 (+357.98%)

Mutual labels: transformers

language-planner

Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"

Stars: ✭ 84 (-55.32%)

Mutual labels: transformers

elastic transformers

Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers

Stars: ✭ 153 (-18.62%)

Mutual labels: transformers

PororoQA

PororoQA, https://arxiv.org/abs/1707.00836

Stars: ✭ 26 (-86.17%)

Mutual labels: question-answering

WikiTableQuestions

A dataset of complex questions on semi-structured Wikipedia tables

Stars: ✭ 81 (-56.91%)

Mutual labels: question-answering

sentiment-analysis-of-tweets-in-russian

Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.

Stars: ✭ 51 (-72.87%)

Mutual labels: embeddings

tf-idf-python

Term frequency–inverse document frequency for Chinese novel/documents implemented in python.

Stars: ✭ 98 (-47.87%)

Mutual labels: tf-idf

contextualSpellCheck

✔️Contextual word checker for better suggestions

Stars: ✭ 274 (+45.74%)

Mutual labels: bert

text-classification-baseline

Pipeline for fast building text classification TF-IDF + LogReg baselines.

Stars: ✭ 55 (-70.74%)

Mutual labels: tf-idf

bredon

A modern CSS value compiler in JavaScript

Stars: ✭ 39 (-79.26%)

Mutual labels: tokenizer

ark-nlp

A private nlp coding package, which quickly implements the SOTA solutions.

Stars: ✭ 232 (+23.4%)

Mutual labels: bert

liblex

C library for Lexical Analysis

Stars: ✭ 25 (-86.7%)

Mutual labels: tokenizer

XORQA

This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".

Stars: ✭ 61 (-67.55%)

Mutual labels: question-answering

consistency-adversarial

Consistency Regularization for Adversarial Robustness (AAAI 2022)

Stars: ✭ 37 (-80.32%)

Mutual labels: data-augmentation

relation-network

Tensorflow Implementation of Relation Networks for the bAbI QA Task, detailed in "A Simple Neural Network Module for Relational Reasoning," [https://arxiv.org/abs/1706.01427] by Santoro et. al.

Stars: ✭ 45 (-76.06%)

Mutual labels: embeddings

gnn-lspe

Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural and Positional Representations), ICLR 2022

Stars: ✭ 165 (-12.23%)

Mutual labels: transformers

denspi

Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)

Stars: ✭ 188 (+0%)

Mutual labels: question-answering

301-360 of 1196 similar projects