TapasEnd-to-end neural table-text understanding models.
Stars: ✭ 583 (+398.29%)
Lingua👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (+191.45%)
Sdtm mapperAI SDTM mapping (R for ML, Python, TensorFlow for DL)
Stars: ✭ 27 (-76.92%)
fairseq-tagginga Fairseq fork for sequence tagging/labeling tasks
Stars: ✭ 26 (-77.78%)
Wiki SplitOne million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
Stars: ✭ 95 (-18.8%)
Gec PseudodataRepository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)
Stars: ✭ 49 (-58.12%)
TalismaneNLP framework: sentence detector, tokeniser, pos-tagger and dependency parser
Stars: ✭ 38 (-67.52%)
sent2vecHow to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.
Stars: ✭ 99 (-15.38%)
SummarusModels for automatic abstractive summarization
Stars: ✭ 83 (-29.06%)
Rasa UiRasa UI is a frontend for the Rasa Framework
Stars: ✭ 796 (+580.34%)
Question GenerationGiven a sentence automatically generate reading comprehension style factual questions from that sentence, such that the sentence contains answers to those questions.
Stars: ✭ 100 (-14.53%)
Nlp base自然语言基础模型
Stars: ✭ 524 (+347.86%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+205.98%)
Textaugmentation Gpt2Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Stars: ✭ 104 (-11.11%)
DabData Augmentation by Backtranslation (DAB) ヽ( •_-)ᕗ
Stars: ✭ 294 (+151.28%)
Customer satisfaction analysis基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。目的是克服用户打分和评论不一致,实时对在线民宿的满意度评测,包含在线评论采集和情感可视化分析。搭建了百度地图POI查询入口,可以进行自动化的批量查询 POI 信息的功能;构建了基于在线民宿语料的 LDA 自动主题聚类模型,利用主题中心词能找出对应的主题属性字典;以用户打分作为标注,然后 litNlp 自带的字符级 TextCNN 进行情感分析,将情感分类概率分布作为情感趋势,最后通过 POI 热力图的方式对不同地域的民宿满意度进行展示。软件版本请见链接。
Stars: ✭ 262 (+123.93%)
DatascienceIt consists of examples, assignments discussed in data science course taken at algorithmica.
Stars: ✭ 92 (-21.37%)
nlp-qrmine🔦 Qualitative Research support tools in Python
Stars: ✭ 28 (-76.07%)
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+1077.78%)
Click2analyze AndroiddevchallengeAn app to analyze the text and fixing the anomaly of the message that deviates from what is standard, normal, or expected. #AndroidDevChallenge
Stars: ✭ 20 (-82.91%)
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-35.04%)
DeeppavlovAn open source library for deep learning end-to-end dialog systems and chatbots.
Stars: ✭ 5,525 (+4622.22%)
LemminflectA python module for English lemmatization and inflection.
Stars: ✭ 105 (-10.26%)
Nlp Paper自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (-42.74%)
BabyaiBabyAI platform. A testbed for training agents to understand and execute language commands.
Stars: ✭ 490 (+318.8%)
Monkeylearn⛔️ ARCHIVED ⛔️ 🐒 R package for text analysis with Monkeylearn 🐒
Stars: ✭ 95 (-18.8%)
Aiops platformAn Artificial Intelligence Platform for IT Operations.
Stars: ✭ 63 (-46.15%)
Contextualized Topic ModelsA python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Stars: ✭ 318 (+171.79%)
NerNamed Entity Recognition
Stars: ✭ 288 (+146.15%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+133.33%)
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-11.97%)
News push projectReal Time News Scraping and Recommendation System - React | Tensorflow | NLP | News Scrapers
Stars: ✭ 44 (-62.39%)
NLPnoteGitbook Address: https://app.gitbook.com/@nlpgroup/s/nlpnote/
Stars: ✭ 101 (-13.68%)
Doc2vec📓 Long(er) text representation and classification using Doc2Vec embeddings
Stars: ✭ 92 (-21.37%)
Lingopackage lingo provides the data structures and algorithms required for natural language processing
Stars: ✭ 113 (-3.42%)
AtnreAdversarial Training for Neural Relation Extraction
Stars: ✭ 108 (-7.69%)
Mrc book《机器阅读理解:算法与实践》代码
Stars: ✭ 102 (-12.82%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-22.22%)
Tika PythonTika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Stars: ✭ 997 (+752.14%)