sent2vecHow to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.
Stars: ✭ 99 (+5.32%)
Sdtm mapperAI SDTM mapping (R for ML, Python, TensorFlow for DL)
Stars: ✭ 27 (-71.28%)
Lingua👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (+262.77%)
easyNLPDo NLP without coding!
Stars: ✭ 19 (-79.79%)
Tika PythonTika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Stars: ✭ 997 (+960.64%)
fairseq-tagginga Fairseq fork for sequence tagging/labeling tasks
Stars: ✭ 26 (-72.34%)
nlp newsletterNatural language processing (NLP) newsletter right on GitHub
Stars: ✭ 57 (-39.36%)
TapasEnd-to-end neural table-text understanding models.
Stars: ✭ 583 (+520.21%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+280.85%)
NTUA-slp-nlp💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA
Stars: ✭ 19 (-79.79%)
DabData Augmentation by Backtranslation (DAB) ヽ( •_-)ᕗ
Stars: ✭ 294 (+212.77%)
Customer satisfaction analysis基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。目的是克服用户打分和评论不一致,实时对在线民宿的满意度评测,包含在线评论采集和情感可视化分析。搭建了百度地图POI查询入口,可以进行自动化的批量查询 POI 信息的功能;构建了基于在线民宿语料的 LDA 自动主题聚类模型,利用主题中心词能找出对应的主题属性字典;以用户打分作为标注,然后 litNlp 自带的字符级 TextCNN 进行情感分析,将情感分类概率分布作为情感趋势,最后通过 POI 热力图的方式对不同地域的民宿满意度进行展示。软件版本请见链接。
Stars: ✭ 262 (+178.72%)
TalismaneNLP framework: sentence detector, tokeniser, pos-tagger and dependency parser
Stars: ✭ 38 (-59.57%)
nlp-qrmine🔦 Qualitative Research support tools in Python
Stars: ✭ 28 (-70.21%)
SummarusModels for automatic abstractive summarization
Stars: ✭ 83 (-11.7%)
Natural-Language-ProcessingContains various architectures and novel paper implementations for Natural Language Processing tasks like Sequence Modelling and Neural Machine Translation.
Stars: ✭ 48 (-48.94%)
Rasa UiRasa UI is a frontend for the Rasa Framework
Stars: ✭ 796 (+746.81%)
ChatbotA Deep-Learning multi-purpose chatbot made using Python3
Stars: ✭ 36 (-61.7%)
Nlp base自然语言基础模型
Stars: ✭ 524 (+457.45%)
News push projectReal Time News Scraping and Recommendation System - React | Tensorflow | NLP | News Scrapers
Stars: ✭ 44 (-53.19%)
Nlp Paper自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (-28.72%)
Contextualized Topic ModelsA python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Stars: ✭ 318 (+238.3%)
NerNamed Entity Recognition
Stars: ✭ 288 (+206.38%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+190.43%)
Aiops platformAn Artificial Intelligence Platform for IT Operations.
Stars: ✭ 63 (-32.98%)
NLPnoteGitbook Address: https://app.gitbook.com/@nlpgroup/s/nlpnote/
Stars: ✭ 101 (+7.45%)
Doc2vec📓 Long(er) text representation and classification using Doc2Vec embeddings
Stars: ✭ 92 (-2.13%)
Click2analyze AndroiddevchallengeAn app to analyze the text and fixing the anomaly of the message that deviates from what is standard, normal, or expected. #AndroidDevChallenge
Stars: ✭ 20 (-78.72%)
sensimSentence Similarity Estimator (SenSim)
Stars: ✭ 15 (-84.04%)
DeeppavlovAn open source library for deep learning end-to-end dialog systems and chatbots.
Stars: ✭ 5,525 (+5777.66%)
use-cases-of-bertUse-cases of Hugging Face's BERT (e.g. paraphrase generation, unsupervised extractive summarization).
Stars: ✭ 18 (-80.85%)
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-19.15%)
phd-resourcesInternet Delivered Treatment using Adaptive Technology
Stars: ✭ 37 (-60.64%)
SentimentAnalysisSentiment Analysis: Deep Bi-LSTM+attention model
Stars: ✭ 32 (-65.96%)
BabyaiBabyAI platform. A testbed for training agents to understand and execute language commands.
Stars: ✭ 490 (+421.28%)
DatascienceIt consists of examples, assignments discussed in data science course taken at algorithmica.
Stars: ✭ 92 (-2.13%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-3.19%)
Gec PseudodataRepository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)
Stars: ✭ 49 (-47.87%)