All Projects → Talismane → Similar Projects or Alternatives

257 Open source projects that are alternatives of or similar to Talismane

Sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Stars: ✭ 293 (+671.05%)
Mutual labels:  tokenizer
ConveRT-pytorch
ConveRT Paper Pytorch Implementation
Stars: ✭ 49 (+28.95%)
Mutual labels:  nlp-machine-learning
Awesome Sentiment Analysis
Repository with all what is necessary for sentiment analysis and related areas
Stars: ✭ 459 (+1107.89%)
Mutual labels:  nlp-machine-learning
Contextualized Topic Models
A python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Stars: ✭ 318 (+736.84%)
Mutual labels:  nlp-machine-learning
adversarial-relation-classification
Unsupervised domain adaptation method for relation extraction
Stars: ✭ 18 (-52.63%)
Mutual labels:  nlp-machine-learning
Chinese models for spacy
SpaCy 中文模型 | Models for SpaCy that support Chinese
Stars: ✭ 543 (+1328.95%)
Mutual labels:  nlp-machine-learning
Customer satisfaction analysis
基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。目的是克服用户打分和评论不一致,实时对在线民宿的满意度评测,包含在线评论采集和情感可视化分析。搭建了百度地图POI查询入口,可以进行自动化的批量查询 POI 信息的功能;构建了基于在线民宿语料的 LDA 自动主题聚类模型,利用主题中心词能找出对应的主题属性字典;以用户打分作为标注,然后 litNlp 自带的字符级 TextCNN 进行情感分析,将情感分类概率分布作为情感趋势,最后通过 POI 热力图的方式对不同地域的民宿满意度进行展示。软件版本请见链接。
Stars: ✭ 262 (+589.47%)
Mutual labels:  nlp-machine-learning
Snl Compiler
SNL(Small Nested Language) Compiler. Maven jUnit Tokenizer Lexer Syntax Parser. 编译原理 词法分析 语法分析
Stars: ✭ 19 (-50%)
Mutual labels:  tokenizer
cang-jie
Chinese tokenizer for tantivy, based on jieba-rs
Stars: ✭ 48 (+26.32%)
Mutual labels:  tokenizer
Moo
Optimised tokenizer/lexer generator! 🐄 Uses /y for performance. Moo.
Stars: ✭ 434 (+1042.11%)
Mutual labels:  tokenizer
Lingua
👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (+797.37%)
Mutual labels:  nlp-machine-learning
sensim
Sentence Similarity Estimator (SenSim)
Stars: ✭ 15 (-60.53%)
Mutual labels:  nlp-machine-learning
Tapas
End-to-end neural table-text understanding models.
Stars: ✭ 583 (+1434.21%)
Mutual labels:  nlp-machine-learning
Sentences
A multilingual command line sentence tokenizer in Golang
Stars: ✭ 293 (+671.05%)
Mutual labels:  tokenizer
Lisp Esque Language
💠The Lel programming language
Stars: ✭ 24 (-36.84%)
Mutual labels:  tokenizer
Dstc8 Schema Guided Dialogue
The Schema-Guided Dialogue Dataset
Stars: ✭ 277 (+628.95%)
Mutual labels:  nlp-machine-learning
Tokenizer
A small library for converting tokenized PHP source code into XML (and potentially other formats)
Stars: ✭ 4,770 (+12452.63%)
Mutual labels:  tokenizer
pascal-interpreter
A simple interpreter for a large subset of Pascal language written for educational purposes
Stars: ✭ 21 (-44.74%)
Mutual labels:  tokenizer
Lfuzzer
Fuzzing Parsers with Tokens
Stars: ✭ 28 (-26.32%)
Mutual labels:  tokenizer
Hebrew-Tokenizer
A very simple python tokenizer for Hebrew text.
Stars: ✭ 16 (-57.89%)
Mutual labels:  tokenizer
Smoothnlp
专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Stars: ✭ 435 (+1044.74%)
Mutual labels:  tokenizer
NLPnote
Gitbook Address: https://app.gitbook.com/@nlpgroup/s/nlpnote/
Stars: ✭ 101 (+165.79%)
Mutual labels:  nlp-machine-learning
Natasha
Solves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (+1973.68%)
Mutual labels:  tokenizer
text2text
Text2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+394.74%)
Mutual labels:  tokenizer
Php Parser
🌿 NodeJS PHP Parser - extract AST or tokens (PHP5 and PHP7)
Stars: ✭ 400 (+952.63%)
Mutual labels:  tokenizer
Nlp Conference Compendium
Compendium of the resources available from top NLP conferences.
Stars: ✭ 349 (+818.42%)
Mutual labels:  nlp-machine-learning
nlp newsletter
Natural language processing (NLP) newsletter right on GitHub
Stars: ✭ 57 (+50%)
Mutual labels:  nlp-machine-learning
Deeppavlov
An open source library for deep learning end-to-end dialog systems and chatbots.
Stars: ✭ 5,525 (+14439.47%)
Mutual labels:  nlp-machine-learning
Lexmachine
Lex machinary for go.
Stars: ✭ 335 (+781.58%)
Mutual labels:  tokenizer
React Input Tags
React component for tagging inputs.
Stars: ✭ 10 (-73.68%)
Mutual labels:  tokenizer
Friso
High performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.
Stars: ✭ 313 (+723.68%)
Mutual labels:  tokenizer
Kagome
Self-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+1357.89%)
Mutual labels:  tokenizer
Dab
Data Augmentation by Backtranslation (DAB) ヽ( •_-)ᕗ
Stars: ✭ 294 (+673.68%)
Mutual labels:  nlp-machine-learning
Omnicat Bayes
Naive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (-21.05%)
Mutual labels:  tokenizer
Ner
Named Entity Recognition
Stars: ✭ 288 (+657.89%)
Mutual labels:  nlp-machine-learning
Nlp base
自然语言基础模型
Stars: ✭ 524 (+1278.95%)
Mutual labels:  nlp-machine-learning
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+618.42%)
Mutual labels:  nlp-machine-learning
Click2analyze Androiddevchallenge
An app to analyze the text and fixing the anomaly of the message that deviates from what is standard, normal, or expected. #AndroidDevChallenge
Stars: ✭ 20 (-47.37%)
Mutual labels:  nlp-machine-learning
Jumanpp
Juman++ (a Morphological Analyzer Toolkit)
Stars: ✭ 254 (+568.42%)
Mutual labels:  tokenizer
Babyai
BabyAI platform. A testbed for training agents to understand and execute language commands.
Stars: ✭ 490 (+1189.47%)
Mutual labels:  nlp-machine-learning
ArabicProcessingCog
A Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-50%)
Mutual labels:  tokenizer
Letslearnai.github.io
Lets Learn AI
Stars: ✭ 33 (-13.16%)
Mutual labels:  nlp-machine-learning
fairseq-tagging
a Fairseq fork for sequence tagging/labeling tasks
Stars: ✭ 26 (-31.58%)
Mutual labels:  nlp-machine-learning
Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+1052.63%)
Mutual labels:  tokenizer
knowledge-extraction-recipes-forms
Knowledge Extraction For Forms Accelerators & Examples
Stars: ✭ 144 (+278.95%)
Mutual labels:  nlp-machine-learning
Rasa Ui
Rasa UI is a frontend for the Rasa Framework
Stars: ✭ 796 (+1994.74%)
Mutual labels:  nlp-machine-learning
nlp-qrmine
🔦 Qualitative Research support tools in Python
Stars: ✭ 28 (-26.32%)
Mutual labels:  nlp-machine-learning
Ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+1039.47%)
Mutual labels:  tokenizer
sent2vec
How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.
Stars: ✭ 99 (+160.53%)
Mutual labels:  nlp-machine-learning
Sdtm mapper
AI SDTM mapping (R for ML, Python, TensorFlow for DL)
Stars: ✭ 27 (-28.95%)
Mutual labels:  nlp-machine-learning
PaddleTokenizer
使用 PaddlePaddle 实现基于深度神经网络的中文分词引擎 | A DNN Chinese Tokenizer by Using PaddlePaddle
Stars: ✭ 14 (-63.16%)
Mutual labels:  tokenizer
Hands On Nltk Tutorial
The hands-on NLTK tutorial for NLP in Python
Stars: ✭ 419 (+1002.63%)
Mutual labels:  nlp-machine-learning
Natural-Language-Processing
Contains various architectures and novel paper implementations for Natural Language Processing tasks like Sequence Modelling and Neural Machine Translation.
Stars: ✭ 48 (+26.32%)
Mutual labels:  nlp-machine-learning
Mustard
🌭 Mustard is a Swift library for tokenizing strings when splitting by whitespace doesn't cut it.
Stars: ✭ 689 (+1713.16%)
Mutual labels:  tokenizer
Jflex
The fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (+900%)
Mutual labels:  tokenizer
Sharpmath
A small .NET math library.
Stars: ✭ 36 (-5.26%)
Mutual labels:  tokenizer
Nlp Js Tools French
POS Tagger, lemmatizer and stemmer for french language in javascript
Stars: ✭ 32 (-15.79%)
Mutual labels:  tokenizer
Laravel Token
Laravel token management
Stars: ✭ 10 (-73.68%)
Mutual labels:  tokenizer
Soynlp
한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+1513.16%)
Mutual labels:  tokenizer
Text mining resources
Resources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+842.11%)
Mutual labels:  nlp-machine-learning
1-60 of 257 similar projects