All Projects → thomasthiebaud → spacy-fastlang

thomasthiebaud / spacy-fastlang

Licence: MIT license
Language detection using Spacy and Fasttext

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to spacy-fastlang

nlpbuddy
A text analysis application for performing common NLP tasks through a web dashboard interface and an API
Stars: ✭ 115 (+238.24%)
Mutual labels:  spacy, fasttext
spaczz
Fuzzy matching and more functionality for spaCy.
Stars: ✭ 215 (+532.35%)
Mutual labels:  spacy, spacy-extensions
Fasttext.js
FastText for Node.js
Stars: ✭ 127 (+273.53%)
Mutual labels:  language-detection, fasttext
spacy-langdetect
A fully customisable language detection pipeline for spaCy
Stars: ✭ 86 (+152.94%)
Mutual labels:  language-detection, spacy
Whatthelang
Lightning Fast Language Prediction 🚀
Stars: ✭ 130 (+282.35%)
Mutual labels:  language-detection, fasttext
TRUNAJOD2.0
An easy-to-use library to extract indices from texts.
Stars: ✭ 18 (-47.06%)
Mutual labels:  spacy, spacy-extensions
FastText.NetWrapper
.NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!
Stars: ✭ 57 (+67.65%)
Mutual labels:  fasttext
pynsett
A programmable relation extraction tool
Stars: ✭ 25 (-26.47%)
Mutual labels:  spacy
airy
💬 Open source conversational platform to power conversations with an open source Live Chat, Messengers like Facebook Messenger, WhatsApp and more - 💎 UI from Inbox to dashboards - 🤖 Integrations to Conversational AI / NLP tools and standard enterprise software - ⚡ APIs, WebSocket, Webhook - 🔧 Create any conversational experience
Stars: ✭ 299 (+779.41%)
Mutual labels:  spacy
spacy-server
🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec
Stars: ✭ 58 (+70.59%)
Mutual labels:  spacy
contextualSpellCheck
✔️Contextual word checker for better suggestions
Stars: ✭ 274 (+705.88%)
Mutual labels:  spacy
lingua-go
👄 The most accurate natural language detection library for Go, suitable for long and short text alike
Stars: ✭ 684 (+1911.76%)
Mutual labels:  language-detection
spacy hunspell
✏️ Hunspell extension for spaCy 2.0.
Stars: ✭ 94 (+176.47%)
Mutual labels:  spacy
agile
🌌 Global State and Logic Library for JavaScript/Typescript applications
Stars: ✭ 90 (+164.71%)
Mutual labels:  spacy
ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
Stars: ✭ 40 (+17.65%)
Mutual labels:  spacy
SwiftUIMLKitTranslator
SwiftUI MLKit Language Identification & Translator
Stars: ✭ 23 (-32.35%)
Mutual labels:  language-detection
goclassy
An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
Stars: ✭ 81 (+138.24%)
Mutual labels:  fasttext
jstarcraft-nlp
专注于解决自然语言处理领域的几个核心问题:词法分析,句法分析,语义分析,语种检测,信息抽取,文本聚类和文本分类. 为相关领域的研发人员提供完整的通用设计与参考实现. 涵盖了多种自然语言处理算法,适配了多个自然语言处理框架. 兼容Lucene/Solr/ElasticSearch插件.
Stars: ✭ 92 (+170.59%)
Mutual labels:  language-detection
converse
Conversational text Analysis using various NLP techniques
Stars: ✭ 147 (+332.35%)
Mutual labels:  spacy
spacy-french-models
French models for spacy
Stars: ✭ 22 (-35.29%)
Mutual labels:  spacy

spacy_fastlang

Install

Assuming you have a working python environment, you can simply install it using

pip install spacy_fastlang

Usage

The library exports a pipeline component called language_detector that will set two spacy extensions

  • doc._.language = ISO code of the detected language or xx as a fallback
  • doc._.language_score = confidence
import spacy_fastlang
nlp = spacy.load("...")
nlp.add_pipe("language_detector")
doc = nlp(en_text)

doc._.language == "..."
doc._.language_score >= ...

Options

Check the tests to see more examples and available options

License

Everythin is under MIT except the default model which is distributed under Creative Commons Attribution-Share-Alike License 3.0 by facebook here

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].