All Projects → Natasha → Similar Projects or Alternatives

604 Open source projects that are alternatives of or similar to Natasha

nerus
Large silver standart Russian corpus with NER, morphology and syntax markup
Stars: ✭ 47 (-94.04%)
Mutual labels:  syntax, russian, ner
neuro-comma
🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺
Stars: ✭ 46 (-94.16%)
Mutual labels:  russian, ner
navec
Compact high quality word embeddings for Russian language
Stars: ✭ 118 (-85.03%)
Mutual labels:  embeddings, russian
text2text
Text2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (-76.14%)
Mutual labels:  tokenizer, embeddings
Xmnlp
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首等功能
Stars: ✭ 591 (-25%)
Mutual labels:  ner
Fasthan
fastHan是基于fastNLP与pytorch实现的中文自然语言处理工具,像spacy一样调用方便。
Stars: ✭ 449 (-43.02%)
Mutual labels:  ner
Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (-44.42%)
Mutual labels:  tokenizer
Multi Class Text Classification Cnn
Classify Kaggle Consumer Finance Complaints into 11 classes. Build the model with CNN (Convolutional Neural Network) and Word Embeddings on Tensorflow.
Stars: ✭ 410 (-47.97%)
Mutual labels:  embeddings
Wikipedia2vec
A tool for learning vector representations of words and entities from Wikipedia
Stars: ✭ 655 (-16.88%)
Mutual labels:  embeddings
Multi Class Text Classification Cnn Rnn
Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.
Stars: ✭ 570 (-27.66%)
Mutual labels:  embeddings
Dynamic Dark Mode
The smart, automatic Dark Mode toggle for macOS Mojave+
Stars: ✭ 397 (-49.62%)
Mutual labels:  russian
Jionlp
中文 NLP 任务预处理工具包,准确、高效、零使用门槛
Stars: ✭ 449 (-43.02%)
Mutual labels:  ner
Soynlp
한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (-22.21%)
Mutual labels:  tokenizer
Ru.reactjs.org
React documentation website in Russian / Официальная русская версия сайта React
Stars: ✭ 444 (-43.65%)
Mutual labels:  russian
Qa bible
Библия QA это почти 300 страниц обновляемой смеси ответов на вопросы с реальных собеседований на QA, полезных ресурсов и статей, перевода интересного контента с зарубежных ресурсов и агрегации материала с отечественных.
Stars: ✭ 657 (-16.62%)
Mutual labels:  russian
Ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (-45.05%)
Mutual labels:  tokenizer
Open stt
Open STT
Stars: ✭ 584 (-25.89%)
Mutual labels:  russian
Php Parser
🌿 NodeJS PHP Parser - extract AST or tokens (PHP5 and PHP7)
Stars: ✭ 400 (-49.24%)
Mutual labels:  tokenizer
Nord Visual Studio Code
An arctic, north-bluish clean and elegant Visual Studio Code theme.
Stars: ✭ 749 (-4.95%)
Mutual labels:  syntax
Naomi
Sublime Text enhanced syntax highlighting for JavaScript ES6/ES7/ES2015/ES2016/ES2017+, Babel, FlowType, React JSX, Styled Components, HTML5, SCSS3, PHP 7, phpDoc, PHPUnit, MQL4. Basic: Git config files.
Stars: ✭ 544 (-30.96%)
Mutual labels:  syntax
Fast sentence embeddings
Compute Sentence Embeddings Fast!
Stars: ✭ 384 (-51.27%)
Mutual labels:  embeddings
Bert Multitask Learning
BERT for Multitask Learning
Stars: ✭ 380 (-51.78%)
Mutual labels:  ner
Go101
An online book focusing on Go syntax/semantics and runtime related things
Stars: ✭ 4,128 (+423.86%)
Mutual labels:  syntax
Phplint
🐛 A tool that can speed up linting of php files by running several lint processes at once.
Stars: ✭ 646 (-18.02%)
Mutual labels:  syntax
Ner Lstm
Named Entity Recognition using multilayered bidirectional LSTM
Stars: ✭ 532 (-32.49%)
Mutual labels:  embeddings
Autoner
Learning Named Entity Tagger from Domain-Specific Dictionary
Stars: ✭ 357 (-54.7%)
Mutual labels:  ner
Lightkg
基于Pytorch和torchtext的知识图谱深度学习框架。
Stars: ✭ 452 (-42.64%)
Mutual labels:  ner
Speedtorch
Library for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (-21.95%)
Mutual labels:  embeddings
Gogrep
Search for Go code using syntax trees
Stars: ✭ 450 (-42.89%)
Mutual labels:  syntax
Cluener2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
Stars: ✭ 689 (-12.56%)
Mutual labels:  ner
Lightly
A python library for self-supervised learning on images.
Stars: ✭ 439 (-44.29%)
Mutual labels:  embeddings
Code Surfer
Rad code slides <🏄/>
Stars: ✭ 5,477 (+595.05%)
Mutual labels:  syntax
Nimfa
Nimfa: Nonnegative matrix factorization in Python
Stars: ✭ 440 (-44.16%)
Mutual labels:  embeddings
Rhvoice
a free and open source speech synthesizer for Russian and other languages
Stars: ✭ 750 (-4.82%)
Mutual labels:  russian
Smoothnlp
专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Stars: ✭ 435 (-44.8%)
Mutual labels:  tokenizer
Proposal Pipeline Operator
A proposal for adding a useful pipe operator to JavaScript.
Stars: ✭ 5,899 (+648.6%)
Mutual labels:  syntax
Moo
Optimised tokenizer/lexer generator! 🐄 Uses /y for performance. Moo.
Stars: ✭ 434 (-44.92%)
Mutual labels:  tokenizer
Bert Ner Pytorch
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Stars: ✭ 654 (-17.01%)
Mutual labels:  ner
Lmdb Embeddings
Fast word vectors with little memory usage in Python
Stars: ✭ 404 (-48.73%)
Mutual labels:  embeddings
Sequence Labeling Bilstm Crf
The classical BiLSTM-CRF model implemented in Tensorflow, for sequence labeling tasks. In Vex version, everything is configurable.
Stars: ✭ 579 (-26.52%)
Mutual labels:  ner
Javascript Videos Ru 2018
Собрание видеозаписей докладов про JavaScript | 2018
Stars: ✭ 401 (-49.11%)
Mutual labels:  russian
Lm Lstm Crf
Empower Sequence Labeling with Task-Aware Language Model
Stars: ✭ 778 (-1.27%)
Mutual labels:  ner
Petrovich Ruby
Petrovich, an inflector for Russian anthroponyms.
Stars: ✭ 396 (-49.75%)
Mutual labels:  russian
Kagome
Self-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (-29.7%)
Mutual labels:  tokenizer
Jflex
The fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (-51.78%)
Mutual labels:  tokenizer
Node2vec
Implementation of the node2vec algorithm.
Stars: ✭ 654 (-17.01%)
Mutual labels:  embeddings
Nord Emacs
An arctic, north-bluish clean and elegant Emacs theme.
Stars: ✭ 379 (-51.9%)
Mutual labels:  syntax
Python intro
Jupyter notebooks in Russian. Introduction to Python, basic algorithms and data structures
Stars: ✭ 538 (-31.73%)
Mutual labels:  russian
Spacy Streamlit
👑 spaCy building blocks and visualizers for Streamlit apps
Stars: ✭ 360 (-54.31%)
Mutual labels:  ner
Yedda
YEDDA: A Lightweight Collaborative Text Span Annotation Tool. Code for ACL 2018 Best Demo Paper Nomination.
Stars: ✭ 704 (-10.66%)
Mutual labels:  ner
Rust Bert
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
Stars: ✭ 510 (-35.28%)
Mutual labels:  ner
Ruby Style Guide
📘 Russian Version: A community-driven Ruby coding style guide.
Stars: ✭ 358 (-54.57%)
Mutual labels:  russian
Nlp Cube
Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing
Stars: ✭ 353 (-55.2%)
Mutual labels:  embeddings
Vncorenlp
A Vietnamese natural language processing toolkit (NAACL 2018)
Stars: ✭ 354 (-55.08%)
Mutual labels:  ner
Sublime Markdown Extended
Top 100 Sublime Text plugin! Markdown syntax highlighter for Sublime Text, with extended support for GFM fenced code blocks, with language-specific syntax highlighting. YAML Front Matter. Works with ST2/ST3. Goes great with Assemble.
Stars: ✭ 645 (-18.15%)
Mutual labels:  syntax
Tokenizer
A small library for converting tokenized PHP source code into XML (and potentially other formats)
Stars: ✭ 4,770 (+505.33%)
Mutual labels:  tokenizer
Bert Bilstm Crf Ner
Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services
Stars: ✭ 3,838 (+387.06%)
Mutual labels:  ner
Snips Nlu
Snips Python library to extract meaning from text
Stars: ✭ 3,583 (+354.7%)
Mutual labels:  ner
Awesome Persian Nlp Ir
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (-41.62%)
Mutual labels:  embeddings
Awesome 2vec
Curated list of 2vec-type embedding models
Stars: ✭ 784 (-0.51%)
Mutual labels:  embeddings
1-60 of 604 similar projects