All Projects → Fugashi → Similar Projects or Alternatives

222 Open source projects that are alternatives of or similar to Fugashi

Yomichan
Japanese pop-up dictionary extension for Chrome and Firefox.
Stars: ✭ 464 (+271.2%)
Mutual labels:  japanese
Toiro
A comparison tool of Japanese tokenizers
Stars: ✭ 95 (-24%)
Mutual labels:  japanese
Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+250.4%)
Mutual labels:  tokenizer
String Calc
PHP calculator library for mathematical terms (expressions) passed as strings
Stars: ✭ 60 (-52%)
Mutual labels:  tokenizer
Ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+246.4%)
Mutual labels:  tokenizer
Textlint Rule Preset Jtf Style
JTF日本語標準スタイルガイド for textlint.
Stars: ✭ 112 (-10.4%)
Mutual labels:  japanese
Php Parser
🌿 NodeJS PHP Parser - extract AST or tokens (PHP5 and PHP7)
Stars: ✭ 400 (+220%)
Mutual labels:  tokenizer
Oxygennotincluded Japanese
Oxygen Not Included 日本語化
Stars: ✭ 54 (-56.8%)
Mutual labels:  japanese
Jflex
The fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (+204%)
Mutual labels:  tokenizer
Jconv
Pure-JavaScript converter for Japanese character encodings.
Stars: ✭ 91 (-27.2%)
Mutual labels:  japanese
Lexmachine
Lex machinary for go.
Stars: ✭ 335 (+168%)
Mutual labels:  tokenizer
Vanilla Autokana
A Vanilla-JavaScript library to complete furigana automatically.
Stars: ✭ 48 (-61.6%)
Mutual labels:  japanese
Sentences
A multilingual command line sentence tokenizer in Golang
Stars: ✭ 293 (+134.4%)
Mutual labels:  tokenizer
Gse
Go efficient multilingual NLP and text segmentation; support english, chinese, japanese and other. Go 高性能多语言 NLP 和分词
Stars: ✭ 1,695 (+1256%)
Mutual labels:  japanese
Yakuhanjp
Yakumono-Hankaku Only Web Fonts
Stars: ✭ 288 (+130.4%)
Mutual labels:  japanese
Py Nltools
A collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-63.2%)
Mutual labels:  tokenizer
Cheatsheet Of Ui With Fuzzy Behaviors
挙動や仕様が曖昧なユーザインタフェースチートシート
Stars: ✭ 89 (-28.8%)
Mutual labels:  japanese
pascal-interpreter
A simple interpreter for a large subset of Pascal language written for educational purposes
Stars: ✭ 21 (-83.2%)
Mutual labels:  tokenizer
Owasp Masvs
The Mobile Application Security Verification Standard (MASVS) is a standard for mobile app security.
Stars: ✭ 1,030 (+724%)
Mutual labels:  japanese
textlint-rule-ja-no-abusage
よくある日本語の誤用をチェックするtextlintルール
Stars: ✭ 21 (-83.2%)
Mutual labels:  japanese
Topokanji
Topologically ordered lists of kanji for effective learning
Stars: ✭ 108 (-13.6%)
Mutual labels:  japanese
ArabicProcessingCog
A Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-84.8%)
Mutual labels:  tokenizer
Adobe Japan1
The Adobe-Japan1-7 Character Collection
Stars: ✭ 38 (-69.6%)
Mutual labels:  japanese
Zipangu
A library for compatibility about Japan.
Stars: ✭ 27 (-78.4%)
Mutual labels:  japanese
Djurl
Simple yet helpful library for writing Django urls by an easy, short and intuitive way.
Stars: ✭ 85 (-32%)
Mutual labels:  tokenizer
sembei
🍘 単語分割を経由しない単語埋め込み 🍘
Stars: ✭ 14 (-88.8%)
Mutual labels:  japanese
Nlp Js Tools French
POS Tagger, lemmatizer and stemmer for french language in javascript
Stars: ✭ 32 (-74.4%)
Mutual labels:  tokenizer
unidic-py
Unidic packaged for installation via pip.
Stars: ✭ 17 (-86.4%)
Mutual labels:  japanese
Chevrotain
Parser Building Toolkit for JavaScript
Stars: ✭ 1,795 (+1336%)
Mutual labels:  tokenizer
japanese-pitch-accent-resources
Trying to consolidate japanese phonetic, and in particular pitch accent resources into one list
Stars: ✭ 64 (-48.8%)
Mutual labels:  japanese
Omnicat Bayes
Naive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (-76%)
Mutual labels:  tokenizer
sample-ui-react
Material-UI+ React.js + Redux [ Pug / Scss / Babel ]
Stars: ✭ 15 (-88%)
Mutual labels:  japanese
Sentence Splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (-34.4%)
Mutual labels:  tokenizer
PaddleTokenizer
使用 PaddlePaddle 实现基于深度神经网络的中文分词引擎 | A DNN Chinese Tokenizer by Using PaddlePaddle
Stars: ✭ 14 (-88.8%)
Mutual labels:  tokenizer
Lfuzzer
Fuzzing Parsers with Tokens
Stars: ✭ 28 (-77.6%)
Mutual labels:  tokenizer
activitypub
私家版ActivityPub日本語訳
Stars: ✭ 23 (-81.6%)
Mutual labels:  japanese
Languagepod101 Scraper
Python scraper for Language Pods such as Japanesepod101.com 👹 🗾 🍣 Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
Stars: ✭ 104 (-16.8%)
Mutual labels:  japanese
wana kana rust
Utility library for checking and converting between Japanese characters - Hiragana, Katakana - and Romaji
Stars: ✭ 46 (-63.2%)
Mutual labels:  japanese
Emby.plugins.javscraper
Emby/Jellyfin 的一个日本电影刮削器插件,可以从某些网站抓取影片信息。
Stars: ✭ 864 (+591.2%)
Mutual labels:  japanese
kanji-web-app
Angular.js kanji web application
Stars: ✭ 45 (-64%)
Mutual labels:  japanese
Momdo.github.io
Japanese translation of the W3C/WHATWG specification(s).
Stars: ✭ 81 (-35.2%)
Mutual labels:  japanese
bredon
A modern CSS value compiler in JavaScript
Stars: ✭ 39 (-68.8%)
Mutual labels:  tokenizer
React Input Tags
React component for tagging inputs.
Stars: ✭ 10 (-92%)
Mutual labels:  tokenizer
KWDLC
Kyoto University Web Document Leads Corpus
Stars: ✭ 64 (-48.8%)
Mutual labels:  japanese
Japanesetokenizers
aim to use JapaneseTokenizer as easy as possible
Stars: ✭ 120 (-4%)
Mutual labels:  tokenizer
mystem-scala
Morphological analyzer `mystem` (Russian language) wrapper for JVM languages
Stars: ✭ 21 (-83.2%)
Mutual labels:  tokenizer
Snl Compiler
SNL(Small Nested Language) Compiler. Maven jUnit Tokenizer Lexer Syntax Parser. 编译原理 词法分析 语法分析
Stars: ✭ 19 (-84.8%)
Mutual labels:  tokenizer
kanji
Haskell suite for determining what 級 (level) of the 漢字検定 (national Kanji exam) a given Kanji belongs to.
Stars: ✭ 19 (-84.8%)
Mutual labels:  japanese
Risingstars2016
A complete overview of the JavaScript landscape in 2016: trends about front-end and node.js frameworks, tooling... Available in English, Japanese and Chinese.
Stars: ✭ 75 (-40%)
Mutual labels:  japanese
Hibi
[No Active Development] An Android app for learning Japanese by keeping a journal.
Stars: ✭ 37 (-70.4%)
Mutual labels:  japanese
Natasha
Solves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (+530.4%)
Mutual labels:  tokenizer
ilmulti
Tooling to play around with multilingual machine translation for Indian Languages.
Stars: ✭ 19 (-84.8%)
Mutual labels:  tokenizer
Source Han Code Jp
Source Han Code JP | 源ノ角ゴシック Code
Stars: ✭ 1,362 (+989.6%)
Mutual labels:  japanese
Mustard
🌭 Mustard is a Swift library for tokenizing strings when splitting by whitespace doesn't cut it.
Stars: ✭ 689 (+451.2%)
Mutual labels:  tokenizer
Cutlet
Japanese to romaji converter in Python
Stars: ✭ 124 (-0.8%)
Mutual labels:  japanese
Syntok
Text tokenization and sentence segmentation (segtok v2)
Stars: ✭ 123 (-1.6%)
Mutual labels:  tokenizer
Tokenizer
Source code tokenizer
Stars: ✭ 119 (-4.8%)
Mutual labels:  tokenizer
Nodejs Ja
Node.js 日本語ローカリゼーション
Stars: ✭ 98 (-21.6%)
Mutual labels:  japanese
Cols Agent Tasks
Colin's ALM Corner Custom Build Tasks
Stars: ✭ 70 (-44%)
Mutual labels:  tokenizer
Soynlp
한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+390.4%)
Mutual labels:  tokenizer
61-120 of 222 similar projects