All Projects → tshatrov → Ichiran

tshatrov / Ichiran

Licence: mit
Linguistic tools for texts in Japanese language

Programming Languages

language
365 projects
grammar
57 projects

Projects that are alternatives of or similar to Ichiran

unofficial-jisho-api
Encapsulates the official Jisho.org API and also provides kanji, example, and stroke diagram search.
Stars: ✭ 88 (-26.67%)
Mutual labels:  dictionary, japanese, japanese-language
jmdict-simplified
JMdict, JMnedict, Kanjidic, KRADFILE/RADKFILE in JSON format
Stars: ✭ 96 (-20%)
Mutual labels:  dictionary, japanese, japanese-language
Yomichan
Japanese pop-up dictionary extension for Chrome and Firefox.
Stars: ✭ 464 (+286.67%)
Mutual labels:  japanese, japanese-language, dictionary
kanji-web-app
Angular.js kanji web application
Stars: ✭ 45 (-62.5%)
Mutual labels:  japanese, japanese-language
Jotoba
A free online, self-hostable, multilang Japanese dictionary.
Stars: ✭ 87 (-27.5%)
Mutual labels:  dictionary, japanese
jiten
jiten - japanese android/cli/web dictionary based on jmdict/kanjidic — 日本語 辞典 和英辞典 漢英字典 和独辞典 和蘭辞典
Stars: ✭ 64 (-46.67%)
Mutual labels:  dictionary, japanese
ra-language-japanese
Japanese messages for react-admin
Stars: ✭ 22 (-81.67%)
Mutual labels:  japanese, japanese-language
japanese-pitch-accent-resources
Trying to consolidate japanese phonetic, and in particular pitch accent resources into one list
Stars: ✭ 64 (-46.67%)
Mutual labels:  japanese, japanese-language
KanjiRecognitionDictionary
Perfect for those who forgets kanji pronunciation
Stars: ✭ 14 (-88.33%)
Mutual labels:  japanese, japanese-language
Topokanji
Topologically ordered lists of kanji for effective learning
Stars: ✭ 108 (-10%)
Mutual labels:  japanese, japanese-language
Google Ime Dictionary
日英変換・英語略語展開のための IME 追加辞書 📙 日本語から英語への和英変換や英語略語の展開を Google 日本語入力や ATOK などで可能にする IME 拡張辞書です
Stars: ✭ 30 (-75%)
Mutual labels:  japanese, dictionary
limelight
A php Japanese language text analyzer and parser.
Stars: ✭ 76 (-36.67%)
Mutual labels:  japanese, japanese-language
Domino-English-Translation
🌏 Let's translate Domino, a Japanese MIDI editor!
Stars: ✭ 29 (-75.83%)
Mutual labels:  japanese, japanese-language
Hibi
[No Active Development] An Android app for learning Japanese by keeping a journal.
Stars: ✭ 37 (-69.17%)
Mutual labels:  japanese, japanese-language
kanji-frequency
Kanji usage frequency data collected from various sources
Stars: ✭ 92 (-23.33%)
Mutual labels:  japanese, japanese-language
Kagome
Self-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+361.67%)
Mutual labels:  japanese, japanese-language
Memorize
🚀 Japanese-English-Mongolian dictionary. It lets you find words, kanji and more quickly and easily
Stars: ✭ 72 (-40%)
Mutual labels:  japanese, dictionary
google-news-scraper
Google News Scraper for languages like Japanese, Chinese... [VPN Support]
Stars: ✭ 88 (-26.67%)
Mutual labels:  japanese, japanese-language
kotoba
A Discord bot for helping with learning Japanese.
Stars: ✭ 118 (-1.67%)
Mutual labels:  japanese, japanese-language
The Tab Of Words
A minimal Chrome / Firefox extension to help you learn Japanese words in each new tab.
Stars: ✭ 94 (-21.67%)
Mutual labels:  japanese, japanese-language

Ichiran

Ichiran is a collection of tools for working with text in Japanese language. It contains experimental segmenting and romanization algorithms and uses open source JMdictDB dictionary database to display meanings of words.

The web interface is under development right now. You can try it at ichi.moe.

Installation

!!!NEW!!! There's now a blog post which contains detailed instructions how to get Ichiran running on Linux and Windows. It also describes how to use the new ichiran-cli command line interface!

  1. Download JMDict data from here. If you want to initialize database from scratch download JMDict, and optionally kanjidic2.xml to use ichiran/kanji functionality.
  2. Create a settings.lisp file based on the provided settings.lisp.template file with the correct paths to the abovementioned files and the database connection parameters.
  3. The code can be loaded as a regular ASDF system. Use quicklisp to easily install all the dependencies.
    • Easy mode: Use database dump from the release page to create a suitable database. Make sure settings.lisp contains the correct connection parameters. Use (ichiran/maintenance:add-errata) to make database up to date.
    • Hard mode: Use (ichiran/maintenance:full-init) to completely initialize the database. Use (ichiran/maintenance:load-jmdict) followed by (ichiran/maintenance:load-best-readings) to initialize only ichiran/dict and not ichiran/kanji. Either way, this will take a few hours or so.
  4. Use (ichiran/test:run-all-tests) to check that the installation satisfies the tests.
  5. Before using any word segmenting functionality, run (ichiran/dict:init-suffixes t) to create a suffix cache, which will improve the quality of segmentation.

Documentation

There is no documentation yet. Any API is considered unstable at this point.

The basic functionality is (ichiran:romanize "一覧は最高だぞ" :with-info t), but feel free to explore further.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].