All Projects → open2ch-dialogue-corpus → Similar Projects or Alternatives

492 Open source projects that are alternatives of or similar to open2ch-dialogue-corpus

Dialogue-Corpus
No description or website provided.
Stars: ✭ 27 (-58.46%)
Mutual labels:  dialogue, corpus
dialogue-datasets
collect the open dialog corpus and some useful data processing utils.
Stars: ✭ 24 (-63.08%)
Mutual labels:  dialogue, corpus
Chinese Nlp Corpus
Collections of Chinese NLP corpus
Stars: ✭ 438 (+573.85%)
Mutual labels:  corpus, datasets
TV4Dialog
No description or website provided.
Stars: ✭ 33 (-49.23%)
Mutual labels:  dialogue, corpus
BSD
The Business Scene Dialogue corpus
Stars: ✭ 51 (-21.54%)
Mutual labels:  japanese, corpus
kanji-frequency
Kanji usage frequency data collected from various sources
Stars: ✭ 92 (+41.54%)
Mutual labels:  japanese, corpus
Cluecorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (+327.69%)
Mutual labels:  corpus, datasets
Cluedatasetsearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Stars: ✭ 2,112 (+3149.23%)
Mutual labels:  corpus, datasets
Probabilistic-RNN-DA-Classifier
Probabilistic Dialogue Act Classification for the Switchboard Corpus using an LSTM model
Stars: ✭ 22 (-66.15%)
Mutual labels:  dialogue, corpus
KWDLC
Kyoto University Web Document Leads Corpus
Stars: ✭ 64 (-1.54%)
Mutual labels:  japanese, corpus
Chatbot-Training-Corpus
总结了一些可以用作聊天机器人训练实作的文字语聊,包含中英文不同语言
Stars: ✭ 117 (+80%)
Mutual labels:  dialogue, corpus
compact-wine
No description or website provided.
Stars: ✭ 87 (+33.85%)
Mutual labels:  japanese
frostpunk mod
Frostpunk / Mod Tools / 非公式日本語化MODツール
Stars: ✭ 17 (-73.85%)
Mutual labels:  japanese
sample-ui-vue-pages
Bootstrap + Vue.js [ Scss / Babel ] (Multi-Page/SSR Model)
Stars: ✭ 20 (-69.23%)
Mutual labels:  japanese
geodaData
Data package for accessing GeoDa datasets using R
Stars: ✭ 15 (-76.92%)
Mutual labels:  datasets
nytwit
New York Times Word Innovation Types dataset
Stars: ✭ 21 (-67.69%)
Mutual labels:  corpus
mlx
Machine Learning eXchange (MLX). Data and AI Assets Catalog and Execution Engine
Stars: ✭ 132 (+103.08%)
Mutual labels:  datasets
Speech-Corpus-Collection
A Collection of Speech Corpus for ASR and TTS
Stars: ✭ 113 (+73.85%)
Mutual labels:  corpus
proiel-treebank
Official releases of the PROIEL treebank of ancient Indo-European languages
Stars: ✭ 30 (-53.85%)
Mutual labels:  corpus
rasa ch faq
用 rasa 实现 rasa demo 机器人,有一些惊奇的功能,faq,图谱,多轮等
Stars: ✭ 156 (+140%)
Mutual labels:  dialogue
delitos-caba
🚓 Crime dataset for the City of Buenos Aires, Argentina
Stars: ✭ 44 (-32.31%)
Mutual labels:  datasets
industrial-ml-datasets
A curated list of datasets, publically available for machine learning research in the area of manufacturing
Stars: ✭ 45 (-30.77%)
Mutual labels:  datasets
gum
Repository for the Georgetown University Multilayer Corpus (GUM)
Stars: ✭ 71 (+9.23%)
Mutual labels:  corpus
jmdict-kindle
Japanese - English dictionary for Kindle based on the JMdict / EDICT database
Stars: ✭ 151 (+132.31%)
Mutual labels:  japanese
OpenConvert
Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)
Stars: ✭ 20 (-69.23%)
Mutual labels:  corpus
DANeS
DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)
Stars: ✭ 64 (-1.54%)
Mutual labels:  corpus
tvsub
TVsub: DCU-Tencent Chinese-English Dialogue Corpus
Stars: ✭ 40 (-38.46%)
Mutual labels:  corpus
zkanji
Japanese language study suite and dictionary
Stars: ✭ 55 (-15.38%)
Mutual labels:  japanese
humanflow2
Official repository of Learning Multi-Human Optical Flow (IJCV 2019)
Stars: ✭ 37 (-43.08%)
Mutual labels:  datasets
Thirukkural-Tamil-Dataset
திருக்குறள் by திருவள்ளுவர்.
Stars: ✭ 44 (-32.31%)
Mutual labels:  datasets
CompBioDatasetsForMachineLearning
A Curated List of Computational Biology Datasets Suitable for Machine Learning
Stars: ✭ 90 (+38.46%)
Mutual labels:  datasets
biomechanics dataset
Information of public available data sets for biomechanics.
Stars: ✭ 31 (-52.31%)
Mutual labels:  datasets
data.world-py
Python package for data.world
Stars: ✭ 98 (+50.77%)
Mutual labels:  datasets
torchgeo
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Stars: ✭ 1,125 (+1630.77%)
Mutual labels:  datasets
Kawazu
A C# library for converting Japanese sentence to Hiragana, Katakana or Romaji with furigana and okurigana modes supported. Inspired by project Kuroshiro.
Stars: ✭ 33 (-49.23%)
Mutual labels:  japanese
CHR
SIXray : A Large-scale Security Inspection X-ray Benchmark in CVPR 2019
Stars: ✭ 78 (+20%)
Mutual labels:  datasets
next-qrcode
React hooks for generating QRCode for your next React apps.
Stars: ✭ 87 (+33.85%)
Mutual labels:  japanese
Google-Playstore-Dataset
Google PlayStore App dataset. (2.3 million App Data) and 24 attributes
Stars: ✭ 27 (-58.46%)
Mutual labels:  datasets
clothing-detection-ecommerce-dataset
Clothing detection dataset
Stars: ✭ 43 (-33.85%)
Mutual labels:  datasets
japanese-word-handler
Better Japanese word handling on Visual Studio Code.
Stars: ✭ 32 (-50.77%)
Mutual labels:  japanese
lang-ja
Manage Japanese language files which distributed with vim.
Stars: ✭ 20 (-69.23%)
Mutual labels:  japanese
bugrepo
A collection of publicly available bug reports
Stars: ✭ 93 (+43.08%)
Mutual labels:  datasets
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-67.69%)
Mutual labels:  corpus
firestore-to-bigquery-export
NPM package for copying and converting Cloud Firestore data to BigQuery.
Stars: ✭ 26 (-60%)
Mutual labels:  datasets
git-rdm
A research data management plugin for the Git version control system.
Stars: ✭ 34 (-47.69%)
Mutual labels:  datasets
sample-boot-scala
Spring Boot + Scala + Skinny ORM
Stars: ✭ 14 (-78.46%)
Mutual labels:  japanese
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+993.85%)
Mutual labels:  corpus
kanjigrid
A web-app displaying the 2200 kanji characters taught in James Heisig's "Remembering the Kanji", 6th edition.
Stars: ✭ 37 (-43.08%)
Mutual labels:  japanese
rclc
Rich Context leaderboard competition, including the corpus and current SOTA for required tasks.
Stars: ✭ 20 (-69.23%)
Mutual labels:  corpus
dic-nico-intersection-pixiv
ニコニコ大百科とピクシブ百科事典の共通部分のIME辞書
Stars: ✭ 49 (-24.62%)
Mutual labels:  japanese
german-nouns
A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.
Stars: ✭ 101 (+55.38%)
Mutual labels:  corpus
ubuntu-desktop-jp
日本人向けのUbuntuデスクトップ環境のDockerイメージです。
Stars: ✭ 62 (-4.62%)
Mutual labels:  japanese
DiscEval
Discourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-72.31%)
Mutual labels:  datasets
ocr2text
Convert a PDF via OCR to a TXT file in UTF-8 encoding
Stars: ✭ 90 (+38.46%)
Mutual labels:  corpus
kanjigrid
Fork of the Kanji Grid addon for Anki
Stars: ✭ 21 (-67.69%)
Mutual labels:  japanese
Kaku
画 - Japanese OCR Dictionary
Stars: ✭ 160 (+146.15%)
Mutual labels:  japanese
kanji-handwriting-swift
Kanji handwriting recognition for iOS using Zinnia.
Stars: ✭ 27 (-58.46%)
Mutual labels:  japanese
dagpi
Dagpi is a powerful and fast api that does image manipulation as well as serves datasets. It is fast and written in rust and python. Perfect for discord bots, social media apps, camera apps and more.
Stars: ✭ 25 (-61.54%)
Mutual labels:  datasets
scrapeOP
A python package for scraping oddsportal.com
Stars: ✭ 99 (+52.31%)
Mutual labels:  datasets
morghulis
No description or website provided.
Stars: ✭ 18 (-72.31%)
Mutual labels:  datasets
1-60 of 492 similar projects