thaigov-corpusโครงการเก็บรวบรวมข่าวสารจากเว็บไซต์รัฐบาลไทย
Stars: ✭ 19 (-5%)
Mutual labels: corpus, thai-language
mev-corpusMEV Data Corpus
Stars: ✭ 77 (+285%)
Mutual labels: corpus
egret-wenda-corpusA Public Corpus for Machine Learning
Stars: ✭ 41 (+105%)
Mutual labels: corpus
kanji-frequencyKanji usage frequency data collected from various sources
Stars: ✭ 92 (+360%)
Mutual labels: corpus
pdf-corpusPython script to quickly create hand-crafted PDF files
Stars: ✭ 17 (-15%)
Mutual labels: corpus
When-in-RomeA meta-corpus of functional harmonic analysis.
Stars: ✭ 35 (+75%)
Mutual labels: corpus
cljs-corpusA greppable archive of ClojureScript code
Stars: ✭ 37 (+85%)
Mutual labels: corpus
PoetryCorpusПоэтический корпус русского языка
Stars: ✭ 40 (+100%)
Mutual labels: corpus
toSkoyเเอปเเปลงพ๊ษ๊ไธญเป็นภ๊ษ๊สก๊อบ์ย (รุ่นใหฒ่ล่๊ษุฎ) (Plain English : One-way encryption algorithm for Thai language, which only Thai people could understand)
Stars: ✭ 52 (+160%)
Mutual labels: thai-language
jrte-corpusJapanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
Stars: ✭ 66 (+230%)
Mutual labels: corpus
lucene-geo-gazetteerUses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.
Stars: ✭ 34 (+70%)
Mutual labels: opennlp
TV4DialogNo description or website provided.
Stars: ✭ 33 (+65%)
Mutual labels: corpus
bible-corpusA multilingual parallel corpus created from translations of the Bible.
Stars: ✭ 115 (+475%)
Mutual labels: corpus
BSDThe Business Scene Dialogue corpus
Stars: ✭ 51 (+155%)
Mutual labels: corpus
CBLUE中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Stars: ✭ 379 (+1795%)
Mutual labels: corpus
KWDLCKyoto University Web Document Leads Corpus
Stars: ✭ 64 (+220%)
Mutual labels: corpus
turing✨ 🧬 Turing AI - Semantic Navigation, Chatbot using Search Engine and Many NLP Vendors.
Stars: ✭ 30 (+50%)
Mutual labels: opennlp