All Projects → thaigov-corpus → Similar Projects or Alternatives

127 Open source projects that are alternatives of or similar to thaigov-corpus

🇹🇭 Thai address input for Vue.

Stars: ✭ 44 (+131.58%)

Mutual labels: thailand, thai

เเอปเเปลงพ๊ษ๊ไธญเป็นภ๊ษ๊สก๊อบ์ย (รุ่นใหฒ่ล่๊ษุฎ) (Plain English : One-way encryption algorithm for Thai language, which only Thai people could understand)

Stars: ✭ 52 (+173.68%)

Mutual labels: thai-language, thai

vue-thailand-address-autocomplete

🇹🇭 Autocomplete ที่อยู่ในประเทศไทย

Stars: ✭ 49 (+157.89%)

Mutual labels: thailand, thai

Awesome-Thai-Library

แหล่งรวม library ไทยๆ เกี่ยวกับ "ประเทศไทย" และ "ภาษาไทย" - Delightful Thai packages and resources

Stars: ✭ 37 (+94.74%)

Mutual labels: thailand, thai

thai-language

computer tools for thai language

Stars: ✭ 20 (+5.26%)

Mutual labels: corpus, thai-language

Weibo terminater

Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator

Stars: ✭ 2,295 (+11978.95%)

Mutual labels: corpus

Chatbot-Training-Corpus

总结了一些可以用作聊天机器人训练实作的文字语聊，包含中英文不同语言

Stars: ✭ 117 (+515.79%)

Mutual labels: corpus

Indonesian Nlp Resources

data resource untuk NLP bahasa indonesia

Stars: ✭ 143 (+652.63%)

Mutual labels: corpus

Gossiping Chinese Corpus

PTT 八卦版問答中文語料

Stars: ✭ 137 (+621.05%)

Mutual labels: corpus

nytwit

New York Times Word Innovation Types dataset

Stars: ✭ 21 (+10.53%)

Mutual labels: corpus

proiel-treebank

Official releases of the PROIEL treebank of ancient Indo-European languages

Stars: ✭ 30 (+57.89%)

Mutual labels: corpus

Cluedatasetsearch

搜索所有中文NLP数据集，附常用英文NLP数据集

Stars: ✭ 2,112 (+11015.79%)

Mutual labels: corpus

Chinese Names Corpus

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

Stars: ✭ 3,053 (+15968.42%)

Mutual labels: corpus

trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Stars: ✭ 711 (+3642.11%)

Mutual labels: corpus

Efaqa Corpus Zh

❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库

Stars: ✭ 170 (+794.74%)

Mutual labels: corpus

gum

Repository for the Georgetown University Multilayer Corpus (GUM)

Stars: ✭ 71 (+273.68%)

Mutual labels: corpus

Clue

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Stars: ✭ 2,425 (+12663.16%)

Mutual labels: corpus

thai-date

Display date in Thai use same PHP date() and strftime() function attributes.

Stars: ✭ 14 (-26.32%)

Mutual labels: thai

Awesome Chatbot

Awesome Chatbot Projects,Corpus,Papers,Tutorials.Chinese Chatbot =>:

Stars: ✭ 1,785 (+9294.74%)

Mutual labels: corpus

textbox

Text collections made available by the CLiGS group.

Stars: ✭ 19 (+0%)

Mutual labels: corpus

Awesome Hungarian Nlp

A curated list of NLP resources for Hungarian

Stars: ✭ 121 (+536.84%)

Mutual labels: corpus

Probabilistic-RNN-DA-Classifier

Probabilistic Dialogue Act Classification for the Switchboard Corpus using an LSTM model

Stars: ✭ 22 (+15.79%)

Mutual labels: corpus

Colibri Core

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.

Stars: ✭ 112 (+489.47%)

Mutual labels: corpus

Ua Gec

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Stars: ✭ 108 (+468.42%)

Mutual labels: corpus

thai-data

รวมข้อมูล ตำบล อำเภอ และ จังหวัด ในประเทศไทย (77 จังหวัด) อ้างอิงตาม รหัสไปรษณีย์ไทย โดยที่ไม่ใช้ Server side ได้รับแรงบันดาลใจจาก เราไม่ทิ้งกัน.com

Stars: ✭ 20 (+5.26%)

Mutual labels: thailand

rclc

Rich Context leaderboard competition, including the corpus and current SOTA for required tasks.

Stars: ✭ 20 (+5.26%)

Mutual labels: corpus

Pubmed Rct

PubMed 200k RCT dataset: a large dataset for sequential sentence classification.

Stars: ✭ 101 (+431.58%)

Mutual labels: corpus

Dialogue-Corpus

No description or website provided.

Stars: ✭ 27 (+42.11%)

Mutual labels: corpus

Awesome Deeplearning Resources

Deep Learning and deep reinforcement learning research papers and some codes

Stars: ✭ 2,483 (+12968.42%)

Mutual labels: corpus

howlonguntilprayuthleaves.com

นับเวลาถอยหลังถึงวันที่พลเอกประยุทธ์ จันทร์โอชา หมดวาระการเป็นนายกรัฐมนตรี

Stars: ✭ 29 (+52.63%)

Mutual labels: thailand

Nlvr

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

Stars: ✭ 192 (+910.53%)

Mutual labels: corpus

guide-to-becoming

แหล่งรวบรวมข้อมูลสำหรับคนที่อยากจะพัฒนาตัวเองในด้านต่างๆจากผู้เริ่มต้นสู่ระดับเทพ

Stars: ✭ 23 (+21.05%)

Mutual labels: thailand

Nlp bahasa resources

A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia

Stars: ✭ 158 (+731.58%)

Mutual labels: corpus

torpleng

การต่อเพลงไทยที่ยาวที่สุดในประวัติศาสตร์

Stars: ✭ 39 (+105.26%)

Mutual labels: thai

Wp2txt

WP2TXT extracts plain text data from Wikipedia dump file (encoded in XML/compressed with Bzip2) stripping all the MediaWiki markups and other metadata.

Stars: ✭ 145 (+663.16%)

Mutual labels: corpus

tvsub

TVsub: DCU-Tencent Chinese-English Dialogue Corpus

Stars: ✭ 40 (+110.53%)

Mutual labels: corpus

Prosody

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

Stars: ✭ 139 (+631.58%)

Mutual labels: corpus

.dev

รวมความรู้ด้าน Coding เป็นภาษาไทย

Stars: ✭ 20 (+5.26%)

Mutual labels: thai

Code Docstring Corpus

Preprocessed Python functions and docstrings for automated code documentation (code2doc) and automated code generation (doc2code) tasks.

Stars: ✭ 137 (+621.05%)

Mutual labels: corpus

Speech-Corpus-Collection

A Collection of Speech Corpus for ASR and TTS

Stars: ✭ 113 (+494.74%)

Mutual labels: corpus

Khcoder

KH Coder: for Quantitative Content Analysis or Text Mining

Stars: ✭ 126 (+563.16%)

Mutual labels: corpus

BSD

The Business Scene Dialogue corpus

Stars: ✭ 51 (+168.42%)

Mutual labels: corpus

Dialog corpus

用于训练中英文对话系统的语料库 Datasets for Training Chatbot System

Stars: ✭ 1,662 (+8647.37%)

Mutual labels: corpus

DANeS

DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)

Stars: ✭ 64 (+236.84%)

Mutual labels: corpus

Sejong Corpus

Korean sejong corpus download and simple analysis

Stars: ✭ 116 (+510.53%)

Mutual labels: corpus

ocr2text

Convert a PDF via OCR to a TXT file in UTF-8 encoding

Stars: ✭ 90 (+373.68%)

Mutual labels: corpus

Datasets

Poetry-related datasets developed by THUAIPoet (Jiuge) group.

Stars: ✭ 111 (+484.21%)

Mutual labels: corpus

Chi Corpus

迟先生语料库

Stars: ✭ 96 (+405.26%)

Mutual labels: corpus

Pansori

Tools for ASR Corpus Generation from Online Video

Stars: ✭ 106 (+457.89%)

Mutual labels: corpus

malay-dataset

Text corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html

Stars: ✭ 189 (+894.74%)

Mutual labels: corpus

Lexicon Thai

คลังศัพท์ภาษาไทย

Stars: ✭ 96 (+405.26%)

Mutual labels: corpus

german-nouns

A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.

Stars: ✭ 101 (+431.58%)

Mutual labels: corpus

OpenConvert

Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)

Stars: ✭ 20 (+5.26%)

Mutual labels: corpus

Pyclue

Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark

Stars: ✭ 91 (+378.95%)

Mutual labels: corpus

covidthailand

Thailand Covid testing and case data gathered and combined from various sources for others to download or view