GitPlanet
Projects
Users
Categories
Languages
About
All Categories
→
No Category
→ corpora
Top 7 corpora open source projects
Open-korean-corpora
Open Korean NLP Dataset Curation for the Users All Around the Globe
✭ 82
nlp
open-source
dataset
korean
corpora
curation
spanish-corpora
Unannotated Spanish 3 Billion Words Corpora
✭ 61
python
nlp
natural-language-processing
linguistics
spanish
corpora
spanish-language
huner
Named Entity Recognition for biomedical entities
✭ 44
python
shell
perl
Dockerfile
named-entity-recognition
neural-networks
corpora
ner
bionlp
CorpusLoaders.jl
A variety of loaders for various NLP corpora.
✭ 28
julia
nlp
corpora
kontext
An advanced, extensible web front-end for the Manatee-open corpus search engine
✭ 50
typescript
python
HTML
javascript
shell
PEG.js
user-interface
corpora
corpus-linguistics
corpus-tools
parallel-corpora-tools
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
✭ 35
PHP
shell
nlp
data-science
natural-language-processing
translation
machine
machine-translation
natural-language
neural-machine-translation
corpora
nmt
filtering
data-processing
neural
language-processing
cleaning
corpus-tools
CrossNER
CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)
✭ 87
python
shell
dataset
named-entity-recognition
corpora
multi-domain
ner
cross-domain
sequence-labeling
domain-adaptation
low-resource
multi-domain-adaptation
1-7
of
7
corpora projects