All Projects → Chinese Names Corpus → Similar Projects or Alternatives

679 Open source projects that are alternatives of or similar to Chinese Names Corpus

Company Names Corpus
公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。
Stars: ✭ 868 (-71.57%)
Mutual labels:  dict, dataset, corpus, ner
Medical-Names-Corpus
医疗语料库。医疗机构名语料库。药品本位码。
Stars: ✭ 26 (-99.15%)
Mutual labels:  corpus, dataset, dict
Species-Names-Corpus
物种名称语料库。植物名,动物名。
Stars: ✭ 23 (-99.25%)
Mutual labels:  corpus, dataset, dict
Cluedatasetsearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Stars: ✭ 2,112 (-30.82%)
Mutual labels:  corpus, ner
Nlp chinese corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+118.02%)
Mutual labels:  dataset, corpus
Dialog corpus
用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
Stars: ✭ 1,662 (-45.56%)
Mutual labels:  dataset, corpus
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (-95.32%)
Mutual labels:  dataset, corpus
Ua Gec
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-96.46%)
Mutual labels:  dataset, corpus
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (-96.04%)
Mutual labels:  dataset, corpus
Cluener2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
Stars: ✭ 689 (-77.43%)
Mutual labels:  dataset, ner
Bond
BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-96.86%)
Mutual labels:  dataset, ner
Fakenewscorpus
A dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (-91.65%)
Mutual labels:  dataset, corpus
Gossiping Chinese Corpus
PTT 八卦版問答中文語料
Stars: ✭ 137 (-95.51%)
Mutual labels:  dataset, corpus
Insuranceqa Corpus Zh
🚁 保险行业语料库,聊天机器人
Stars: ✭ 821 (-73.11%)
Mutual labels:  dataset, corpus
Coarij
Corpus of Annual Reports in Japan
Stars: ✭ 55 (-98.2%)
Mutual labels:  dataset, corpus
Prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-95.45%)
Mutual labels:  dataset, corpus
Cluepretrainedmodels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Stars: ✭ 493 (-83.85%)
Mutual labels:  dataset, corpus
Dataset List
lists of text corpus and more (mainly Japanese)
Stars: ✭ 84 (-97.25%)
Mutual labels:  dataset, corpus
Clue
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (-20.57%)
Mutual labels:  dataset, corpus
Nlp bahasa resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-94.82%)
Mutual labels:  dataset, corpus
Collection
Collection Data for Cooper Hewitt, Smithsonian Design Museum
Stars: ✭ 214 (-92.99%)
Mutual labels:  dataset
Covid Chestxray Dataset
We are building an open database of COVID-19 cases with chest X-ray or CT images.
Stars: ✭ 2,759 (-9.63%)
Mutual labels:  dataset
Datatable
A go in-memory table
Stars: ✭ 215 (-92.96%)
Mutual labels:  dataset
Dialogrpt
EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"
Stars: ✭ 216 (-92.92%)
Mutual labels:  dataset
Dict
Chinese and English translation tools in the command line(命令行下中英文翻译工具)
Stars: ✭ 243 (-92.04%)
Mutual labels:  dict
University1652 Baseline
ACM Multimedia2020 University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization 🚁 annotates 1652 buildings in 72 universities around the world.
Stars: ✭ 232 (-92.4%)
Mutual labels:  dataset
Ava downloader
⏬ Download AVA dataset (A Large-Scale Database for Aesthetic Visual Analysis)
Stars: ✭ 214 (-92.99%)
Mutual labels:  dataset
Ner Datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
Stars: ✭ 220 (-92.79%)
Mutual labels:  ner
Covid 19 Repo Data
Data archive of identifiable COVID-19 related public projects on GitHub
Stars: ✭ 236 (-92.27%)
Mutual labels:  dataset
Bccd dataset
BCCD (Blood Cell Count and Detection) Dataset is a small-scale dataset for blood cells detection.
Stars: ✭ 216 (-92.92%)
Mutual labels:  dataset
Cocostuff10k
The official homepage of the (outdated) COCO-Stuff 10K dataset.
Stars: ✭ 248 (-91.88%)
Mutual labels:  dataset
Dataset Serialize
JSON to DataSet and DataSet to JSON converter for Delphi and Lazarus (FPC)
Stars: ✭ 213 (-93.02%)
Mutual labels:  dataset
Img2poem
Stars: ✭ 238 (-92.2%)
Mutual labels:  dataset
Short Jokes Dataset
Python scripts for building 'Short Jokes' dataset, featured on Kaggle
Stars: ✭ 215 (-92.96%)
Mutual labels:  dataset
Text
Data loaders and abstractions for text and NLP
Stars: ✭ 2,915 (-4.52%)
Mutual labels:  dataset
Awesome Deeplearning Resources
Deep Learning and deep reinforcement learning research papers and some codes
Stars: ✭ 2,483 (-18.67%)
Mutual labels:  corpus
Datalad
Keep code, data, containers under control with git and git-annex
Stars: ✭ 234 (-92.34%)
Mutual labels:  dataset
Spacy Lookup
Named Entity Recognition based on dictionaries
Stars: ✭ 212 (-93.06%)
Mutual labels:  ner
Pottery
Redis for humans. 🌎🌍🌏
Stars: ✭ 204 (-93.32%)
Mutual labels:  dict
Taco
🌮 Trash Annotations in Context Dataset Toolkit
Stars: ✭ 243 (-92.04%)
Mutual labels:  dataset
Datasets
source{d} datasets ("big code") for source code analysis and machine learning on source code
Stars: ✭ 231 (-92.43%)
Mutual labels:  dataset
Pynasa
Stars: ✭ 212 (-93.06%)
Mutual labels:  dataset
Omnianomaly
KDD 2019: Robust Anomaly Detection for Multivariate Time Series through Stochastic Recurrent Neural Network
Stars: ✭ 208 (-93.19%)
Mutual labels:  dataset
Structured3d
[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
Stars: ✭ 224 (-92.66%)
Mutual labels:  dataset
Dynamic Training Bench
Simplify the training and tuning of Tensorflow models
Stars: ✭ 210 (-93.12%)
Mutual labels:  dataset
Charlatan
Create fake data in R
Stars: ✭ 209 (-93.15%)
Mutual labels:  dataset
Cities.json
Cities of the world in Json, based on GeoNames Gazetteer
Stars: ✭ 251 (-91.78%)
Mutual labels:  dataset
Recommendersystem Dataset
This repository contains some datasets that I have collected in Recommender Systems.
Stars: ✭ 249 (-91.84%)
Mutual labels:  dataset
Retriever
Quickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (-92.11%)
Mutual labels:  dataset
Webstruct
NER toolkit for HTML data
Stars: ✭ 230 (-92.47%)
Mutual labels:  ner
Mini Imagenet Tools
Tools for generating mini-ImageNet dataset and processing batches
Stars: ✭ 209 (-93.15%)
Mutual labels:  dataset
Python Benedict
dict subclass with keylist/keypath support, I/O shortcuts (base64, csv, json, pickle, plist, query-string, toml, xml, yaml) and many utilities. 📘
Stars: ✭ 204 (-93.32%)
Mutual labels:  dict
Weatherbench
A benchmark dataset for data-driven weather forecasting
Stars: ✭ 227 (-92.56%)
Mutual labels:  dataset
Computervisiondatasets
Stars: ✭ 207 (-93.22%)
Mutual labels:  dataset
Covid19za
Coronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa
Stars: ✭ 208 (-93.19%)
Mutual labels:  dataset
Malaya
Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/
Stars: ✭ 239 (-92.17%)
Mutual labels:  ner
Stocknet Dataset
A comprehensive dataset for stock movement prediction from tweets and historical stock prices.
Stars: ✭ 228 (-92.53%)
Mutual labels:  dataset
Split Folders
🗂 Split folders with files (i.e. images) into training, validation and test (dataset) folders
Stars: ✭ 203 (-93.35%)
Mutual labels:  dataset
Tech.ml.dataset
A Clojure high performance data processing system
Stars: ✭ 205 (-93.29%)
Mutual labels:  dataset
Vehicle reid Collection
🚗 the collection of vehicle re-ID papers, datasets. 🚗
Stars: ✭ 225 (-92.63%)
Mutual labels:  dataset
1-60 of 679 similar projects