All Projects → Company Names Corpus → Similar Projects or Alternatives

674 Open source projects that are alternatives of or similar to Company Names Corpus

Chinese Names Corpus
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
Stars: ✭ 3,053 (+251.73%)
Mutual labels:  dict, dataset, corpus, ner
Species-Names-Corpus
物种名称语料库。植物名,动物名。
Stars: ✭ 23 (-97.35%)
Mutual labels:  corpus, dataset, dict
Medical-Names-Corpus
医疗语料库。医疗机构名语料库。药品本位码。
Stars: ✭ 26 (-97%)
Mutual labels:  corpus, dataset, dict
Prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-83.99%)
Mutual labels:  dataset, corpus
Bond
BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-88.94%)
Mutual labels:  dataset, ner
Dataset List
lists of text corpus and more (mainly Japanese)
Stars: ✭ 84 (-90.32%)
Mutual labels:  dataset, corpus
Insuranceqa Corpus Zh
🚁 保险行业语料库,聊天机器人
Stars: ✭ 821 (-5.41%)
Mutual labels:  dataset, corpus
Cluedatasetsearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Stars: ✭ 2,112 (+143.32%)
Mutual labels:  corpus, ner
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (-86.06%)
Mutual labels:  dataset, corpus
Clue
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+179.38%)
Mutual labels:  dataset, corpus
Ua Gec
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-87.56%)
Mutual labels:  dataset, corpus
Dialog corpus
用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
Stars: ✭ 1,662 (+91.47%)
Mutual labels:  dataset, corpus
Coarij
Corpus of Annual Reports in Japan
Stars: ✭ 55 (-93.66%)
Mutual labels:  dataset, corpus
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (-83.53%)
Mutual labels:  dataset, corpus
Nlp chinese corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+666.82%)
Mutual labels:  dataset, corpus
Cluener2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
Stars: ✭ 689 (-20.62%)
Mutual labels:  dataset, ner
Nlp bahasa resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-81.8%)
Mutual labels:  dataset, corpus
Gossiping Chinese Corpus
PTT 八卦版問答中文語料
Stars: ✭ 137 (-84.22%)
Mutual labels:  dataset, corpus
Fakenewscorpus
A dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (-70.62%)
Mutual labels:  dataset, corpus
Cluepretrainedmodels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Stars: ✭ 493 (-43.2%)
Mutual labels:  dataset, corpus
Chatito
🎯🗯 Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Stars: ✭ 678 (-21.89%)
Mutual labels:  dataset
Wilayah Administratif Indonesia
Data Provinsi, Kota/Kabupaten, Kecamatan, dan Kelurahan/Desa di Indonesia
Stars: ✭ 667 (-23.16%)
Mutual labels:  dataset
Person search
Joint Detection and Identification Feature Learning for Person Search
Stars: ✭ 666 (-23.27%)
Mutual labels:  dataset
Awesome Project Ideas
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
Stars: ✭ 6,114 (+604.38%)
Mutual labels:  dataset
Cophy
"CoPhy: Counterfactual Learning of Physical Dynamics", F. Baradel, N. Neverova, J. Mille, G. Mori, C. Wolf, ICLR'2020
Stars: ✭ 24 (-97.24%)
Mutual labels:  dataset
Datastream.io
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Stars: ✭ 814 (-6.22%)
Mutual labels:  dataset
Quanteda
An R package for the Quantitative Analysis of Textual Data
Stars: ✭ 647 (-25.46%)
Mutual labels:  corpus
Naive Bayes Classifier
Naive Bayes classifier is classification algorithm. It uses Naive based Bernoulli and Multinomial equation to classify documents(Text) as ham or spam.
Stars: ✭ 6 (-99.31%)
Mutual labels:  corpus
Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+7.03%)
Mutual labels:  dataset
Uhttbarcodereference
Universe-HTT barcode reference
Stars: ✭ 634 (-26.96%)
Mutual labels:  dataset
Proteinnet
Standardized data set for machine learning of protein structure
Stars: ✭ 664 (-23.5%)
Mutual labels:  dataset
Covid Ct
COVID-CT-Dataset: A CT Scan Dataset about COVID-19
Stars: ✭ 820 (-5.53%)
Mutual labels:  dataset
Bert Ner Pytorch
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Stars: ✭ 654 (-24.65%)
Mutual labels:  ner
Facerank
FaceRank - Rank Face by CNN Model based on TensorFlow (add keras version). FaceRank-人脸打分基于 TensorFlow (新增 Keras 版本) 的 CNN 模型(QQ群:167122861)。技术支持:http://tensorflow123.com
Stars: ✭ 841 (-3.11%)
Mutual labels:  dataset
Devblogs
+2600 developer-related blogs and publications.
Stars: ✭ 637 (-26.61%)
Mutual labels:  dataset
Osint collection
Maintained collection of OSINT related resources. (All Free & Actionable)
Stars: ✭ 809 (-6.8%)
Mutual labels:  dataset
Imagenetscraper
👁 Bulk-download all thumbnails from an ImageNet synset, with optional rescaling
Stars: ✭ 24 (-97.24%)
Mutual labels:  dataset
Esc 50
ESC-50: Dataset for Environmental Sound Classification
Stars: ✭ 631 (-27.3%)
Mutual labels:  dataset
Safety Helmet Wearing Dataset
Safety helmet wearing detect dataset, with pretrained model
Stars: ✭ 802 (-7.6%)
Mutual labels:  dataset
Awesome chinese medical nlp
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
Stars: ✭ 623 (-28.23%)
Mutual labels:  dataset
Gensim Data
Data repository for pretrained NLP models and NLP corpora.
Stars: ✭ 622 (-28.34%)
Mutual labels:  dataset
Chatbot cn
基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
Stars: ✭ 791 (-8.87%)
Mutual labels:  ner
Label Studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+736.87%)
Mutual labels:  dataset
Dict build
自动构建中文词库:http://www.matrix67.com/blog/archives/5044
Stars: ✭ 599 (-30.99%)
Mutual labels:  dict
Khayyam
106 Omar Khayyam quatrains in YAML format.
Stars: ✭ 8 (-99.08%)
Mutual labels:  dataset
Chinesener
中文命名实体识别,实体抽取,tensorflow,pytorch,BiLSTM+CRF
Stars: ✭ 938 (+8.06%)
Mutual labels:  ner
Rdhs
API Client and Data Munging for the Demographic and Health Survey Data
Stars: ✭ 22 (-97.47%)
Mutual labels:  dataset
Natasha
Solves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (-9.22%)
Mutual labels:  ner
Xmnlp
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首等功能
Stars: ✭ 591 (-31.91%)
Mutual labels:  ner
Couplet Dataset
Dataset for couplets. 70万条对联数据库。
Stars: ✭ 589 (-32.14%)
Mutual labels:  dataset
Lm Lstm Crf
Empower Sequence Labeling with Task-Aware Language Model
Stars: ✭ 778 (-10.37%)
Mutual labels:  ner
Cvat
Powerful and efficient Computer Vision Annotation Tool (CVAT)
Stars: ✭ 6,557 (+655.41%)
Mutual labels:  dataset
Open stt
Open STT
Stars: ✭ 584 (-32.72%)
Mutual labels:  dataset
Sohu baseline
基于BERT的中文命名实体识别(pytorch)
Stars: ✭ 19 (-97.81%)
Mutual labels:  ner
Seq2seq Chatbot
Chatbot in 200 lines of code using TensorLayer
Stars: ✭ 777 (-10.48%)
Mutual labels:  corpus
Total Text Dataset
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
Stars: ✭ 580 (-33.18%)
Mutual labels:  dataset
Sequence Labeling Bilstm Crf
The classical BiLSTM-CRF model implemented in Tensorflow, for sequence labeling tasks. In Vex version, everything is configurable.
Stars: ✭ 579 (-33.29%)
Mutual labels:  ner
Bert Chinese Ner
使用预训练语言模型BERT做中文NER
Stars: ✭ 758 (-12.67%)
Mutual labels:  ner
Hate Speech And Offensive Language
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Stars: ✭ 543 (-37.44%)
Mutual labels:  dataset
Nas Bench 201
NAS-Bench-201 API and Instruction
Stars: ✭ 537 (-38.13%)
Mutual labels:  dataset
1-60 of 674 similar projects