Entity Recognition DatasetsA collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (+879.12%)
Ner DatasetsDatasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
Stars: ✭ 220 (+141.76%)
Dataframes.jlIn-memory tabular data in Julia
Stars: ✭ 951 (+945.05%)
CorenlpStanford CoreNLP: A Java suite of core NLP tools.
Stars: ✭ 8,248 (+8963.74%)
Bert NerPytorch-Named-Entity-Recognition-with-BERT
Stars: ✭ 829 (+810.99%)
HealthcheckHealth Check ✔ is a Machine Learning Web Application made using Flask that can predict mainly three diseases i.e. Diabetes, Heart Disease, and Cancer.
Stars: ✭ 35 (-61.54%)
Open Semantic Search AppsPython/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations and named entities) and data import (ETL like text extraction, OCR and crawling filesystems or websites)
Stars: ✭ 55 (-39.56%)
Chinesener中文命名实体识别,实体抽取,tensorflow,pytorch,BiLSTM+CRF
Stars: ✭ 938 (+930.77%)
Seq2annotation基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注(Part Of Speech, POS)和命名实体识别(Named Entity Recognition, NER)等序列标注任务。
Stars: ✭ 70 (-23.08%)
EasyprAn easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations.
Stars: ✭ 6,046 (+6543.96%)
Pytorch CppC++ Implementation of PyTorch Tutorials for Everyone
Stars: ✭ 1,014 (+1014.29%)
Label StudioLabel Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+7882.42%)
Commons⛲️ Commons Marketplace client & server to explore, download, and publish open data sets in the Ocean Protocol Network.
Stars: ✭ 34 (-62.64%)
PydatasetInstant access to many datasets in Python.
Stars: ✭ 880 (+867.03%)
Iob2corpusJapanese IOB2 tagged corpus for Named Entity Recognition.
Stars: ✭ 51 (-43.96%)
GigabertZero-shot Transfer Learning from English to Arabic
Stars: ✭ 23 (-74.73%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+1250.55%)
AudinoOpen source audio annotation tool for humans™
Stars: ✭ 740 (+713.19%)
Cluener2020CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
Stars: ✭ 689 (+657.14%)
ColourColour Science for Python
Stars: ✭ 1,131 (+1142.86%)
DeeppavlovAn open source library for deep learning end-to-end dialog systems and chatbots.
Stars: ✭ 5,525 (+5971.43%)
JointreEnd-to-end neural relation extraction using deep biaffine attention (ECIR 2019)
Stars: ✭ 41 (-54.95%)
TorchcrfAn Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0
Stars: ✭ 58 (-36.26%)
Nlp Experiments In PytorchPyTorch repository for text categorization and NER experiments in Turkish and English.
Stars: ✭ 35 (-61.54%)
Deepnlp基于深度学习的自然语言处理库
Stars: ✭ 34 (-62.64%)
PhonlpPhoNLP: A BERT-based multi-task learning toolkit for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
Stars: ✭ 56 (-38.46%)
Harvesttext文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Stars: ✭ 956 (+950.55%)
Openml RR package to interface with OpenML
Stars: ✭ 81 (-10.99%)
TnerLanguage model finetuning on NER with an easy interface, and cross-domain evaluation. We released NER models finetuned on various domain via huggingface model hub.
Stars: ✭ 54 (-40.66%)
Tf nerSimple and Efficient Tensorflow implementations of NER models with tf.estimator and tf.data
Stars: ✭ 876 (+862.64%)
Ner blstm CrfLSTM-CRF for NER with ConLL-2002 dataset
Stars: ✭ 51 (-43.96%)
Turkish Bert Nlp PipelineBert-base NLP pipeline for Turkish, Ner, Sentiment Analysis, Question Answering etc.
Stars: ✭ 85 (-6.59%)
OgbBenchmark datasets, data loaders, and evaluators for graph machine learning
Stars: ✭ 799 (+778.02%)
PersonasDatasets for Deep learning Personas
Stars: ✭ 49 (-46.15%)
Awesome TransitCommunity list of transit APIs, apps, datasets, research, and software 🚌🌟🚋🌟🚂
Stars: ✭ 713 (+683.52%)
Coco Annotator✏️ Web-based image segmentation tool for object detection, localization, and keypoints
Stars: ✭ 1,138 (+1150.55%)
YeddaYEDDA: A Lightweight Collaborative Text Span Annotation Tool. Code for ACL 2018 Best Demo Paper Nomination.
Stars: ✭ 704 (+673.63%)
Awesome Earth Artificial IntelligenceA curated list of Earth Science's Artificial Intelligence (AI) tutorials, notebooks, software, datasets, courses, books, video lectures and papers. Contributions most welcome.
Stars: ✭ 44 (-51.65%)
Chatito🎯🗯 Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Stars: ✭ 678 (+645.05%)
Atis datasetThe ATIS (Airline Travel Information System) Dataset
Stars: ✭ 81 (-10.99%)
StanzaOfficial Stanford NLP Python Library for Many Human Languages
Stars: ✭ 5,887 (+6369.23%)
Datasets For Recommender SystemsThis is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Stars: ✭ 564 (+519.78%)
Wikipedia ner📖 Labeled examples from wiki dumps in Python
Stars: ✭ 61 (-32.97%)
LoghubA large collection of system log datasets for AI-powered log analytics
Stars: ✭ 551 (+505.49%)
DareblopyData Reading Blocks for Python
Stars: ✭ 82 (-9.89%)
Tf Lstm Crf BatchTensorflow-LSTM-CRF tool for Named Entity Recognizer
Stars: ✭ 59 (-35.16%)