TorchBlocksA PyTorch-based toolkit for natural language processing
Stars: ✭ 85 (-61.36%)
Crf Layer On The Top Of BilstmThe CRF Layer was implemented by using Chainer 2.0. Please see more details here: https://createmomo.github.io/2017/09/12/CRF_Layer_on_the_Top_of_BiLSTM_1/
Stars: ✭ 148 (-32.73%)
Seq2annotation基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注(Part Of Speech, POS)和命名实体识别(Named Entity Recognition, NER)等序列标注任务。
Stars: ✭ 70 (-68.18%)
Slot filling and intent detection of sluslot filling, intent detection, joint training, ATIS & SNIPS datasets, the Facebook’s multilingual dataset, MIT corpus, E-commerce Shopping Assistant (ECSA) dataset, CoNLL2003 NER, ELMo, BERT, XLNet
Stars: ✭ 298 (+35.45%)
PydatasetInstant access to many datasets in Python.
Stars: ✭ 880 (+300%)
Open3d MlAn extension of Open3D to address 3D Machine Learning tasks
Stars: ✭ 284 (+29.09%)
MolaA Modular Optimization framework for Localization and mApping (MOLA)
Stars: ✭ 206 (-6.36%)
PharmacoDBSearch across publicly available datasets to find instances where a drug or cell line of interest has been profiled.
Stars: ✭ 38 (-82.73%)
DatasaurusR Package 📦 Containing the Datasaurus Dozen datasets 📊
Stars: ✭ 193 (-12.27%)
Open Semantic EtlPython based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Stars: ✭ 165 (-25%)
Mt DnnMulti-Task Deep Neural Networks for Natural Language Understanding
Stars: ✭ 1,871 (+750.45%)
Farm🏡 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Stars: ✭ 1,140 (+418.18%)
TdcTherapeutics Data Commons: Machine Learning Datasets and Tasks for Therapeutics
Stars: ✭ 291 (+32.27%)
DanlpDaNLP is a repository for Natural Language Processing resources for the Danish Language.
Stars: ✭ 111 (-49.55%)
wink-nerLanguage agnostic named entity recognizer
Stars: ✭ 32 (-85.45%)
BERT-NERUsing pre-trained BERT models for Chinese and English NER with 🤗Transformers
Stars: ✭ 114 (-48.18%)
OgbBenchmark datasets, data loaders, and evaluators for graph machine learning
Stars: ✭ 799 (+263.18%)
AlpacaTagAlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging (ACL 2019 Demo)
Stars: ✭ 126 (-42.73%)
Delfta Deep Learning Framework for Text
Stars: ✭ 289 (+31.36%)
NatashaSolves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (+258.18%)
let-it-be中国高等教育群体的心理健康状态数据集
Stars: ✭ 28 (-87.27%)
ChineseglueLanguage Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Stars: ✭ 1,548 (+603.64%)
DeepNERAn Easy-to-use, Modular and Prolongable package of deep-learning based Named Entity Recognition Models.
Stars: ✭ 9 (-95.91%)
farabio🤖 PyTorch toolkit for biomedical imaging ❤️
Stars: ✭ 48 (-78.18%)
Ner Slot filling中文自然语言的实体抽取和意图识别(Natural Language Understanding),可选Bi-LSTM + CRF 或者 IDCNN + CRF
Stars: ✭ 151 (-31.36%)
11K-HandsTwo-stream CNN for gender classification and biometric identification using a dataset of 11K hand images.
Stars: ✭ 44 (-80%)
Awesome TransitCommunity list of transit APIs, apps, datasets, research, and software 🚌🌟🚋🌟🚂
Stars: ✭ 713 (+224.09%)
covid19-datasetsA list of high quality open datasets for COVID-19 data analysis
Stars: ✭ 56 (-74.55%)
Coco Annotator✏️ Web-based image segmentation tool for object detection, localization, and keypoints
Stars: ✭ 1,138 (+417.27%)
NerNamed Entity Recognition
Stars: ✭ 288 (+30.91%)
Cluecorpus2020Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (+26.36%)
ColourColour Science for Python
Stars: ✭ 1,131 (+414.09%)
TextpipeTextpipe: clean and extract metadata from text
Stars: ✭ 284 (+29.09%)
trinity-ieInformation extraction pipeline containing coreference resolution, named entity linking, and relationship extraction
Stars: ✭ 59 (-73.18%)
Chatito🎯🗯 Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Stars: ✭ 678 (+208.18%)
Few-Shot-Intent-DetectionFew-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.
Stars: ✭ 63 (-71.36%)
PynlpA pythonic wrapper for Stanford CoreNLP.
Stars: ✭ 103 (-53.18%)
ml-datasets🌊 Machine learning dataset loaders for testing and example scripts
Stars: ✭ 40 (-81.82%)
Label StudioLabel Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+3201.82%)
datasetdataset is a command line tool, Go package, shared library and Python package for working with JSON objects as collections
Stars: ✭ 21 (-90.45%)
IdenprofIdenProf dataset is a collection of images of identifiable professionals. It is been collected to enable the development of AI systems that can serve by identifying people and the nature of their job by simply looking at an image, just like humans can do.
Stars: ✭ 149 (-32.27%)
Hscrf PytorchACL 2018: Hybrid semi-Markov CRF for Neural Sequence Labeling (http://aclweb.org/anthology/P18-2038)
Stars: ✭ 284 (+29.09%)
SolrtexttaggerA text tagger based on Lucene / Solr, using FST technology
Stars: ✭ 162 (-26.36%)
Ner AnnotatorNamed Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.
Stars: ✭ 127 (-42.27%)
Wikipedia ner📖 Labeled examples from wiki dumps in Python
Stars: ✭ 61 (-72.27%)
MeglassAn eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Stars: ✭ 281 (+27.73%)
Tf Lstm Crf BatchTensorflow-LSTM-CRF tool for Named Entity Recognizer
Stars: ✭ 59 (-73.18%)