Naive Bayes classifier is classification algorithm. It uses Naive based Bernoulli and Multinomial equation to classify documents(Text) as ham or spam.

Stars: ✭ 6 (-99.31%)

Mutual labels: corpus

Mobius

C# and F# language binding and extensions to Apache Spark

Stars: ✭ 929 (+7.03%)

Mutual labels: dataset

Uhttbarcodereference

Universe-HTT barcode reference

Stars: ✭ 634 (-26.96%)

Mutual labels: dataset

Proteinnet

Standardized data set for machine learning of protein structure

Stars: ✭ 664 (-23.5%)

Mutual labels: dataset

Covid Ct

COVID-CT-Dataset: A CT Scan Dataset about COVID-19

Stars: ✭ 820 (-5.53%)

Mutual labels: dataset

Bert Ner Pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Stars: ✭ 654 (-24.65%)

Mutual labels: ner

Facerank

FaceRank - Rank Face by CNN Model based on TensorFlow (add keras version). FaceRank-人脸打分基于 TensorFlow (新增 Keras 版本) 的 CNN 模型（QQ群：167122861）。技术支持：http://tensorflow123.com

Stars: ✭ 841 (-3.11%)

Mutual labels: dataset

Devblogs

+2600 developer-related blogs and publications.

Stars: ✭ 637 (-26.61%)

Mutual labels: dataset

Osint collection

Maintained collection of OSINT related resources. (All Free & Actionable)

Stars: ✭ 809 (-6.8%)

Mutual labels: dataset

Imagenetscraper

👁 Bulk-download all thumbnails from an ImageNet synset, with optional rescaling

Stars: ✭ 24 (-97.24%)

Mutual labels: dataset

Esc 50

ESC-50: Dataset for Environmental Sound Classification

Stars: ✭ 631 (-27.3%)

Mutual labels: dataset

Safety Helmet Wearing Dataset

Safety helmet wearing detect dataset, with pretrained model

Stars: ✭ 802 (-7.6%)

Mutual labels: dataset

Awesome chinese medical nlp

中文医学NLP公开资源整理：术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc

Stars: ✭ 623 (-28.23%)

Mutual labels: dataset

Gensim Data

Data repository for pretrained NLP models and NLP corpora.

Stars: ✭ 622 (-28.34%)

Mutual labels: dataset

Chatbot cn

基于金融-司法领域(兼有闲聊性质)的聊天机器人，其中的主要模块有信息抽取、NLU、NLG、知识图谱等，并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口

Stars: ✭ 791 (-8.87%)

Mutual labels: ner

Label Studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Stars: ✭ 7,264 (+736.87%)

Mutual labels: dataset

Dict build

自动构建中文词库：http://www.matrix67.com/blog/archives/5044

Stars: ✭ 599 (-30.99%)

Mutual labels: dict

Khayyam

106 Omar Khayyam quatrains in YAML format.

Stars: ✭ 8 (-99.08%)

Mutual labels: dataset

Chinesener

中文命名实体识别，实体抽取，tensorflow，pytorch，BiLSTM+CRF

Stars: ✭ 938 (+8.06%)

Mutual labels: ner

Rdhs

API Client and Data Munging for the Demographic and Health Survey Data

Stars: ✭ 22 (-97.47%)

Mutual labels: dataset

Natasha

Solves basic Russian NLP tasks, API for lower level Natasha projects

Stars: ✭ 788 (-9.22%)

Mutual labels: ner

Xmnlp

xmnlp：提供中文分词, 词性标注, 命名体识别，情感分析，文本纠错，文本转拼音，文本摘要，偏旁部首等功能

Stars: ✭ 591 (-31.91%)

Mutual labels: ner

Couplet Dataset

Dataset for couplets. 70万条对联数据库。

Stars: ✭ 589 (-32.14%)

Mutual labels: dataset

Lm Lstm Crf

Empower Sequence Labeling with Task-Aware Language Model

Stars: ✭ 778 (-10.37%)

Mutual labels: ner

Cvat

Powerful and efficient Computer Vision Annotation Tool (CVAT)

Stars: ✭ 6,557 (+655.41%)

Mutual labels: dataset

Open stt

Open STT

Stars: ✭ 584 (-32.72%)

Mutual labels: dataset

Sohu baseline

基于BERT的中文命名实体识别（pytorch）

Stars: ✭ 19 (-97.81%)

Mutual labels: ner

Seq2seq Chatbot

Chatbot in 200 lines of code using TensorLayer

Stars: ✭ 777 (-10.48%)

Mutual labels: corpus

Total Text Dataset

Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.

Stars: ✭ 580 (-33.18%)

Mutual labels: dataset

Sequence Labeling Bilstm Crf

The classical BiLSTM-CRF model implemented in Tensorflow, for sequence labeling tasks. In Vex version, everything is configurable.

Stars: ✭ 579 (-33.29%)

Mutual labels: ner

Bert Chinese Ner

使用预训练语言模型BERT做中文NER

Stars: ✭ 758 (-12.67%)

Mutual labels: ner

Hate Speech And Offensive Language

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

Stars: ✭ 543 (-37.44%)

Mutual labels: dataset

Nas Bench 201

NAS-Bench-201 API and Instruction

Stars: ✭ 537 (-38.13%)

Mutual labels: dataset

1-60 of 674 similar projects

›

next*5