Ai lawall kinds of baseline models for long text classificaiton( text categorization)
Cw2veccw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
GensimTopic Modelling for Humans
Embedding As ServiceOne-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Fasttext4j Implementing Facebook's FastText with java
Nlp researchNLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务
CvtkCVTK, a Computer Vision ToolKit.
MagnitudeA fast, efficient universal vector embedding utility package.
Half SizeCode for "Effective Dimensionality Reduction for Word Embeddings".
Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
TgcontestTelegram Data Clustering contest solution by Mindful Squirrel
Convai Bot 1337NIPS Conversational Intelligence Challenge 2017 Winner System: Skill-based Conversational Agent with Supervised Dialog Manager
Pytorchtext1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案)
EmbeddingsvizVisualize word embeddings of a vocabulary in TensorBoard, including the neighbors
Keras Textclassification中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Textclassification KerasText classification models implemented in Keras, including: FastText, TextCNN, TextRNN, TextBiRNN, TextAttBiRNN, HAN, RCNN, RCNNVariant, etc.
Mynlp一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)
BiosentvecBioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Tensorflow fasttextSimple embedding based text classifier inspired by fastText, implemented in tensorflow
node-fasttextNodejs binding for fasttext representation and classification.
fastText1607Unofficial Implementation of "Bag of Tricks for Efficient Text Classification", 2016, Armand Joulin et al. (https://arxiv.org/pdf/1607.01759.pdf)
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
extremeTextLibrary for fast text representation and extreme classification.
fasttext-serverFlask web server to serve supervised models trained with FastText.
nlpbuddyA text analysis application for performing common NLP tasks through a web dashboard interface and an API
FastText.NetWrapper.NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!
goclassyAn asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
NLP-paper🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/
fasttext-serverlessServerless hashtag recommendations using fastText and Python with AWS Lambda
ungoliant🕷️ The pipeline for the OSCAR corpus