goclassyAn asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
Stars: ✭ 81 (+17.39%)
Fasttext4j Implementing Facebook's FastText with java
Stars: ✭ 148 (+114.49%)
Pytorchtext1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案)
Stars: ✭ 1,022 (+1381.16%)
Bert language understandingPre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Stars: ✭ 933 (+1252.17%)
TgcontestTelegram Data Clustering contest solution by Mindful Squirrel
Stars: ✭ 74 (+7.25%)
kanji-frequencyKanji usage frequency data collected from various sources
Stars: ✭ 92 (+33.33%)
WhatthelangLightning Fast Language Prediction 🚀
Stars: ✭ 130 (+88.41%)
node-fasttextNodejs binding for fasttext representation and classification.
Stars: ✭ 39 (-43.48%)
Half SizeCode for "Effective Dimensionality Reduction for Word Embeddings".
Stars: ✭ 89 (+28.99%)
Cw2veccw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
Stars: ✭ 224 (+224.64%)
Fasttext NodeNode wrapper around FastText Library
Stars: ✭ 58 (-15.94%)
kontextAn advanced, extensible web front-end for the Manatee-open corpus search engine
Stars: ✭ 50 (-27.54%)
EmbeddingsvizVisualize word embeddings of a vocabulary in TensorBoard, including the neighbors
Stars: ✭ 40 (-42.03%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+18397.1%)
Fasttext Tuning📈 Find your fasttext hyperparameters quickly and easily.
Stars: ✭ 13 (-81.16%)
Mynlp一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)
Stars: ✭ 519 (+652.17%)
Text Classification DemosNeural models for Text Classification in Tensorflow, such as cnn, dpcnn, fasttext, bert ...
Stars: ✭ 144 (+108.7%)
BiosentvecBioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Stars: ✭ 308 (+346.38%)
CogNetCogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates
Stars: ✭ 26 (-62.32%)
CvtkCVTK, a Computer Vision ToolKit.
Stars: ✭ 119 (+72.46%)
extremeTextLibrary for fast text representation and extreme classification.
Stars: ✭ 141 (+104.35%)
FastrtextR wrapper for fastText
Stars: ✭ 103 (+49.28%)
PyfasttextYet another Python binding for fastText
Stars: ✭ 229 (+231.88%)
Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (+1769.57%)
fasttextjsJavaScript implementation of the FastText prediction algorithm
Stars: ✭ 31 (-55.07%)
Convai Bot 1337NIPS Conversational Intelligence Challenge 2017 Winner System: Skill-based Conversational Agent with Supervised Dialog Manager
Stars: ✭ 65 (-5.8%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+184.06%)
Fasttext.pyA Python interface for Facebook fastText
Stars: ✭ 1,091 (+1481.16%)
fastchessPredicts the best chess move with 27.5% accuracy by a single matrix multiplication
Stars: ✭ 75 (+8.7%)
Cvpr paper search toolAutomatic paper clustering and search tool by fastext from Facebook Research
Stars: ✭ 43 (-37.68%)
WordvectorsPre-trained word vectors of 30+ languages
Stars: ✭ 2,043 (+2860.87%)
Keras Textclassification中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Stars: ✭ 914 (+1224.64%)
Embedding As ServiceOne-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (+118.84%)
Text classificationall kinds of text classification models and more with deep learning
Stars: ✭ 7,179 (+10304.35%)
fasttext-servingServe your fastText models for text classification and word vectors
Stars: ✭ 21 (-69.57%)
Textclassification KerasText classification models implemented in Keras, including: FastText, TextCNN, TextRNN, TextBiRNN, TextAttBiRNN, HAN, RCNN, RCNNVariant, etc.
Stars: ✭ 621 (+800%)
Lmdb EmbeddingsFast word vectors with little memory usage in Python
Stars: ✭ 404 (+485.51%)
nerusLarge silver standart Russian corpus with NER, morphology and syntax markup
Stars: ✭ 47 (-31.88%)
Nlp researchNLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务
Stars: ✭ 141 (+104.35%)
Tensorflow fasttextSimple embedding based text classifier inspired by fastText, implemented in tensorflow
Stars: ✭ 290 (+320.29%)
KeywordAnalysisWord analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends
Stars: ✭ 49 (-28.99%)
fastText1607Unofficial Implementation of "Bag of Tricks for Efficient Text Classification", 2016, Armand Joulin et al. (https://arxiv.org/pdf/1607.01759.pdf)
Stars: ✭ 20 (-71.01%)
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-69.57%)
Ai lawall kinds of baseline models for long text classificaiton( text categorization)
Stars: ✭ 243 (+252.17%)
Nlp兜哥出品 <一本开源的NLP入门书籍>
Stars: ✭ 1,677 (+2330.43%)
actions-suggest-related-linksA GitHub Action to suggest related or similar issues, documents, and links. Based on the power of NLP and fastText.
Stars: ✭ 23 (-66.67%)
Pytorch Sentiment AnalysisTutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (+4550.72%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+1920.29%)