ungoliant🕷️ The pipeline for the OSCAR corpus
Stars: ✭ 69 (-14.81%)
TgcontestTelegram Data Clustering contest solution by Mindful Squirrel
Stars: ✭ 74 (-8.64%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+1620.99%)
Pytorch Sentiment AnalysisTutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (+3861.73%)
Pytorchtext1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案)
Stars: ✭ 1,022 (+1161.73%)
fasttext-servingServe your fastText models for text classification and word vectors
Stars: ✭ 21 (-74.07%)
Fasttext4j Implementing Facebook's FastText with java
Stars: ✭ 148 (+82.72%)
Bert language understandingPre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Stars: ✭ 933 (+1051.85%)
CvtkCVTK, a Computer Vision ToolKit.
Stars: ✭ 119 (+46.91%)
kontextAn advanced, extensible web front-end for the Manatee-open corpus search engine
Stars: ✭ 50 (-38.27%)
Half SizeCode for "Effective Dimensionality Reduction for Word Embeddings".
Stars: ✭ 89 (+9.88%)
Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (+1492.59%)
Fasttext NodeNode wrapper around FastText Library
Stars: ✭ 58 (-28.4%)
Cw2veccw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
Stars: ✭ 224 (+176.54%)
EmbeddingsvizVisualize word embeddings of a vocabulary in TensorBoard, including the neighbors
Stars: ✭ 40 (-50.62%)
nerusLarge silver standart Russian corpus with NER, morphology and syntax markup
Stars: ✭ 47 (-41.98%)
Fasttext Tuning📈 Find your fasttext hyperparameters quickly and easily.
Stars: ✭ 13 (-83.95%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+15656.79%)
Mynlp一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)
Stars: ✭ 519 (+540.74%)
fastchessPredicts the best chess move with 27.5% accuracy by a single matrix multiplication
Stars: ✭ 75 (-7.41%)
Text Classification DemosNeural models for Text Classification in Tensorflow, such as cnn, dpcnn, fasttext, bert ...
Stars: ✭ 144 (+77.78%)
BiosentvecBioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Stars: ✭ 308 (+280.25%)
node-fasttextNodejs binding for fasttext representation and classification.
Stars: ✭ 39 (-51.85%)
fasttextjsJavaScript implementation of the FastText prediction algorithm
Stars: ✭ 31 (-61.73%)
Nlp兜哥出品 <一本开源的NLP入门书籍>
Stars: ✭ 1,677 (+1970.37%)
CogNetCogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates
Stars: ✭ 26 (-67.9%)
FastrtextR wrapper for fastText
Stars: ✭ 103 (+27.16%)
Ai lawall kinds of baseline models for long text classificaiton( text categorization)
Stars: ✭ 243 (+200%)
NLP-paper🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/
Stars: ✭ 23 (-71.6%)
fastText1607Unofficial Implementation of "Bag of Tricks for Efficient Text Classification", 2016, Armand Joulin et al. (https://arxiv.org/pdf/1607.01759.pdf)
Stars: ✭ 20 (-75.31%)
Convai Bot 1337NIPS Conversational Intelligence Challenge 2017 Winner System: Skill-based Conversational Agent with Supervised Dialog Manager
Stars: ✭ 65 (-19.75%)
PyfasttextYet another Python binding for fastText
Stars: ✭ 229 (+182.72%)
Fasttext.pyA Python interface for Facebook fastText
Stars: ✭ 1,091 (+1246.91%)
Cvpr paper search toolAutomatic paper clustering and search tool by fastext from Facebook Research
Stars: ✭ 43 (-46.91%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+141.98%)
kanji-frequencyKanji usage frequency data collected from various sources
Stars: ✭ 92 (+13.58%)
Keras Textclassification中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Stars: ✭ 914 (+1028.4%)
WordvectorsPre-trained word vectors of 30+ languages
Stars: ✭ 2,043 (+2422.22%)
Text classificationall kinds of text classification models and more with deep learning
Stars: ✭ 7,179 (+8762.96%)
actions-suggest-related-linksA GitHub Action to suggest related or similar issues, documents, and links. Based on the power of NLP and fastText.
Stars: ✭ 23 (-71.6%)
Textclassification KerasText classification models implemented in Keras, including: FastText, TextCNN, TextRNN, TextBiRNN, TextAttBiRNN, HAN, RCNN, RCNNVariant, etc.
Stars: ✭ 621 (+666.67%)
Embedding As ServiceOne-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (+86.42%)
Lmdb EmbeddingsFast word vectors with little memory usage in Python
Stars: ✭ 404 (+398.77%)
german-sentimentA data set and model for german sentiment classification.
Stars: ✭ 37 (-54.32%)
Tensorflow fasttextSimple embedding based text classifier inspired by fastText, implemented in tensorflow
Stars: ✭ 290 (+258.02%)
Nlp researchNLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务
Stars: ✭ 141 (+74.07%)
compress-fasttextTools for shrinking fastText models (in gensim format)
Stars: ✭ 124 (+53.09%)
fasttext-serverlessServerless hashtag recommendations using fastText and Python with AWS Lambda
Stars: ✭ 20 (-75.31%)
WhatthelangLightning Fast Language Prediction 🚀
Stars: ✭ 130 (+60.49%)