Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (+743.14%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-53.59%)
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (-23.53%)
Transorthogonal LinguisticsUses a distributed word representation to finds words along the hyperchord of two input words.
Stars: ✭ 93 (-39.22%)
Ml ProjectsML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
Stars: ✭ 127 (-16.99%)
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-50.33%)
DhooksA simple python Discord webhook API wrapper
Stars: ✭ 136 (-11.11%)
Deeplearning Nlp ModelsA small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs. Models: word2vec, CNNs, transformer, gpt.
Stars: ✭ 64 (-58.17%)
Awesome Embedding ModelsA curated list of awesome embedding models tutorials, projects and communities.
Stars: ✭ 1,486 (+871.24%)
Postgres Word2vecutils to use word embedding like word2vec vectors in a postgres database
Stars: ✭ 96 (-37.25%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-71.9%)
Scattertext PydataNotebooks for the Seattle PyData 2017 talk on Scattertext
Stars: ✭ 132 (-13.73%)
Dict2vecDict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (-40.52%)
Media EmbedA PHP library to deal with all those media services around, parsing their URLs and displaying their audios/videos.
Stars: ✭ 143 (-6.54%)
Cw2vec基于字符训练词向量
Stars: ✭ 80 (-47.71%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+1025.49%)
AsneA sparsity aware and memory efficient implementation of "Attributed Social Network Embedding" (TKDE 2018).
Stars: ✭ 73 (-52.29%)
Fasttext4j Implementing Facebook's FastText with java
Stars: ✭ 148 (-3.27%)
EmbedEmbed player for D.Tube
Stars: ✭ 65 (-57.52%)
ChiveJapanese word embedding with Sudachi and NWJC 🌿
Stars: ✭ 63 (-58.82%)
Mlsourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees
Stars: ✭ 136 (-11.11%)
PujanggaPujangga - Indonesian Natural Language Processing Tool with REST API, an Interface for InaNLP and Deeplearning4j's Word2Vec
Stars: ✭ 47 (-69.28%)
TextclfTextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。
Stars: ✭ 105 (-31.37%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (-37.25%)
ExperimentsSome research experiments
Stars: ✭ 95 (-37.91%)
Word2vecGo library for performing computations in word2vec binary models
Stars: ✭ 143 (-6.54%)
Textfeatures👷♂️ A simple package for extracting useful features from character objects 👷♀️
Stars: ✭ 148 (-3.27%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-44.44%)
Ja.text8Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-48.37%)
Nlp researchNLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务
Stars: ✭ 141 (-7.84%)
MusaeThe reference implementation of "Multi-scale Attributed Node Embedding".
Stars: ✭ 75 (-50.98%)
VimeCustomizable, extensible, accessible and framework agnostic media player. Modern alternative to Video.js and Plyr. Supports HTML5, HLS, Dash, YouTube, Vimeo, Dailymotion...
Stars: ✭ 1,928 (+1160.13%)
Sense2vec🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (+673.86%)
Embedding As ServiceOne-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (-1.31%)
Gh Card GitHub Repository Card for Any Web Site
Stars: ✭ 1,154 (+654.25%)
Kor2vecLibrary for Korean morpheme and word vector representation
Stars: ✭ 64 (-58.17%)
Word2vec对 ansj 编写的 Word2VEC_java 的进一步包装,同时实现了常用的词语相似度和句子相似度计算。
Stars: ✭ 136 (-11.11%)
Repo 2017Python codes in Machine Learning, NLP, Deep Learning and Reinforcement Learning with Keras and Theano
Stars: ✭ 1,123 (+633.99%)
Nlp兜哥出品 <一本开源的NLP入门书籍>
Stars: ✭ 1,677 (+996.08%)
Prettyembed.jsPrettier embeds for your YouTubes - with nice options like high-res preview images, advanced customization of embed options, and optional FitVids support.
Stars: ✭ 1,078 (+604.58%)
Skip Thoughts.torchPorting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7
Stars: ✭ 146 (-4.58%)
Word2vec訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.
Stars: ✭ 48 (-68.63%)
Numpy MlMachine learning, in numpy
Stars: ✭ 11,100 (+7154.9%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+811.11%)
GraphwavemachineA scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".
Stars: ✭ 151 (-1.31%)
Role2vecA scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Stars: ✭ 134 (-12.42%)
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-32.68%)