SensegramMaking sense embedding out of word embeddings using graph-based word sense induction
Stars: ✭ 209 (+46.15%)
Lmdb EmbeddingsFast word vectors with little memory usage in Python
Stars: ✭ 404 (+182.52%)
Ngram2vecFour word embedding models implemented in Python. Supporting arbitrary context features
Stars: ✭ 703 (+391.61%)
Dict2vecDict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (-36.36%)
ExperimentsSome research experiments
Stars: ✭ 95 (-33.57%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-40.56%)
Awesome Embedding ModelsA curated list of awesome embedding models tutorials, projects and communities.
Stars: ✭ 1,486 (+939.16%)
Cw2vec基于字符训练词向量
Stars: ✭ 80 (-44.06%)
Js Word✒️ Word Processing Document Library
Stars: ✭ 1,203 (+741.26%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (-32.87%)
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (-18.18%)
Transorthogonal LinguisticsUses a distributed word representation to finds words along the hyperchord of two input words.
Stars: ✭ 93 (-34.97%)
Scattertext PydataNotebooks for the Seattle PyData 2017 talk on Scattertext
Stars: ✭ 132 (-7.69%)
Nlp兜哥出品 <一本开源的NLP入门书籍>
Stars: ✭ 1,677 (+1072.73%)
Weditor🍋支持多人协作的 富文本 编辑器
Stars: ✭ 82 (-42.66%)
NpoiA .NET library for reading and writing Microsoft Office binary and OOXML file formats.
Stars: ✭ 1,751 (+1124.48%)
Ja.text8Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-44.76%)
Etherpad LiteEtherpad: A modern really-real-time collaborative document editor.
Stars: ✭ 11,937 (+8247.55%)
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-46.85%)
TextclfTextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。
Stars: ✭ 105 (-26.57%)
MusaeThe reference implementation of "Multi-scale Attributed Node Embedding".
Stars: ✭ 75 (-47.55%)
Sense2vec🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (+727.97%)
VbasyncCross-platform tool to synchronize macros from an Office VBA-enabled file with a version-controlled folder
Stars: ✭ 98 (-31.47%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+1104.2%)
Postgres Word2vecutils to use word embedding like word2vec vectors in a postgres database
Stars: ✭ 96 (-32.87%)
Mschart📊 mschart: office charts from R
Stars: ✭ 94 (-34.27%)
Docx Embeddedhtml InjectionWord 2016 vulnerability allows injecting HTML/JS code into a docx file's embeddedHTML="" tags.
Stars: ✭ 91 (-36.36%)
Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (+802.1%)
BibwordMicrosoft Word and Bibliography Styles extender.
Stars: ✭ 131 (-8.39%)
PandiffProse diffs for any document format supported by Pandoc
Stars: ✭ 110 (-23.08%)
EvilclippyA cross-platform assistant for creating malicious MS Office documents. Can hide VBA macros, stomp VBA code (via P-Code) and confuse macro analysis tools. Runs on Linux, OSX and Windows.
Stars: ✭ 1,224 (+755.94%)
Word2vec对 ansj 编写的 Word2VEC_java 的进一步包装,同时实现了常用的词语相似度和句子相似度计算。
Stars: ✭ 136 (-4.9%)
Numpy MlMachine learning, in numpy
Stars: ✭ 11,100 (+7662.24%)
Word To MarkdownA ruby gem to liberate content from Microsoft Word documents
Stars: ✭ 1,216 (+750.35%)
Ml ProjectsML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
Stars: ✭ 127 (-11.19%)
Magicodes.ieImport and export general library, support Dto import and export, template export, fancy export and dynamic export, support Excel, Csv, Word, Pdf and Html.
Stars: ✭ 1,198 (+737.76%)
PdfSimple http microservice that converts Word documents to PDF
Stars: ✭ 107 (-25.17%)
Random WordThis is a simple python package to generate random english words
Stars: ✭ 75 (-47.55%)
Role2vecA scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Stars: ✭ 134 (-6.29%)
AsneA sparsity aware and memory efficient implementation of "Attributed Social Network Embedding" (TKDE 2018).
Stars: ✭ 73 (-48.95%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+874.83%)
DocxEasily generate .docx files with JS/TS with a nice declarative API. Works for Node and on the Browser.
Stars: ✭ 2,150 (+1403.5%)
DutchembeddingsRepository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", presented at LREC 2016.
Stars: ✭ 71 (-50.35%)
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-27.97%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-50.35%)
Ttpassgen密码生成 flexible and scriptable password dictionary generator which can support brute-force、combination、complex rule mode etc...
Stars: ✭ 68 (-52.45%)
SharpdocxC# based template engine for generating Word documents
Stars: ✭ 100 (-30.07%)
Nlp researchNLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务
Stars: ✭ 141 (-1.4%)
Mlsourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees
Stars: ✭ 136 (-4.9%)