All Projects → wordfish-python → Similar Projects or Alternatives

510 Open source projects that are alternatives of or similar to wordfish-python

Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation)，etc. All codes are implemented intensorflow 2.0.

Stars: ✭ 1,290 (+6689.47%)

Mutual labels: word2vec, gensim, lda

Musae

The reference implementation of "Multi-scale Attributed Node Embedding".

Stars: ✭ 75 (+294.74%)

Mutual labels: word2vec, gensim

NLP-paper

🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/

Stars: ✭ 23 (+21.05%)

Mutual labels: word2vec, lda

pydataberlin-2017

Repo for my talk at the PyData Berlin 2017 conference

Stars: ✭ 63 (+231.58%)

Mutual labels: gensim, lda

Magnitude

A fast, efficient universal vector embedding utility package.

Stars: ✭ 1,394 (+7236.84%)

Mutual labels: word2vec, gensim

Gemsec

The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).

Stars: ✭ 210 (+1005.26%)

Mutual labels: word2vec, gensim

Nlp In Practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

Stars: ✭ 790 (+4057.89%)

Mutual labels: word2vec, gensim

Twitter sentiment analysis word2vec convnet

Twitter Sentiment Analysis with Gensim Word2Vec and Keras Convolutional Network

Stars: ✭ 24 (+26.32%)

Mutual labels: word2vec, gensim

Shallowlearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

Stars: ✭ 196 (+931.58%)

Mutual labels: word2vec, gensim

Webvectors

Web-ify your word2vec: framework to serve distributional semantic models online

Stars: ✭ 154 (+710.53%)

Mutual labels: word2vec, gensim

Aravec

AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.

Stars: ✭ 239 (+1157.89%)

Mutual labels: word2vec, gensim

Text-Analysis

Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.

Stars: ✭ 48 (+152.63%)

Mutual labels: word2vec, lda

doc2vec-api

document embedding and machine learning script for beginners

Stars: ✭ 92 (+384.21%)

Mutual labels: word2vec, gensim

Log Anomaly Detector

Log Anomaly Detection - Machine learning to detect abnormal events logs

Stars: ✭ 169 (+789.47%)

Mutual labels: word2vec, gensim

Wordembeddings Elmo Fasttext Word2vec

Using pre trained word embeddings (Fasttext, Word2Vec)

Stars: ✭ 146 (+668.42%)

Mutual labels: word2vec, gensim

NMFADMM

A sparsity aware implementation of "Alternating Direction Method of Multipliers for Non-Negative Matrix Factorization with the Beta-Divergence" (ICASSP 2014).

Stars: ✭ 39 (+105.26%)

Mutual labels: word2vec, lda

Word2vec Tutorial

中文詞向量訓練教學

Stars: ✭ 426 (+2142.11%)

Mutual labels: word2vec, gensim

Sense2vec

🦆 Contextually-keyed word vectors

Stars: ✭ 1,184 (+6131.58%)

Mutual labels: word2vec, gensim

Word2VecAndTsne

Scripts demo-ing how to train a Word2Vec model and reduce its vector space

Stars: ✭ 45 (+136.84%)

Mutual labels: word2vec, gensim

Role2vec

A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).

Stars: ✭ 134 (+605.26%)

Mutual labels: word2vec, gensim

Turkish Word2vec

Pre-trained Word2Vec Model for Turkish

Stars: ✭ 136 (+615.79%)

Mutual labels: word2vec, gensim

Germanwordembeddings

Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets

Stars: ✭ 189 (+894.74%)

Mutual labels: word2vec, gensim

Gensim

Topic Modelling for Humans

Stars: ✭ 12,763 (+67073.68%)

Mutual labels: word2vec, gensim

biovec

ProtVec can be used in protein interaction predictions, structure prediction, and protein data visualization.

Stars: ✭ 23 (+21.05%)

Mutual labels: word2vec, gensim

text-classification-cn

中文文本分类实践，基于搜狗新闻语料库，采用传统机器学习方法以及预训练模型等方法

Stars: ✭ 81 (+326.32%)

Mutual labels: word2vec, corpus

RolX

An alternative implementation of Recursive Feature and Role Extraction (KDD11 & KDD12)

Stars: ✭ 52 (+173.68%)

Mutual labels: word2vec, gensim

Tadw

An implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).

Stars: ✭ 43 (+126.32%)

Mutual labels: word2vec, gensim

Word2vec

訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.

Stars: ✭ 48 (+152.63%)

Mutual labels: word2vec, gensim

Ml Projects

ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python

Stars: ✭ 127 (+568.42%)

Mutual labels: word2vec, gensim

walklets

A lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).

Stars: ✭ 94 (+394.74%)

Mutual labels: word2vec, gensim

Splitter

A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).

Stars: ✭ 177 (+831.58%)

Mutual labels: word2vec, gensim

word2vec-pt-br

Implementação e modelo gerado com o treinamento (trigram) da wikipedia em pt-br

Stars: ✭ 34 (+78.95%)

Mutual labels: word2vec, gensim

Ja.text8

Japanese text8 corpus for word embedding.

Stars: ✭ 79 (+315.79%)

Mutual labels: word2vec, corpus

Nlp chinese corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

Stars: ✭ 6,656 (+34931.58%)

Mutual labels: word2vec, corpus

Lmdb Embeddings

Fast word vectors with little memory usage in Python

Stars: ✭ 404 (+2026.32%)

Mutual labels: word2vec, gensim

Russian news corpus

Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ

Stars: ✭ 76 (+300%)

Mutual labels: word2vec, corpus

word-embeddings-from-scratch

Creating word embeddings from scratch and visualize them on TensorBoard. Using trained embeddings in Keras.

Stars: ✭ 22 (+15.79%)

Mutual labels: word2vec, gensim

lda2vec

Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019

Stars: ✭ 27 (+42.11%)

Mutual labels: word2vec, lda

Product-Categorization-NLP

Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).

Stars: ✭ 30 (+57.89%)

Mutual labels: word2vec, gensim

spark-word2vec

A parallel implementation of word2vec based on Spark

Stars: ✭ 24 (+26.32%)

Mutual labels: word2vec

tokenizr

String Tokenization Library for JavaScript

Stars: ✭ 70 (+268.42%)

Mutual labels: token

brauzie

Awesome CLI for fetching JWT tokens for OAuth2.0 clients

Stars: ✭ 14 (-26.32%)

Mutual labels: token

auth-flow-react-apollo-saga

Full stack login/register flow with React, Apollo, Redux, Redux-saga and MongoDB.

Stars: ✭ 22 (+15.79%)

Mutual labels: token

NEMPay

Adaptable Android & iOS Mosaic Wallet for NEM Blockchain

Stars: ✭ 36 (+89.47%)

Mutual labels: token

EL1T3

🖤 Ƭ𝘩𝘦 𝘮𝘰𝘴𝘵 𝘱𝘰𝘸𝘦𝘳𝘧𝘶𝘭𝘭 𝘢𝘯𝘥 𝘉𝘦𝘵𝘵𝘦𝘳 𝘵𝘰𝘬𝘦𝘯 𝘴𝘵𝘦𝘢𝘭𝘦𝘳.

Stars: ✭ 41 (+115.79%)

Mutual labels: token

sent2vec

How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.

Stars: ✭ 99 (+421.05%)

Mutual labels: word2vec

IoT-Technical-Guide

🐝 IoT Technical Guide --- 从零搭建高性能物联网平台及物联网解决方案和Thingsboard源码分析 ✨ ✨ ✨ (IoT Platform, SaaS, MQTT, CoAP, HTTP, Modbus, OPC, WebSocket, 物模型，Protobuf, PostgreSQL, MongoDB, Spring Security, OAuth2, RuleEngine, Kafka, Docker)

Stars: ✭ 2,565 (+13400%)

Mutual labels: token

DeepSentiPers

Repository for the experiments described in the paper named "DeepSentiPers: Novel Deep Learning Models Trained Over Proposed Augmented Persian Sentiment Corpus"

Stars: ✭ 17 (-10.53%)

Mutual labels: corpus

OpenDialog

An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统，一键部署微信闲聊机器人)

Stars: ✭ 94 (+394.74%)

Mutual labels: corpus

nodejs-wechat

基于nodejs开发微信公众号

Stars: ✭ 13 (-31.58%)

Mutual labels: token

node-uid-generator

Generates cryptographically strong pseudo-random UIDs with custom size and base-encoding