All Projects → Bpemb → Similar Projects or Alternatives

879 Open source projects that are alternatives of or similar to Bpemb

Wikipedia2vec
A tool for learning vector representations of words and entities from Wikipedia
Stars: ✭ 655 (-27.94%)
Awesome Embedding Models
A curated list of awesome embedding models tutorials, projects and communities.
Stars: ✭ 1,486 (+63.48%)
Parallax
Tool for interactive embeddings visualization
Stars: ✭ 192 (-78.88%)
Chars2vec
Character-based word embeddings model based on RNN for handling real world texts
Stars: ✭ 130 (-85.7%)
Magnitude
A fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+53.36%)
Catalyst
🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
Stars: ✭ 224 (-75.36%)
Pytorch Nlp
Basic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+119.58%)
Multilingual Latent Dirichlet Allocation Lda
A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.
Stars: ✭ 64 (-92.96%)
Vec4ir
Word Embeddings for Information Retrieval
Stars: ✭ 188 (-79.32%)
Laserembeddings
LASER multilingual sentence embeddings as a pip package
Stars: ✭ 125 (-86.25%)
Mutual labels:  embeddings, multilingual
Ner Lstm
Named Entity Recognition using multilayered bidirectional LSTM
Stars: ✭ 532 (-41.47%)
Trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Stars: ✭ 311 (-65.79%)
Awesome Persian Nlp Ir
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (-49.39%)
Speedtorch
Library for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (-32.34%)
Underthesea
Underthesea - Vietnamese NLP Toolkit
Stars: ✭ 823 (-9.46%)
Ciff
Cornell Instruction Following Framework
Stars: ✭ 23 (-97.47%)
Insuranceqa Corpus Zh
🚁 保险行业语料库,聊天机器人
Stars: ✭ 821 (-9.68%)
Spacy Models
💫 Models for the spaCy Natural Language Processing (NLP) library
Stars: ✭ 796 (-12.43%)
Language
Shared repository for open-sourced projects from the Google AI Language team.
Stars: ✭ 860 (-5.39%)
Covid 19 Bert Researchpapers Semantic Search
BERT semantic search engine for searching literature research papers for coronavirus covid-19 in google colab
Stars: ✭ 23 (-97.47%)
Nlp In Practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (-13.09%)
Awesome 2vec
Curated list of 2vec-type embedding models
Stars: ✭ 784 (-13.75%)
Mutual labels:  embeddings
Eda nlp
Data augmentation for NLP, presented at EMNLP 2019
Stars: ✭ 902 (-0.77%)
Mutual labels:  embeddings
Jcseg
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for the latest lucene,solr,elasticsearch
Stars: ✭ 754 (-17.05%)
Ecco
Visualize and explore NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2).
Stars: ✭ 723 (-20.46%)
Deep Mihash
Code for papers "Hashing with Mutual Information" (TPAMI 2019) and "Hashing with Binary Matrix Pursuit" (ECCV 2018)
Stars: ✭ 13 (-98.57%)
Mutual labels:  embeddings
Spago
Self-contained Machine Learning and Natural Language Processing library in Go
Stars: ✭ 854 (-6.05%)
Kts linguistics
Spellcheck, phonetics, text processing and more
Stars: ✭ 18 (-98.02%)
Machine learning examples
A collection of machine learning examples and tutorials.
Stars: ✭ 6,466 (+611.33%)
Ciphey
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
Stars: ✭ 9,116 (+902.86%)
Orange3 Imageanalytics
🍊 🎑 Orange3 add-on for dealing with image related tasks
Stars: ✭ 24 (-97.36%)
Mutual labels:  embeddings
Nlg Eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
Stars: ✭ 822 (-9.57%)
Node Api.ai
[DEPRECATED] Ultimate Node.JS SDK for api.ai
Stars: ✭ 12 (-98.68%)
Pororo
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Stars: ✭ 812 (-10.67%)
Spacy Transformers
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Stars: ✭ 919 (+1.1%)
Torchmoji
😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc
Stars: ✭ 795 (-12.54%)
Nlp tutorials
Overview of NLP tools and techniques in python
Stars: ✭ 14 (-98.46%)
Natasha
Solves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (-13.31%)
Mutual labels:  embeddings
Nlp With Ruby
Curated List: Practical Natural Language Processing done in Ruby
Stars: ✭ 907 (-0.22%)
Coursera
Quiz & Assignment of Coursera
Stars: ✭ 774 (-14.85%)
Pke
Python Keyphrase Extraction module
Stars: ✭ 855 (-5.94%)
Youtokentome
Unsupervised text tokenizer focused on computational efficiency
Stars: ✭ 728 (-19.91%)
Crosslingual Nlu
Zero-shot Cross-lingual Task-Oriented Dialogue Systems (EMNLP 2019)
Stars: ✭ 20 (-97.8%)
Mutual labels:  multilingual
Text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (-21.34%)
Sentence Similarity Based On Semantic Nets And Corpus Statistics
This is an implementation of the paper written by Yuhua Li, David McLean, Zuhair A. Bandar, James D. O’Shea, and Keeley Crockett
Stars: ✭ 20 (-97.8%)
Machine Learning
머신러닝 입문자 혹은 스터디를 준비하시는 분들에게 도움이 되고자 만든 repository입니다. (This repository is intented for helping whom are interested in machine learning study)
Stars: ✭ 705 (-22.44%)
Biolitmap
Code for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-98.02%)
Ai Series
📚 [.md & .ipynb] Series of Artificial Intelligence & Deep Learning, including Mathematics Fundamentals, Python Practices, NLP Application, etc. 💫 人工智能与深度学习实战,数理统计篇 | 机器学习篇 | 深度学习篇 | 自然语言处理篇 | 工具实践 Scikit & Tensoflow & PyTorch 篇 | 行业应用 & 课程笔记
Stars: ✭ 702 (-22.77%)
Keras Attention
Visualizing RNNs using the attention mechanism
Stars: ✭ 697 (-23.32%)
Knowledge Graphs
A collection of research on knowledge graphs
Stars: ✭ 845 (-7.04%)
Riceteacatpanda
repo with challenge material for riceteacatpanda (2020)
Stars: ✭ 18 (-98.02%)
Bert
TensorFlow code and pre-trained models for BERT
Stars: ✭ 29,971 (+3197.14%)
Madewithml
Learn how to responsibly deliver value with ML.
Stars: ✭ 29,253 (+3118.15%)
Neuralparser
NeuralParser is a very simple to use dependency parser, based on the Latent Syntactic Structure encoding.
Stars: ✭ 17 (-98.13%)
Bertsearch
Elasticsearch with BERT for advanced document search.
Stars: ✭ 684 (-24.75%)
Ai Job Recommend
国内公司人工智能方向(含机器学习、深度学习、计算机视觉和自然语言处理)岗位的招聘信息(含全职、实习和校招)
Stars: ✭ 679 (-25.3%)
Twitter Bot
👻 Markov chain-based Japanese twitter bot
Stars: ✭ 12 (-98.68%)
Syntree2vec
An algorithm to augment syntactic hierarchy into word embeddings
Stars: ✭ 9 (-99.01%)
Entity Recognition Datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (-1.98%)
Nltk data
NLTK Data
Stars: ✭ 675 (-25.74%)
1-60 of 879 similar projects