All Projects → Ja.text8 → Similar Projects or Alternatives

931 Open source projects that are alternatives of or similar to Ja.text8

Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ

Stars: ✭ 76 (-3.8%)

Mutual labels: corpus, word2vec

Repo 2017

Python codes in Machine Learning, NLP, Deep Learning and Reinforcement Learning with Keras and Theano

Stars: ✭ 1,123 (+1321.52%)

Mutual labels: natural-language-processing, word2vec

Awesome Embedding Models

A curated list of awesome embedding models tutorials, projects and communities.

Stars: ✭ 1,486 (+1781.01%)

Mutual labels: natural-language-processing, word2vec

Repo 2016

R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation

Stars: ✭ 103 (+30.38%)

Mutual labels: natural-language-processing, word2vec

Natural Language Processing

Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning

Stars: ✭ 377 (+377.22%)

Mutual labels: natural-language-processing, word2vec

Prosody

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

Stars: ✭ 139 (+75.95%)

Mutual labels: corpus, natural-language-processing

Nlp In Practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

Stars: ✭ 790 (+900%)

Mutual labels: natural-language-processing, word2vec

Weixin public corpus

微信公众号语料库

Stars: ✭ 465 (+488.61%)

Mutual labels: corpus, natural-language-processing

Nlp bahasa resources

A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia

Stars: ✭ 158 (+100%)

Mutual labels: corpus, natural-language-processing

Fakenewscorpus

A dataset of millions of news articles scraped from a curated list of data sources.

Stars: ✭ 255 (+222.78%)

Mutual labels: corpus, natural-language-processing

Quanteda

An R package for the Quantitative Analysis of Textual Data

Stars: ✭ 647 (+718.99%)

Mutual labels: corpus, natural-language-processing

Insuranceqa Corpus Zh

🚁 保险行业语料库，聊天机器人

Stars: ✭ 821 (+939.24%)

Mutual labels: corpus, natural-language-processing

Nlvr

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

Stars: ✭ 192 (+143.04%)

Mutual labels: corpus, natural-language-processing

Germanwordembeddings

Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets

Stars: ✭ 189 (+139.24%)

Mutual labels: natural-language-processing, word2vec

Deep Math Machine Learning.ai

A blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.

Stars: ✭ 173 (+118.99%)

Mutual labels: natural-language-processing, word2vec

Kor2vec

Library for Korean morpheme and word vector representation

Stars: ✭ 64 (-18.99%)

Mutual labels: natural-language-processing, word2vec

Cs224n

CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017

Stars: ✭ 656 (+730.38%)

Mutual labels: natural-language-processing, word2vec

Sense2vec

🦆 Contextually-keyed word vectors

Stars: ✭ 1,184 (+1398.73%)

Mutual labels: natural-language-processing, word2vec

text-classification-cn

中文文本分类实践，基于搜狗新闻语料库，采用传统机器学习方法以及预训练模型等方法

Stars: ✭ 81 (+2.53%)

Mutual labels: word2vec, corpus

Ua Gec

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Stars: ✭ 108 (+36.71%)

Mutual labels: corpus, natural-language-processing

Awesome Hungarian Nlp

A curated list of NLP resources for Hungarian

Stars: ✭ 121 (+53.16%)

Mutual labels: corpus, natural-language-processing

wordfish-python

extract relationships from standardized terms from corpus of interest with deep learning 🐟

Stars: ✭ 19 (-75.95%)

Mutual labels: word2vec, corpus

Efaqa Corpus Zh

❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库

Stars: ✭ 170 (+115.19%)

Mutual labels: corpus, natural-language-processing

Text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

Stars: ✭ 715 (+805.06%)

Mutual labels: natural-language-processing, word2vec

Nlp chinese corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

Stars: ✭ 6,656 (+8325.32%)

Mutual labels: corpus, word2vec

Languagecrunch

LanguageCrunch NLP server docker image

Stars: ✭ 281 (+255.7%)

Mutual labels: natural-language-processing, word2vec

Pujangga

Pujangga - Indonesian Natural Language Processing Tool with REST API, an Interface for InaNLP and Deeplearning4j's Word2Vec

Stars: ✭ 47 (-40.51%)

Mutual labels: natural-language-processing, word2vec

Gensim

Topic Modelling for Humans

Stars: ✭ 12,763 (+16055.7%)

Mutual labels: natural-language-processing, word2vec

Magnitude

A fast, efficient universal vector embedding utility package.

Stars: ✭ 1,394 (+1664.56%)

Mutual labels: natural-language-processing, word2vec

Scattertext

Beautiful visualizations of how language differs among document types.

Stars: ✭ 1,722 (+2079.75%)

Mutual labels: natural-language-processing, word2vec

Practical 1

Oxford Deep NLP 2017 course - Practical 1: word2vec

Stars: ✭ 220 (+178.48%)

Mutual labels: natural-language-processing, word2vec

Scattertext Pydata

Notebooks for the Seattle PyData 2017 talk on Scattertext

Stars: ✭ 132 (+67.09%)

Mutual labels: natural-language-processing, word2vec

Awesome Persian Nlp Ir

Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources

Stars: ✭ 460 (+482.28%)

Mutual labels: corpus, natural-language-processing

Typing Assistant

Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.

Stars: ✭ 32 (-59.49%)

Mutual labels: corpus, natural-language-processing

Coarij

Corpus of Annual Reports in Japan

Stars: ✭ 55 (-30.38%)

Mutual labels: corpus, natural-language-processing

Get started with deep learning for text with allennlp

Getting started with AllenNLP and PyTorch by training a tweet classifier

Stars: ✭ 69 (-12.66%)

Mutual labels: natural-language-processing

Stminsights

A Shiny Application for Inspecting Structural Topic Models

Stars: ✭ 74 (-6.33%)

Mutual labels: natural-language-processing

Ai Writer data2doc

PyTorch Implementation of NBA game summary generator.

Stars: ✭ 69 (-12.66%)

Mutual labels: natural-language-processing

Touchdown

Cornell Touchdown natural language navigation and spatial reasoning dataset.

Stars: ✭ 69 (-12.66%)

Mutual labels: natural-language-processing

Monkeylearn Ruby

Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.

Stars: ✭ 76 (-3.8%)

Mutual labels: natural-language-processing

Course Computational Literary Analysis

Course materials for Introduction to Computational Literary Analysis, taught at UC Berkeley in Summer 2018, 2019, and 2020, and at Columbia University in Fall 2020.

Stars: ✭ 74 (-6.33%)

Mutual labels: natural-language-processing

Hackerrank

This is the Repository where you can find all the solution of the Problems which you solve on competitive platforms mainly HackerRank and HackerEarth

Stars: ✭ 68 (-13.92%)

Mutual labels: natural-language-processing

Intent classifier

Stars: ✭ 67 (-15.19%)

Mutual labels: natural-language-processing

Nlp Tutorial

A list of NLP(Natural Language Processing) tutorials

Stars: ✭ 1,188 (+1403.8%)

Mutual labels: natural-language-processing

Capsnet Nlp

CapsNet for NLP

Stars: ✭ 66 (-16.46%)

Mutual labels: natural-language-processing

Chinese Xlnet

Pre-Trained Chinese XLNet（中文XLNet预训练模型）

Stars: ✭ 1,213 (+1435.44%)

Mutual labels: natural-language-processing

Text Analytics With Python

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

Stars: ✭ 1,132 (+1332.91%)

Mutual labels: natural-language-processing

Senta

Baidu's open-source Sentiment Analysis System.

Stars: ✭ 1,187 (+1402.53%)

Mutual labels: natural-language-processing

Convai Bot 1337

NIPS Conversational Intelligence Challenge 2017 Winner System: Skill-based Conversational Agent with Supervised Dialog Manager

Stars: ✭ 65 (-17.72%)

Mutual labels: natural-language-processing

Python nlp tutorial

This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)

Stars: ✭ 72 (-8.86%)

Mutual labels: natural-language-processing

Chicksexer

A Python package for gender classification.

Stars: ✭ 64 (-18.99%)

Mutual labels: natural-language-processing

Multilingual Latent Dirichlet Allocation Lda

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

Stars: ✭ 64 (-18.99%)

Mutual labels: natural-language-processing

Nested Ner Tacl2020 Transformers

Implementation of Nested Named Entity Recognition using BERT

Stars: ✭ 76 (-3.8%)

Mutual labels: natural-language-processing

Asne

A sparsity aware and memory efficient implementation of "Attributed Social Network Embedding" (TKDE 2018).

Stars: ✭ 73 (-7.59%)

Mutual labels: word2vec

Gpt2

PyTorch Implementation of OpenAI GPT-2

Stars: ✭ 64 (-18.99%)

Mutual labels: natural-language-processing

Deeplearning Nlp Models

A small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs. Models: word2vec, CNNs, transformer, gpt.

Stars: ✭ 64 (-18.99%)

Mutual labels: word2vec

Languagetoys

Random fun with statistical language models.

Stars: ✭ 63 (-20.25%)

Mutual labels: natural-language-processing

Practical 3

Oxford Deep NLP 2017 course - Practical 3: Text Classification with RNNs

Stars: ✭ 78 (-1.27%)

Mutual labels: natural-language-processing

Multimodal Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data