All Projects → Nlp_bahasa_resources → Similar Projects or Alternatives

2759 Open source projects that are alternatives of or similar to Nlp_bahasa_resources

Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (-23.42%)
Pytreebank
😡😇 Stanford Sentiment Treebank loader in Python
Stars: ✭ 93 (-41.14%)
Ua Gec
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-31.65%)
Insuranceqa Corpus Zh
🚁 保险行业语料库,聊天机器人
Stars: ✭ 821 (+419.62%)
Ml Classify Text Js
Machine learning based text classification in JavaScript using n-grams and cosine similarity
Stars: ✭ 38 (-75.95%)
Fakenewscorpus
A dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (+61.39%)
Coarij
Corpus of Annual Reports in Japan
Stars: ✭ 55 (-65.19%)
Prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-12.03%)
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (-9.49%)
Mutual labels:  dataset, corpus, sentiment-analysis
Cluepretrainedmodels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Stars: ✭ 493 (+212.03%)
Mutual labels:  dataset, corpus
Nlp chinese corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+4112.66%)
Mutual labels:  dataset, corpus
Nlp With Ruby
Curated List: Practical Natural Language Processing done in Ruby
Stars: ✭ 907 (+474.05%)
Awesome Persian Nlp Ir
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+191.14%)
Weixin public corpus
微信公众号语料库
Stars: ✭ 465 (+194.3%)
Quanteda
An R package for the Quantitative Analysis of Textual Data
Stars: ✭ 647 (+309.49%)
Awesome Twitter Data
A list of Twitter datasets and related resources.
Stars: ✭ 533 (+237.34%)
Mutual labels:  dataset, sentiment-analysis
French Sentiment Analysis Dataset
A collection of over 1.5 Million tweets data translated to French, with their sentiment.
Stars: ✭ 35 (-77.85%)
Mutual labels:  dataset, sentiment-analysis
Stocksight
Stock market analyzer and predictor using Elasticsearch, Twitter, News headlines and Python natural language processing and sentiment analysis
Stars: ✭ 1,037 (+556.33%)
Pattern
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
Stars: ✭ 8,112 (+5034.18%)
Char Rnn Tensorflow
Multi-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
Stars: ✭ 58 (-63.29%)
Company Names Corpus
公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。
Stars: ✭ 868 (+449.37%)
Mutual labels:  dataset, corpus
Lingua Franca
Mycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (-67.72%)
Textblob Ar
Arabic support for textblob
Stars: ✭ 60 (-62.03%)
Text Analytics With Python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+616.46%)
Dataset List
lists of text corpus and more (mainly Japanese)
Stars: ✭ 84 (-46.84%)
Mutual labels:  dataset, corpus
Turkish Bert Nlp Pipeline
Bert-base NLP pipeline for Turkish, Ner, Sentiment Analysis, Question Answering etc.
Stars: ✭ 85 (-46.2%)
Pytorch Nlp
Basic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+1163.29%)
Nlp.js
An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more
Stars: ✭ 4,670 (+2855.7%)
Pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+169.62%)
Doccano
Open source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+3444.3%)
Text mining resources
Resources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+126.58%)
Conv Emotion
This repo contains implementation of different architectures for emotion recognition in conversations.
Stars: ✭ 646 (+308.86%)
Hate Speech And Offensive Language
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Stars: ✭ 543 (+243.67%)
Aspect Based Sentiment Analysis
A paper list for aspect based sentiment analysis.
Stars: ✭ 311 (+96.84%)
Wikisql
A large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (+510.76%)
Nlp Papers
Papers and Book to look at when starting NLP 📚
Stars: ✭ 111 (-29.75%)
Mtnt
Code for the collection and analysis of the MTNT dataset
Stars: ✭ 48 (-69.62%)
Typing Assistant
Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Stars: ✭ 32 (-79.75%)
Bond
BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-39.24%)
Colibri Core
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (-29.11%)
Mutual labels:  corpus, library
Keita
My personal toolkit for PyTorch development.
Stars: ✭ 124 (-21.52%)
Text Classification Keras
📚 Text classification library with Keras
Stars: ✭ 53 (-66.46%)
Mutual labels:  library, sentiment-analysis
Repo 2017
Python codes in Machine Learning, NLP, Deep Learning and Reinforcement Learning with Keras and Theano
Stars: ✭ 1,123 (+610.76%)
Linusrants
Dataset of Linus Torvalds' rants classified by negativity using sentiment analysis
Stars: ✭ 291 (+84.18%)
Mutual labels:  dataset, sentiment-analysis
Dem.net
Digital Elevation model library in C#. 3D terrain models, line/point Elevations, intervisibility reports
Stars: ✭ 153 (-3.16%)
Mutual labels:  dataset, library
Ja.text8
Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-50%)
Dialogue Understanding
This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study
Stars: ✭ 77 (-51.27%)
Pynlp
A pythonic wrapper for Stanford CoreNLP.
Stars: ✭ 103 (-34.81%)
Gossiping Chinese Corpus
PTT 八卦版問答中文語料
Stars: ✭ 137 (-13.29%)
Mutual labels:  dataset, corpus
Senta
Baidu's open-source Sentiment Analysis System.
Stars: ✭ 1,187 (+651.27%)
Mams For Absa
A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.
Stars: ✭ 135 (-14.56%)
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-15.82%)
Cluedatasetsearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Stars: ✭ 2,112 (+1236.71%)
Mutual labels:  corpus, sentiment-analysis
Clue
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+1434.81%)
Mutual labels:  dataset, corpus
Absapapers
Worth-reading papers and related awesome resources on aspect-based sentiment analysis (ABSA). 值得一读的方面级情感分析论文与相关资源集合
Stars: ✭ 142 (-10.13%)
Oie Resources
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (+79.11%)
Text2sql Data
A collection of datasets that pair questions with SQL queries.
Stars: ✭ 287 (+81.65%)
Absa Pytorch
Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。
Stars: ✭ 1,181 (+647.47%)
Dialog corpus
用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
Stars: ✭ 1,662 (+951.9%)
Mutual labels:  dataset, corpus
Googlelanguager
R client for the Google Translation API, Google Cloud Natural Language API and Google Cloud Speech API
Stars: ✭ 145 (-8.23%)
1-60 of 2759 similar projects