Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → quanteda → Quanteda

quanteda / Quanteda

Licence: gpl-3.0

An R package for the Quantitative Analysis of Textual Data

Programming Languages

7636 projects

Labels

natural-language-processing corpus

Projects that are alternatives of or similar to Quanteda

Typing Assistant

Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.

Stars: ✭ 32 (-95.05%)

Mutual labels: corpus, natural-language-processing

Awesome Hungarian Nlp

A curated list of NLP resources for Hungarian

Stars: ✭ 121 (-81.3%)

Mutual labels: corpus, natural-language-processing

Corpus of Annual Reports in Japan

Stars: ✭ 55 (-91.5%)

Mutual labels: corpus, natural-language-processing

Insuranceqa Corpus Zh

🚁 保险行业语料库，聊天机器人

Stars: ✭ 821 (+26.89%)

Mutual labels: corpus, natural-language-processing

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

Stars: ✭ 192 (-70.32%)

Mutual labels: corpus, natural-language-processing

Japanese text8 corpus for word embedding.

Stars: ✭ 79 (-87.79%)

Mutual labels: corpus, natural-language-processing

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Stars: ✭ 108 (-83.31%)

Mutual labels: corpus, natural-language-processing

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

Stars: ✭ 139 (-78.52%)

Mutual labels: corpus, natural-language-processing

Efaqa Corpus Zh

❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库

Stars: ✭ 170 (-73.72%)

Mutual labels: corpus, natural-language-processing

Nlp bahasa resources

A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia

Stars: ✭ 158 (-75.58%)

Mutual labels: corpus, natural-language-processing

Awesome Persian Nlp Ir

Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources

Stars: ✭ 460 (-28.9%)

Mutual labels: corpus, natural-language-processing

A dataset of millions of news articles scraped from a curated list of data sources.

Stars: ✭ 255 (-60.59%)

Mutual labels: corpus, natural-language-processing

Weixin public corpus

微信公众号语料库

Stars: ✭ 465 (-28.13%)

Mutual labels: corpus, natural-language-processing

Self Attentive Parser

High-accuracy NLP parser with models for 11 languages.

Stars: ✭ 569 (-12.06%)

Mutual labels: natural-language-processing

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

Stars: ✭ 618 (-4.48%)

Mutual labels: natural-language-processing

Awesome Bert Nlp

A curated list of NLP resources focused on BERT, attention mechanism, Transformer networks, and transfer learning.

Stars: ✭ 567 (-12.36%)

Mutual labels: natural-language-processing

BERT score for text generation

Stars: ✭ 568 (-12.21%)

Mutual labels: natural-language-processing

Build a bot that speaks like you!

Stars: ✭ 641 (-0.93%)

Mutual labels: natural-language-processing

Library for faster pinned CPU <-> GPU transfer in Pytorch

Stars: ✭ 615 (-4.95%)

Mutual labels: natural-language-processing

Mycroft Core, the Mycroft Artificial Intelligence platform.

Stars: ✭ 5,489 (+748.38%)

Mutual labels: natural-language-processing

View All Similar Projects ➔

About

An R package for managing and analyzing text, created by Kenneth Benoit. Supported by the European Research Council grant ERC-2011-StG 283794-QUANTESS.

For more details, see https://quanteda.io.

How to Install

The normal way from CRAN, using your R GUI or

install.packages("quanteda")

Or for the latest development version:

# devtools package required to install quanteda from Github 
devtools::install_github("quanteda/quanteda")

Because this compiles some C++ and Fortran source code, you will need to have installed the appropriate compilers.

If you are using a Windows platform, this means you will need also to install the Rtools software available from CRAN.

If you are using macOS, you should install the macOS tools, namely the Clang 6.x compiler and the GNU Fortran compiler (as quanteda requires gfortran to build). If you are still getting errors related to gfortran, follow the fixes here.

The quanteda family of packages

As of v3.0, we have continued our trend of splitting quanteda into modular packages. These are now the following:

quanteda: contains all of the core natural language processing and textual data management functions
quanteda.textmodels: contains all of the text models and supporting functions, namely the textmodel_*() functions. This was split from the main package with the v2 release
quanteda.textstats: statistics for textual data, namely the textstat_*() functions, split with the v3 release
quanteda.textplots: plots for textual data, namely the textplot_*() functions, split with the v3 release

We are working on additional package releases, available in the meantime from our GitHub pages:

quanteda.sentiment: Functions and lexicons for sentiment analysis using dictionaries
quanteda.tidy: Extensions for manipulating document variables in core quanteda objects using your favourite tidyverse functions

and more to come.

How to Use

See the quick start guide to learn how to use quanteda.

How to cite

Benoit, Kenneth, Kohei Watanabe, Haiyan Wang, Paul Nulty, Adam Obeng, Stefan Müller, and Akitaka Matsuo. (2018) “quanteda: An R package for the quantitative analysis of textual data”. Journal of Open Source Software. 3(30), 774. https://doi.org/10.21105/joss.00774.

For a BibTeX entry, use the output from citation(package = "quanteda").

Leaving Feedback

If you like quanteda, please consider leaving feedback or a testimonial here.

Contributing

Contributions in the form of feedback, comments, code, and bug reports are most welcome. How to contribute:

Fork the source code, modify, and issue a pull request through the project GitHub page. See our Contributor Code of Conduct and the all-important quanteda Style Guide.
Issues, bug reports, and wish lists: File a GitHub issue.
Usage questions: Submit a question on the quanteda channel on StackOverflow.
Contact the maintainer by email.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 647

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (54) 🔗