A fast, flexible, and comprehensive framework for
quantitative text analysis in R. Provides functionality for corpus management,
creating and manipulating tokens and n-grams, exploring keywords in context,
forming and manipulating sparse matrices
of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and
distances, applying content dictionaries, applying supervised and unsupervised machine learning,
visually representing text and text analyses, and more.
Version: |
4.2.0 |
Depends: |
R (≥ 3.5.0), methods |
Imports: |
fastmatch, jsonlite, lifecycle, magrittr, Matrix (≥ 1.5-0), Rcpp (≥ 0.12.12), SnowballC, stopwords, stringi, xml2, yaml |
LinkingTo: |
Rcpp |
Suggests: |
rmarkdown, spelling, testthat, formatR, tm (≥ 0.6), knitr, lsa, rlang, slam |
Enhances: |
dplyr, lda, purrr, spacyr, stm, text2vec, tibble, tidytext, tokenizers, topicmodels |
Published: |
2025-01-08 |
DOI: |
10.32614/CRAN.package.quanteda |
Author: |
Kenneth Benoit
[cre, aut, cph] (<https://orcid.org/0000-0002-0797-564X>),
Kohei Watanabe
[aut] (<https://orcid.org/0000-0001-6519-5265>),
Haiyan Wang [aut]
(<https://orcid.org/0000-0003-4992-4311>),
Paul Nulty [aut]
(<https://orcid.org/0000-0002-7214-4666>),
Adam Obeng [aut]
(<https://orcid.org/0000-0002-2906-4775>),
Stefan Müller
[aut] (<https://orcid.org/0000-0002-6315-4125>),
Akitaka Matsuo
[aut] (<https://orcid.org/0000-0002-3323-6330>),
William Lowe
[aut] (<https://orcid.org/0000-0002-1549-6163>),
Christian Müller [ctb],
Olivier Delmarcelle
[ctb]
(<https://orcid.org/0000-0003-4347-070X>),
European Research Council [fnd] (ERC-2011-StG 283794-QUANTESS) |
Maintainer: |
Kenneth Benoit <kbenoit at lse.ac.uk> |
BugReports: |
https://github.com/quanteda/quanteda/issues |
License: |
GPL-3 |
URL: |
https://quanteda.io |
NeedsCompilation: |
yes |
Language: |
en-GB |
Citation: |
quanteda citation info |
Materials: |
README NEWS |
In views: |
NaturalLanguageProcessing |
CRAN checks: |
quanteda results |