PsychWordVec: Word Embedding Research Framework for Psychological Science
An integrated toolbox of word embedding research that provides:
(1) a collection of 'pre-trained' static word vectors in the '.RData'
compressed format <https://psychbruce.github.io/WordVector_RData.pdf>;
(2) a series of functions to process, analyze, and visualize word vectors;
(3) a range of tests to examine conceptual associations, including
the Word Embedding Association Test <doi:10.1126/science.aal4230>
and the Relative Norm Distance <doi:10.1073/pnas.1720347115>,
with permutation test of significance;
(4) a set of training methods to locally train (static) word vectors
from text corpora, including 'Word2Vec' <arXiv:1301.3781>,
'GloVe' <doi:10.3115/v1/D14-1162>, and 'FastText' <arXiv:1607.04606>;
(5) a group of functions to download 'pre-trained' language models
(e.g., 'GPT', 'BERT'), extract contextualized (dynamic) word vectors
(based on the R package 'text'), and perform language analysis tasks
(e.g., fill in the blank masks).
Version: |
0.3.0 |
Depends: |
R (≥ 4.0.0) |
Imports: |
bruceR, dplyr, stringr, data.table, purrr, vroom, cli, ggplot2, ggrepel, corrplot, psych, Rtsne, rgl, qgraph, rsparse, text2vec, word2vec, fastTextR, text, reticulate |
Suggests: |
wordsalad, sweater |
Published: |
2022-12-15 |
Author: |
Han-Wu-Shuang Bao [aut, cre] |
Maintainer: |
Han-Wu-Shuang Bao <baohws at foxmail.com> |
BugReports: |
https://github.com/psychbruce/PsychWordVec/issues |
License: |
GPL-3 |
URL: |
https://psychbruce.github.io/PsychWordVec/ |
NeedsCompilation: |
no |
Materials: |
README NEWS |
CRAN checks: |
PsychWordVec results |
Documentation:
Downloads:
Linking:
Please use the canonical form
https://CRAN.R-project.org/package=PsychWordVec
to link to this page.