I am just learning about text mining (and using R!) and would like to check I am assembling a document-term matrix correctly here. Just for two documents for now. Is this correct?
AAPL <- readLines('AAPL.txt')
JE<- readLines('JaneEyre.txt')
# Preliminary corpus
corpusAAPL <- Corpus(VectorSource(AAPL,JE)) %>%
# Create term-document matrices
tdmAAPL <- DocumentTermMatrix(corpusAAPL) %>%
At this point I would like R to print at least a part of this TDM so I can check it is working right.
Thank you very much.
You can use
tm::inspect()
to view the matrix (page 15 of the documentation).