Computing word frequencies using wordfreq

Description

wordfreq generates a word frequency matrix from text files

Usage

wordfreq filename1.txt [filename2.txt filename3.txt ...]

Notes

wordfreq takes a set of text files as input and produces in memory a set of frequencies for all words that occur in at least one of the input texts. The resulting dataset consists of a text variable word containing a list of the words themselves, followed by a set of frequency variables, one for each text, with the names tfilename1, tfilename2, tfilename3, etc. Each frequency variable will range from 0 to a maximum of the total words associated with its text file.

Currently all input filenames must be in plain text and end in '.txt'

Stata Code

Examine the source.


Previous Up to Table of Contents Next