Textanz is an excellent tool for writers and editors that allows you to import text from documents and calculate their phrase and word frequencies, and word form frequencies.
You can import text from files, web-site URLs, or paste text from the clipboard. The file formats supported include RTF, HTML, XML, PDF, MS Office, OpenOffice, and more.
The text is imported and organized by tabs, so you can have one or more text tabs open at the same time. To see all the text from a tab you can turn on the line wrap feature or use the slide bar located at the bottom of the text tab.
After importing the text you can hit the "Phrase and word frequencies" or "Word form frequencies" options to obtain the results. The program calculates in the same table the phrase and word frequencies for all the tabs which are open. You can use the Filter text box to show the results for a given word. The table of results can be exported to HTML, CSV, or XML format.
Among the parameters you can adjust are: the non-alphabetical word characters to be excluded (e.g. @#$%&'0123456789); the minimum characters to be considered for the phrase frequency, and phrase length, and ignore common words.
Comments (1)