WordStat by Provalis Research

Content analysis and text mining software.

A highly advanced content analysis and text-mining software with unmatched analysis capabilities.

Download a free trial

WordStat is a flexible and easy-to-use text analysis software.

whether you need text mining tools for fast extraction of themes and trends, or careful and precise measurement with state-of-the-art quantitative content analysis tools. WordStat‘s seamless integration with SimStat – our statistical data analysis tool – QDA Miner – our qualitative data analysis software – and Stata – the comprehensive statistical software from StataCorp, gives you unprecedented flexibility for analyzing text and relating its content to structured information, including numerical and categorical data.

Why WordStat?

WordStat can be used by anyone who needs to quickly extract and analyze information from large amounts of documents. Our content analysis and text mining software is used for:

  • Content analysis of open-ended responses, interview or focus group transcripts
  • Business intelligence and competitive websites analysis
  • Information extraction and knowledge discovery from incident reports, customer complaints
  • Content analysis of news coverage or scientific literature
  • Automatic tagging and classification of documents
  • Fraud detection, authorship attribution, patent analysis
  • Taxonomy development and validation

Why use WordStat to analyse your unstructured data?


Import Word, Excel, HTML, SPSS, Stata, NVivo or PDFs. Connect to and directly import from social media, emails, web survey platforms, and reference management tools.

Content analysis

Handle large amounts of unstructured data, processing up to 25mill words / minute, identifying all references to user-defined concepts in categorisation dictionaries.

Visualisation of text

Integrated exploratory text mining and visualisation tools such as topic modeling, clustering, multidimensional scaling, proximity plots, and more, to quickly extract themes and automatically identify patterns.

Salient topics

Get a quick overview of the most salient topics from large text collections by using state-of-the-art automatic topic extraction techniques.

Relate data

Relate unstructured text with structured data such as dates, numbers or categories to identify trends or differences between subgroups.

GIS mapping

The GIS mapping module helps create interactive plots of data points, thematic maps, and heat maps.

WordStat 8: Improved performance and provision, a more flexible approach and enhanced usability.