Quantitative Index Text Analyzer (QUITA) covers the most common indicators, especially those connected with frequency structure of a text. In addition to computing results of the indicators, QUITA provides also statistical testing and graphical visualization of obtained data.
- MATTR (Moving-Average Type-Token Ratio)
- NLPUtils (Boisclair tokenizer and morphological analyzer)
- CPIDR® (Computerized Propositional Idea Density Rater)
LancsBox is a new software package for the analysis of language data and corpora developed at Lancaster University. It can be used by linguists, language teachers, historians, sociologists, educators and anyone interested in language. It is free to use for non-commercial purposes and works with any operating system.
Several programs developed by Jiří Milička (Charles University Prague). There are calculators of vocabulary richness, thematic concentration, type-token relation, keywords, etc.
Software for finding patterns in text. WordSmith Tools provide Corcondance, KeyWords, Word Lists and other utility programs such as Character Profiler, CharGrams, Data Converter, Text Converter, Aligner. WordSmith Tools can handle most languages including Chinese, Japanese, etc.
Java-based, modular, program for textual analysis, text categorization, and authorship attribution i.e. stylometry / textometry. JGAAP is intended to tackle two different problems, firstly to allow people unfamiliar with machine learning and quantitative analysis the ability to use cutting edge techniques on their text based stylometry / textometry problems, and secondly to act as a framework for testing and comparing the effectiveness of different analytic techniques' performance on text analysis quickly and easily. JGAAP is developed by the Evaluating Variation in Language (EVL) Lab at Duquesne university.
The Signature computer program provides a number of tools for authorship attribution (word lengths, sentence lengths, paragraph lengths, frequencies of letters and punctuation marks. The program also enables statistical comparison of texts using chi-square test.
Stylo is a R package which provides number of statistical methods for stylometric research such as MFW (Most Frequent Word) cluster anylysis. Stylo works also via Graphical user interface (GUI). Stylo was developed by Maciej Eder (Pedagogical Univ. of Kraków), Jan Rybicki (Jagiellonian University), Mike Kestemont (University of Antwerp).
The Altmann-Fitter is an interactive software for the iterative fitting of univariate discrete probability distributions to frequency data. It uses the Nelder-Mead Simplex Algorithm. Fitting is automatic, i.e. no initial estimators are necessary, and it improves iteratively. The goodness-of-fit test is performed by means of the chi-square test. A number of options and configurations enables the user to flexibly process data. Developed by Gabriel Altmann.
Pajek is a program for analysis and visualization of large networks having some thousands or even millions of vertices. Developed by Andrej Mrvar & Vladimir Batagelj (University of Ljubljana, Slovenia).
Gephi is an open-source software for visualizing and analysing large networks graphs. Gephi uses a 3D render engine to display graphs in real-time and speed up the exploration. You can use it to explore, analyse, spatialise, filter, cluterize, manipulate and export all types of graphs.
Online statistical multifunctional software.
- Equation Plotter
- Time Series Analysis
- Multiple Regression
- Descriptive Statistics
- Statistical Distributions
- Hypothesis Testing
Software developed by UCREL (University Centre for Computer Corpus Research on Language)
R is an open source programming language and software environment for statistical computing and graphics that is supported by the R Foundation for Statistical Computing. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. Polls, surveys of data miners, and studies of scholarly literature databases show that R's popularity has increased substantially in recent years.