[↑1] N — the total number of candidates who took the test. Read more
[↑2] Range — the difference between the highest and lowest score in the dataset. Read more
[↑3] Resolution — the number of consecutive possible raw scores contained within a unit of spread. Read more
[↑4] Mean — the arithmetic average of all scores in the dataset. Read more
[↑5] Median — the middle score when all scores are ordered from lowest to highest. Read more
[↑6] Mode — the value that occurs most frequently in the dataset. Read more
[↑7] Standard deviation — the square root of variance, indicating typical distance of scores from the mean; most informative for normal distributions. Read more
[↑8] Quartile deviation — a spread measure useful for skewed or non-normal score distributions. Read more
[↑9] Standard error of measurement — the expected variability of a test score due to measurement error. Read more
[↑10] Skewness — a measure of how asymmetric a score distribution is around its mean. Read more
[↑11] Excess kurtosis — measures how much distribution tails differ in heaviness from those of the normal distribution. Read more
[↑12] Item facility — refers to the probability that examinees will answer the item correctly. Read more
[↑13] Sample-dependent hardness — the average proportion of the possible raw score range that candidates fail to achieve. Read more
[↑14] Sample-independent hardness — a difficulty index anchored to a fixed normative ability level instead of the sample mean. Read more
[↑15] Overall test complexity — a sample-independent index of global difficulty across the test’s entire effective measurement range. Read more
[↑16] Cronbach’s alpha — quantifies the extent to which items measure the same underlying construct. Read more
[↑17] Split-half reliability — estimates consistency by correlating scores from two halves of the test, often averaged over multiple random splits. Read more
[↑18] Spearman-Brown correction — scales split-half reliability to reflect the reliability of the entire test. Read more
[↑19] McDonald’s omega — a reliability coefficient estimating internal consistency using a latent factor model. Read more
[↑20] Test dimensionality — the number of distinct latent constructs or abilities measured by the test. Read more
Credits: Paul Cooijmans, Marc-André Nydegger