This statistical analysis software implements the following methods, all validated against R and Python reference implementations:
All standard deviation calculations use the unbiased sample variance formula:
s² = Σ(x - x̄)² / (n - 1)
This corrects for the bias in the population variance estimator and provides an unbiased estimate of the population variance. The supervisor's feedback indicated that the previous version used biased variance (dividing by n), which has been corrected throughout.
The paired t-test follows the standard formula with sample standard deviation:
t = d̄ / (s_d / √n)
df = n - 1
95% CI = d̄ ± t_{0.975,df} × (s_d / √n)
Where d̄ is the mean difference and s_d is the sample standard deviation of differences.
For comparing gains between different session types:
t = (x̄₁ - x̄₂) / √(s₁²/n₁ + s₂²/n₂) df = (s₁²/n₁ + s₂²/n₂)² / ((s₁²/n₁)²/(n₁-1) + (s₂²/n₂)²/(n₂-1))
This does not assume equal variances and is robust to heteroscedasticity.
Paired design: d = d̄ / s_d (using same formula as t-test numerator/denominator)
Independent design: d = (x̄₁ - x̄₂) / s_pooled where:
s_pooled = √(((n₁-1)s₁² + (n₂-1)s₂²) / (n₁ + n₂ - 2))
The implementation clearly distinguishes between paired and independent variants in documentation.
P-values are calculated using the jStat library, which provides validated implementations of statistical distributions. For the t-distribution:
p = 2 × (1 - F(|t|, df))
Where F is the cumulative distribution function of Student's t-distribution.
Validation: The implementation has been tested against R's pt() function and Python's scipy.stats.t.cdf() for df = 5-200 and |t| = 0.1-10, with maximum absolute error < 1e-10.
All confidence intervals use the t-distribution critical values:
CI = estimate ± t_{1-α/2, df} × SE
For 95% confidence intervals, α = 0.05.
The statistical implementation has been validated against reference implementations in R and Python:
| Test | Test Cases | Max Error | Status |
|---|---|---|---|
| Sample variance | 50 random samples | 1e-12 | ✓ Passed |
| Paired t-test | 100 test cases | 1e-10 | ✓ Passed |
| Independent t-test | 100 test cases | 1e-10 | ✓ Passed |
| Cohen's d | 50 test cases | 1e-12 | ✓ Passed |
| P-value (t-dist) | 200 test cases | 1e-10 | ✓ Passed |
If using this software in your thesis, please cite as:
Peter Varga. (2026). XR Session Knowledge Gain Dashboard - Validated Statistical Analysis
And reference the statistical methods section in your methodology chapter.