Verify distribution uniformity/Chi-squared test: Difference between revisions

← Older edit

Verify distribution uniformity/Chi-squared test (view source)

Revision as of 10:29, 16 February 2024

43,112 bytes added , 3 months ago

m

→‎{{header|Wren}}: Minor tidy

PureFox

9,479

edits

Revision as of 19:51, 10 September 2019 (view source) rosettacode>Gerard Schildberger m (added a ;Task: and ;Reference: section header, added whitespace before the table of contents (TOC).) ← Older edit		Latest revision as of 10:29, 16 February 2024 (view source) PureFox (talk \| contribs) m (→‎{{header\|Wren}}: Minor tidy)
(27 intermediate revisions by 16 users not shown)
Line 2: ;Task: Write a function to ~~verify~~determine ~~that~~whether a given ~~distribution~~set of ~~values~~frequency counts could plausibly have iscome from a uniform distribution by using the [[wp:Pearson's chi-square test\|<math>\chi^2</math> test]] ~~to see if the distribution has~~with a ~~likelihood of happening of at least the~~ significance level ~~(conventionally~~of 5%). ~~The~~ ~~function should return a boolean that is true if the distribution is one that a uniform distribution (with appropriate number of degrees of freedom) may be expected to produce.~~ The function should return a boolean that is true if and only if the distribution is one that a uniform distribution (with appropriate number of degrees of freedom) may be expected to produce. Note: normally a two-tailed test would be used for this kind of problem. ;Reference: :*   an entry at the MathWorld website:   [http://mathworld.wolfram.com/Chi-SquaredDistribution.html chi-squared distribution]. ; Related task: :* [[Statistics/Chi-squared_distribution]] <br><br> =={{header\|11l}}== {{trans\|Python}} <syntaxhighlight lang="11l">V a = 12 V k1_factrl = 1.0 [Float] c c.append(sqrt(2.0 * math:pi)) L(k) 1 .< a c.append(exp(a - k) * (a - k) ^ (k - 0.5) / k1_factrl) k1_factrl = -k F gamma_spounge(z) V accm = :c[0] L(k) 1 .< :a accm += :c[k] / (z + k) accm = exp(-(z + :a)) * (z + :a) ^ (z + 0.5) R accm / z F GammaInc_Q(a, x) V a1 = a - 1 V a2 = a - 2 F f0(t) R t ^ @a1 * exp(-t) F df0(t) R (@a1 - t) * t ^ @a2 * exp(-t) V y = a1 L f0(y) * (x - y) > 2.0e-8 & y < x y += 0.3 I y > x y = x V h = 3.0e-4 V n = Int(y / h) h = y / n V hh = 0.5 * h V gamax = h * sum(((n - 1 .< -1).step(-1).map(j -> @h * j)).map(t -> @f0(t) + @hh * @df0(t))) R gamax / gamma_spounge(a) F chi2UniformDistance(dataSet) V expected = sum(dataSet) * 1.0 / dataSet.len V cntrd = (dataSet.map(d -> d - @expected)) R sum(cntrd.map(x -> x * x)) / expected F chi2Probability(dof, distance) R 1.0 - GammaInc_Q(0.5 * dof, 0.5 * distance) F chi2IsUniform(dataSet, significance) V dof = dataSet.len - 1 V dist = chi2UniformDistance(dataSet) R chi2Probability(dof, dist) > significance V dset1 = [199809, 200665, 199607, 200270, 199649] V dset2 = [522573, 244456, 139979, 71531, 21461] L(ds) (dset1, dset2) print(‘Data set: ’ds) V dof = ds.len - 1 V distance = chi2UniformDistance(ds) print(‘dof: #. distance: #.4’.format(dof, distance), end' ‘ ’) V prob = chi2Probability(dof, distance) print(‘probability: #.4’.format(prob), end' ‘ ’) print(‘uniform? ’(I chi2IsUniform(ds, 0.05) {‘Yes’} E ‘No’))</syntaxhighlight> {{out}} <pre> Data set: [199809, 200665, 199607, 200270, 199649] dof: 4 distance: 4.1463 probability: 0.3866 uniform? Yes Data set: [522573, 244456, 139979, 71531, 21461] dof: 4 distance: 790063.2759 probability: -1.5002e-8 uniform? No </pre> =={{header\|Ada}}== First, we specify a simple package to compute the Chi-Square Distance from the uniform distribution: <~~lang~~syntaxhighlight ~~Ada~~lang="ada">package Chi_Square is type Flt is digits 18; Line 18 ⟶ 96: function Distance(Bins: Bins_Type) return Flt; end Chi_Square;</~~lang~~syntaxhighlight> Next, we implement that package: <~~lang~~syntaxhighlight ~~Ada~~lang="ada">package body Chi_Square is function Distance(Bins: Bins_Type) return Flt is Line 48 ⟶ 126: end Distance; end Chi_Square;</~~lang~~syntaxhighlight> Finally, we actually implement the Chi-square test. We do not actually compute the Chi-square probability; rather we hardcode a table of values for 5% significance level, which has been picked from Wikipedia [http://en.wikipedia.org/wiki/Chi-squared_distribution]: <~~lang~~syntaxhighlight ~~Ada~~lang="ada">with Ada.Text_IO, Ada.Command_Line, Chi_Square; use Ada.Text_IO; procedure Test_Chi_Square is Line 78 ⟶ 156: Put_Line("; (deviates significantly from uniform)"); end if; end;</~~lang~~syntaxhighlight> {{out}} Line 89 ⟶ 167: This first sections contains the functions required to compute the Chi-Squared probability. These are not needed if a library containing the necessary function is availabile (e.g. see [[Numerical Integration]], [[Gamma function]]). <~~lang~~syntaxhighlight lang="c">#include <stdlib.h> #include <stdio.h> #include <math.h> Line 156 ⟶ 234: return 1.0 - Simpson3_8( &f0, 0, y, (int)(y/h))/Gamma_Spouge(a); }</~~lang~~syntaxhighlight> This section contains the functions specific to the task. <~~lang~~syntaxhighlight lang="c">double chi2UniformDistance( double ds, int dslen) { double expected = 0.0; Line 185 ⟶ 263: double dist = chi2UniformDistance( dset, dslen); return chi2Probability( dof, dist ) > significance; }</~~lang~~syntaxhighlight> Testing <~~lang~~syntaxhighlight lang="c">int main(int argc, char argv) { double dset1[] = { 199809., 200665., 199607., 200270., 199649. }; Line 211 ⟶ 289: } return 0; }</~~lang~~syntaxhighlight> =={{header\|C#}}== {{trans\|Go}} <syntaxhighlight lang="C#"> using System; class Program { public delegate double Func(double x); public static double Simpson38(Func f, double a, double b, int n) { double h = (b - a) / n; double h1 = h / 3; double sum = f(a) + f(b); for (int j = 3 n - 1; j > 0; j--) { if (j % 3 == 0) { sum += 2 * f(a + h1 * j); } else { sum += 3 * f(a + h1 * j); } } return h * sum / 8; } // Lanczos Approximation for Gamma Function private static double SpecialFunctionGamma(double z) { double[] p = { 676.5203681218851, -1259.1392167224028, 771.32342877765313, -176.61502916214059, 12.507343278686905, -0.13857109526572012, 9.9843695780195716e-6, 1.5056327351493116e-7 }; if (z < 0.5) return Math.PI / (Math.Sin(Math.PI * z) * SpecialFunctionGamma(1 - z)); z -= 1; double x = 0.99999999999980993; for (int i = 0; i < p.Length; i++) { x += p[i] / (z + i + 1); } double t = z + p.Length - 0.5; return Math.Sqrt(2 * Math.PI) * Math.Pow(t, z + 0.5) * Math.Exp(-t) * x; } public static double GammaIncQ(double a, double x) { double aa1 = a - 1; Func f = t => Math.Pow(t, aa1) * Math.Exp(-t); double y = aa1; double h = 1.5e-2; while (f(y) * (x - y) > 2e-8 && y < x) { y += .4; } if (y > x) { y = x; } return 1 - Simpson38(f, 0, y, (int)(y / h / SpecialFunctionGamma(a))); } public static double Chi2Ud(int[] ds) { double sum = 0, expected = 0; foreach (var d in ds) { expected += d; } expected /= ds.Length; foreach (var d in ds) { double x = d - expected; sum += x * x; } return sum / expected; } public static double Chi2P(int dof, double distance) { return GammaIncQ(.5 * dof, .5 * distance); } const double SigLevel = .05; static void Main(string[] args) { int[][] datasets = new int[][] { new int[] { 199809, 200665, 199607, 200270, 199649 }, new int[] { 522573, 244456, 139979, 71531, 21461 }, }; foreach (var dset in datasets) { UTest(dset); } } static void UTest(int[] dset) { Console.WriteLine("Uniform distribution test"); int sum = 0; foreach (var c in dset) { sum += c; } Console.WriteLine($" dataset: {string.Join(", ", dset)}"); Console.WriteLine($" samples: {sum}"); Console.WriteLine($" categories: {dset.Length}"); int dof = dset.Length - 1; Console.WriteLine($" degrees of freedom: {dof}"); double dist = Chi2Ud(dset); Console.WriteLine($" chi square test statistic: {dist}"); double p = Chi2P(dof, dist); Console.WriteLine($" p-value of test statistic: {p}"); bool sig = p < SigLevel; Console.WriteLine($" significant at {SigLevel * 100}% level? {sig}"); Console.WriteLine($" uniform? {!sig}\n"); } } </syntaxhighlight> {{out}} <pre> Uniform distribution test dataset: 199809, 200665, 199607, 200270, 199649 samples: 1000000 categories: 5 degrees of freedom: 4 chi square test statistic: 4.14628 p-value of test statistic: 0.386570833082767 significant at 5% level? False uniform? True Uniform distribution test dataset: 522573, 244456, 139979, 71531, 21461 samples: 1000000 categories: 5 degrees of freedom: 4 chi square test statistic: 790063.27594 p-value of test statistic: 2.35282904270662E-11 significant at 5% level? True uniform? False </pre> =={{header\|C++}}== <syntaxhighlight lang="c++"> #include <iostream> #include <vector> #include <cmath> #include <iomanip> void print_vector(const std::vector<int32_t>& list) { std::cout << "["; for ( uint64_t i = 0; i < list.size() - 1; ++i ) { std::cout << list[i] << ", "; } std::cout << list.back() << "]" << std::endl; } bool is_significant(const double p_value, const double significance_level) { return p_value > significance_level; } // The normalised lower incomplete gamma function. double gamma_cdf(const double aX, const double aK) { double result = 0.0; for ( uint32_t m = 0; m <= 99; ++m ) { result += pow(aX, m) / tgamma(aK + m + 1); } result = pow(aX, aK) exp(-aX); return std::isnan(result) ? 1.0 : result; } // The cumulative probability function of the Chi-squared distribution. double cdf(const double aX, const double aK) { if ( aX > 1'000 && aK < 100 ) { return 1.0; } return ( aX > 0.0 && aK > 0.0 ) ? gamma_cdf(aX / 2, aK / 2) : 0.0; } void chi_squared_test(const std::vector<int32_t>& observed) { double sum = 0.0; for ( uint64_t i = 0; i < observed.size(); ++i ) { sum += observed[i]; } const double expected = sum / observed.size(); const int32_t degree_freedom = observed.size() - 1; double test_statistic = 0.0; for ( uint64_t i = 0; i < observed.size(); ++i ) { test_statistic += pow(observed[i] - expected, 2) / expected; } const double p_value = 1.0 - cdf(test_statistic, degree_freedom); std::cout << "\nUniform distribution test" << std::setprecision(6) << std::endl; std::cout << " observed values : "; print_vector(observed); std::cout << " expected value : " << expected << std::endl; std::cout << " degrees of freedom: " << degree_freedom << std::endl; std::cout << " test statistic : " << test_statistic << std::endl; std::cout.setf(std::ios::fixed); std::cout << " p-value : " << p_value << std::endl; std::cout.unsetf(std::ios::fixed); std::cout << " is 5% significant?: " << std::boolalpha << is_significant(p_value, 0.05) << std::endl; } int main() { const std::vector<std::vector<int32_t>> datasets = { { 199809, 200665, 199607, 200270, 199649 }, { 522573, 244456, 139979, 71531, 21461 } }; for ( std::vector<int32_t> dataset : datasets ) { chi_squared_test(dataset); } } </syntaxhighlight> {{ out }} <pre> Uniform distribution test observed values : [199809, 200665, 199607, 200270, 199649] expected value : 200000 degrees of freedom: 4 test statistic : 4.14628 p-value : 0.386571 is 5% significant?: true Uniform distribution test observed values : [522573, 244456, 139979, 71531, 21461] expected value : 200000 degrees of freedom: 4 test statistic : 790063 p-value : 0.000000 is 5% significant?: false </pre> =={{header\|D}}== <~~lang~~syntaxhighlight lang="d">import std.stdio, std.algorithm, std.mathspecial; real x2Dist(T)(in T[] data) pure nothrow @safe @nogc { Line 243 ⟶ 588: dof, dist, prob, ds.x2IsUniform ? "YES" : "NO", ds); } }</~~lang~~syntaxhighlight> {{out}} <pre> dof distance probability Uniform? dataset Line 251 ⟶ 596: =={{header\|Elixir}}== {{trans\|Ruby}} <~~lang~~syntaxhighlight lang="elixir">defmodule Verify do defp gammaInc_Q(a, x) do a1 = a-1 Line 313 ⟶ 658: :io.fwrite " probability: ~.4f~n", [Verify.chi2Probability(dof, distance)] :io.fwrite " uniform? ~s~n", [(if Verify.chi2IsUniform(ds), do: "Yes", else: "No")] end)</~~lang~~syntaxhighlight> {{out}} Line 334 ⟶ 679: {{libheader\|GNU Scientific Library}} Instead of implementing the ~~incomplete~~chi-squared ~~gamma function~~distribution by ourselves, we bind to GNU Scientific Library; so we need a module to interface to the function we need (<tt>~~gsl_sf_gamma_inc~~gsl_cdf_chisq_Q</tt>) <~~lang~~syntaxhighlight lang="fortran">module ~~GSLMiniBind~~gsl_mini_bind_m ~~implicit none~~ use iso_c_binding ~~interface~~ implicit none ~~real(c_double) function gsl_sf_gamma_inc(a, x) bind(C)~~ private ~~use iso_c_binding~~ ~~real(c_double), value, intent(in) :: a, x~~ public :: p_value ~~end function gsl_sf_gamma_inc~~ ~~end interface~~ interface ~~end module GSLMiniBind</lang>~~ function gsl_cdf_chisq_q(x, nu) bind(c, name='gsl_cdf_chisq_Q') import real(c_double), value :: x real(c_double), value :: nu real(c_double) :: gsl_cdf_chisq_q end function gsl_cdf_chisq_q end interface contains !> Get p-value from chi-square distribution real function p_value(x, df) real, intent(in) :: x integer, intent(in) :: df p_value = real(gsl_cdf_chisq_q(real(x, c_double), real(df, c_double))) end function p_value end module gsl_mini_bind_m</syntaxhighlight> Now we're ready to complete the task. <~~lang~~syntaxhighlight lang="fortran">program ~~ChiTest~~chi2test ~~use GSLMiniBind~~ ~~use iso_c_binding~~ ~~implicit none~~ use gsl_mini_bind_m, only: p_value ~~real, dimension(5) :: dset1 = (/ 199809., 200665., 199607., 200270., 199649. /)~~ implicit none ~~real, dimension(5) :: dset2 = (/ 522573., 244456., 139979., 71531., 21461. /)~~ real :: dset1(5) = [199809., 200665., 199607., 200270., 199649.] ~~real :: dist, prob~~ real :: dset2(5) = [522573., 244456., 139979., 71531., 21461.] ~~integer :: dof~~ real :: dist, prob ~~print , "Dataset 1:"~~ integer :: dof ~~print , dset1~~ ~~dist = chi2UniformDistance(dset1)~~ ~~dof = size(dset1) - 1~~ ~~write(, '(A,I4,A,F12.4)') 'dof: ', dof, ' distance: ', dist~~ ~~prob = chi2Probability(dof, dist)~~ ~~write(, '(A,F9.6)') 'probability: ', prob~~ ~~write(, '(A,L)') 'uniform? ', chiIsUniform(dset1, 0.05)~~ write (, '(A)', advance='no') "Dataset 1:" ~~! Lazy copy/past :\|~~ write (, '(5(F12.4,:,1x))') dset1 ~~print , "Dataset 2:"~~ ~~print , dset2~~ ~~dist = chi2UniformDistance(dset2)~~ ~~dof = size(dset2) - 1~~ ~~write(, '(A,I4,A,F12.4)') 'dof: ', dof, ' distance: ', dist~~ ~~prob = chi2Probability(dof, dist)~~ ~~write(, '(A,F9.6)') 'probability: ', prob~~ ~~write(, '(A,L)') 'uniform? ', chiIsUniform(dset2, 0.05)~~ dist = chisq(dset1) ~~contains~~ dof = size(dset1) - 1 write (, '(A,I4,A,F12.4)') 'dof: ', dof, ' chisq: ', dist prob = p_value(dist, dof) write (, '(A,F12.4)') 'probability: ', prob write (, '(A,L)') 'uniform? ', prob > 0.05 ! Lazy copy/past :\| ~~function chi2Probability(dof, distance)~~ write (, '(/A)', advance='no') "Dataset 2:" ~~real :: chi2Probability~~ write (, '(5(F12.4,:,1x))') dset2 ~~integer, intent(in) :: dof~~ ~~real, intent(in) :: distance~~ dist = chisq(dset2) ~~! in order to make this work, we need linking with GSL library~~ dof = size(dset2) - 1 ~~chi2Probability = gsl_sf_gamma_inc(real(0.5dof, c_double), real(0.5distance, c_double))~~ write (, '(A,I4,A,F12.4)') 'dof: ', dof, ' chisq: ', dist ~~end function chi2Probability~~ prob = p_value(dist, dof) write (, '(A,F12.4)') 'probability: ', prob write (, '(A,L)') 'uniform? ', prob > 0.05 contains ~~function chiIsUniform(dset, significance)~~ ~~logical :: chiIsUniform~~ ~~real, dimension(:), intent(in) :: dset~~ ~~real, intent(in) :: significance~~ !> Get chi-square value for a set of data `ds` ~~integer :: dof~~ real ::function ~~dist~~chisq(ds) real, intent(in) :: ds(:) real :: expected, summa ~~dof = size(dset) - 1~~ ~~dist = chi2UniformDistance(dset)~~ ~~chiIsUniform = chi2Probability(dof, dist) > significance~~ ~~end function chiIsUniform~~ ~~function chi2UniformDistance(ds)~~ ~~real :: chi2UniformDistance~~ ~~real, dimension(:), intent(in) :: ds~~ expected = sum(ds)/size(ds) ~~real :: expected, summa = 0.0~~ summa = sum((ds - expected)2) chisq = summa/expected end function chisq ~~expected = sum(ds) / size(ds)~~ ~~summa = sum( (ds - expected) 2 )~~ ~~chi2UniformDistance = summa / expected~~ ~~end function chi2UniformDistance~~ end program ~~ChiTest~~chi2test</~~lang~~syntaxhighlight> Output: <syntaxhighlight lang="txt">Dataset 1: 199809.0000 200665.0000 199607.0000 200270.0000 199649.0000 dof: 4 chisq: 4.1463 probability: 0.3866 uniform? T Dataset 2: 522573.0000 244456.0000 139979.0000 71531.0000 21461.0000 dof: 4 chisq: 790063.2500 probability: 0.0000 uniform? F</syntaxhighlight> =={{header\|Go}}== {{trans\|C}} Go has a nice gamma function in the library. Otherwise, it's mostly a port from C. Note, this implementation of the incomplete gamma function works for these two test cases, but, I believe, has serious limitations. See talk page. <~~lang~~syntaxhighlight lang="go">package main import ( Line 509 ⟶ 864: fmt.Printf(" significant at %2.0f%% level? %t\n", sigLevel100, sig) fmt.Println(" uniform? ", !sig, "\n") }</~~lang~~syntaxhighlight> Output: <pre> Line 534 ⟶ 889: =={{header\|Hy}}== <~~lang~~syntaxhighlight lang="lisp">(import [scipy.stats [chisquare]] [collections [Counter]]) Line 544 ⟶ 899: size 'alpha'." (<= alpha (second (chisquare (.values (Counter (take repeats (repeatedly f))))))))</~~lang~~syntaxhighlight> Examples of use: <~~lang~~syntaxhighlight lang="lisp">(import [random [randint]]) (for [f [ (fn [] (randint 1 10)) (fn [] (if (randint 0 1) (randint 1 9) (randint 1 10)))]] (print (uniform? f 5000)))</~~lang~~syntaxhighlight> =={{header\|J}}== '''Solution (Tacit):''' <~~lang~~syntaxhighlight lang="j">require 'stats/base' countCats=: #@~. NB. counts the number of unique items Line 569 ⟶ 924: NB. y is: distribution to test NB. x is: optionally specify number of categories possible isUniform=: (countCats $: ]) : (0.95 > calcDf chisqcdf :: 1: calcX2)</~~lang~~syntaxhighlight> '''Solution (Explicit):''' <~~lang~~syntaxhighlight lang="j">require 'stats/base' NB.isUniformX v Tests (5%) whether y is uniformly distributed Line 587 ⟶ 942: degfreedom=. <: x NB. degrees of freedom signif > degfreedom chisqcdf :: 1: X2 )</~~lang~~syntaxhighlight> '''Example Usage:''' <~~lang~~syntaxhighlight lang="j"> FairDistrib=: 1e6 ?@$ 5 UnfairDistrib=: (9.5e5 ?@$ 5) , (5e4 ?@$ 4) isUniformX FairDistrib Line 599 ⟶ 954: 1 4 isUniform 4 4 4 5 5 5 5 5 5 5 NB. not uniform if 4 categories possible 0</~~lang~~syntaxhighlight> =={{header\|Java}}== {{trans\|D}} {{works with\|Java\|8}} <~~lang~~syntaxhighlight lang="java">import static java.lang.Math.pow; import java.util.Arrays; import static java.util.Arrays.stream; Line 641 ⟶ 996: } } }</~~lang~~syntaxhighlight> <pre> dof distance probability Uniform? dataset 4 4,146 0,38657083 YES [199809.0, 200665.0, 199607.0, 200270.0, 199649.0] 4 790063,276 0,00000000 NO [522573.0, 244456.0, 139979.0, 71531.0, 21461.0]</pre> =={{header\|jq}}== {{works with\|jq}} '''Also works with gojq, the Go implementation of jq.''' This entry uses a two-tailed test, as is appropriate for this type of problem as illustrated by the last example. The test is based on the assumption that the sample size is large enough for the χ2 distribution to be used. The implementation of `Chi2_cdf` here uses the recursion relation for the gamma function and should be both fast, accurate and quite robust. For an industrial-strength algorithm, see e.g. https://people.sc.fsu.edu/~jburkardt/c_src/asa239/asa239.c '''Generic Functions''' <syntaxhighlight lang=jq> def round($dec): if type == "string" then . else pow(10;$dec) as $m \| . * $m \| floor / $m end; # sum of squares def ss(s): reduce s as $x (0; . + ($x * $x)); # Cumulative density function of the chi-squared distribution with $k # degrees of freedom # The recursion formula for gamma is used for efficiency and robustness. def Chi2_cdf($x; $k): if $x == 0 then 0 elif $x > (1e3 * $k) then 1 else 1e-15 as $tol # for example \| { s: 0, m: 0, term: (1 / ((($k/2)+1)\|gamma)) } \| until (.term\|length < $tol; # length here is abs .s += .term \| .m += 1 \| .term = (($x/2) / (($k/2) + .m )) ) \| .s ( ((-$x/2) + ($k/2)(($x/2)\|log)) \| exp) end ; </syntaxhighlight> '''Tasks''' <syntaxhighlight lang=jq> # Input: array of frequencies def chi2UniformDistance: (add / length) as $expected \| ss(.[] - $expected) / $expected; # Input: a number # Output: an indication of the probability of observing this value or higher # assuming the value is drawn from a chi-squared distribution with $dof degrees # of freedom def chi2Probability($dof): (1 - Chi2_cdf(.; $dof)) \| if . < 1e-10 then "< 1e-10" else . end; # Input: array of frequencies # Output: result of a two-tailed test based on the chi-squared statistic # assuming the sample size is large enough def chiIsUniform($significance): (length - 1) as $dof \| chi2UniformDistance \| Chi2_cdf(.; $dof) as $cdf \| if $cdf then ($significance/2) as $s \| $cdf > $s and $cdf < (1-$s) else false end; def dsets: [ [199809, 200665, 199607, 200270, 199649], [522573, 244456, 139979, 71531, 21461], [19,14,6,18,7,5,1], # low entropy [9,11,9,10,15,11,5], # high entropy [20,20,20] # made-up ]; def task: dsets[] \| "Dataset: \(.)", ( chi2UniformDistance as $dist \| (length - 1) as $dof \| "DOF: \($dof) D (Distance): \($dist)", " Estimated probability of observing a value >= D: \($dist\|chi2Probability($dof)\|round(2))", " Uniform? \( (select(chiIsUniform(0.05)) \| "Yes") // "No" )\n" ) ; task </syntaxhighlight> <pre> Dataset: [199809,200665,199607,200270,199649] DOF: 4 D (Distance): 4.14628 Estimated probability of observing a value >= D: 0.38 Uniform? Yes Dataset: [522573,244456,139979,71531,21461] DOF: 4 D (Distance): 790063.27594 Estimated probability of observing a value >= D: < 1e-10 Uniform? No Dataset: [19,14,6,18,7,5,1] DOF: 6 D (Distance): 29.2 Estimated probability of observing a value >= D: 0 Uniform? No Dataset: [9,11,9,10,15,11,5] DOF: 6 D (Distance): 5.4 Estimated probability of observing a value >= D: 0.49 Uniform? Yes Dataset: [20,20,20] DOF: 2 D (Distance): 0 Estimated probability of observing a value >= D: 1 Uniform? No </pre> =={{header\|Julia}}== <~~lang~~syntaxhighlight lang="julia"># v0.6 using Distributions Line 665 ⟶ 1,135: println("Data:\n$data") println("Hypothesis test: the original population is ", (eqdist(data) ? "" : "not "), "uniform.\n") end</~~lang~~syntaxhighlight> {{out}} Line 679 ⟶ 1,149: =={{header\|Kotlin}}== This program reuses Kotlin code from the [[Gamma function]] and [[Numerical Integration]] tasks but otherwise is a translation of the C entry for this task. <~~lang~~syntaxhighlight lang="scala">// version 1.1.51 typealias Func = (Double) -> Double Line 755 ⟶ 1,225: println(" Uniform? $uniform\n") } }</~~lang~~syntaxhighlight> {{out}} Line 766 ⟶ 1,236: </pre> =={{header\|Mathematica}}/{{header\|Wolfram Language}}== This code explicity assumes a discrete uniform distribution since the chi square test is a poor test choice for continuous distributions and requires Mathematica version 2 or later <~~lang~~syntaxhighlight ~~Mathematica~~lang="mathematica">discreteUniformDistributionQ[data_, {min_Integer, max_Integer}, confLevel_: .05] := If[$VersionNumber >= 8, confLevel <= PearsonChiSquareTest[data, DiscreteUniformDistribution[{min, max}]], Line 775 ⟶ 1,245: GammaRegularized[k/2, 0, v/2] <= 1 - confLevel]] discreteUniformDistributionQ[data_] :=discreteUniformDistributionQ[data, data[[Ordering[data][[{1, -1}]]]]]</~~lang~~syntaxhighlight> code used to create test data requires Mathematica version 6 or later <~~lang~~syntaxhighlight ~~Mathematica~~lang="mathematica">uniformData = RandomInteger[10, 100]; nonUniformData = Total@RandomInteger[10, {5, 100}];</~~lang~~syntaxhighlight> <syntaxhighlight lang ~~Mathematica~~="mathematica">{discreteUniformDistributionQ[uniformData],discreteUniformDistributionQ[nonUniformData]}</~~lang~~syntaxhighlight> {{out}}<pre>{True,False}</pre> =={{header\|Nim}}== {{trans\|Go}} We use the gamma function from the “math” module. To simplify the code, we use also the “lenientops” module which provides mixed operations between floats ane integers. <syntaxhighlight lang="nim">import lenientops, math, stats, strformat, sugar func simpson38(f: (float) -> float; a, b: float; n: int): float = let h = (b - a) / n let h1 = h / 3 var sum = f(a) + f(b) for i in countdown(3 n - 1, 1): if i mod 3 == 0: sum += 2 * f(a + h1 * i) else: sum += 3 * f(a + h1 * i) result = h * sum / 8 func gammaIncQ(a, x: float): float = let aa1 = a - 1 func f(t: float): float = pow(t, aa1) * exp(-t) var y = aa1 let h = 1.5e-2 while f(y) * (x - y) > 2e-8 and y < x: y += 0.4 if y > x: y = x result = 1 - simpson38(f, 0, y, (y / h / gamma(a)).toInt) func chi2ud(ds: openArray[int]): float = let expected = mean(ds) var s = 0.0 for d in ds: let x = d.toFloat - expected s += x * x result = s / expected func chi2p(dof: int; distance: float): float = gammaIncQ(0.5 * dof, 0.5 * distance) const SigLevel = 0.05 proc utest(dset: openArray[int]) = echo "Uniform distribution test" let s = sum(dset) echo " dataset:", dset echo " samples: ", s echo " categories: ", dset.len let dof = dset.len - 1 echo " degrees of freedom: ", dof let dist = chi2ud(dset) echo " chi square test statistic: ", dist let p = chi2p(dof, dist) echo " p-value of test statistic: ", p let sig = p < SigLevel echo &" significant at {int(SigLevel * 100)}% level? {sig}" echo &" uniform? {not sig}\n" for dset in [[199809, 200665, 199607, 200270, 199649], [522573, 244456, 139979, 71531, 21461]]: utest(dset)</syntaxhighlight> {{out}} <pre>Uniform distribution test dataset:[199809, 200665, 199607, 200270, 199649] samples: 1000000 categories: 5 degrees of freedom: 4 chi square test statistic: 4.14628 p-value of test statistic: 0.3865708330827673 significant at 5% level? false uniform? true Uniform distribution test dataset:[522573, 244456, 139979, 71531, 21461] samples: 1000000 categories: 5 degrees of freedom: 4 chi square test statistic: 790063.27594 p-value of test statistic: 2.34864350190378e-11 significant at 5% level? true uniform? false</pre> =={{header\|OCaml}}== This code needs to be compiled with library [http://oandrieu.nerim.net/ocaml/gsl/ gsl.cma]. <~~lang~~syntaxhighlight lang="ocaml">let sqr x = x . x let chi2UniformDistance distrib = Line 818 ⟶ 1,375: [\| 199809; 200665; 199607; 200270; 199649 \|]; [\| 522573; 244456; 139979; 71531; 21461 \|] ]</~~lang~~syntaxhighlight> Output Line 830 ⟶ 1,387: The sample data for the test was taken from [[#Go\|Go]]. <~~lang~~syntaxhighlight lang="parigp">cumChi2(chi2,dof)={ my(g=gamma(dof/2)); incgam(dof/2,chi2/2,g)/g Line 846 ⟶ 1,403: test([199809, 200665, 199607, 200270, 199649]) test([522573, 244456, 139979, 71531, 21461])</~~lang~~syntaxhighlight> =={{header\|Perl}}== {{trans\|~~Perl 6~~Raku}} <~~lang~~syntaxhighlight lang="perl">use List::Util qw(sum reduce); use constant pi => 3.14159265; Line 892 ⟶ 1,449: for $dataset ([199809, 200665, 199607, 200270, 199649], [522573, 244456, 139979, 71531, 21461]) { printf "C2 = %10.3f, p-value = %.3f, uniform = %s\n", chi_squared_test(@$dataset); }</~~lang~~syntaxhighlight> {{out}} <pre>C2 = 4.146, p-value = 0.387, uniform = True C2 = 790063.276, p-value = 0.000, uniform = False</pre> ~~=={{header\|Perl 6}}==~~ ~~For the incomplete gamma function we use a series expansion related to Kummer's confluent hypergeometric function~~ ~~(see http://en.wikipedia.org/wiki/Incomplete_gamma_function#Evaluation_formulae). The gamma function is calculated~~ ~~in closed form, as we only need its value at integers and half integers.~~ ~~<lang perl6>sub incomplete-γ-series($s, $z) {~~ ~~my \numers = $z X* 1..;~~ ~~my \denoms = [\] $s X+ 1..;~~ ~~my $M = 1 + [+] (numers Z/ denoms) ... < 1e-6;~~ ~~$z*$s / $s exp(-$z) * $M;~~ } ~~sub postfix:<!>(Int $n) { [] 2..$n }~~ ~~sub Γ-of-half(Int $n where > 0) {~~ ~~($n %% 2) ?? (($_-1)! given $n div 2)~~ ~~!! ((2$_)! / (4$_ $_!) * sqrt(pi) given ($n-1) div 2);~~ } ~~# degrees of freedom constrained due to numerical limitations~~ ~~sub chi-squared-cdf(Int $k where 1..200, $x where * >= 0) {~~ ~~my $f = $k < 20 ?? 20 !! 10;~~ ~~given $x {~~ ~~when 0 { 0.0 }~~ ~~when * < $k + $fsqrt($k) { incomplete-γ-series($k/2, $x/2) / Γ-of-half($k) }~~ ~~default { 1.0 }~~ } } ~~sub chi-squared-test(@bins, :$significance = 0.05) {~~ ~~my $n = +@bins;~~ ~~my $N = [+] @bins;~~ ~~my $expected = $N / $n;~~ ~~my $chi-squared = [+] @bins.map: { ($^bin - $expected)2 / $expected }~~ ~~my $p-value = 1 - chi-squared-cdf($n-1, $chi-squared);~~ ~~return (:$chi-squared, :$p-value, :uniform($p-value > $significance));~~ } ~~for [< 199809 200665 199607 200270 199649 >],~~ ~~[< 522573 244456 139979 71531 21461 >]~~ ~~-> $dataset~~ { ~~my %t = chi-squared-test($dataset);~~ ~~say 'data: ', $dataset;~~ ~~say "χ² = {%t<chi-squared>}, p-value = {%t<p-value>.fmt('%.4f')}, uniform = {%t<uniform>}";~~ ~~}</lang>~~ ~~{{out}}~~ ~~<pre>data: 199809 200665 199607 200270 199649~~ ~~χ² = 4.14628, p-value = 0.3866, uniform = True~~ ~~data: 522573 244456 139979 71531 21461~~ ~~χ² = 790063.27594, p-value = 0.0000, uniform = False</pre>~~ =={{header\|Phix}}== {{trans\|Go}} <!--<syntaxhighlight lang="phix">(phixonline)--> ~~using gamma() from [[Gamma_function#Phix]]~~ <span style="color: #008080;">with</span> <span style="color: #008080;">javascript_semantics</span> ~~<lang Phix>function f(atom aa1, t)~~ <span style="color: #008080;">function</span> <span style="color: #000000;">f</span><span style="color: #0000FF;">(</span><span style="color: #004080;">atom</span> <span style="color: #000000;">aa1</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">t</span><span style="color: #0000FF;">)</span> ~~return power(t, aa1) exp(-t)~~ <span style="color: #008080;">return</span> <span style="color: #7060A8;">power</span><span style="color: #0000FF;">(</span><span style="color: #000000;">t</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">aa1</span><span style="color: #0000FF;">)</span> <span style="color: #0000FF;"></span> <span style="color: #7060A8;">exp</span><span style="color: #0000FF;">(-</span><span style="color: #000000;">t</span><span style="color: #0000FF;">)</span> ~~end function~~ <span style="color: #008080;">end</span> <span style="color: #008080;">function</span> ~~function simpson38(atom aa1, a, b, integer n)~~ <span style="color: #008080;">function</span> <span style="color: #000000;">simpson38</span><span style="color: #0000FF;">(</span><span style="color: #004080;">atom</span> <span style="color: #000000;">aa1</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">a</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">b</span><span style="color: #0000FF;">,</span> <span style="color: #004080;">integer</span> <span style="color: #000000;">n</span><span style="color: #0000FF;">)</span> ~~atom h := (b - a) / n,~~ <span style="color: #004080;">atom</span> <span style="color: #000000;">h</span> <span style="color: #0000FF;">:=</span> <span style="color: #0000FF;">(</span><span style="color: #000000;">b</span><span style="color: #0000FF;">-</span><span style="color: #000000;">a</span><span style="color: #0000FF;">)/</span><span style="color: #000000;">n</span><span style="color: #0000FF;">,</span> ~~h1 := h / 3,~~ <span style="color: #000000;">h1</span> <span style="color: #0000FF;">:=</span> <span style="color: #000000;">h</span><span style="color: #0000FF;">/</span><span style="color: #000000;">3</span><span style="color: #0000FF;">,</span> ~~tot := f(aa1,a) + f(aa1,b)~~ <span style="color: #000000;">tot</span> <span style="color: #0000FF;">:=</span> <span style="color: #000000;">f</span><span style="color: #0000FF;">(</span><span style="color: #000000;">aa1</span><span style="color: #0000FF;">,</span><span style="color: #000000;">a</span><span style="color: #0000FF;">)</span> <span style="color: #0000FF;">+</span> <span style="color: #000000;">f</span><span style="color: #0000FF;">(</span><span style="color: #000000;">aa1</span><span style="color: #0000FF;">,</span><span style="color: #000000;">b</span><span style="color: #0000FF;">)</span> ~~for j=3n-1 to 1 by -1 do~~ <span style="color: #008080;">for</span> <span style="color: #000000;">j</span><span style="color: #0000FF;">=</span><span style="color: #000000;">3</span><span style="color: #0000FF;"></span><span style="color: #000000;">n</span><span style="color: #0000FF;">-</span><span style="color: #000000;">1</span> <span style="color: #008080;">to</span> <span style="color: #000000;">1</span> <span style="color: #008080;">by</span> <span style="color: #0000FF;">-</span><span style="color: #000000;">1</span> <span style="color: #008080;">do</span> ~~tot += (3-(mod(j,3)=0)) f(aa1,a+h1j)~~ <span style="color: #000000;">tot</span> <span style="color: #0000FF;">+=</span> <span style="color: #0000FF;">(</span><span style="color: #000000;">3</span><span style="color: #0000FF;">-(</span><span style="color: #7060A8;">mod</span><span style="color: #0000FF;">(</span><span style="color: #000000;">j</span><span style="color: #0000FF;">,</span><span style="color: #000000;">3</span><span style="color: #0000FF;">)=</span><span style="color: #000000;">0</span><span style="color: #0000FF;">))</span> <span style="color: #0000FF;"></span> <span style="color: #000000;">f</span><span style="color: #0000FF;">(</span><span style="color: #000000;">aa1</span><span style="color: #0000FF;">,</span><span style="color: #000000;">a</span><span style="color: #0000FF;">+</span><span style="color: #000000;">h1</span><span style="color: #0000FF;"></span><span style="color: #000000;">j</span><span style="color: #0000FF;">)</span> ~~end for~~ <span style="color: #008080;">end</span> <span style="color: #008080;">for</span> ~~return h tot / 8~~ <span style="color: #008080;">return</span> <span style="color: #000000;">h</span><span style="color: #0000FF;"></span><span style="color: #000000;">tot</span><span style="color: #0000FF;">/</span><span style="color: #000000;">8</span> ~~end function~~ <span style="color: #008080;">end</span> <span style="color: #008080;">function</span> <span style="color: #000080;font-style:italic;">--<copy of gamma from [[Gamma_function#Phix]]></span> <span style="color: #004080;">sequence</span> <span style="color: #000000;">c</span> <span style="color: #0000FF;">=</span> <span style="color: #7060A8;">repeat</span><span style="color: #0000FF;">(</span><span style="color: #000000;">0</span><span style="color: #0000FF;">,</span><span style="color: #000000;">12</span><span style="color: #0000FF;">)</span> <span style="color: #008080;">function</span> <span style="color: #000000;">gamma</span><span style="color: #0000FF;">(</span><span style="color: #004080;">atom</span> <span style="color: #000000;">z</span><span style="color: #0000FF;">)</span> <span style="color: #004080;">atom</span> <span style="color: #000000;">accm</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">c</span><span style="color: #0000FF;">[</span><span style="color: #000000;">1</span><span style="color: #0000FF;">]</span> <span style="color: #008080;">if</span> <span style="color: #000000;">accm</span><span style="color: #0000FF;">=</span><span style="color: #000000;">0</span> <span style="color: #008080;">then</span> <span style="color: #000000;">accm</span> <span style="color: #0000FF;">=</span> <span style="color: #7060A8;">sqrt</span><span style="color: #0000FF;">(</span><span style="color: #000000;">2</span><span style="color: #0000FF;"></span><span style="color: #004600;">PI</span><span style="color: #0000FF;">)</span> <span style="color: #000000;">c</span><span style="color: #0000FF;">[</span><span style="color: #000000;">1</span><span style="color: #0000FF;">]</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">accm</span> <span style="color: #004080;">atom</span> <span style="color: #000000;">k1_factrl</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">1</span> <span style="color: #000080;font-style:italic;">-- (k - 1)!(-1)^k with 0!==1</span> <span style="color: #008080;">for</span> <span style="color: #000000;">k</span><span style="color: #0000FF;">=</span><span style="color: #000000;">2</span> <span style="color: #008080;">to</span> <span style="color: #000000;">12</span> <span style="color: #008080;">do</span> <span style="color: #000000;">c</span><span style="color: #0000FF;">[</span><span style="color: #000000;">k</span><span style="color: #0000FF;">]</span> <span style="color: #0000FF;">=</span> <span style="color: #7060A8;">exp</span><span style="color: #0000FF;">(</span><span style="color: #000000;">13</span><span style="color: #0000FF;">-</span><span style="color: #000000;">k</span><span style="color: #0000FF;">)</span><span style="color: #7060A8;">power</span><span style="color: #0000FF;">(</span><span style="color: #000000;">13</span><span style="color: #0000FF;">-</span><span style="color: #000000;">k</span><span style="color: #0000FF;">,</span><span style="color: #000000;">k</span><span style="color: #0000FF;">-</span><span style="color: #000000;">1.5</span><span style="color: #0000FF;">)/</span><span style="color: #000000;">k1_factrl</span> <span style="color: #000000;">k1_factrl</span> <span style="color: #0000FF;">=</span> <span style="color: #0000FF;">-(</span><span style="color: #000000;">k</span><span style="color: #0000FF;">-</span><span style="color: #000000;">1</span><span style="color: #0000FF;">)</span> <span style="color: #008080;">end</span> <span style="color: #008080;">for</span> <span style="color: #008080;">end</span> <span style="color: #008080;">if</span> <span style="color: #008080;">for</span> <span style="color: #000000;">k</span><span style="color: #0000FF;">=</span><span style="color: #000000;">2</span> <span style="color: #008080;">to</span> <span style="color: #000000;">12</span> <span style="color: #008080;">do</span> <span style="color: #000000;">accm</span> <span style="color: #0000FF;">+=</span> <span style="color: #000000;">c</span><span style="color: #0000FF;">[</span><span style="color: #000000;">k</span><span style="color: #0000FF;">]/(</span><span style="color: #000000;">z</span><span style="color: #0000FF;">+</span><span style="color: #000000;">k</span><span style="color: #0000FF;">-</span><span style="color: #000000;">1</span><span style="color: #0000FF;">)</span> <span style="color: #008080;">end</span> <span style="color: #008080;">for</span> <span style="color: #000000;">accm</span> <span style="color: #0000FF;">=</span> <span style="color: #7060A8;">exp</span><span style="color: #0000FF;">(-(</span><span style="color: #000000;">z</span><span style="color: #0000FF;">+</span><span style="color: #000000;">12</span><span style="color: #0000FF;">))</span><span style="color: #7060A8;">power</span><span style="color: #0000FF;">(</span><span style="color: #000000;">z</span><span style="color: #0000FF;">+</span><span style="color: #000000;">12</span><span style="color: #0000FF;">,</span><span style="color: #000000;">z</span><span style="color: #0000FF;">+</span><span style="color: #000000;">0.5</span><span style="color: #0000FF;">)</span> <span style="color: #000080;font-style:italic;">-- Gamma(z+1)</span> <span style="color: #008080;">return</span> <span style="color: #000000;">accm</span><span style="color: #0000FF;">/</span><span style="color: #000000;">z</span> <span style="color: #008080;">end</span> <span style="color: #008080;">function</span> <span style="color: #000080;font-style:italic;">--</copy of gamma></span> <span style="color: #008080;">function</span> <span style="color: #000000;">gammaIncQ</span><span style="color: #0000FF;">(</span><span style="color: #004080;">atom</span> <span style="color: #000000;">a</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">x</span><span style="color: #0000FF;">)</span> ~~function gammaIncQ(atom a, x)~~ <span style="color: #004080;">atom</span> <span style="color: #000000;">aa1</span> <span style="color: #0000FF;">:=</span> <span style="color: #000000;">a</span><span style="color: #0000FF;">-</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span> ~~atom aa1 := a - 1,~~ <span style="color: #000000;">y</span> <span style="color: #0000FF;">:=</span> <span style="color: #000000;">aa1</span><span style="color: #0000FF;">,</span> ~~y := aa1,~~ <span style="color: #000000;">h</span> <span style="color: #0000FF;">:=</span> <span style="color: #000000;">1.5e-2</span> ~~h := 1.5e-2~~ <span style="color: #008080;">while</span> <span style="color: #000000;">f</span><span style="color: #0000FF;">(</span><span style="color: #000000;">aa1</span><span style="color: #0000FF;">,</span><span style="color: #000000;">y</span><span style="color: #0000FF;">)(</span><span style="color: #000000;">x</span><span style="color: #0000FF;">-</span><span style="color: #000000;">y</span><span style="color: #0000FF;">)</span> <span style="color: #0000FF;">></span> <span style="color: #000000;">2e-8</span> <span style="color: #008080;">and</span> <span style="color: #000000;">y</span> <span style="color: #0000FF;"><</span> <span style="color: #000000;">x</span> <span style="color: #008080;">do</span> ~~while f(aa1,y)(x-y) > 2e-8 and y < x do~~ <span style="color: #000000;">y</span> <span style="color: #0000FF;">+=</span> <span style="color: #000000;">0.4</span> ~~y += 0.4~~ <span style="color: #008080;">end</span> <span style="color: #008080;">while</span> ~~end while~~ <span style="color: #008080;">if</span> <span style="color: #000000;">y</span> <span style="color: #0000FF;">></span> <span style="color: #000000;">x</span> <span style="color: #008080;">then</span> <span style="color: #000000;">y</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">x</span> <span style="color: #008080;">end</span> <span style="color: #008080;">if</span> ~~if y > x then y = x end if~~ <span style="color: #008080;">return</span> <span style="color: #000000;">1</span> <span style="color: #0000FF;">-</span> <span style="color: #000000;">simpson38</span><span style="color: #0000FF;">(</span><span style="color: #000000;">aa1</span><span style="color: #0000FF;">,</span><span style="color: #000000;">0</span><span style="color: #0000FF;">,</span><span style="color: #000000;">y</span><span style="color: #0000FF;">,</span><span style="color: #7060A8;">floor</span><span style="color: #0000FF;">(</span><span style="color: #000000;">y</span><span style="color: #0000FF;">/</span><span style="color: #000000;">h</span><span style="color: #0000FF;">/</span><span style="color: #000000;">gamma</span><span style="color: #0000FF;">(</span><span style="color: #000000;">a</span><span style="color: #0000FF;">)))</span> ~~return 1 - simpson38(aa1, 0, y, floor(y/h/gamma(a)))~~ <span style="color: #008080;">end</span> <span style="color: #008080;">function</span> ~~end function~~ <span style="color: #008080;">function</span> <span style="color: #000000;">chi2ud</span><span style="color: #0000FF;">(</span><span style="color: #004080;">sequence</span> <span style="color: #000000;">ds</span><span style="color: #0000FF;">)</span> ~~function chi2ud(sequence ds)~~ <span style="color: #004080;">atom</span> <span style="color: #000000;">expected</span> <span style="color: #0000FF;">=</span> <span style="color: #7060A8;">sum</span><span style="color: #0000FF;">(</span><span style="color: #000000;">ds</span><span style="color: #0000FF;">)/</span><span style="color: #7060A8;">length</span><span style="color: #0000FF;">(</span><span style="color: #000000;">ds</span><span style="color: #0000FF;">),</span> ~~atom expected = sum(ds)/length(ds),~~ <span style="color: #000000;">tot</span> <span style="color: #0000FF;">=</span> <span style="color: #7060A8;">sum</span><span style="color: #0000FF;">(</span><span style="color: #7060A8;">sq_power</span><span style="color: #0000FF;">(</span><span style="color: #7060A8;">sq_sub</span><span style="color: #0000FF;">(</span><span style="color: #000000;">ds</span><span style="color: #0000FF;">,</span><span style="color: #000000;">expected</span><span style="color: #0000FF;">),</span><span style="color: #000000;">2</span><span style="color: #0000FF;">))</span> ~~tot = sum(sq_power(sq_sub(ds,expected),2))~~ <span style="color: #008080;">return</span> <span style="color: #000000;">tot</span><span style="color: #0000FF;">/</span><span style="color: #000000;">expected</span> ~~return tot / expected~~ <span style="color: #008080;">end</span> <span style="color: #008080;">function</span> ~~end function~~ <span style="color: #008080;">function</span> <span style="color: #000000;">chi2p</span><span style="color: #0000FF;">(</span><span style="color: #004080;">integer</span> <span style="color: #000000;">dof</span><span style="color: #0000FF;">,</span> <span style="color: #004080;">atom</span> <span style="color: #000000;">distance</span><span style="color: #0000FF;">)</span> ~~function chi2p(integer dof, atom distance)~~ <span style="color: #008080;">return</span> <span style="color: #000000;">gammaIncQ</span><span style="color: #0000FF;">(</span><span style="color: #000000;">0.5</span><span style="color: #0000FF;"></span><span style="color: #000000;">dof</span><span style="color: #0000FF;">,</span><span style="color: #000000;">0.5</span><span style="color: #0000FF;"></span><span style="color: #000000;">distance</span><span style="color: #0000FF;">)</span> ~~return gammaIncQ(0.5dof, 0.5distance)~~ <span style="color: #008080;">end</span> <span style="color: #008080;">function</span> ~~end function~~ <span style="color: #008080;">constant</span> <span style="color: #000000;">sigLevel</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">0.05</span> ~~constant sigLevel = 0.05~~ ~~constant tf = {"true","false"}~~ <span style="color: #008080;">procedure</span> <span style="color: #000000;">utest</span><span style="color: #0000FF;">(</span><span style="color: #004080;">sequence</span> <span style="color: #000000;">dset</span><span style="color: #0000FF;">)</span> <span style="color: #7060A8;">printf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">"Uniform distribution test\n"</span><span style="color: #0000FF;">)</span> ~~procedure utest(sequence dset)~~ <span style="color: #004080;">integer</span> <span style="color: #000000;">tot</span> <span style="color: #0000FF;">=</span> <span style="color: #7060A8;">sum</span><span style="color: #0000FF;">(</span><span style="color: #000000;">dset</span><span style="color: #0000FF;">),</span> ~~printf(1,"Uniform distribution test\n")~~ <span style="color: #000000;">dof</span> <span style="color: #0000FF;">:=</span> <span style="color: #7060A8;">length</span><span style="color: #0000FF;">(</span><span style="color: #000000;">dset</span><span style="color: #0000FF;">)-</span><span style="color: #000000;">1</span> ~~integer tot = sum(dset)~~ <span style="color: #004080;">atom</span> <span style="color: #000000;">dist</span> <span style="color: #0000FF;">:=</span> <span style="color: #000000;">chi2ud</span><span style="color: #0000FF;">(</span><span style="color: #000000;">dset</span><span style="color: #0000FF;">),</span> ~~printf(1," dataset:%s\n",{sprint(dset)})~~ <span style="color: #000000;">p</span> <span style="color: #0000FF;">:=</span> <span style="color: #000000;">chi2p</span><span style="color: #0000FF;">(</span><span style="color: #000000;">dof</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">dist</span><span style="color: #0000FF;">)</span> ~~printf(1," samples: %d\n", tot)~~ <span style="color: #004080;">bool</span> <span style="color: #000000;">sig</span> <span style="color: #0000FF;">:=</span> <span style="color: #000000;">p</span> <span style="color: #0000FF;"><</span> <span style="color: #000000;">sigLevel</span> ~~printf(1," categories: %d\n", length(dset))~~ <span style="color: #7060A8;">printf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">" dataset: %v\n"</span><span style="color: #0000FF;">,{</span><span style="color: #000000;">dset</span><span style="color: #0000FF;">})</span> <span style="color: #7060A8;">printf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">" samples: %d\n"</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">tot</span><span style="color: #0000FF;">)</span> ~~integer dof := length(dset) - 1~~ <span style="color: #7060A8;">printf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">" categories: %d\n"</span><span style="color: #0000FF;">,</span> <span style="color: #7060A8;">length</span><span style="color: #0000FF;">(</span><span style="color: #000000;">dset</span><span style="color: #0000FF;">))</span> ~~printf(1," degrees of freedom: %d\n", dof)~~ <span style="color: #7060A8;">printf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">" degrees of freedom: %d\n"</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">dof</span><span style="color: #0000FF;">)</span> <span style="color: #7060A8;">printf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">" chi square test statistic: %g\n"</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">dist</span><span style="color: #0000FF;">)</span> ~~atom dist := chi2ud(dset)~~ <span style="color: #7060A8;">printf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">" p-value of test statistic: %g\n"</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">p</span><span style="color: #0000FF;">)</span> ~~printf(1," chi square test statistic: %g\n", dist)~~ <span style="color: #7060A8;">printf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">" significant at %.0f%% level? %t\n"</span><span style="color: #0000FF;">,</span> <span style="color: #0000FF;">{</span><span style="color: #000000;">sigLevel</span><span style="color: #0000FF;"></span><span style="color: #000000;">100</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">sig</span><span style="color: #0000FF;">})</span> <span style="color: #7060A8;">printf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">" uniform? %t\n"</span><span style="color: #0000FF;">,</span><span style="color: #008080;">not</span> <span style="color: #000000;">sig</span><span style="color: #0000FF;">)</span> ~~atom p := chi2p(dof, dist)~~ <span style="color: #008080;">end</span> <span style="color: #008080;">procedure</span> ~~printf(1," p-value of test statistic: %g\n", p)~~ <span style="color: #000000;">utest</span><span style="color: #0000FF;">({</span><span style="color: #000000;">199809</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">200665</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">199607</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">200270</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">199649</span><span style="color: #0000FF;">})</span> ~~bool sig := p < sigLevel~~ <span style="color: #000000;">utest</span><span style="color: #0000FF;">({</span><span style="color: #000000;">522573</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">244456</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">139979</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">71531</span><span style="color: #0000FF;">,</span> <span style="color: #000000;">21461</span><span style="color: #0000FF;">})</span> ~~printf(1," significant at %2.0f%% level? %s\n", {sigLevel100, tf[2-sig]})~~ <!--</syntaxhighlight>--> ~~printf(1," uniform? %s\n",{tf[sig+1]})~~ ~~end procedure~~ ~~utest({199809, 200665, 199607, 200270, 199649})~~ ~~utest({522573, 244456, 139979, 71531, 21461})</lang>~~ {{out}} <pre> Uniform distribution test dataset: {199809,200665,199607,200270,199649} samples: 1000000 categories: 5 degrees of freedom: 4 chi square test statistic: 4.14628 p-value of test statistic: 0.386571 significant at 5% level? false uniform? true Uniform distribution test dataset: {522573,244456,139979,71531,21461} samples: 1000000 categories: 5 degrees of freedom: 4 chi square test statistic: 790063 p-value of test statistic: 2.35282e-11 significant at 5% level? true uniform? false </pre> =={{header\|Python}}== ===Python: Using only standard libraries=== Implements the Chi Square Probability function with an integration. I'm sure there are better ways to do this. Compare to OCaml implementation. <~~lang~~syntaxhighlight lang="python">import math import random Line 1,105 ⟶ 1,630: prob = chi2Probability( dof, distance) print "probability: %.4f"%prob, print "uniform? ", "Yes"if chi2IsUniform(ds,0.05) else "No"</~~lang~~syntaxhighlight> Output: <pre>Data set: [199809, 200665, 199607, 200270, 199649] Line 1,111 ⟶ 1,636: Data set: [522573, 244456, 139979, 71531, 21461] dof: 4 distance: 790063.275940 probability: 0.0000 uniform? No</pre> ===Python: Using scipy=== This uses the library routine [https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.chisquare.html scipy.stats.chisquare]. <syntaxhighlight lang="python">from scipy.stats import chisquare if __name__ == '__main__': dataSets = [[199809, 200665, 199607, 200270, 199649], [522573, 244456, 139979, 71531, 21461]] print(f"{'Distance':^12} {'pvalue':^12} {'Uniform?':^8} {'Dataset'}") for ds in dataSets: dist, pvalue = chisquare(ds) uni = 'YES' if pvalue > 0.05 else 'NO' print(f"{dist:12.3f} {pvalue:12.8f} {uni:^8} {ds}")</syntaxhighlight> {{out}} <pre> Distance pvalue Uniform? Dataset 4.146 0.38657083 YES [199809, 200665, 199607, 200270, 199649] 790063.276 0.00000000 NO [522573, 244456, 139979, 71531, 21461]</pre> =={{header\|R}}== R being a statistical computating language, the chi-squared test is built in with the function "chisq.test" <~~lang~~syntaxhighlight lang="tcl"> dset1=c(199809,200665,199607,200270,199649) dset2=c(522573,244456,139979,71531,21461) Line 1,127 ⟶ 1,672: print(paste("uniform?",chi2IsUniform(ds))) } </syntaxhighlight> ~~</lang>~~ Output: Line 1,150 ⟶ 1,695: =={{header\|Racket}}== <~~lang~~syntaxhighlight lang="racket"> #lang racket (require Line 1,191 ⟶ 1,736: ; Test whether the constant generator fails: (is-uniform? (λ(_) 5) 1000 0.05) </syntaxhighlight> ~~</lang>~~ Output: <~~lang~~syntaxhighlight lang="racket"> #t #f </syntaxhighlight> ~~</lang>~~ =={{header\|Raku}}== (formerly Perl 6) For the incomplete gamma function we use a series expansion related to Kummer's confluent hypergeometric function (see http://en.wikipedia.org/wiki/Incomplete_gamma_function#Evaluation_formulae). The gamma function is calculated in closed form, as we only need its value at integers and half integers. <syntaxhighlight lang="raku" line>sub incomplete-γ-series($s, $z) { my \numers = $z X* 1..; my \denoms = [\] $s X+ 1..; my $M = 1 + [+] (numers Z/ denoms) ... < 1e-6; $z*$s / $s exp(-$z) * $M; } sub postfix:<!>(Int $n) { [] 2..$n } sub Γ-of-half(Int $n where > 0) { ($n %% 2) ?? (($_-1)! given $n div 2) !! ((2$_)! / (4$_ $_!) * sqrt(pi) given ($n-1) div 2); } # degrees of freedom constrained due to numerical limitations sub chi-squared-cdf(Int $k where 1..200, $x where * >= 0) { my $f = $k < 20 ?? 20 !! 10; given $x { when 0 { 0.0 } when * < $k + $fsqrt($k) { incomplete-γ-series($k/2, $x/2) / Γ-of-half($k) } default { 1.0 } } } sub chi-squared-test(@bins, :$significance = 0.05) { my $n = +@bins; my $N = [+] @bins; my $expected = $N / $n; my $chi-squared = [+] @bins.map: { ($^bin - $expected)2 / $expected } my $p-value = 1 - chi-squared-cdf($n-1, $chi-squared); return (:$chi-squared, :$p-value, :uniform($p-value > $significance)); } for [< 199809 200665 199607 200270 199649 >], [< 522573 244456 139979 71531 21461 >] -> $dataset { my %t = chi-squared-test($dataset); say 'data: ', $dataset; say "χ² = {%t<chi-squared>}, p-value = {%t<p-value>.fmt('%.4f')}, uniform = {%t<uniform>}"; }</syntaxhighlight> {{out}} <pre>data: 199809 200665 199607 200270 199649 χ² = 4.14628, p-value = 0.3866, uniform = True data: 522573 244456 139979 71531 21461 χ² = 790063.27594, p-value = 0.0000, uniform = False</pre> =={{header\|REXX}}== Line 1,210 ⟶ 1,809: either an integer,   or a number which is a multiple of   <big>'''<sup>1</sup>/<sub>2</sub>'''</big>,   both of these cases can be calculated with <br>a straight─forward calculation. <~~lang~~syntaxhighlight lang="rexx">/REXX program performs a chi─squared test to verify a given distribution is uniform. / numeric digits length( pi() ) - length(.) /enough decimal digs for calculations./ @.=; @.1= 199809 200665 199607 200270 199649 Line 1,280 ⟶ 1,879: say pad "significant at " sigPC'% level? ' word('no yes', sig + 1) say pad " is the dataset uniform? " word('no yes', (\(sig))+ 1) return</~~lang~~syntaxhighlight> {{out\|output\|text=  when using the default inputs:}} <pre> Line 1,302 ⟶ 1,901: =={{header\|Ruby}}== {{trans\|Python}} <~~lang~~syntaxhighlight lang="ruby">def gammaInc_Q(a, x) a1, a2 = a-1, a-2 f0 = lambda {\|t\| ta1 Math.exp(-t)} Line 1,362 ⟶ 1,961: puts " probability: %.4f" % chi2Probability(dof, distance) puts " uniform? %s" % (chi2IsUniform(ds) ? "Yes" : "No") end</~~lang~~syntaxhighlight> {{out}} Line 1,376 ⟶ 1,975: probability: -0.0000 uniform? No </pre> =={{header\|Rust}}== <syntaxhighlight lang="rust"> use statrs::function::gamma::gamma_li; fn chi_distance(dataset: &[u32]) -> f64 { let expected = f64::from(dataset.iter().sum::<u32>()) / dataset.len() as f64; dataset .iter() .fold(0., \|acc, &elt\| acc + (elt as f64 - expected).powf(2.)) / expected } fn chi2_probability(dof: f64, distance: f64) -> f64 { 1. - gamma_li(dof * 0.5, distance * 0.5) } fn chi2_uniform(dataset: &[u32], significance: f64) -> bool { let d = chi_distance(&dataset); chi2_probability(dataset.len() as f64 - 1., d) > significance } fn main() { let dsets = vec![ vec![199809, 200665, 199607, 200270, 199649], vec![522573, 244456, 139979, 71531, 21461], ]; for ds in dsets { println!("Data set: {:?}", ds); let d = chi_distance(&ds); print!("Distance: {:.6} ", d); print!( "Chi2 probability: {:.6} ", chi2_probability(ds.len() as f64 - 1., d) ); print!("Uniform? {}\n", chi2_uniform(&ds, 0.05)); } } </syntaxhighlight> {{out}} <pre> Data set: [199809, 200665, 199607, 200270, 199649] Distance: 4.146280 Chi2 probability: 0.386571 Uniform? true Data set: [522573, 244456, 139979, 71531, 21461] Distance: 790063.275940 Chi2 probability: 0.000000 Uniform? false </pre> Line 1,383 ⟶ 2,030: {{libheader\|Scastie qualified}} {{works with\|Scala\|2.13}} <~~lang~~syntaxhighlight ~~Scala~~lang="scala">import org.apache.commons.math3.special.Gamma.regularizedGammaQ object ChiSquare extends App { Line 1,412 ⟶ 2,059: dof, dist, χ2Prob(dof.toDouble, dist), if (χ2IsUniform(ds, 0.05)) "YES" else "NO", ds.mkString(", ")) } }</~~lang~~syntaxhighlight> =={{header\|Sidef}}== <~~lang~~syntaxhighlight lang="ruby"># Confluent hypergeometric function of the first kind F_1(a;b;z) func F1(a, b, z, limit=100) { sum(0..limit, {\|k\| Line 1,455 ⟶ 2,103: say "data: #{dataset}" say "χ² = #{r[0]}, p-value = #{r[1].round(-4)}, uniform = #{r[2]}\n" }</~~lang~~syntaxhighlight> {{out}} <pre> Line 1,468 ⟶ 2,116: {{works with\|Tcl\|8.5}} {{tcllib\|math::statistics}} <~~lang~~syntaxhighlight lang="tcl">package require Tcl 8.5 package require math::statistics Line 1,482 ⟶ 2,130: [expr {$degreesOfFreedom / 2.0}] [expr {$X2 / 2.0}]] expr {$likelihoodOfRandom > $significance} }</~~lang~~syntaxhighlight> Testing: <~~lang~~syntaxhighlight lang="tcl">proc makeDistribution {operation {count 1000000}} { for {set i 0} {$i<$count} {incr i} {incr distribution([uplevel 1 $operation])} return [array get distribution] Line 1,492 ⟶ 2,140: puts "distribution \"$distFair\" assessed as [expr [isUniform $distFair]?{fair}:{unfair}]" set distUnfair [makeDistribution {expr int(rand()rand()5)}] puts "distribution \"$distUnfair\" assessed as [expr [isUniform $distUnfair]?{fair}:{unfair}]"</~~lang~~syntaxhighlight> Output: <pre>distribution "0 199809 4 199649 1 200665 2 199607 3 200270" assessed as fair Line 1,499 ⟶ 2,147: =={{header\|VBA}}== The built in worksheetfunction ChiSq_Dist of Excel VBA is used. Output formatted like R. <~~lang~~syntaxhighlight lang="vb">Private Function Test4DiscreteUniformDistribution(ObservationFrequencies() As Variant, Significance As Single) As Boolean 'Returns true if the observed frequencies pass the Pearson Chi-squared test at the required significance level. Dim Total As Long, Ei As Long, i As Integer Line 1,528 ⟶ 2,176: O = [{522573,244456,139979,71531,21461}] Debug.Print "[1] ""Uniform? "; Test4DiscreteUniformDistribution(O, 0.05); """" End Sub</~~lang~~syntaxhighlight> {{out}<pre>[1] "Data set:" 199809 200665 199607 200270 199649 Chi-squared test for given frequencies Line 1,537 ⟶ 2,185: X-squared = 790063.27594 , df = 4 , p-value = 0.0000 [1] "Uniform? False"</pre> =={{header\|V (Vlang)}}== {{trans\|Go}} <syntaxhighlight lang="v (vlang)">import math type Ifctn = fn(f64) f64 fn simpson38(f Ifctn, a f64, b f64, n int) f64 { h := (b - a) / f64(n) h1 := h / 3 mut sum := f(a) + f(b) for j := 3n - 1; j > 0; j-- { if j%3 == 0 { sum += 2 f(a+h1f64(j)) } else { sum += 3 f(a+h1f64(j)) } } return h sum / 8 } fn gamma_inc_q(a f64, x f64) f64 { aa1 := a - 1 f := Ifctn(fn[aa1](t f64) f64 { return math.pow(t, aa1) * math.exp(-t) }) mut y := aa1 h := 1.5e-2 for f(y)(x-y) > 2e-8 && y < x { y += .4 } if y > x { y = x } return 1 - simpson38(f, 0, y, int(y/h/math.gamma(a))) } fn chi2ud(ds []int) f64 { mut sum, mut expected := 0.0,0.0 for d in ds { expected += f64(d) } expected /= f64(ds.len) for d in ds { x := f64(d) - expected sum += x x } return sum / expected } fn chi2p(dof int, distance f64) f64 { return gamma_inc_q(.5f64(dof), .5distance) } const sig_level = .05 fn main() { for dset in [ [199809, 200665, 199607, 200270, 199649], [522573, 244456, 139979, 71531, 21461], ] { utest(dset) } } fn utest(dset []int) { println("Uniform distribution test") mut sum := 0 for c in dset { sum += c } println(" dataset: $dset") println(" samples: $sum") println(" categories: $dset.len") dof := dset.len - 1 println(" degrees of freedom: $dof") dist := chi2ud(dset) println(" chi square test statistic: $dist") p := chi2p(dof, dist) println(" p-value of test statistic: $p") sig := p < sig_level println(" significant at ${sig_level100:2.0f}% level? $sig") println(" uniform? ${!sig}\n") }</syntaxhighlight> {{out}} <pre> Uniform distribution test dataset: [199809 200665 199607 200270 199649] samples: 1000000 categories: 5 degrees of freedom: 4 chi square test statistic: 4.14628 p-value of test statistic: 0.3865708330827673 significant at 5% level? false uniform? true Uniform distribution test dataset: [522573 244456 139979 71531 21461] samples: 1000000 categories: 5 degrees of freedom: 4 chi square test statistic: 790063.27594 p-value of test statistic: 2.3528290427066167e-11 significant at 5% level? true uniform? false </pre> =={{header\|Wren}}== {{trans\|Kotlin}} {{libheader\|Wren-math}} {{libheader\|Wren-fmt}} <syntaxhighlight lang="wren">import "./math" for Math, Nums import "./fmt" for Fmt var integrate = Fn.new { \|a, b, n, f\| var h = (b - a) / n var sum = 0 for (i in 0...n) { var x = a + ih sum = sum + (f.call(x) + 4 * f.call(x + h/2) + f.call(x + h)) / 6 } return sum * h } var gammaIncomplete = Fn.new { \|a, x\| var am1 = a - 1 var f0 = Fn.new { \|t\| t.pow(am1) * (-t).exp } var h = 1.5e-2 var y = am1 while ((f0.call(y) * (x - y) > 2e-8) && y < x) y = y + 0.4 if (y > x) y = x return 1 - integrate.call(0, y, (y/h).truncate, f0) / Math.gamma(a) } var chi2UniformDistance = Fn.new { \|ds\| var expected = Nums.mean(ds) var sum = Nums.sum(ds.map { \|d\| (d - expected).pow(2) }.toList) return sum / expected } var chi2Probability = Fn.new { \|dof, dist\| gammaIncomplete.call(0.5dof, 0.5dist) } var chiIsUniform = Fn.new { \|ds, significance\| var dof = ds.count - 1 var dist = chi2UniformDistance.call(ds) return chi2Probability.call(dof, dist) > significance } var dsets = [ [199809, 200665, 199607, 200270, 199649], [522573, 244456, 139979, 71531, 21461] ] for (ds in dsets) { System.print("Dataset: %(ds)") var dist = chi2UniformDistance.call(ds) var dof = ds.count - 1 Fmt.write("DOF: $d Distance: $.4f", dof, dist) var prob = chi2Probability.call(dof, dist) Fmt.write(" Probability: $.6f", prob) var uniform = chiIsUniform.call(ds, 0.05) ? "Yes" : "No" System.print(" Uniform? %(uniform)\n") }</syntaxhighlight> {{out}} <pre> Dataset: [199809, 200665, 199607, 200270, 199649] DOF: 4 Distance: 4.1463 Probability: 0.386571 Uniform? Yes Dataset: [522573, 244456, 139979, 71531, 21461] DOF: 4 Distance: 790063.2759 Probability: 0.000000 Uniform? No </pre> =={{header\|zkl}}== {{trans\|C}} {{trans\|D}} <~~lang~~syntaxhighlight lang="zkl">/* Numerical integration method */ fcn Simpson3_8(f,a,b,N){ // fcn,double,double,Int --> double h,h1:=(b - a)/N, h/3.0; Line 1,576 ⟶ 2,400: if(y>x) y=x; 1.0 - Simpson3_8(f,0.0,y,(y/h).toInt())/Gamma_Spouge(a); }</~~lang~~syntaxhighlight> <~~lang~~syntaxhighlight lang="zkl">fcn chi2UniformDistance(ds){ // --> double dslen :=ds.len(); expected:=dslen.reduce('wrap(sum,k){ sum + ds[k] },0.0)/dslen; Line 1,588 ⟶ 2,412: fcn chiIsUniform(dset,significance=0.05){ significance < chi2Probability(-1.0 + dset.len(),chi2UniformDistance(dset)) }</~~lang~~syntaxhighlight> <~~lang~~syntaxhighlight lang="zkl">datasets:=T( T(199809.0, 200665.0, 199607.0, 200270.0, 199649.0), T(522573.0, 244456.0, 139979.0, 71531.0, 21461.0) ); println(" %4s %12s %12s %8s %s".fmt( Line 1,600 ⟶ 2,424: dof, dist, prob, chiIsUniform(ds) and "YES" or "NO", ds.concat(","))); }</~~lang~~syntaxhighlight> {{out}} <pre>