Statistical testing of Non-random assignment

Chapter 4 Testing non-random assignment:

4.4 Evidence regarding non-random assignment

4.4.2 Statistical testing of Non-random assignment

Even though graphical evidence supports the random assignment assumption of pupil to classrooms in the Chilean school system, we test statistically how con- sistent this hypothesis is for our two selected groups of schools. Similar to the graphical approachers, we make comparisons between real classes and the two type of counterfactuals based on the previous year school marks. If there is non- random assignment of student to classes, there should not be statistical differences between real classes and random (RDM) counterfactual classes, while we expect to find statistical difference between real classes and perfectly sorted (SRT) counterfactuals.

The comparisons between real classes and their counterfactuals are carried out with two statistical tests: (i) T-test, for measuring mean difference between real classrooms and counterfactuals; and (ii)Kolgomorov-Smirnov (KS) test, for assessing statistical difference between the distributions.5 _{We test for “Sorting}

Evidence” (SoE) within school based on each possible comparison, given: the type of counterfactual, the type of statistical test, and the previous school performance measures which are used to create the RDM and SRT counterfactuals. In total, we have 12 independent non-random assignment measures per schools, represented by the Non-Random Assignment (Non-RA) Indexes, from (1) to (12), as it is described in Table 4.16.

Depending on the SoE observed in each school for every type of comparison,

the Non-RA) Index classifies schools into five levels of non-random assignment:

None, Low, Medium, Med-High, and High. Therefore, we sum the number of

schools observed in each category, and analyse their distribution for both groups of school.

We compute the SoE to construct the Non-RA) Indexes for all schools in

Group 1and Group 2. Table 4.17 shows how the SoE is estimated for the Non- RA) Index (1) to (6) in Group 1. Here, the type of counterfactual class used the RDM, which is tested either under t-test or ks-test. For every Index, the maximum possible SoE is 8, meaning there is evidence of non-random assignment in the eight possible comparisons for this group of schools. Similarly, for Group 2, we present in Table 4.18 the maximum SoE that can be reached in this group

5_{The Kolmorov-Smirnov test is considered as the most appropriate test for comparing dis-}

tributions (Gibbons and Chakraborti(2011)). The criterion comparison is stricter than a t-test as we are now comparing the whole distribution instead of just the mean.

when we compare real classes with RDM counterfactuals. Here, we have four more possible comparisons, therefore the maximum SoE score is12.

Table 4.16: Non-random assignment (Non-RA) Indexes per school

Type%of% Counterfactual% Class Type%of%Statistic% Test Performance% measures %Non6RA%Indexes% GPA Non$RA'Index'(1) Language Non$RA'Index'(2) Maths Non$RA'Index'(3) GPA Non$RA'Index'(4) Language Non$RA'Index'(5) Maths Non$RA'Index'(6) GPA Non$RA'Index'(7) Language Non$RA'Index'(8) Maths Non$RA'Index'(9) GPA Non$RA'Index'(10) Language Non$RA'Index'(11) Maths Non$RA'Index'(12) Random2(RDM) T7test KS7test Perfectly2Sorted2 (SRT) T7test KS7test

Note: (i)In total, we have 12 independent Non-Random Assignment measures (from 12 Non-RA Indexes) per group of schools (Group 1, Group 2)(ii)There are two categories of artificially created counterfactual classes:

Random (RDM) and Perfectly Sorted counterfactual (SRT).(iii)SRT counterfactual can be created

based on GPA, Language, or Maths school Marks. (iv)To compare real classes with counterfactual classes we apply two statistics test: T-test and KS-test.

Table 4.17: Potential cases of non-random assignment tested with RDM counterfactuals

Group 1 Real%Class Counterfactual%_Class Hypothesis%test

Sorting% Evidence%

(SoE) Condition

4A 4A#RDM Ho:*Mean*difference*=*0 1 If%t=tests%(ks=test)%rejects%Ho

(Ho:*Diff.*in*distribution*=*0) 0 If*t#tests*(ks#test)*does*not*reject*Ho

4B#RDM Ho:*Mean*difference*=*0 1 If%t=tests%(ks=test)%rejects%Ho

(Ho:*Diff.*in*distribution*=*0) 0 If*t#tests*(ks#test)*does*not*reject*Ho

4B 4A#RDM Ho:*Mean*difference*=*0 1 If%t=tests%(ks=test)%rejects%Ho

(Ho:*Diff.*in*distribution*=*0) 0 If*t#tests*(ks#test)*does*not*reject*Ho

4B#RDM Ho:*Mean*difference*=*0 1 If%t=tests%(ks=test)%rejects%Ho

(Ho:*Diff.*in*distribution*=*0) 0 If*t#tests*(ks#test)*does*not*reject*Ho

5A 5A#RDM Ho:*Mean*difference*=*0 1 If%t=tests%(ks=test)%rejects%Ho

(Ho:*Diff.*in*distribution*=*0) 0 If*t#tests*(ks#test)*does*not*reject*Ho

5B#RDM Ho:*Mean*difference*=*0 1 If%t=tests%(ks=test)%rejects%Ho

(Ho:*Diff.*in*distribution*=*0) 0 If*t#tests*(ks#test)*does*not*reject*Ho

5B 5A#RDM Ho:*Mean*difference*=*0 1 If%t=tests%(ks=test)%rejects%Ho

(Ho:*Diff.*in*distribution*=*0) 0 If*t#tests*(ks#test)*does*not*reject*Ho

5B#RDM Ho:*Mean*difference*=*0 1 If%t=tests%(ks=test)%rejects%Ho

(Ho:*Diff.*in*distribution*=*0) 0 If*t#tests*(ks#test)*does*not*reject*Ho

Maximum%SoE%Score% 8

Non$RA'Indexes'(1)'$'(6)

Notes: (i)For schools in Group 1, the potential maximum evidence of non-random assignment is8.(ii)Every real class within school is compared with the two random counterfactual classes per grade(A,B RDM).(iii) We use two statistical tests: T-test to compare means, and KS-test to compare distributions. (iv)We claim there issorting evidence (SoE)in a particular comparison when the Null Hypothesis (Ho) of No differences between the classes is rejected at a 5% significance level (for both t-test and ks-test).(v)In the Hypothesis test column the Ho in brackets refers to the ks-test.

Table 4.18: Potential cases of non-random assignment tested with RDM counterfactuals

Group 2

Real%Class Counterfactual%_Class Hypothesis%test

Sorting% Evidence%

(SoE)

Condition