IEEE Signal Processing - July 2018 - 134

y = [y 1, y 2, f, y N ] T ! R N # 1, where N
is the number of images in the database
and supposing that the DMOS for the
images in the database is denoted as
z = [z 1, z 2, f, z N ] T ! R N # 1, then the
two residuals between the DMOS and
the two nonlinearly mapped scores
are d x = z - x and d y = z - y.
Sheikh et al. [7] assumed that d x =
[d x1, d x2, f, d xN ] T ! R N # 1 a nd d y =
[d y1, d y2, f, d yN ] T ! R N # 1 are the samples drawn from Gaussian distributions
with zero means. Therefore, testing
whether d x and d y are from the same
Gaussian distribution becomes testing
whether their population variances v 2x
and v 2y are the same.
The F-test is adopted in [7] to test the
equality of variances. The null hypothesis H 0 is v 2x v 2y = 1, and the alternative
hypothesis H 1 is v 2x v 2y ! 1. Given the
two sets of samples d x and d y, the F-test
statistic is calculated as
F=

s 2x
,
s 2y

(1)

where s 2x = (1/ (N -1)) R iN= 1 (d xi - drx) 2 and
s 2y = (1/ (N -1)) R iN= 1 (d yi - dry)2 a r e t h e
sample variances, and drx = (1/N ) R iN= 1 d xi
and dr y = (1/N ) R iN= 1 d yi are the sample
means. The F-test statistic has an F distribution with N - 1 and N - 1 degrees
of freedom. Therefore, the test conclusion
is drawn by comparing the value of F in
(1) with the critical values of an F distribution with N - 1 and N - 1 degrees of
freedom. Given the significance level a
(which usually takes values of 1, 5, or
10%), the null hypothesis is rejected if
F 2 F1 - a/2, N - 1, N - 1 or F 1 Fa/2, N - 1, N - 1,
where F1 - a/2, N - 1, N - 1 and Fa/2, N - 1, N - 1
are the critical values. Then, we conclude that d x and d y are from different Gaussian distributions. Otherwise, if
Fa/2, N - 1, N - 1 # F # F1 - a/2, N - 1, N - 1, we
do not reject the null hypothesis, and the
conclusion is that d x and d y are from the
same Gaussian distribution.

Why is the F-test unsuitable for
comparing IQA metrics?
The F-test assumes that d x and d y are
two independent samples from Gaussian populations. However, d x and d y
may be correlated because the samples
134

are paired; one sample d xi in d x is
uniquely paired with one sample d yi in
d y because they are calculated on the
same ith image. Such samples are called
paired samples in statistics. If the two
IQA metrics are both well designed,
their scores both decrease as the degree
of degradation increases in the same
image. Such correlations between scores
may also render the residuals d x and d y
as correlated. Empirical evidence of the
correlation between residuals is provided later in experimental results. Therefore, the conclusion drawn from the
F-test of whether d x and d y are from
the same distribution can be unreliable.

The Pitman test as a solution
In statistics, a hypothesis test for paired
samples is usually different from that for
independent samples. For example, the
t-test is used to test the equality of means
for independent samples, whereas the
paired t-test is used for paired samples.
For evaluating the equality of variances,
the Pitman test is designed for correlated
samples [9], [11], [12].
Here, we introduce the Pitman test to
examine the equality of variances for the
residuals of two IQA metrics. The null
hypothesis H 0 is v 2x = v 2y, and the alternative hypothesis H 1 is v 2x ! v 2y . The
Pitman test statistic is calculated as
t=

` 1 - s 2x s 2y j N - 2
4 (1 - r 2) ` s 2x s 2y j

,

(2)

where
N

r=

/ ^d xi - drxh^d yi - dryh

i=1
N

N

/ ^d xi - drxh2 / ^d yi - dryh2

i=1

i=1

(3)

is the Pearson correlation coefficient
between the two sets of samples d x and
d y . It is clear that in (2) the correlation
r is considered in the test statistic. The
Pitman test statistic exhibits a Student's
t distribution with N - 2 degrees of
freedom.
Similar to that in the F-test, the test
conclusion is drawn by comparing the
value of t in (2) with the critical values
of a t distribution with N - 2 degrees
of freedom.
IEEE Signal Processing Magazine

|

July 2018

|

The F-test versus the Pitman test
In Figure 1, we illustrate the use of the
F-test and the Pitman test in comparing
IQA metrics. Two IQA metrics M x and
M y are applied to the same IQA database, providing two residuals d x and d y,
respectively. In the comparison of d x and
d y by using the F-test, two assumptions
are applied: 1) independence between d x
and d y and 2) normality of d x and d y, as
shown in Figure 1(a). By contrast, when
the Pitman test is used, the only assumption is the normality of d x and d y, as
shown in Figure 1(b). Given that d x and
d y are paired and correlated, the Pitman
test is more appropriate to test the equality of variances than the F-test.

Experimental results
In the following experiments, we aim to
test whether d x and d y are from Gaussian distributions with the same variances
on the LIVE database. We show that different conclusions can be drawn from the
F-test and the Pitman test.
Following the experiments in [7],
all experiments are performed on five
types of degradations (JPEG2000, JPEG,
Gaussian noise, Gaussian blur, and fastfading wireless) separately and then on
the overall database.
We compare the scores of the following seven IQA metrics: FSIM [5], GSM
[6], most apparent distortion (MAD) [13],
MS-SSIM [3], NQM [1], peak signal-tonoise ratio (PSNR), and SSIM [2]. All
scores and their nonlinearly mapped
scores are obtained from http://sse.tongji
.edu.cn/linzhang/IQA/IQA.htm. The significance levels of the F-test and the Pitman test are both set to 5%.

Are two conclusions different?
The results show that, for all types of
degradations and the overall database,
the F-test does not always produce the
same conclusion as that by the Pitman
test regarding whether two IQA metrics are statistically significantly different. Here, we show two examples on
the overall database and the Gaussian
noises in Table 1.
A total of 21 pairs of IQA metrics are
compared in the experiments. For the
overall database, we obtain 19 same conclusions and two different conclusions


http://sse.tongji.edu.cn/linzhang/IQA/IQA.htm http://sse.tongji.edu.cn/linzhang/IQA/IQA.htm

Table of Contents for the Digital Edition of IEEE Signal Processing - July 2018

Contents
IEEE Signal Processing - July 2018 - Cover1
IEEE Signal Processing - July 2018 - Cover2
IEEE Signal Processing - July 2018 - Contents
IEEE Signal Processing - July 2018 - 2
IEEE Signal Processing - July 2018 - 3
IEEE Signal Processing - July 2018 - 4
IEEE Signal Processing - July 2018 - 5
IEEE Signal Processing - July 2018 - 6
IEEE Signal Processing - July 2018 - 7
IEEE Signal Processing - July 2018 - 8
IEEE Signal Processing - July 2018 - 9
IEEE Signal Processing - July 2018 - 10
IEEE Signal Processing - July 2018 - 11
IEEE Signal Processing - July 2018 - 12
IEEE Signal Processing - July 2018 - 13
IEEE Signal Processing - July 2018 - 14
IEEE Signal Processing - July 2018 - 15
IEEE Signal Processing - July 2018 - 16
IEEE Signal Processing - July 2018 - 17
IEEE Signal Processing - July 2018 - 18
IEEE Signal Processing - July 2018 - 19
IEEE Signal Processing - July 2018 - 20
IEEE Signal Processing - July 2018 - 21
IEEE Signal Processing - July 2018 - 22
IEEE Signal Processing - July 2018 - 23
IEEE Signal Processing - July 2018 - 24
IEEE Signal Processing - July 2018 - 25
IEEE Signal Processing - July 2018 - 26
IEEE Signal Processing - July 2018 - 27
IEEE Signal Processing - July 2018 - 28
IEEE Signal Processing - July 2018 - 29
IEEE Signal Processing - July 2018 - 30
IEEE Signal Processing - July 2018 - 31
IEEE Signal Processing - July 2018 - 32
IEEE Signal Processing - July 2018 - 33
IEEE Signal Processing - July 2018 - 34
IEEE Signal Processing - July 2018 - 35
IEEE Signal Processing - July 2018 - 36
IEEE Signal Processing - July 2018 - 37
IEEE Signal Processing - July 2018 - 38
IEEE Signal Processing - July 2018 - 39
IEEE Signal Processing - July 2018 - 40
IEEE Signal Processing - July 2018 - 41
IEEE Signal Processing - July 2018 - 42
IEEE Signal Processing - July 2018 - 43
IEEE Signal Processing - July 2018 - 44
IEEE Signal Processing - July 2018 - 45
IEEE Signal Processing - July 2018 - 46
IEEE Signal Processing - July 2018 - 47
IEEE Signal Processing - July 2018 - 48
IEEE Signal Processing - July 2018 - 49
IEEE Signal Processing - July 2018 - 50
IEEE Signal Processing - July 2018 - 51
IEEE Signal Processing - July 2018 - 52
IEEE Signal Processing - July 2018 - 53
IEEE Signal Processing - July 2018 - 54
IEEE Signal Processing - July 2018 - 55
IEEE Signal Processing - July 2018 - 56
IEEE Signal Processing - July 2018 - 57
IEEE Signal Processing - July 2018 - 58
IEEE Signal Processing - July 2018 - 59
IEEE Signal Processing - July 2018 - 60
IEEE Signal Processing - July 2018 - 61
IEEE Signal Processing - July 2018 - 62
IEEE Signal Processing - July 2018 - 63
IEEE Signal Processing - July 2018 - 64
IEEE Signal Processing - July 2018 - 65
IEEE Signal Processing - July 2018 - 66
IEEE Signal Processing - July 2018 - 67
IEEE Signal Processing - July 2018 - 68
IEEE Signal Processing - July 2018 - 69
IEEE Signal Processing - July 2018 - 70
IEEE Signal Processing - July 2018 - 71
IEEE Signal Processing - July 2018 - 72
IEEE Signal Processing - July 2018 - 73
IEEE Signal Processing - July 2018 - 74
IEEE Signal Processing - July 2018 - 75
IEEE Signal Processing - July 2018 - 76
IEEE Signal Processing - July 2018 - 77
IEEE Signal Processing - July 2018 - 78
IEEE Signal Processing - July 2018 - 79
IEEE Signal Processing - July 2018 - 80
IEEE Signal Processing - July 2018 - 81
IEEE Signal Processing - July 2018 - 82
IEEE Signal Processing - July 2018 - 83
IEEE Signal Processing - July 2018 - 84
IEEE Signal Processing - July 2018 - 85
IEEE Signal Processing - July 2018 - 86
IEEE Signal Processing - July 2018 - 87
IEEE Signal Processing - July 2018 - 88
IEEE Signal Processing - July 2018 - 89
IEEE Signal Processing - July 2018 - 90
IEEE Signal Processing - July 2018 - 91
IEEE Signal Processing - July 2018 - 92
IEEE Signal Processing - July 2018 - 93
IEEE Signal Processing - July 2018 - 94
IEEE Signal Processing - July 2018 - 95
IEEE Signal Processing - July 2018 - 96
IEEE Signal Processing - July 2018 - 97
IEEE Signal Processing - July 2018 - 98
IEEE Signal Processing - July 2018 - 99
IEEE Signal Processing - July 2018 - 100
IEEE Signal Processing - July 2018 - 101
IEEE Signal Processing - July 2018 - 102
IEEE Signal Processing - July 2018 - 103
IEEE Signal Processing - July 2018 - 104
IEEE Signal Processing - July 2018 - 105
IEEE Signal Processing - July 2018 - 106
IEEE Signal Processing - July 2018 - 107
IEEE Signal Processing - July 2018 - 108
IEEE Signal Processing - July 2018 - 109
IEEE Signal Processing - July 2018 - 110
IEEE Signal Processing - July 2018 - 111
IEEE Signal Processing - July 2018 - 112
IEEE Signal Processing - July 2018 - 113
IEEE Signal Processing - July 2018 - 114
IEEE Signal Processing - July 2018 - 115
IEEE Signal Processing - July 2018 - 116
IEEE Signal Processing - July 2018 - 117
IEEE Signal Processing - July 2018 - 118
IEEE Signal Processing - July 2018 - 119
IEEE Signal Processing - July 2018 - 120
IEEE Signal Processing - July 2018 - 121
IEEE Signal Processing - July 2018 - 122
IEEE Signal Processing - July 2018 - 123
IEEE Signal Processing - July 2018 - 124
IEEE Signal Processing - July 2018 - 125
IEEE Signal Processing - July 2018 - 126
IEEE Signal Processing - July 2018 - 127
IEEE Signal Processing - July 2018 - 128
IEEE Signal Processing - July 2018 - 129
IEEE Signal Processing - July 2018 - 130
IEEE Signal Processing - July 2018 - 131
IEEE Signal Processing - July 2018 - 132
IEEE Signal Processing - July 2018 - 133
IEEE Signal Processing - July 2018 - 134
IEEE Signal Processing - July 2018 - 135
IEEE Signal Processing - July 2018 - 136
IEEE Signal Processing - July 2018 - 137
IEEE Signal Processing - July 2018 - 138
IEEE Signal Processing - July 2018 - 139
IEEE Signal Processing - July 2018 - 140
IEEE Signal Processing - July 2018 - Cover3
IEEE Signal Processing - July 2018 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_201809
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_201807
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_201805
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_201803
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_201801
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1117
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0917
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0717
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0517
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0317
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0117
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1116
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0916
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0716
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0516
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0316
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0116
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1115
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0915
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0715
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0515
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0315
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0115
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1114
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0914
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0714
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0514
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0314
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0114
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1113
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0913
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0713
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0513
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0313
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0113
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1112
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0912
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0712
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0512
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0312
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0112
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1111
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0911
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0711
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0511
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0311
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0111
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1110
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0910
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0710
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0510
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0310
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0110
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1109
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0909
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0709
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0509
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0309
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0109
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1108
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0908
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0708
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0508
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0308
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0108
https://www.nxtbookmedia.com