Evaluation of Multiscale-SSIM (MS-SSIM) Lin Zhang, Dept. Computing, The Hong Kong Polytechnic University |
Introduction
Multiscale-SSIM (MS-SSIM) index is an extension of SSIM. It performs much better than its single-scale counterpart. It is presented by Dr. Zhou Wang et al. on 2004 [1].
Source Code
There are different implementations for MS-SSIM. In our previous papers, we used the version provided in the package MeTriX MuX. MeTriX MuX was also adopted to evaluate IQA metrics by the TID2008 creators. LIVE team also provides their version. However, these two versions are both slightly different from Dr. Zhou Wang's implementation. To make the evaluations consistent with Dr. Wang, we now use his implementation which can be downloaded here https://ece.uwaterloo.ca/~z70wang/research/iwssim/msssim.zip.
Usage Notes
1. Do not change the default parameter settings.
2. MS-SSIM can only deal with gray-scale images and the luminance range is [0, 255]. So, for color images, before calling MS-SSIM, you need to convert it to [0, 255] gray-scale version. Usually, this can be accomplished by the Matlab routine rgb2gray.
Evaluation Results
The results (in Matlab .mat format) are provided here. Each result file contains a n by 2 matrix, where n denotes the number of distorted images in the database. The first column is the MS-SSIM values, and the second column is the mos/dmos values provided by the database. For example, you can use the following matlab code to calculate the SROCC and KROCC values for MS-SSIM values obtained on the TID2008 database:
%%%%%%%%%%%%%%%
matData
= load('MSSSIMOnTID.mat');
MSSSIMOnTID = matData.MSSSIMOnTID;
MSSSIM_TID_SROCC = corr(MSSSIMOnTID(:,1), MSSSIMOnTID(:,2), 'type', 'spearman');
MSSSIM_TID_KROCC = corr(MSSSIMOnTID(:,1), MSSSIMOnTID(:,2), 'type', 'kendall');
%%%%%%%%%%%%%%%
The source codes to calculate the PLCC and RMSE are also provided for each database. This needs a nonlinear regression procedure which is dependant on the initialization of the parameters. We try to adjust the parameters to get a high PLCC value. For different databases, the parameter initialization may be different. The nonlinear fitting function is of the form as described in [2].
Evaluation results of MS-SSIM on seven databases are given below. Besides, for each evaluation metric, we present its weighted-average value over all the testing datasets; and the weight for each database is set as the number of distorted images in that dataset.
Database |
Results |
Nonlinear fitting code | SROCC | KROCC | PLCC | RMSE |
TID2013 |
NonlinearFittingTID2013 | 0.7859 | 0.6047 | 0.8329 | 0.6861 | |
TID2008 |
NonlinearFittingTID | 0.8542 | 0.6568 | 0.8451 | 0.7173 | |
CSIQ |
NonlinearFittingCSIQ | 0.9133 | 0.7393 | 0.8991 | 0.1149 | |
LIVE |
NonlinearFittingLIVE | 0.9513 | 0.8045 | 0.9489 | 8.6188 | |
IVC |
NonlinearFittingIVC | 0.8980 | 0.7203 | 0.9108 | 0.5029 | |
Toyama-MICT |
0.8874 |
0.7029 |
0.8927 |
0.5640 |
||
A57 |
0.8414 |
0.6478 |
0.8603 |
0.1253 |
||
WIQ |
0.7495 |
0.5740 |
0.8095 |
13.4486 |
||
Weighted-Average |
|
Reference
[1] Z. Wang, E.P. Simoncelli, and A. C. Bovik, "Multi-scale structural similarity for image quality assessment", IEEE Asilomar Conf. Signals, Systems, and Computers, pp. 1398-1402, 2003.
[2] H.R. Sheikh, M.F. Sabir, and A.C. Bovik, "A statistical evaluation of recent full reference image quality assessment algorithms", IEEE Trans. on Image Processing, vol. 15, no. 11, pp. 3440-3451, 2006.
Created on: May 08, 2011
Last update: Dec. 02, 2013