nuomin 发表于 2014-3-15 10:52 
既然是测试那么就知道真实值。用预测值和真实值的差得到偏误。再对偏误的方差进行比较,小的那一个当然是好 ...
20个测试函数的真实值都是零,单纯比较算法的平均误差和方差这种,编辑说方法太简单了,不够高上大。所以他要求参考这样的检验方式:
---------------------------------------------------------------------------------------------------------------
Table 5 Results of the Iman-Davenport’s test of MA-SSW-Chains
and the reference algorithms for dimensions 50, 100, 200, and 500
Dimension Iman-Davenport value Critical value Significant differences?
50 5.232034 2.46 Yes
100 3.256133 2.46 Yes
200 2.707457 2.46 Yes
500 2.820250 2.46 Yes
---------------------------------------------------------------------------------------------------------
table5中使用的这个Iman‘s Davenport test是Friedman test的改进。话说这个critical value哪里来的?
-----------------------------------------------------------------------------------------------------------
Table 6 Comparison of DE (control algorithm) with CHC, IPOPCMAES,
and MA-SSW-Chains (Holm’s test)
Dimension Algorithm z pvalue a/i Sig.differences?
50 CHC 4.461 8.17E-06 0.017 Yes
IPOP-CMAES 2.827 4.69E-03 0.025 Yes
MA-SSW-Chains 0.251 8.02E-01 0.050 No
100 CHC 4.837 1.31E-6 0.017 Yes
IPOP-CMAES 3.08 2.08E-3 0.025 Yes
MA-SSW-Chains 0.628 0.53 0.050 No
200 CHC 4.963 6.93E-7 0.017 Yes
IPOP-CMAES 3.079 2.08E-3 0.025 Yes
MA-SSW-Chains 1.005 0.31 0.050 No
500 CHC 5.780 7.46E-9 0.017 Yes
IPOP-CMAES 2.890 3.85E-3 0.025 Yes
MA-SSW-Chains 1.634 0.102 0.050 No
-----------------------------------------------------------------------------------------------------------------------------
table6中使用DE作为控制算法,因为DE的average ranking最好。这里的z难道是转换过来的可以用来查标准正态分布表的z值?那p值查表来的?这精度也太精准了,难道是写代码求出来的不是用spss求的吧?