Coursera|Coursera Inferential Statistics 笔记 CourseraInferentialStatistics笔记

【Coursera|Coursera Inferential Statistics 笔记】上完 Basic Statistics 趁热打铁把 Inferential Statistics 也学习了。和上期一样把一些遇到的问题记录一下。
Inferential Statistics 是在 Coursera - Methods and Statistics in Social Sciences Specialization - University of Amsterdam 里接着 Basic Statistics 的一门课程，这门课我打四星，难度加大也更深入。老师讲得没有 Basic Statistics 细致，有些地方需要自己另外找资料弄明白。而且视频总是字幕和图像对不上，只能当练听力了。
Wilcoxon 秩和检验 - test statistic in R Quiz 6 里有一题是这样的

文章图片
Image from: Coursera Inferential Statistics - Quiz 6 求

sum of ranks in Wayne group
sum of ranks in Astrophysics group
test statistic in Wilcoxon rank sum test.

第一第二个问题可以手算
rank sum Wayne = 19
rank sum Astrophysics = 17
理论上来说当每组样本量相同时， test statistic 为二者中较小的一个，也就是17. 然后我就发现用R计算的W值并非 test statistic.
代码如下：

rating <- c(2.5, 7.4, 7.2, 6.5, 8.0, 5.5, 3.2, 6.2) group <- c('wayne', 'wayne', 'wayne', 'wayne', 'astro', 'astro', 'astro', 'astro') wilcox.test(rating ~ group)

输出结果为

Wilcoxon rank sum test data:rating by group W = 7, p-value = https://www.it610.com/article/0.8857 alternative hypothesis: true location shift is not equal to 0

其中W代表的是什么呢？
四处搜了一下，在这个论坛里找到解释
Wilcoxon秩-和检验结果的解读中遇到问题
这里的W是W统计量：第1组的秩和减去第1组秩和的数学期望值 n * (n+1)/2.

在R环境中，W = R - n * (n+1)/2.

所以本题中 W = 17 - 4 * (4+1)/2 = 7. test statistic = 17 并非7.
当然我觉得这个题目有点tricky, 因为在曼惠特尼U检验中刚刚代码里输出的 W 值就是 U 值（Wilcoxon-Mann-Whitney as an alternative to the t-test）。
这两个检验方式的关系是：
Wilcoxon rank sum test = Mann Whitney U test
在 R 里面 U 检验也是用的 wilcox.test(...) 的语句，和Wilcoxon 符号检验的函数是一样的，只是参数不太一样（Is the W statistic output by wilcox.test() in R the same as the U statistic?）。