如题,在stata14中使用两个功能相似的命令,得到五个变量的均值,但两个命令结果不同,这是怎么回事?
差异主要体现在第一个变量income上面,其余变量结果相同。其中,分类变量income5的产生规则如下:
sort income, stable ; gen income5 = group(5)
. *考虑加权
. table income5 [aw = swgt], contents( n f2004 mean income mean f2004 mean f2019 mean f2020) for
> mat(%10.4f)
note: cellwidth too small, variable name truncated;
to increase cellwidth, specify cellwidth(#)
----------------------------------------------------------------------
income5 | N(f2004) mean(inco) mean(f200) mean(f201) mean(f202)
----------+-----------------------------------------------------------
1 | 455.0000 6422.1641 871.7755 705.9392 410.1433
2 | 455.0000 12501.9678 529.7763 368.6875 125.9335
3 | 455.0000 18381.3379 536.0304 383.8629 181.2390
4 | 455.0000 26965.7910 795.5706 571.8908 355.5436
5 | 455.0000 85581.7266 1537.1392 689.3633 337.6386
----------------------------------------------------------------------
. tabstat income f2004 f2019 f2020 [aw = swgt], by(income5) long format(%10.4f) nototal
income5 stats | income f2004 f2019 f2020
------------------+----------------------------------------
1 mean | 6422.1640 871.7755 705.9392 410.1433
2 mean |12501.9680 529.7763 368.6875 125.9335
3 mean |18381.3385 536.0304 383.8629 181.2390
4 mean |26965.7915 795.5706 571.8908 355.5436
5 mean |85581.7297 1537.1392 689.3633 337.6386
-----------------------------------------------------------