现在有模拟数据:
| sample | x_1 | x_2 |
| 1 | 1 | 50 |
| 1 | 1 | 41 |
| 1 | 3 | 38 |
| 1 | 4 | 60 |
| 1 | 10 | 65 |
| 1 | 3 | 71 |
| 1 | 2 | 21 |
| 1 | 2 | 30 |
| 1 | 1 | 40 |
| 1 | 6 | 66 |
找出sample里面top5(出现次数最多的前5个)x_1的x_2的平均值。如x_1=1,meanx_2=(50+41+40)/3
理想输出结果:
| sample | top5x_1 | meanx_2 |
| 1 | 1 | (50+41+40)/3 |
| 1 | 2 | (21+30)/2 |
| 1 | 3 | (71+38)/2 |
| 1 | 4 | 60 |
| 1 | 6 | 66 |
各位这东西能用stata处理么TT?