【转】“repeated time values within panel”问题的解决

34774

收藏 2010-04-19

Title		Dealing with reports of repeated time values within panel
Author		Nicholas J. Cox, Durham University, UK Michael Mulcahy, University of Connecticut
Date		December 2005

QuestionI have panel data. I want to exploit the power of tsset (see [TS] tsset), but when I type
         . tsset id timeI get a report of
         repeated time values within panel          r(451);

What should I do next?
AnswerPanel data are defined by an identifier variable and a time variable. Each combination of identifier and time should occur, at most, once. That is, any such combination might appear either once or not at all, as gaps are allowed in panel data. The report of "repeated time values within panel" is thus serious, as Stata is unable to proceed with any commands that depend upon your data being accepted as panel data.
Two common reactions to this report are to suppose that it cannot be true, as you know you have panel data, or that there must be a bug or at least a misunderstanding here. In our experience, the misunderstanding will, on closer inspection, be found embedded in the dataset. Here we discuss various methods for approaching the problem. The underlying idea is that knowing several ways of going further is much better than knowing none. All the methods discussed are also applicable to other problems.

1. Do identifier and time uniquely identify the data?Observations in panel data are uniquely identified by the combination of identifier and year. Thus isid may be used to check for this, for example,
   . isid id timeWith isid,

no news is good news. However, if the variables specified do not jointly identify the data, an error message will appear.
The logic of isid may be implemented in other ways. At its heart is an operation
   . bysort id time: assert _N == 1
asserting that each combination of identifier and time is unique. Again, with assert no news is good news. If the statement asserted is not true everywhere that it is tested, an error message will ensue.

2. Check for duplicatesIf you have received confirmation of a problem, the next step is to track it down. With a very small dataset, a list or edit of the data may be sufficient, but even then, a more systematic approach is preferable. Here is what we did in a specific example using the duplicates command, which is a small bundle of tools for investigating possible problems arising from duplicated observations.
The dataset consists of several variables for various cities and years, with identifier id and time variable year. The number of values is 7,813, large enough for a visual scan of the data to be a poor solution. The subcommand duplicates report quantifies the extent of the problem, 26 pairs of values of id and year. The subcommand duplicates list finds that they involve id 467. The subcommand duplicates tag is used to tag the observations to examine more closely. An edit then gives all the details.

. duplicates report id year

Duplicates in terms of id year

--------------------------------------
copies | observations    surplus
----------+---------------------------
      1 |       7787          0
      2 |          26          13
--------------------------------------

. duplicates list id year

Duplicates in terms of id year

+----------------------------+
| group: obs: id year |
|----------------------------|
|    1 6059 467 1990 |
|    1 6060 467 1990 |
|    2 6061 467 1991 |
|    2 6062 467 1991 |
|    3 6063 467 1992 |
|----------------------------|
|    3 6064 467 1992 |
|    4 6065 467 1993 |
|    4 6066 467 1993 |
|    5 6067 467 1994 |
|    5 6068 467 1994 |
|----------------------------|
|    6 6069 467 1995 |
|    6 6070 467 1995 |
|    7 6071 467 1996 |
|    7 6072 467 1996 |
|    8 6073 467 1997 |
|----------------------------|
|    8 6074 467 1997 |
|    9 6075 467 1998 |
|    9 6076 467 1998 |
|    10 6077 467 1999 |
|    10 6078 467 1999 |
|----------------------------|
|    11 6079 467 2000 |
|    11 6080 467 2000 |
|    12 6081 467 2001 |
|    12 6082 467 2001 |
|    13 6083 467 2002 |
|----------------------------|
|    13 6084 467 2002 |
+----------------------------+

. duplicates tag id year, gen(isdup)

Duplicates in terms of id year

. edit if isdup

. drop isdup

The final edit command reveals the precise problem: two cities, Royal Oak, MI, and Bristol, CT, have been assigned the same identifier. We need to fix that by changing the identifier of one city to something else.
Not all these steps are essential. Some users omit the report. On the other hand, in a large dataset, the list could be lengthy. Either way, duplicates offers various handles for the problem.

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

全部回复

novice07

2010-5-16 15:19:56

居然无人问津，唉。。。

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

蓝色

2010-5-16 15:54:09

呵呵
或许人家都已经解决了

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

lahraf

2010-5-18 18:42:16

Many thanks for information share

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

武陵溪yn

2010-12-9 12:43:41

非常感激呀！我就是在非常紧急的情况下靠您解决了问题~~

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

cooking0830

2010-12-14 17:42:04

谢谢楼主，现在这个正是我碰到的问题！

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

点击查看更多内容…

tianxiaoxiao1

2010-12-15 00:31:15

可是，识别出之后是要重新命名成新的标识吗？对于需要这个重复的id的情况下，怎样和其他变量连接？
比如说怎样设置新变量表示动态变量？

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

rucchenqiong

2010-12-18 21:20:17

顶贴！我也遇到这种情况，解决了。原因是id缺失。定义时间序列时，id不可以缺失。

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

wdlhong888

2011-1-8 16:28:25

谢谢对我这个菜鸟正好用上

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

carawang

2011-5-20 14:50:13

非常感谢~~~~~~~~~~~~~~~

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

qiustata

2012-1-9 16:46:45

武陵溪yn 发表于 2010-12-9 12:43
非常感激呀！我就是在非常紧急的情况下靠您解决了问题~~

不好意思啊，

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

qiustata

2012-1-10 12:13:26

dinga shuinengjieshi guiqiu!

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

qiaqiao

2012-5-30 13:01:11

继续问,后续怎么做

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

neoctopus

2013-3-11 18:20:20

非常感谢！！解决了我的问题~~太感谢了！！

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

younger111

2013-7-26 02:34:41

万分感谢，帮了我个大忙

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

Ivory523

2014-8-11 08:43:16

非常感谢，解决了我的问题，太有用了

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

scau

2015-4-17 19:22:39

真是好方法

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

七月流水

2016-8-4 20:54:39

scau 发表于 2015-4-17 19:22
真是好方法

请问用duplicate查不到重复，这种情况怎么解决呢？？

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

深深深深深的海

2016-12-15 16:52:35

感谢楼主，看了之后果然找出来啦

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

yingyingly5

2017-3-16 21:45:18

rucchenqiong 发表于 2010-12-18 21:20
顶贴！我也遇到这种情况，解决了。原因是id缺失。定义时间序列时，id不可以缺失。

请问要怎么做呢，我现在也遇到这个问题了！十万火急！！非常感谢！！！

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

fengy1001

2019-7-18 18:07:22

解决问题。谢谢！

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

3170101965

2019-12-26 10:23:21

学习笔记1. Do identifier and time uniquely identify the data?Observations in panel data are uniquely identified by the combination of identifier and year.

代码：. isid id time  //检验id和year是否是唯一标识，即是否有重复观测值
结果1：no news is good news
结果2：variables ctnm_id year do not uniquely identify the observations
r(459);
——确实有重复
2. using the duplicates command, which is a small bundle of tools for investigating possible problems arising from duplicated observations.

. duplicates report ctnm_id year //查找重复的个数

Duplicates in terms of ctnm_id year

--------------------------------------
copies | observations    surplus
----------+---------------------------
      1 |       551          0
      2 |          2          1
--------------------------------------
. duplicates list ctnm_id year //查找重复的观测值

Duplicates in terms of ctnm_id year

  +-----------------------+
  | obs: ctnm_id year |
  |-----------------------|
  |  524 610400 2013 |
  |  525 610400 2013 |
  +-----------------------+

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

qingqingzuibang

2020-3-26 22:36:19

太棒了！非常感谢。多亏了这些操作解决了棘手的问题

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

Zhang707379846

2020-4-2 11:43:27

很棒，刚遇到这个问题

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

Adolf云亮

2020-4-2 12:27:09

写的太棒了

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群