数据标化与非标化

rvohen

1543

收藏 2011-05-19

在做回归时，数据标化和不标化有什么区别吗?对结果有什么影响吗？比如说截距？标化之后是不是就没有截距了？

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

全部回复

ltx5151

2011-5-20 02:44:16

Whether to standardize the data depends on what method you want to use, how you want to interpret the model and your data structure. For example, when we use some models that are not invariant to scaling, like ridge regression and lasso, normally we would standardize the data to make the model reproducible.

On the other hand, in some special cases, the different scales of variables might be indicators of their real structural effects which are what we want to incorporate. In such cases, we might not standardize them.

And in most cases, the thing that really make a difference is the standardization on predictors, rather than the response. In normal linear regression, we would still have intercept (mean of y) if we don't centralize the response (y). In many literature, people prefer to centralize y as well, then they don't get the intercept.

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

trier2006

2011-5-20 09:03:05

是否有截距和标准化无关

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

yanbridge

2011-5-20 09:16:16

不同的回归模型，对数据要求不一，OLS，ARMA，GARCH等模型是不要求的
主成分分析和聚类分析需要标准化

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

rvohen

2011-5-20 09:19:04

4# yanbridge 那主成分标准化提取成分后做的线性回归是不是有截距呢？

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

rvohen

2011-5-20 09:19:56

2# ltx5151 谢谢啦

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群