jiangbeilu 发表于 2015-12-22 16:17 
你可以两个都试一下,主要是减少工作量,少出错。
两个我都试了,还是不行
> myheader=c("User-Agent"="Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.9.1.6) ","Accept"="text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8","Accept-Language"="en-us","Connection"="keep-alive","Accept-Charset"="GB2312,utf-8;q=0.7,*;q=0.7")
> url<-"http://vip.stock.finance.sina.com.cn/corp/go.php/vCI_StockStructure/stockid/000786.phtml"
> temp<-getURL(url=url,httpheader=myheader,encoding="gb2312")
> k<-htmlParse(temp)
> title<-getNodeSet(k,'//title')
#返回的是<title>卤卤D<c2><bd>篓2<c4>(000786)1茅卤<be><bd>谩11_D<c2>脿<cb>2<c6><be>-_D<c2>脿<cb>铆<f8></f8></cb></c2></be></c6></cb></c2></bd></be></c4></bd></c2></title> 
a <- sapply(title,xmlValue)
wp2=iconv(a,"gb2312","UTF-8")
wp2返回的还是乱码