全部版块 我的主页
论坛 计量经济学与统计论坛 五区 计量经济学与统计软件 LATEX论坛
2092 10
2016-06-30

To help you sort the true data scientist from the fake (or misguided) one, we’ve complied a list of 20 interview questions you can ask when interviewing data scientists.

  • Explain what regularization is and why it is useful.
  • Which data scientists do you admire most? which startups?
  • How would you validate a model you created to generate a predictive model of a quantitative outcome variable using multiple regression.
  • Explain what precision and recall are. How do they relate to the ROC curve?
  • How can you prove that one improvement you’ve brought to an algorithm is really an improvement over not doing anything?
  • What is root cause analysis?
  • Are you familiar with pricing optimization, price elasticity, inventory management, competitive intelligence? Give examples.
  • What is statistical power?
  • Explain what resampling methods are and why they are useful. Also explain their limitations.
  • Is it better to have too many false positives, or too many false negatives? Explain.
  • What is selection bias, why is it important and how can you avoid it?
  • Give an example of how you would use experimental design to answer a question about user behavior.
  • What is the difference between “long” and “wide” format data?
  • What method do you use to determine whether the statistics published in an article (e.g. newspaper) are either wrong or presented to support the author’s point of view, rather than correct, comprehensive factual information on a specific subject?
  • Explain Edward Tufte’s concept of “chart junk.”
  • How would you screen for outliers and what should you do if you find one?
  • How would you use either the extreme value theory, monte carlo simulations or mathematical statistics (or anything else) to correctly estimate the chance of a very rare event?
  • What is a recommendation engine? How does it work?
  • Explain what a false positive and a false negative are. Why is it important to differentiate these from each other?
  • Which tools do you use for visualization? What do you think of Tableau? R? SAS? (for graphs). How to efficiently represent 5 dimension in a chart (or in a video)?

“A “real” data scientist knows how to apply mathematics, statistics, how to build and validate models using proper experimental designs. Having IT skills without statistics skills makes you a data scientist as much as it makes you a surgeon to know how to build a scalpel.”


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

全部回复
2016-6-30 17:47:51
谢谢分享
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2016-6-30 19:53:22
发现自己是fake data analyst,题目只会一半……好伤心,滚回去看书了= =
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2016-7-1 08:03:05
新东方又看到了商机。
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2016-7-1 09:03:46
thanks for sharing
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2016-7-1 09:06:35
2009116226 发表于 2016-6-30 19:53
发现自己是fake data analyst,题目只会一半……好伤心,滚回去看书了= =
题目不简单,别当真哈
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

点击查看更多内容…
相关推荐
栏目导航
热门文章
推荐文章

说点什么

分享

扫码加好友,拉您进群
各岗位、行业、专业交流群