全部版块 我的主页
论坛 计量经济学与统计论坛 五区 计量经济学与统计软件 winbugs及其他软件专版
1369 2
2016-08-04

Statistics can often be the most intimidating aspect of data science for aspiring data scientists to learn. Gain some personal perspective from someone who has traveled the path.

By Jean-Nicholas Hould, JeanNicholasHould.com.

Learning statistics can be a daunting journey for aspiring data scientists that are not coming from a quantitative field. Whether you are a computer science undergrad, a developer in seek of a career change or a MBA graduate, it seems that the statistical part of data science is often the most intimidating one. As a business school graduate, it was for me.

Statistics are a serious discipline, some people spend their live studying them. As an aspiring data scientist, how should you approach learning stats? What do you need to know? What’s the best way to learn about stats? Here’s how you should go about this.

Start Simple


You can get tremendous value from understanding simple statistical concepts. In many data science projects, you don’t need advanced stats knowledge to draw significant conclusions. For this reason, you should focus on learning the basics of statistics, applying them to your work and expanding from there.

The two main branches of statistics that you need know are descriptive statistics and inferential statistics. You can get a ton of value by understanding those properly.

Descriptive Statistics


Descriptive statistics describe quantitatively a collection of information. They summarize the observed data. Contrarily to inferential statistics, they are not deducing facts about the greater population. They are only describing the collected data set.

You have surely interacted with those statistics in the past. Some common measurements in descriptive statistics gauge the central tendency (mean, median, mode…) and others the variability (standard deviation…) of the data set.

Inferential Statistics


Inferential statistics enables us to infer properties about a population based on a sample data set. They use the sample to form conclusions beyond the collected data.

In practical data science, inferential statistics are heavily used when comparing conversion rates, analyzing an experiment such as an A/B test, etc.

Online Courses


For me, online classes worked like a charm to learn the basics:

These classes are interactive, include exercices and videos. I find they are a very good way to get started in this field. They will provide you just enough knowledge so you can start getting more comfortable with statistics.

Books


On a general note, I recommend the book Naked Statistics: Stripping the Dread from the Data. This book by Charles Wheelan covers, amongst others, the topics of descriptive/inferential statistics and provides a good overview of each field. It demystifies statistics through some very concrete and cheerful examples.

Build from there


Remember, the best way to learn these concepts is by applying your knowledge to concrete examples. Once you have started to integrate those concepts in your analyses, I recommend you pick up a statistics manual, such as All of Statistics and deepen your knowledge.



二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

全部回复
2016-8-4 01:47:59
谢谢分享
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2016-8-4 05:52:02
oliyiyi 发表于 2016-8-4 00:03
Statistics can often be the most intimidating aspect of data science for aspiring data scientists to ...
谢谢分享
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

相关推荐
栏目导航
热门文章
推荐文章

说点什么

分享

扫码加好友,拉您进群
各岗位、行业、专业交流群