全部版块 我的主页
论坛 计量经济学与统计论坛 五区 计量经济学与统计软件 HLM专版
2120 1
2014-04-17

Is "deep learning" just another term for multilevel/hierarchical modeling?

I'm much more familiar with the latter than the former, but from what I can tell, the primary difference is not in their definition, but how they are used and evaluated within their application domain.

It looks like the number of nodes in a typical "deep learning" application is larger and uses a generic hierarchical form, whereas applications of multilevel modeling typically uses a hierarchical relationships that mimic the generative process being modeled. Using a generic hierarchy in an applied statistics (hierarchical modeling) domain would be regarded as an "incorrect" model of the phenomena, whereas modeling a domain-specific hierarchy might be regarded as subverting the objective of making a generic deep learning learning machine.

Are these two things really the same machinery under two different names, used in two different ways?


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

全部回复
2014-4-17 03:58:44
Similarity
Fundamentally both types of algorithms were developed to answer one general question in machine learning applications:

Given predictors (factors) x1,x2,…,xp - how to incorporate the interactions between this factors in order to increase the performance?

One way is to simply introduce new predictors: xp+1=x1x2,xp+2=x1x3,… But this proves to be bad idea due to huge number of parameters and very specific type of interactions.

Both Multilevel modelling and Deep Learning algorithms answer this question by introducing much smarter model of interactions. And from this point of view they are very similar.

Difference
Now let me try to give my understanding on what is the great conceptual difference between them. In order to give some explanation, let's see the assumptions that we make in each of the models:

Multilevel modelling:1 layers that reflect the data structure can be represented as a Bayesian Hierarchical Network. This network is fixed and usually comes from domain applications.

Deep Learning:2 the data were generated by the interactions of many factors. The structure of interactions is not known, but can be represented as a layered factorisation: higher-level interactions are obtained by transforming lower-level representations.

The fundamental difference comes from the phrase "the structure of interactions is not known" in Deep Learning. We can assume some priors on the type of interaction, but yet the algorithm defines all the interactions during the learning procedure. On the other hand, we have to define the structure of interactions for Multilevel modelling (we learn only vary the parameters of the model afterwards).

Examples
For example, let's assume we are given three factors x1,x2,x3 and we define {x1} and {x2,x3} as different layers.

In the Multilevel modelling regression, for example, we will get the interactions x1x2 and x1x3, but we will never get the interaction x2x3. Of course, partly the results will be affected by the correlation of the errors, but this is not that important for the example.

In Deep learning, for example in multilayered Restricted Boltzmann machines (RBM) with two hidden layers and linear activation function, we will have all the possible polinomial interactions with the degree less or equal than three.

Common advantages and disadvantages
Multilevel modelling

  • (-) need to define the structure of interactions
  • (+) results are usually easier to interpret
  • (+) can apply statistics methods (evaluate confidence intervals, check hypotheses)

Deep learning

  • (-) requires huge amount of data to train (and time for training as well)
  • (-) results are usually impossible to interpret (provided as a black box)
  • (+) no expert knowledge required
  • (+) once well-trained, usually outperforms most other general methods (not application specific)

Hope it will help!
Dmitry Laptev
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

相关推荐
栏目导航
热门文章
推荐文章

说点什么

分享

扫码加好友,拉您进群
各岗位、行业、专业交流群