利用多武装匪徒和

378

收藏 2022-04-26

英文标题：
《Expanding on Repeated Consumer Search Using Multi-Armed Bandits and
Secretaries》
---
作者：
Tung Yu Marco Chan
---
最新提交年份：
2020
---
英文摘要：
We seek to take a different approach in deriving the optimal search policy for the repeated consumer search model found in Fishman and Rob (1995) with the main motivation of dropping the assumption of prior knowledge of the price distribution $F(p)$ in each period. We will do this by incorporating the famous multi-armed bandit problem (MAB). We start by modifying the MAB framework to fit the setting of the repeated consumer search model and formulate the objective as a dynamic optimization problem. Then, given any sequence of exploration, we assign a value to each store in that sequence using Bellman equations. We then proceed to break down the problem into individual optimal stopping problems for each period which incidentally coincides with the framework of the famous secretary problem where we proceed to derive the optimal stopping policy. We will see that implementing the optimal stopping policy in each period solves the original dynamic optimization by `forward induction\' reasoning.
---
中文摘要：
对于Fishman和Rob（1995）中发现的重复消费者搜索模型，我们试图采用不同的方法来推导最优搜索策略，其主要动机是放弃每个时期价格分布$F（p）$的先验知识假设。我们将通过合并著名的多武装土匪问题（MAB）来实现这一点。我们首先修改MAB框架，以适应重复消费者搜索模型的设置，并将目标表述为一个动态优化问题。然后，给定任何探索序列，我们使用贝尔曼方程为该序列中的每个存储分配一个值。然后，我们继续将问题分解为每个时段的单个最优停止问题，这与著名的秘书问题的框架巧合，我们继续推导最优停止策略。我们将看到，在每个阶段实施最优停止策略通过“正向归纳”推理解决了原始的动态优化问题。
---
分类信息：

一级分类：Economics 经济学
二级分类：Theoretical Economics 理论经济学
分类描述：Includes theoretical contributions to Contract Theory, Decision Theory, Game Theory, General Equilibrium, Growth, Learning and Evolution, Macroeconomics, Market and Mechanism Design, and Social Choice.
包括对契约理论、决策理论、博弈论、一般均衡、增长、学习与进化、宏观经济学、市场与机制设计、社会选择的理论贡献。
--

---
PDF下载：
-->

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群