搜索结果: 1-1 共查到“代数学 Bayesian”相关记录1条 . 查询时间(0.156 秒)
The Non-Bayesian Restless Multi-Armed Bandit: a Case of Near-Logarithmic Regret
The Non-Bayesian Restless Multi-Armed Bandit:Near-Logarithmic Regret
2010/11/24
In the classic Bayesian restless multi-armed bandit (RMAB) problem, there are $N$ arms, with rewards on all arms evolving at each time as Markov chains with known parameters. A player seeks to activa...