bet365注册会员-bet365是什么网站

搜索
你想要找的

10月10日 史成春:Combining Experimental and Historical Data for Policy Evaluation
2024-10-10 15:00:00
活動主題:Combining Experimental and Historical Data for Policy Evaluation
主講人:史成春
開始時間:2024-10-10 15:00:00
舉行地點:普陀校區理科大樓A1514
主辦單位:統計學院、統計交叉科學研究院
報告人簡介

史成春博士,現任倫敦政治經濟學院統計系副教授,曾在北卡羅來納州立大學(North Carolina State University)獲得統計學博士學位。他的研究主要集中在強化學習領域(Reinforcement Learning),特別是在策略評估(Policy Evaluation)、因果推斷(Causal Inference)、半監督學習(Semi-Supervised Learning)等方面的應用與優化。史博士曾榮獲Institute of Mathematical Statistics (IMS) Tweedie Award和Royal Statistical Society (RSS) Research Prize等獎項。


內容簡介

This talk considers policy evaluation with multiple data sources, especially in scenarios that involve one experimental dataset with two arms, complemented by a historical dataset generated under a single control arm. We propose novel data integration methods that linearly integrate base policy value estimators constructed based on the experimental and historical data, with weights optimized to minimize the mean square error (MSE) of the resulting combined estimator. We further apply the pessimistic principle to obtain more robust estimators, and extend these developments to sequential decision making. Theoretically, we establish non-asymptotic error bounds for the MSEs of our proposed estimators, and derive their oracle, efficiency and robustness properties across a broad spectrum of reward shift scenarios. Numerical experiments and real-data-based analyses from a ridesharing company demonstrate the superior performance of the proposed estimators.

百家乐投注网站| 网上百家乐赢钱公式| 疏附县| 火箭百家乐官网的玩法技巧和规则| 网络百家乐金海岸| 威尼斯人娱乐网赌| 百家乐官网龙虎斗扎金花| 百家乐赢钱公式| 怎么玩百家乐呀| 足球竞猜推荐| 百家乐稳赢玩法| 百家乐园首选海立方| 网上百家乐官网骗人| 新澳门百家乐娱乐城| 美乐门娱乐| 真人百家乐破解软件下载| 天峨县| 百家乐官网筹码片| 缅甸黄金赌场| 网上百家乐有人赢过嘛| 百家乐官网大眼仔用法| 三公百家乐玩法| 蜀都棋牌游戏| 金沙百家乐娱乐城场| 百家乐官网国际娱乐场开户注册 | 百家乐必赢外挂软件| 大发888游戏平台 送1688元现金礼金领取| 百家乐官网路单显示程序| 大发888中文官网| 网络百家乐电脑| 百家乐分析| 威尼斯娱乐| 凯时娱乐城官网| 至富百家乐的玩法技巧和规则| 百家乐官网游戏接口| 帝王百家乐官网新足球平台| 德州扑克大盲注| 娱乐城百家乐可以代理吗| 威尼斯人娱乐天上人间| 百家乐庄闲出现几| 百家乐游戏研发|