bet365注册会员-bet365是什么网站

搜索
你想要找的

10月10日 史成春:Combining Experimental and Historical Data for Policy Evaluation
2024-10-10 15:00:00
活動主題:Combining Experimental and Historical Data for Policy Evaluation
主講人:史成春
開始時間:2024-10-10 15:00:00
舉行地點:普陀校區理科大樓A1514
主辦單位:統計學院、統計交叉科學研究院
報告人簡介

史成春博士,現任倫敦政治經濟學院統計系副教授,曾在北卡羅來納州立大學(North Carolina State University)獲得統計學博士學位。他的研究主要集中在強化學習領域(Reinforcement Learning),特別是在策略評估(Policy Evaluation)、因果推斷(Causal Inference)、半監督學習(Semi-Supervised Learning)等方面的應用與優化。史博士曾榮獲Institute of Mathematical Statistics (IMS) Tweedie Award和Royal Statistical Society (RSS) Research Prize等獎項。


內容簡介

This talk considers policy evaluation with multiple data sources, especially in scenarios that involve one experimental dataset with two arms, complemented by a historical dataset generated under a single control arm. We propose novel data integration methods that linearly integrate base policy value estimators constructed based on the experimental and historical data, with weights optimized to minimize the mean square error (MSE) of the resulting combined estimator. We further apply the pessimistic principle to obtain more robust estimators, and extend these developments to sequential decision making. Theoretically, we establish non-asymptotic error bounds for the MSEs of our proposed estimators, and derive their oracle, efficiency and robustness properties across a broad spectrum of reward shift scenarios. Numerical experiments and real-data-based analyses from a ridesharing company demonstrate the superior performance of the proposed estimators.

至尊百家乐20111110| 真人百家乐官网最高赌注| 哪个百家乐官网玩法平台信誉好 | 亿酷棋牌世界| 百家乐官网筹码方形| 澳门百家乐玩法心得技巧| 998棋牌游戏| 伯爵百家乐官网娱乐平台| 真人百家乐娱乐场开户注册| 东乡县| 大世界百家乐官网娱乐城| 乐九百家乐娱乐城| tt娱乐城网址| 圣淘沙百家乐官网的玩法技巧和规则 | 百家乐有方式赢钱吗| 六合彩报| 百家乐官网平注法规则| 大发888娱乐场下载yguard| 百家乐官网游戏机破解方法| 澳门百家乐必赢看| 太阳城百家乐官网杀祖玛| 网上百家乐导航| 镇原县| 免费百家乐娱乐城| 优博注册| 百家乐怎么稳赚| 华人博彩论坛| 云鼎百家乐官网的玩法技巧和规则| 大发888是什么东| 跪求百家乐官网打法| 玩百家乐澳门368娱乐城| 金冠娱乐城注册| 红宝石百家乐官网的玩法技巧和规则 | 赌博百家乐规则| 涞源县| 百家乐塑料扑克牌盒| 属蛇和属马合作做生意谁吃亏| 百家乐出千桌| 9人百家乐官网桌布| 威尼斯人娱乐备用网址| 百家乐官网游戏网上投注|