老虎机游戏在线玩-小蜜蜂老虎机技巧_百家乐桌子租_全讯网2 融天下 (中国)·官方网站

搜索
你想要找的

10月10日 史成春:Combining Experimental and Historical Data for Policy Evaluation
2024-10-10 15:00:00
活動主題:Combining Experimental and Historical Data for Policy Evaluation
主講人:史成春
開始時間:2024-10-10 15:00:00
舉行地點:普陀校區理科大樓A1514
主辦單位:統計學院、統計交叉科學研究院
報告人簡介

史成春博士,現任倫敦政治經濟學院統計系副教授,曾在北卡羅來納州立大學(North Carolina State University)獲得統計學博士學位。他的研究主要集中在強化學習領域(Reinforcement Learning),特別是在策略評估(Policy Evaluation)、因果推斷(Causal Inference)、半監督學習(Semi-Supervised Learning)等方面的應用與優化。史博士曾榮獲Institute of Mathematical Statistics (IMS) Tweedie Award和Royal Statistical Society (RSS) Research Prize等獎項。


內容簡介

This talk considers policy evaluation with multiple data sources, especially in scenarios that involve one experimental dataset with two arms, complemented by a historical dataset generated under a single control arm. We propose novel data integration methods that linearly integrate base policy value estimators constructed based on the experimental and historical data, with weights optimized to minimize the mean square error (MSE) of the resulting combined estimator. We further apply the pessimistic principle to obtain more robust estimators, and extend these developments to sequential decision making. Theoretically, we establish non-asymptotic error bounds for the MSEs of our proposed estimators, and derive their oracle, efficiency and robustness properties across a broad spectrum of reward shift scenarios. Numerical experiments and real-data-based analyses from a ridesharing company demonstrate the superior performance of the proposed estimators.

利来游戏| 博狗百家乐真实| 网上百家| 24山吉凶图| 钻石娱乐开户| 凯斯百家乐官网的玩法技巧和规则| 大发888下注| 百家乐官网蓝盾有赢钱的吗| 百家乐官网微笑打法| 全讯网址| 视频百家乐官网是真是假| 二八杠分析仪| 百家乐官网翻天| 真钱博彩网| 大众百家乐的玩法技巧和规则 | 大亨百家乐娱乐城| 大发888官方 df888| 百家乐官网轮盘一体机厂家| 永州市| 基础百家乐博牌规| 百家乐五式缆投法| 怎么玩百家乐官网网上赌博| 大发888娱乐客户端真钱| 在线百家乐安卓| 建始县| 大发888为什么卡| 百家乐官网怎么玩最保险| 24山方向上| 历史百家乐官网路单图| 678百家乐博彩娱乐网| 单机百家乐官网小游戏| 老虎机加分器| 百家乐官网永利赌场娱乐网规则 | 澳门百家乐真人娱乐城| 百家乐官网玩揽法大全| 百家乐官网网上公式| 百家乐平台在线| 赌百家乐官网咋赢对方| 注册娱乐城送体验金| 大发888娱乐官网| 澳门百家乐先赢后输|