科学研究

科学研究

学术讲座
当前位置是: 首页 -> 科学研究 -> 学术讲座 -> 正文

Optimal Semi-supervised Subsampling via Predictive Inference

作者: 发布时间:2022-12-15 点击数:
主讲人:邹长亮
主讲人简介:
邹长亮,南开大学统计与数据科学学院教授。2008年于南开大学获博士学位,随后留校任教。主要从事统计学及其与数据科学领域的交叉研究和实际应用。研究兴趣包括:高维数据统计推断、大规模数据流分析、变点和异常点检测等,在Annals of Statistics、Biometrika、Journal of the American Statistical Association 、Mathematical Programming、Technometrics、IISE Transactions等统计学和工业工程领域期刊上发表论文几十篇,主持国家自然科学基金委优青、杰青、重点项目、重大项目等课题。
主持人:洪永淼
讲座简介:

In big data era, subsampling or sub-data selection techniques are often adopted to extract a fraction of informative individuals from the massive data. Existing subsampling algorithms focus mainly on obtaining a representative subset to achieve best estimation accuracy under a given class of models. In this talk, we consider a semi-supervised setting wherein a small or moderate sized “labeled” data is available in addition to a much larger sized “unlabeled” data. The goal is to sample from the unlabeled data with a give budget to obtain informative individuals that are characterized by their unobserved responses. I will introduce an optimal subsampling procedure that is able to maximize the diversity of the selected subsample and control false selection rate (FSR) simultaneously, allowing us to explore reliable information as much as possible.

时间:2022-12-20 (Tuesday) 16:30-18:00
地点:腾讯会议:37586125504,经济楼N302
讲座语言:中文
主办单位:中国科学院大学经济与管理学院、中国科学院预测科学研究中心、太阳成tyc7111cc邹至庄经济研究院、NSFC"计量建模与经济政策研究”基础科学中心
承办单位:
期数:“邹至庄讲座”青年学者论坛(第45期)
联系人信息:许老师,0592-2182991
TOP