A Conformal-Based Two-Sample Conditional Distribution Test

Abstract: We consider the problem of testing the equality of the conditional distribution of a response variable given a set of covariates between two populations. Such a testing problem is related to transfer learning and causal inference. We develop a nonparametric procedure by combining recent advances in conformal prediction with some new ingredients such as a novel choice of conformity score and data-driven choices of weight and score functions. To our knowledge, this is the first successful attempt of using conformal prediction for testing statistical hypotheses beyond exchangeability. The final test statistic reveals a natural connection between conformal inference and the classical rank-sum test. Our method is suitable for modern machine learning scenarios where the data has high dimensionality and the sample size is large, and can be effectively combined with existing classification algorithms to find good weight and score functions. The performance of the proposed method is demonstrated in synthetic and real data examples.


About the Speaker:

Jing Lei obtained B.S. in Probability and Statistics from the School of Mathematical Sciences at Peking University in 2005, and Ph.D. in Statistics at UC Berkeley in 2010. He is now  Professor of Statistics at Carnegie Mellon University. His research interests include model-free predictive inference, statistical machine learning, data privacy, and high dimensional statistics. He is IMS Fellow and receive the 2016 NSF CAREER Award and 2016 ASA Noether Young Scholar Award.

 

Tencent Meeting( ID:  288-521-104 )

Meeting Link:    https://meeting.tencent.com/dm/xO2C2RLUoTQn