I am a statistician working at Merck. My research interests include sparse linear models, representation learning, and computational reproducibility. I build software to improve my workflow. My favorites include msaenet, stackgbm, oneclust, liftr, ggsci, and pkglite.

Previously, I worked as a data scientist at Seven Bridges in Boston. Earlier in my career, I studied human genetics in Matthew Stephens Lab at the University of Chicago. I earned my PhD degree in statistics from Central South University, China. My thesis focused on developing statistical machine learning methods for high-dimensional data analysis, advised by Qing-Song Xu.