Hi! I’m Nan Xiao, a statistician working at Merck. My research interests include sparse linear models, representation learning, and computational reproducibility. I enjoy discovering the serendipitous connections between things. I build software to improve my workflow. My favorites include msaenet, oneclust, liftr, ggsci, and pkglite.

Previously, I worked as a data scientist at Seven Bridges in Boston, MA. Earlier in my career, I studied human genetics in Matthew Stephens Lab at the University of Chicago. I earned my PhD degree in statistics from Central South University, China. My thesis focused on developing statistical machine learning methods for high-dimensional data analysis, advised by Qing-Song Xu. My Erdős number is 4.