Hi! I’m Nan Xiao, a statistician working at Merck. My current research interests include sparse linear models, group sequential design, and computational reproducibility.

Previously, I worked as a data scientist at Seven Bridges in Boston, Massachusetts. Earlier in my career, I studied Human Genetics in Matthew Stephens Lab at the University of Chicago. I earned my PhD degree in Statistics from Central South University, China. My thesis focused on developing statistical machine learning methods for high-dimensional data analysis, advised by Qing-Song Xu. My Erdős number is 4.

I’m interested in discovering the connections between things. I build software to improve my computational workflow. My favorite works include msaenet, stackgbm, oneclust, liftr, and ggsci.

I currently live in Cambridge, Massachusetts. When I’m not working, I enjoy playing squash and running along the Charles River.