Hi! I’m Nan Xiao, a statistician working at Merck. My research interests include sparse linear models, representation learning, and computational reproducibility. I enjoy discovering the serendipitous connections between things. I build software to improve my workflow. My favorites include msaenet, oneclust, liftr, ggsci, and pkglite.

Previously, I worked as a data scientist at Seven Bridges in Boston, MA. Earlier in my career, I studied human genetics in Matthew Stephens Lab at the University of Chicago. I earned my PhD degree in statistics from Central South University, China. My thesis focused on developing statistical machine learning methods for high-dimensional data analysis, advised by Qing-Song Xu. My Erdős number is 4.

I currently live in Cambridge, MA. In my free time, I love playing squash, running along the Charles River, and exploring downtown Boston.