About
I am a statistician at Merck & Co., Inc., Rahway, NJ, USA in Biostatistics and Research Decision Sciences (BARDS), Methodology Research, led by Keaven M. Anderson. I also contribute to cross-industry initiatives including the R Consortium Infrastructure Steering Committee, R Submissions Working Group, and pharmaverse.
My work sits at the intersection of statistical methodology and research software engineering. Interests include sparse linear models, representation learning, and computational reproducibility. I build software in R, Python, TypeScript, and Rust; selected projects include tinytopics, msaenet, ggsci, pkglite, and py-pkglite.
Previously, I was a data scientist at Seven Bridges (now acquired by Summa Equity) in Boston, Massachusetts. Earlier, I studied human genetics in Matthew Stephens Lab at the University of Chicago. I hold a Ph.D. in Statistics from Central South University, China, where I developed statistical machine learning methods for high-dimensional data under the supervision of Qing-Song Xu.