Assistant Professor
Department of Statistics
University of California, Davis

Mathematical Sciences Building
One Shields Avenue
Davis, CA 95616

I am an assistant professor at the Statistics department at U.C. Davis.

I am interested in using large-scale, granular sources of data, and statistical and machine learning methods, to measure and study human behavior. My current research is focused on global public health and estimating the social and economic consequences of violent conflict.

Before this, I was a postdoctoral researcher at the School of Information at U.C. Berkeley, working with Josh Blumenstock at the Global Policy Lab (formerly the Data-Intensive Development Lab). I graduated from Carnegie Mellon University with a PhD in Statistics. I was advised by Bill Eddy, and also worked closely with Nicolas Christin’s group at CyLab, CMU’s university-wide security institute.

I am originally from Singapore and was previously a government statistician at the Department of Statistics. I also spent some time at J.P. Morgan Chase as a quantitative modeler.

My first name is “Xiao Hui” (not Xiao!), and my last name is Tai.

Publications

Pablo Busch, Paulo Rocha, Kyung Jin Lee, Luis Abdón Cifuentes and Xiao Hui Tai. Short-term exposure to fine particulate pollution and elderly mortality in Chile. Communications Earth & Environment. 2024. (conditionally accepted) [Code]

Yury Elena Garcia Puerta, Miryam Elizabeth Villa-Perez, Kuang Li, Xiao Hui Tai, Luis A. Trejo, Maria L. Daza–Torres, J. Cricelio Montesinos-Lopez and Miriam Nuno. Wildfires and Social Media Discourse: Exploring Mental Health and Emotional Well-Being Through Twitter. Frontiers in Public Health. 2024.

Arogya Koirala, Suraj R. Nair and Xiao Hui Tai. Mapping Opium Poppy Cultivation: Socioeconomic Insights from Satellite Imagery. ACM Journal on Computing and Sustainable Societies. 2024. [Code] Early versions: 4th ACM SIGCAS Conference on Computing and Sustainable Societies (COMPASS ’21) (poster track); 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’21), Workshop on Humanitarian Mapping

Xiao Hui Tai, Shikhar Mehra and Joshua E. Blumenstock. Mobile Phone Data Reveal the Effects of Violence on Internal Displacement. Nature Human Behavior. 2022. [Code]

Cornelia Ilin*, Sébastien Annan-Phan*, Xiao Hui Tai*, Shikhar Mehra, Solomon Hsiang and Joshua E. Blumenstock. Public Mobility Data Enables COVID-19 Forecasting and Management at Local and Global Scales. Nature Scientific Reports. 2021. (* indicates equal contribution) [Radio show]

Xiao Hui Tai and Kayla Frisoli. Benchmarking Minimax Linkage in Hierarchical Clustering. Data Analysis and Rationality in a Complex World. Springer International Publishing, 2021. [Code]

Xiao Hui Tai, Kyle Soska and Nicolas Christin. Adversarial Matching of Dark Net Market Vendor Accounts. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD). 2019. [Code] [Video]

Xiao Hui Tai. Record Linkage and Matching Problems in Forensics. IEEE 18th International Conference on Data Mining Workshops (ICDMW). IEEE, 2018.

Xiao Hui Tai and William F. Eddy. A Fully Automatic Method for Comparing Cartridge Case Images. Journal of Forensic Sciences. 2018. [Code]

Book Chapters and Magazine Articles

Susan VanderPlas, Alicia Carriquiry, Heike Hofmann, James Hamby and Xiao Hui Tai. An Introduction to Firearms Examination for Researchers in Statistics. Handbook of Forensic Statistics. 2021.

Sam Tyner, Soyoung Park, Ganesh Krishnan, Karen Pan, Eric Hare, Amanda Luby, Xiao Hui Tai, Heike Hofmann, and Guillermo Basulto-Elias. 2019. OpenForSciR: Open Forensic Science in R. 2019.

Alicia Carriquiry, Heike Hofmann, Xiao Hui Tai and Susan VanderPlas. Machine Learning in Forensic Applications. Significance. 2019.

Pre-prints

Xiao Hui Tai. Nearby Armed Conflict Affects Girls’ Education in Africa. In submission.

Xiao Hui Tai, Suraj R. Nair, Shikhar Mehra and Joshua E. Blumenstock. Satellite and Mobile Phone Data Reveal How Violence Affects Seasonal Migration in Afghanistan. In revision.

Xiao Hui Tai and William F. Eddy. Automatically Matching Topographical Measurements of Cartridge Cases Using a Record Linkage Framework. arXiv:2003.00060.

Teaching

In Fall 2022 and 2023, I taught STA 35A, a new introductory statistical data science course for data science majors at UC Davis. In Fall 2023 I also taught STA 160 Practice in Statistical Data Science, an undergraduate capstone course.

In Winter 2023, I taught STA 250, a research seminar on Data Science for International Development.

I taught STA 141C Big Data and High Performance Statistical Computing in Spring 2024.

Last updated: 2024-08-05.