Email: huimin.li@utdallas.edu

Google Scholar

GitHub

LinkedIn

Huimin Li (李慧敏)

Ph.D. in Statistics

Department of Mathematical Sciences
The University of Texas at Dallas

I am currently a final year Statistics Ph.D. student in the Department of Mathematical Sciences at the University of Texas at Dallas (UTD). My doctoral advisor is Dr. Qiwei Li.

My research focuses on the development of Bayesian statistical methodologies to analyze high-dimensional data, spatial data, and shape data.

Education

Ph.D. in Statistics
The University of Texas at Dallas (UTD), Richardson, Texas, United States
Aug 2020 - May 2025 (expected)
M.S. in Statistics
The University of Texas Rio Grande Valley (UTRGV), Edinburg, Texas, United States
Jan 2017 - May 2019
Bachelor of Management in Business Administration
Xiangtan University, Xiangtan, Hunan, China
Sep 2009 - Jun 2013

Projects

Bayesian Clustering Approach with Feature Selection for Analyzing Spatially Resolved Transcriptomics Data

  • Developed Bayesian spatial clustering methods for high-dimensional count data, achieving more robust and accurate results in both simulatd and real datasets
  • Utilized feature selection to identifiy discriminating genes, improving model performance and interpretability
  • Implemented Markov Chain Monte Carlo (MCMC) methods with R and C++
  • Link

Robust Bayesian Integrative Modeling of Single Cell and Spatial Transcriptomics Data

  • Integrating spatial transcriptomics (ST) data with scRNA-seq reference data
  • Decomposing cell-type mixtures of regularly distributed spots and identifying the underlying spatial domains simultaneously
  • Capturing cell type sparsity using a zero-inflated Dirichlet distribution

Spatial Transcriptomics Arena (STAr): an Integrated Platform for Spatial Transcriptomics Methodology Research

  • Involved in the website design with curated datasets, reproducible methods, and analysis results to facilitate the spatial transcriptomics methodology research
  • Collected datasets, including data retrieval, categorization, organization and pre-processing
  • Selected spatially variable gene identification methods and applied them on simulated and real data
  • Developed and maintained an R package boost as an integrated tool of existing spatially variable gene identification methods
  • bioRxiv

BACON: Bayesian Clustering of n-gons via a Double Dirichlet Mixture Model

  • Developing a landmark-based Bayesian clustering method for closed polygonal chains, relying on intrinsic shape features
  • Employing a weighted version of a Double Truncated Dirichlet (DTDir) Model to account for geometric constraints on a simplex space
  • Employing a Mixture of Finite Mixtures (MFM) model to estimate the optimal number of clusters
  • GitHub

South Texas Early Prevention Study (STEPS) Pre-K Project

  • Designed a cluster randomized trial to assess the effect of the Bienestar coordinated school health program on children’s health outcomes
  • Examined social and health risk factors among preschool children living along the Texas-Mexico border, a region with high poverty and limited access to healthcare
  • Collected family characteristics, dietary intake, fitness, and anthropometric data from 1277 preschool students enrolled in 28 preschools
  • Revealed that children living in low-income areas are affected by high levels of social and health risk factors
  • Link

Teaching Experience

Graduate Teaching Assistant
The University of Texas at Dallas
  • STAT 7330: Bayesian Data Analysis
  • STAT 4355: Applied Linear Models
  • MATH 2417: Calculus I
  • STAT 3341: Probability and Statistics in Computer Science and Software Engineering
The University of Texas Rio Grande Valley
  • MATH 3337: Probability & Statistics
  • MATH 2413: Calculus I
  • MATH 2414: Calculus II
  • MATH 1343: Introduction to Biostatistics
  • MATH 1314: College Algebra


© Huimin Li | Last updated: