I am an assistant professor in the Department of Biostatistics at Harvard T.H. Chan School of Public Health. I obtained my Ph.D. in Operations Research and Financial Engineering at Princeton University.
Preprints
The Wreaths of KHAN: Uniform Graph Feature Selection with False Discovery Rate Control
[arXiv] [Package] |
Inference of Dependency Knowledge Graph for Electronic Health Records
[arXiv] |
FADI: Fast Distributed Principal Component Analysis With High Accuracy for Large-Scale Federated Data
[arXiv] ASA Statistical Learning and Data Science Paper Award |
ARCH: Large-scale Knowledge Graph via Aggregated Narrative Codified Health Records Analysis
[Pubmed][Package] |
Publications [by Topic]
StarTrek: Combinatorial Variable Selection with False Discovery Rate Control
The Annals of Statistics 52.1: 78-102. 2024 [Journal] [Package] |
Federated Offline Reinforcement Learning
Journal of the American Statistical Association, 2024. [Journal] [Package] |
Nonparametric Additive Value Functions: Interpretable Reinforcement Learning with an Application to Surgical Recovery
Annals of Applied Statistics, to appear. [arXiv] |
LATTE: Label-efficient incident phenotyping from longitudinal electronic health records
Patterns, 5(1), 2024. [Journal] [Package] |
Multi-source Learning via Completion of Block-wise
Overlapping Noisy Matrices
Journal of Machine Learning Research. 24(221), 1-43, 2023. [Journal] [Package] ASA Statistical Learning and Data Science Paper Award |
Prompt Discriminative Language Models for Domain Adaptation
Proceedings of the 5th Clinical Natural Language Processing Workshop, pp. 247-258. 2023. [Journal] |
Inferring Differential Hub Nodes on Differential Gaussian Graphical Models.
Statistica Sinica. 35(4), 2023. [Journal] |
Combinatorial-Probabilistic Trade-Off: Community Properties Test in the Stochastic Block Models
Conference version: International Conference on Learning Representations (spotlight paper). [Video] Journal version: IEEE Transactions on Information Theory, 2023. [Journal] WNAR Best Paper Award |
Inference on the optimal assortment in the multinomial logit model
ACM Conference on Economics and Computation, 2023. [arXiv] [Journal] |
Graph over-parameterization: Why the graph helps the training of deep graph convolutional network
Neurocomputing, 534, 77-85. 2023. [Journal] |
Multimodal representation learning for predicting molecule–disease relations
Bioinformatics, 39(2), btad085. 2023. [Journal] |
Lagrangian Inference for Ranking Problems
Operations Research 71.1: 202-223. 2023 [arXiv] [Journal] [Package] |
Penalized estimation of frailty-based illness–death models for semi-competing risks
Biometrics, 79(3), 1657-1669, 2023 [Journal] |
Multiview Incomplete Knowledge Graph Integration with Application to Cross-institutional EHR Data Harmonization
(*: co-senior author) Journal of Biomedical Informatics 133: 104147. 2022. [Journal] |
Clinical Knowledge Extraction via Sparse Embedding Regression (KESER) with Multi-Center Large Scale Electronic Health Record Data.
NPJ digital medicine 4, no. 1, 151. 2021 [Journal] [Package] |
Heteroskedastic and imbalanced deep learning with adaptive regularization.
International Conference on Learning Representations. 2021 [arXiv] |
Progression of traction bronchiectasis/bronchiolectasis in interstitial lung abnormalities is associated with increased all-cause mortality: Age Gene/Environment Susceptibility-Reykjavik Study.
European journal of radiology open 8 100334, 2021 [Journal] |
Interstitial lung abnormalities in patients with stage I non-small cell lung cancer are associated with shorter overall survival: the Boston lung cancer study.
Cancer Imaging 21, no. 1 1-7, 2021 [Journal] |
Inter-Subject Analysis: Inferring Sparse Interactions with Dense Intra-Graphs
Journal of the American Statistical Association, 116(534), 746-755, 2021 [arXiv][Journal] ICSA 2017 Student Paper Award |
Robust Scatter Matrix Estimation for High Dimensional Distributions with Heavy Tails
IEEE Transactions on Information Theory. vol. 67, no. 8, pp. 5283-5304, 2021. [paper][Journal] |
Estimating and inferring the maximum degree of stimulus-locked time-varying brain connectivity networks.
Biometrics. Jun;77(2):379-390, 2020 [Journal] |
Computational and Statistical Tradeoffs in Inferring Combinatorial Structures of Ising Model International Conference on Machine Learning, pp. 4901-4910. PMLR, 2020. [Journal] |
Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation
Advances in Neural Information Processing Systems 33: 18967-18977, 2020. [Journal] |
Kernel Meets Sieve: Post-Regularization Confidence Bands for Sparse Additive Model
Journal of the American Statistical Association, 92:4, pages 875-893, 2020. [arXiv] [Journal] ASA Best Student Paper in Nonparametric Statistics Finalist |
Symmetry, Saddle Points, and Global Geometry of Nonconvex Matrix Factorization
IEEE Transactions on Information Theory, 65(6):3489-3514, 2019. [arXiv][Journal] |
Combinatorial Inference for Graphical Models
(*: equal contribution) Annals of Statistics, 47(2), pp.795-827, 2018 [arXiv] [Journal] |
Sketching Method for Large Scale Combinatorial Inference
Advances in Neural Information Processing Systems 31, 10598-0607, 2018 [Journal] |
Distributed Testing and
Estimation under Sparse High Dimensional Models
(alphabetical order) Annals of Statistics, 46(3), 1352-1382, 2018 [arXiv][Journal] |
Post-Regularization Inference for Dynamic Nonparanormal Graphical Models
Journal of Machine Learning Research, 18, 1-78, 2018 [arXiv][Journal] |
The Edge Density Barrier: Computational-Statistical Tradeoffs in Combinatorial Inference
Proceedings of the 35th International Conference on Machine Learning, 80:3247-3256, 2018 [Journal] |
Provable Sparse Tensor Decomposition
Journal of the Royal Statistical Society: Series B, 79(3), 899-916, 2017 [arXiv][Journal] |
Application of the Strictly Contractive Peaceman-Rachford Splitting Method to Multi-block Separable Convex Programming
(alphabetical order) Splitting Methods in Communication, Imaging, Science, and Engineering (In Roland Glowinski, Stanley J. Osher, Wotao Yin (Eds.)), Springer, 2017 [Journal] |