Han Zhao | 赵晗

Assistant Professor
Department of Computer Science
Department of Electrical and Computer Engineering (affiliated)
University of Illinois at Urbana-Champaign
Email: hanzhao [AT] illinois (DOT) edu
Office: 3320 Siebel Center, 201 N Goodwin Ave Urbana, IL, 61801
[Curriculum Vitae] [Google Scholar] [DBLP] [Thesis] [Github]

About Me

I am an assistant professor at the Department of Computer Science, University of Illinois Urbana-Champaign, affiliated with the Department of Electrical and Computer Engineering. I am also an Amazon scholar at Amazon AI and Search Science. Before joining UIUC, I was a machine learning researcher at D. E. Shaw & Co. I obtained my Ph.D. from the Machine Learning Department, Carnegie Mellon University. Previously, I obtained my BEng degree from the Computer Science Department at Tsinghua University and MMath from the University of Waterloo.

I have a broad interest in trustworthy machine learning. In particular, I work on transfer learning (domain adaptation/generalization/distributional robustness, multitask/meta-learning), algorithmic fairness, probabilistic circuits, and their applications in natural language, signal processing and quantitative finance. My long-term goal is to build trustworthy ML systems that are efficient, robust, fair, and interpretable.

Acknowledgments Our group's research has been generously supported by Google Research, Meta AI, Amazon AI, Nvidia, IBM Research, the National Science Foundation (NSF), and the Defense Advanced Research Projects Agency (DARPA). Thank you!

Prospective students, please read this.

Publications [ show selected / show by date ]

Moment Alignment: Unifying Gradient and Hessian Matching for Domain Generalization
Y. Chen, H. Si, G. Zhang, H. Zhao
In Proceedings of the 41st conference on Uncertainty in Artificial Intelligence (UAI 2025)
[abs] [pdf]

Domain generalization (DG) seeks to develop models that generalize well to unseen target domains, addressing the prevalent issue of distribution shifts in real-world applications. One line of research in DG focuses on aligning domain-level gradients and Hessians to enhance generalization. However, existing methods are computationally inefficient and the underlying principles of these approaches are not well understood. In this paper, we develop the theory of moment alignment for DG. Grounded in \textit{transfer measure}, a principled framework for quantifying generalizability between two domains, we first extend the definition of transfer measure to domain generalization that includes multiple source domains and establish a target error bound. Then, we prove that aligning derivatives across domains improves transfer measure both when the feature extractor induces an invariant optimal predictor across domains and when it does not. Notably, moment alignment provides a unifying understanding of Invariant Risk Minimization, gradient matching, and Hessian matching, three previously disconnected approaches to DG. We further connect feature moments and derivatives of the classifier head, and establish the duality between feature learning and classifier fitting. Building upon our theory, we introduce \textbf{C}losed-Form \textbf{M}oment \textbf{A}lignment (CMA), a novel DG algorithm that aligns domain-level gradients and Hessians in closed-form. Our method overcomes the computational inefficiencies of existing gradient and Hessian-based techniques by eliminating the need for repeated backpropagation or sampling-based Hessian estimation. We validate the efficacy of our approach through two sets of experiments: linear probing and full fine-tuning. CMA demonstrates superior performance in both settings compared to Empirical Risk Minimization and state-of-the-art algorithms.

Multiobjective Distribution Matching
X. Zhang, P. Li, Y. Yu, Y. Zhang, H. Zhao, Q. Zhang
In Proceedings of the 42nd International Conference on Machine Learning (ICML 2025)
[abs] [pdf]

Scaling Laws for Multilingual Language Models
Y. He, A. Benhaim, B. Patra, P. Vaddamanu, S. Ahuja, P. Chopra, V. Chaudhary, H. Zhao, X. Song
In Proceedings of the 63th Annual Meeting of the Association for Computational Linguistics (ACL 2025 Findings)
[abs] [pdf]

We propose a novel scaling law for general-purpose decoder-only language models (LMs) trained on multilingual data, tackling the problem of balancing languages during multilingual pretraining. A primary challenge in studying multilingual scaling is the difficulty of analyzing individual language performance due to cross-lingual transfer. To address this, we shift the focus from individual languages to language families. We introduce and validate a hypothesis that the test cross-entropy loss for each language family is determined solely by its own sampling ratio, independent of other languages in the mixture. This insight simplifies the complexity of multilingual scaling and make the analysis scalable to an arbitrary number of languages. Building on this hypothesis, we derive a power-law relationship that links performance with dataset size, model size and sampling ratios. This relationship enables us to predict performance across various combinations of the above three quantities, and derive the optimal sampling ratios at different model scales. To demonstrate the effectiveness and accuracy of our proposed scaling law, we perform a large-scale empirical study, training more than 100 models on 23 languages spanning 5 language families. Our experiments show that the optimal sampling ratios derived from small models (85M parameters) generalize effectively to models that are several orders of magnitude larger (1.2B parameters), offering a resource-efficient approach for multilingual LM training at scale.

An Improved Autoregressive Evaluation Paradigm for Large Language Models
J. Zhang, R. Pan, Y. Hu, K. Shum, G. Yao, X. Liu, R. Pi, H. Dong, S. Diao, Y. Lin, H. Zhao, T. Zhang
In ACM Transactions on Intelligent Systems and Technology (TIST 2025).
[abs] [pdf]

The AI community has witnessed the emergence of various chat-style Large Language Models (LLMs) since the advent of ChatGPT. Despite significant progress in this area, evaluating these models remains a substantial challenge. The evaluations provided by humans or GPT-4 oracles are often taken as the gold standard, but they are neither automatic nor scalable. More recently, a series of (open-source) LLM-based judge models have been introduced, yet they often exhibit model-specific biases, e.g., a LLaMA-family judge favors a LLaMAfamily model. On the other hand, autoregressive evaluation metrics, which holds the potential to address the aforementioned issues, remains underexplored. Among them, likelihood-based metrics such as perplexity and negative log-likelihood (NLL) are widely adopted and has proven effective in tracking the pretraining progress of LLMs. However, they struggle to evaluate the generation capabilities of fine-tuned models due to exposure bias, a phenomenon where the distribution of the model's output gradually deviates from the ground-truth during inference. To address this key issue, in this paper, we propose a novel autoregressive metric, Normalized Discounted Cumulative Gain (NDCG), to improve the evaluation of fine-tuned LLMs. Our experimental results demonstrate that NDCG significantly outperforms likelihood-based metrics: it shows over 45% improvement in both Spearman and Kendall's tau correlation coefficients for commonsense QA tasks, and aligns more closely with GPT-4 Elo rankings for instruction-tuned models.

Learning Structured Representations by Embedding Class Hierarchy with Fast Optimal Transport
S. Zeng, S. Du, M. Yamada, H. Zhao
In Proceedings of the 13th International Conference on Learning Representations (ICLR 2025)
[abs] [pdf] [code]

Accelerating Neural ODEs: A Variational Formulation-based Approach
H. Zhao, Y. Wang, H. Qi, Z. Huang, H. Zhao, L. Sha, H. Shao
In Proceedings of the 13th International Conference on Learning Representations (ICLR 2025)
[abs] [pdf] [code]

Neural Ordinary Differential Equations (Neural ODEs or NODEs) excel at modeling continuous dynamical systems from observational data, especially when the data is irregularly sampled. However, existing training methods predominantly rely on numerical ODE solvers, which are time-consuming and prone to accumulating numerical errors over time due to autoregression. In this work, we propose VF-NODE, a novel approach based on the variational formulation (VF) to accelerate the training of NODEs. Unlike existing training methods, the proposed VF-NODEs implement a series of global integrals, thus evaluating Deep Neural Network (DNN)--based vector fields only at specific observed data points. This strategy drastically reduces the number of function evaluations (NFEs). Moreover, our method eliminates the use of autoregression, thereby reducing error accumulations for modeling dynamical systems. Nevertheless, the VF loss introduces oscillatory terms into the integrals when using the Fourier basis. We incorporate Filon's method to address this issue. To further enhance the performance for noisy and incomplete data, we employ the natural cubic spline regression to estimate a closed-form approximation. We provide a fundamental analysis of how our approach minimizes computational costs. Extensive experiments demonstrate that our approach accelerates NODE training by 10 to 1000 times compared to existing NODE-based methods, while achieving higher or comparable accuracy in dynamical systems. The code is available at https://github.com/ZhaoHongjue/VF-NODE-ICLR2025.

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks
S. Poppi, ZX. Yong, Y. He, B. Chern, H. Zhao, A. Yang, J. Chi
In Findings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL Findings 2025)
[abs] [pdf]

Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
Y. He, Y. Hu, Y. Lin, T. Zhang, H. Zhao
Transactions on Machine Learning Research (TMLR 2025)
[abs] [pdf] [code]

Model merging offers an effective strategy to combine the strengths of multiple finetuned models into a unified model that preserves the specialized capabilities of each. Existing methods merge models in a global manner, performing arithmetic operations across all model parameters. However, such global merging often leads to task interference, degrading the performance of the merged model. In this work, we introduce Localize-and-Stitch, a novel approach that merges models in a localized way. Our algorithm works in two steps: i) Localization: identify tiny (1% of the total parameters) localized regions in the finetuned models containing essential skills for the downstream tasks, and ii) Stitching: reintegrate only these essential regions back into the pretrained model for task synergy. We demonstrate that our approach effectively locates sparse regions responsible for finetuned performance, and the localized regions could be treated as compact and interpretable representations of the finetuned models (tasks). Empirically, we evaluate our method on various vision and language benchmarks, showing that it outperforms existing model merging methods under different data availability scenarios. Beyond strong empirical performance, our algorithm also facilitates model compression and preserves pretrained knowledge, enabling flexible and continual skill composition from multiple finetuned models with minimal storage and computational overhead.

Most Influential Subset Selection: Challenges, Promises, and Beyond
Y. Hu, P. Hu, H. Zhao, J. Ma
In Proceedings of the 38th Advances in Neural Information Processing Systems (NeurIPS 2024)
[abs] [pdf] [poster]

How can we attribute the behaviors of machine learning models to their training data? While the classic influence function sheds light on the impact of individual samples, it often fails to capture the more complex and pronounced collective influence of a set of samples. To tackle this challenge, we study the Most Influential Subset Selection (MISS) problem, which aims to identify a subset of training samples with the greatest collective influence. We conduct a comprehensive analysis of the prevailing approaches in MISS, elucidating their strengths and weaknesses. Our findings reveal that influence-based greedy heuristics, a dominant class of algorithms in MISS, can provably fail even in linear regression. We delineate the failure modes, including the errors of influence function and the non-additive structure of the collective influence. Conversely, we demonstrate that an adaptive version of these heuristics which applies them iteratively, can effectively capture the interactions among samples and thus partially address the issues. Experiments on real-world datasets corroborate these theoretical findings, and further demonstrate that the merit of adaptivity can extend to more complex scenarios such as classification tasks and non-linear neural networks. We conclude our analysis by emphasizing the inherent trade-off between performance and computational efficiency, questioning the use of additive metrics such as the linear datamodeling score, and offering a range of discussions.

Learning Structured Representations with Hyperbolic Embeddings
A. Sinha, S. Zeng, M. Yamada, H. Zhao
In Proceedings of the 38th Advances in Neural Information Processing Systems (NeurIPS 2024)
[abs] [pdf] [code] [poster]

On the Expressive Power of Tree-Structured Probabilistic Circuits
L. Yin, H. Zhao
In Proceedings of the 38th Advances in Neural Information Processing Systems (NeurIPS 2024)
[abs] [pdf] [poster]

FedGTST: Boosting Global Transferability of Federated Models via Statistics Tuning
E. Ma, C. Pan, S. Rasoul Etesami, H. Zhao, O. Milenkovic
In Proceedings of the 38th Advances in Neural Information Processing Systems (NeurIPS 2024)
[abs] [pdf]

The performance of Transfer Learning (TL) heavily relies on effective pretraining, which demands large datasets and substantial computational resources. As a result, executing TL is often challenging for individual model developers. Federated Learning (FL) addresses these issues by facilitating collaborations among clients, expanding the dataset indirectly, distributing computational costs, and preserving privacy. However, key challenges remain unresolved. First, existing FL methods tend to optimize transferability only within local domains, neglecting the global learning domain. Second, most approaches rely on indirect transferability metrics, which do not accurately reflect the final target loss or true degree of transferability. To address these gaps, we propose two enhancements to FL. First, we introduce a client-server exchange protocol that leverages cross-client Jacobian (gradient) norms to boost transferability. Second, we increase the average Jacobian norm across clients at the server, using this as a local regularizer to reduce cross-client Jacobian variance. Our transferable federated algorithm, termed FedGTST (Federated Global Transferability via Statistics Tuning), demonstrates that increasing the average Jacobian and reducing its variance allows for tighter control of the target loss. This leads to an upper bound on the target loss in terms of the source loss and source-target domain discrepancy. Extensive experiments on datasets such as MNIST to MNIST-M and CIFAR10 to SVHN show that FedGTST outperforms relevant baselines, including FedSR. On the second dataset pair, FedGTST improves accuracy by 9.8% over FedSR and 7.6% over FedIIR when LeNet is used as the backbone.

LibMOON: A Gradient-based MultiObjective OptimizatioN Library in PyTorch
X. Zhang, L. Zhao, Y. Yu, X. Lin, Y. Chen, H. Zhao, Q. Zhang
In Proceedings of the 38th Advances in Neural Information Processing Systems, Track on Datasets and Benchmarks (NeurIPS 2024, D&B Track)
[abs] [pdf] [code]

Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts
H. Wang, W. Xiong, T. Xie, H. Zhao, T. Zhang
In Proceedings of the Association for Computational Linguistics: EMNLP 2024 (EMNLP 2024 Findings)
[abs] [pdf] [code]

Reinforcement learning from human feedback (RLHF) has emerged as the primary method for aligning large language models (LLMs) with human preferences. The RLHF process typically starts by training a reward model (RM) using human preference data. Conventional RMs are trained on pairwise responses to the same user request, with relative ratings indicating which response humans prefer. The trained RM serves as a proxy for human preferences. However, due to the black-box nature of RMs, their outputs lack interpretability, as humans cannot intuitively understand why an RM thinks a response is good or not. As RMs act as human preference proxies, we believe they should be human-interpretable to ensure that their internal decision processes are consistent with human preferences and to prevent reward hacking in LLM alignment. To build RMs with interpretable preferences, we propose a two-stage approach: i) train an Absolute-Rating Multi-Objective Reward Model (ArmoRM) with multi-dimensional absolute-rating data, each dimension corresponding to a human-interpretable objective (e.g., honesty, verbosity, safety); ii) employ a Mixture-of-Experts (MoE) strategy with a gating network that automatically selects the most suitable reward objectives based on the context. We efficiently trained an ArmoRM with Llama-3 8B and a gating network consisting of a shallow MLP on top of the ArmoRM. Our trained model, ArmoRM-Llama3-8B, obtains state-of-the-art performance on RewardBench, a benchmark evaluating RMs for language modeling. Notably, the performance of our model surpasses the LLM-as-a-judge method with GPT-4 judges by a margin, and approaches the performance of the much larger Nemotron-4 340B reward model.

Semi-Supervised Reward Modeling via Iterative Self-Training
Y. He, H. Wang, Z. Jiang, A. Papangelis, H. Zhao
In Proceedings of the Association for Computational Linguistics: EMNLP 2024 (EMNLP 2024 Findings)
[abs] [pdf] [code]

Mitigating the Alignment Tax of RLHF
Y. Lin, H. Lin, W. Xiong, S. Diao, J. Liu, J. Zhang, R. Pan, H. Wang, W. Hu, H. Zhang, H. Dong, R. Pi, H. Zhao, N. Jiang, H. Ji, Y. Yao, T. Zhang
In Proceedings of the Association for Computational Linguistics: EMNLP 2024 (EMNLP 2024)
[abs] [pdf] [code]

LLMs acquire a wide range of abilities during pre-training, but aligning LLMs under Reinforcement Learning with Human Feedback (RLHF) can lead to forgetting pretrained abilities, which is also known as the alignment tax. To investigate alignment tax, we conducted experiments with existing RLHF algorithms using OpenLLaMA-3B, which revealed a pronounced alignment tax in NLP tasks. Whereas, despite various techniques to mitigate forgetting, they are often at odds with the RLHF performance, leading to a trade-off between alignment performance and forgetting mitigation, leading to an alignment-forgetting trade-off. In this paper we show that model averaging, which simply interpolates between pre and post RLHF model weights, surprisingly achieves the most strongest alignment-forgetting Pareto front among a wide range of competing methods. To understand its effectiveness, we offer theoretical insights into model averaging, revealing that it enhances performance Pareto front by increasing feature diversity on the layers where tasks share overlapped feature spaces. Empirical evidence corroborates our analysis by showing the benefits of averaging low-level transformer layers. Building on the analysis and the observation that averaging different layers of the transformer leads to significantly different alignment-forgetting trade-offs, we propose Heterogeneous Model Averaging (HMA) to Heterogeneously find various combination ratios of model layers. HMA seeks to maximize the alignment performance while incurring minimal alignment tax. Moreover, we validate HMA's performance across a range of RLHF algorithms over OpenLLaMA-3B and further extend our findings to Mistral-7B which is evaluated by open-sourced preference model and GPT4.

Fair and Optimal Prediction via Post-Processing
H. Zhao
AI Magazine (an overview of our group's work on algorithmic fairness and more broadly, trustworthy machine learning)
[abs] [link]

With the development of machine learning algorithms and the increasing computational resources available, artificial intelligence has achieved great success in many application domains. However, the success of machine learning has also raised concerns about the fairness of the learned models. For instance, the learned models can perpetuate and even exacerbate the potential bias and discrimination in the training data. This issue has become a major obstacle to the deployment of machine learning systems in high-stakes domains, for example, criminal judgment, medical testing, online advertising, hiring process, and so forth. To mitigate the potential bias exhibited by machine learning models, fairness criteria can be integrated into the training process to ensure fair treatment across all demographics, but it often comes at the expense of model performance. Understanding such tradeoffs, therefore, is crucial to the design of optimal and fair algorithms. My research focuses on characterizing the inherent tradeoff between fairness and accuracy in machine learning, and developing algorithms that can achieve both fairness and optimality. In this article, I will discuss our recent work on designing post-processing algorithms for fair classification, which can be applied to a wide range of fairness criteria, including statistical parity, equal opportunity, and equalized odds, under both attribute-aware and attribute-blind settings, and is particularly suited to large-scale foundation models where retraining is expensive or even infeasible. I will also discuss the connections between our work and other related research on trustworthy machine learning, including the connections between algorithmic fairness and differential privacy as well as adversarial robustness.

An Empirical Study of Self-Supervised Learning with Wasserstein Distance
M. Yamada, Y. Takezawa, G. Houry, K. Düsterwald, D. Sulem, H. Zhao, Y. H. Tsai
Entropy (Entropy 2024)
[abs] [arXiv] [Entropy]

In this study, we delve into the problem of self-supervised learning (SSL) utilizing the 1-Wasserstein distance on a tree structure (a.k.a., Tree-Wasserstein distance (TWD)), where TWD is defined as the L1 distance between two tree-embedded vectors. In SSL methods, the cosine similarity is often utilized as an objective function; however, it has not been well studied when utilizing the Wasserstein distance. Training the Wasserstein distance is numerically challenging. Thus, this study empirically investigates a strategy for optimizing the SSL with the Wasserstein distance and finds a stable training procedure. More specifically, we evaluate the combination of two types of TWD (total variation and ClusterTree) and several probability models, including the softmax function, the ArcFace probability model, and simplicial embedding. We propose a simple yet effective Jeffrey divergence-based regularization method to stabilize optimization. Through empirical experiments on STL10, CIFAR10, CIFAR100, and SVHN, we find that a simple combination of the softmax function and TWD can obtain significantly lower results than the standard SimCLR. Moreover, a simple combination of TWD and SimSiam fails to train the model. We find that the model performance depends on the combination of TWD and probability model, and that the Jeffrey divergence regularization helps in model training. Finally, we show that the appropriate combination of the TWD and probability model outperforms cosine similarity-based representation learning.

Gradual Domain Adaptation: Theory and Algorithms
Y. He, H. Wang, B. Li, H. Zhao
Journal of Machine Learning Research (JMLR 2024)
(Extended version of our ICML 2022 paper under title "Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond")
[abs] [arXiv] [JMLR] [code]

Unsupervised domain adaptation (UDA) adapts a model from a labeled source domain to an unlabeled target domain in a one-off way. Though widely applied, UDA faces a great challenge whenever the distribution shift between the source and the target is large. Gradual domain adaptation (GDA) mitigates this limitation by using intermediate domains to gradually adapt from the source to the target domain. In this work, we first theoretically analyze gradual self-training, a popular GDA algorithm, and provide a significantly improved generalization bound compared with Kumar et al. (2020). Our theoretical analysis leads to an interesting insight: to minimize the generalization error on the target domain, the sequence of intermediate domains should be placed uniformly along the Wasserstein geodesic between the source and target domains. The insight is particularly useful under the situation where intermediate domains are missing or scarce, which is often the case in real-world applications. Based on the insight, we propose Generative Gradual DOmain Adaptation with Optimal Transport (GOAT), an algorithmic framework that can generate intermediate domains in a data-dependent way. More concretely, we first generate intermediate domains along the Wasserstein geodesic between two given consecutive domains in a feature space, then apply gradual self-training to adapt the source-trained classifier to the target along the sequence of intermediate domains. Empirically, we demonstrate that our GOAT framework can improve the performance of standard GDA when the given intermediate domains are scarce, significantly broadening the real-world application scenarios of GDA. Our code is available at https://github.com/yifei-he/GOAT.

Efficient Modality Selection in Multimodal Learning
Y. He, R. Cheng, G. Balasubramaniam, Y. H. Tsai, and H. Zhao
Journal of Machine Learning Research (JMLR 2024)
[abs] [pdf]

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
H. Wang, Y. Lin, W. Xiong, R. Yang, S. Diao, S. Qiu, H. Zhao, T. Zhang
In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
[abs] [arXiv] [code] [poster]

Fine-grained control over large language models (LLMs) remains a significant challenge, hindering their adaptability to diverse user needs. While Reinforcement Learning from Human Feedback (RLHF) shows promise in aligning LLMs, its reliance on scalar rewards often limits its ability to capture diverse user preferences in real-world applications. To address this limitation, we introduce the Directional Preference Alignment (DPA) framework. Unlike the scalar-reward RLHF, DPA incorporates multi-objective reward modeling to represent diverse preference profiles. Additionally, DPA models user preferences as directions (i.e., unit vectors) in the reward space to achieve user-dependent preference control. Our method involves training a multi-objective reward model and then fine-tuning the LLM with a preference-conditioned variant of Rejection Sampling Finetuning (RSF), an RLHF method adopted by Llama 2. This method enjoys a better performance trade-off across various reward objectives. In comparison with the scalar-reward RLHF, DPA offers users intuitive control over LLM generation: they can arithmetically specify their desired trade-offs (e.g., more helpfulness with less verbosity). We also validate the effectiveness of DPA with real-world alignment experiments on Mistral-7B. Our method provides straightforward arithmetic control over the trade-off between helpfulness and verbosity while maintaining competitive performance with strong baselines such as Direct Preference Optimization (DPO).

Differentially Private Post-Processing for Fair Regression
R. Xian, Q. Li, G. Kamath, H. Zhao
In Proceedings of the 41st International Conference on Machine Learning (ICML 2024)
[abs] [pdf] [code]

Pairwise Alignment Improves Graph Domain Adaptation
S. Liu, D. Zou, H. Zhao, P. Li
In Proceedings of the 41st International Conference on Machine Learning (ICML 2024, spotlight)
[abs] [pdf]

Robust Multi-Task Learning with Excess Risks
Y. He, S. Zhou, G. Zhang, H. Yun, Y. Xu, B. Zeng, T. Chilimbi, H. Zhao
In Proceedings of the 41st International Conference on Machine Learning (ICML 2024)
[abs] [pdf]

A Survey of Recent Methods for Addressing AI Fairness and Bias in Biomedicine
Y. Yang, M. Lin, H. Zhao, Y. Peng, F. Huang, Z. Lu
Journal of Biomedical Informatics (JBI 2024)
[abs] [pdf] [arXiv]

Artificial intelligence (AI) systems have the potential to revolutionize clinical practices, including improving diagnostic accuracy and surgical decision-making, while also reducing costs and manpower. However, it is important to recognize that these systems may perpetuate social inequities or demonstrate biases, such as those based on race or gender. Such biases can occur before, during, or after the development of AI models, making it critical to understand and address potential biases to enable the accurate and reliable application of AI models in clinical settings. To mitigate bias concerns during model development, we surveyed recent publications on different debiasing methods in the fields of biomedical natural language processing (NLP) or computer vision (CV). Then we discussed the methods that have been applied in the biomedical domain to address bias. We performed our literature search on PubMed, ACM digital library, and IEEE Xplore of relevant articles published between January 2018 and December 2023 using multiple combinations of keywords. We then filtered the result of 10,041 articles automatically with loose constraints, and manually inspected the abstracts of the remaining 890 articles to identify the 55 articles included in this review. Additional articles in the references are also included in this review. We discuss each method and compare its strengths and weaknesses. Finally, we review other potential methods from the general domain that could be applied to biomedicine to address bias and improve fairness. The bias of AIs in biomedicine can originate from multiple sources. Existing debiasing methods that focus on algorithms can be categorized into distributional or algorithmic.

Towards Practical Non-Adversarial Distribution Alignment via Variational Bounds
Z. Gong, B. Usman, H. Zhao and D. I. Inouye
In Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)
[abs] [pdf]

Fast 1-Wasserstein distance approximations using greedy strategies
G. Houry, H. Bao, H. Zhao and M. Yamada
In Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)
[abs] [pdf] [poster]

FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods
X. Han, J. Chi, Y. Chen, Q. Wang, H. Zhao, N. Zou, X. Hu
In Proceedings of the 12th International Conference on Learning Representations (ICLR 2024)
[abs] [pdf]

RLHF Workflow: From Reward Modeling to Online RLHF
H. Dong, W. Xiong, B. Pang, H. Wang, H. Zhao, Y. Zhou, N. Jiang, D. Sahoo, C. Xiong, T. Zhang
Transactions on Machine Learning Research (TMLR 2024)
[abs] [pdf] [code]

We present the workflow of Online Iterative Reinforcement Learning from Human Feedback (RLHF) in this technical report, which is widely reported to outperform its offline counterpart by a large margin in the recent large language model (LLM) literature. However, existing open-source RLHF projects are still largely confined to the offline learning setting. In this technical report, we aim to fill in this gap and provide a detailed recipe that is easy to reproduce for online iterative RLHF. In particular, since online human feedback is usually infeasible for open-source communities with limited resources, we start by constructing preference models using a diverse set of open-source datasets and use the constructed proxy preference model to approximate human feedback. Then, we discuss the theoretical insights and algorithmic principles behind online iterative RLHF, followed by a detailed practical implementation. Our trained LLM, \texttt{SFR-Iterative-DPO-LLaMA-3-8B-R}, achieves impressive performance on LLM chatbot benchmarks, including AlpacaEval-2, Arena-Hard, and MT-Bench, as well as other academic benchmarks such as HumanEval and TruthfulQA. We have shown that supervised fine-tuning (SFT) and iterative RLHF can obtain state-of-the-art performance with fully open-source datasets. Further, we have made our models, curated datasets, and comprehensive step-by-step code guidebooks publicly available. Please refer to \url{https://github.com/RLHFlow/RLHF-Reward-Modeling} and \url{https://github.com/RLHFlow/Online-RLHF} for more detailed information.

Enhancing Compositional Generalization via Compositional Feature Alignment
H. Wang, H. Si, H. Shao, H. Zhao
Transactions on Machine Learning Research (TMLR 2024)
[abs] [pdf]

Real-world applications of machine learning models often confront data distribution shifts, wherein discrepancies exist between the training and test data distributions. In the common multi-domain multi-class setup, as the number of classes and domains scales up, it becomes infeasible to gather training data for every domain-class combination. This challenge naturally leads the quest for models with Compositional Generalization (CG) ability, where models can generalize to unseen domain-class combinations. To delve into the CG challenge, we develop CG-Bench, a suite of CG benchmarks derived from existing real-world image datasets, and observe that the prevalent pretraining-finetuning paradigm on foundational models, such as CLIP and DINOv2, struggles with the challenge. To address this challenge, we propose Compositional Feature Alignment (CFA), a simple two-stage finetuning technique that i) learns two orthogonal linear heads on a pretrained encoder with respect to class and domain labels, and ii) fine-tunes the encoder with the newly learned head frozen. We theoretically and empirically justify that CFA encourages compositional feature learning of pretrained models. We further conduct extensive experiments on CG-Bench for CLIP and DINOv2, two powerful pretrained vision foundation models. Experiment results show that CFA outperforms common finetuning techniques in compositional generalization, corroborating CFA's efficacy in compositional feature learning.

A General-Purpose Multi-Modal OOD Detection Framework
V. Duong, Q. Wu, Z. Zhou, E. Zavesky, W-L. Hsu, H. Zhao, H. Shao
Transactions on Machine Learning Research (TMLR 2024)
[abs] [pdf]

Out-of-distribution (OOD) detection seeks to identify test samples that deviate from the training data, which is critical to ensuring the safety and reliability of machine learning (ML) systems. While a plethora of methods have been developed to detect uni-modal OOD samples, only a few have focused on multi-modal OOD detection. Current contrastive learning-based methods primarily address multi-modal OOD detection in a scenario where an image is not related to the class labels in training data. However, ML systems in the real-world applications may encounter a broader spectrum of anomalies caused by different factors like systematic errors in labeling, environmental changes, and sensor malfunctions. Hence, we propose a new method to be able to simultaneously detect anomalies from multiple different OOD scenarios, arising from fine-grained image features and textual descriptions, instead of large categorical information. To achieve this goal, we propose a general-purpose weakly-supervised OOD detection framework, called WOOD, that combines a binary classifier and a contrastive learning module to reap the benefits of both. In order to better distinguish in-distribution (ID) samples from OOD ones, we employ the Hinge loss to constrain the similarity of their latent representations. Moreover, we devise a new scoring metric that fuses predictions from both the binary classifier and contrastive learning to enhance OOD detection. Extensive experimental results on multiple benchmarks demonstrate that the proposed WOOD significantly outperforms the state-of-the-art methods for multi-modal OOD detection. Importantly, our approach can achieve superior detection performance in a variety of OOD scenarios.

Personalized Federated Learning with Spurious Features: An Adversarial Approach
X. Wang, H. Zhao, K. Nahrstedt, S. Koyejo
Transactions on Machine Learning Research (TMLR 2024)
[abs] [pdf]

Revisiting Scalarization in Multi-Task Learning: A Theoretical Perspective
Y. Hu, R. Xian, Q. Wu, Q. Fan, L. Yin, and H. Zhao
In Proceedings of the 37th Advances in Neural Information Processing Systems (NeurIPS 2023)
[abs] [pdf] [poster] [slides] [video]

Linear scalarization, i.e., combining all loss functions by a weighted sum, has been the default choice in the literature of multi-task learning (MTL) since its inception. In recent years, there is a surge of interest in developing Specialized Multi-Task Optimizers (SMTOs) that treat MTL as a multi-objective optimization problem. However, it remains open whether there is a fundamental advantage of SMTOs over scalarization. In fact, heated debates exist in the community comparing these two types of algorithms, mostly from an empirical perspective. To approach the above question, in this paper, we revisit scalarization from a theoretical perspective. We focus on linear MTL models and study whether scalarization is capable of fully exploring the Pareto front. Our findings reveal that, in contrast to recent works that claimed empirical advantages of scalarization, scalarization is inherently incapable of full exploration, especially for those Pareto optimal solutions that strike the balanced trade-offs between multiple tasks. More concretely, when the model is under-parametrized, we reveal a multi-surface structure of the feasible region and identify necessary and sufficient conditions for full exploration. This leads to the conclusion that scalarization is in general incapable of tracing out the Pareto front. Our theoretical results partially answer the open questions in Xin et al. (2021), and provide a more intuitive explanation on why scalarization fails beyond non-convexity. We additionally perform experiments on a real-world dataset using both scalarization and state-of-the-art SMTOs. The experimental results not only corroborate our theoretical findings, but also unveil the potential of SMTOs in finding balanced solutions, which cannot be achieved by scalarization.

Efficient Learning of Linear Graph Neural Networks via Node Subsampling
S. Shin, I. Shomorony, and H. Zhao
In Proceedings of the 37th Advances in Neural Information Processing Systems (NeurIPS 2023)
[abs] [pdf] [poster]

Graph Neural Networks (GNNs) are a powerful class of machine learning models with applications in recommender systems, drug discovery, social network analysis, and computer vision. One challenge with their implementation is that GNNs often take large-scale graphs as inputs, which imposes significant computational/storage costs in the training and testing phases. In particular, the message passing operations of a GNN require multiplication of the graph adjacency matrix A ∈\R^n \times n and the data matrix X ∈\R^n \times d, and the O(n^2 d) time complexity can be prohibitive for large n. Thus, a natural question is whether it is possible to perform the GNN operations in (quasi-)linear time by avoiding the full computation of A X. To study this question, we consider the setting of a regression task on a two-layer Linear Graph Convolutional Network (GCN). We develop an efficient training algorithm based on (1) performing node subsampling, (2) estimating the leverage scores of A X based on the subsampled graph, and (3) performing leverage score sampling on A X. We show that our proposed scheme learns the regression model observing only O(nd\eps^-2\log n) entries of A in time O(nd^2 \eps^-2\log n), with the guarantee that the learned weights deviate by at most εunder the \ell_2 norm from the model learned using the entire adjacency matrix A. We present empirical results for regression problems on two real-world graphs and show that our algorithm significantly outperforms other baseline sampling strategies that exploit the same number of observations.

Learning List-Level Domain-Invariant Representations for Ranking
R. Xian, H. Zhuang, Z. Qin, H. Zamani, J. Lu, J. Ma, K. Hui, H. Zhao, X. Wang, M. Bendersky
In Proceedings of the 37th Advances in Neural Information Processing Systems (NeurIPS 2023, spotlight)
[abs] [pdf] [poster]

Adaptation Augmented Model-based Policy Optimization
J. Shen, H. Lai, M. Liu, H. Zhao, Y. Yu, and W. Zhang
Journal of Machine Learning Research (JMLR 2023)
(Extended version of our NeurIPS 2020 paper under title "Model-based Policy Optimization with Unsupervised Model Adaptation")
[abs] [pdf]

Compared to model-free reinforcement learning (RL), model-based RL is often more sample efficient by leveraging a learned dynamics model to help decision-making. However, the learned model is usually not perfectly accurate and the error will compound in multi-step predictions, which can lead to poor asymptotic performance. In this paper, we first derive an upper bound of the return discrepancy between the real dynamics and the learned model, which reveals the fundamental problem of distribution shift between simulated data and real data. Inspired by the theoretical analysis, we propose an adaptation augmented model-based policy optimization (AMPO) framework to address the distribution shift problem from the perspectives of feature learning and instance re-weighting, respectively. Specifically, the feature-based variant, namely FAMPO, introduces unsupervised model adaptation to minimize the integral probability metric (IPM) between feature distributions from real and simulated data, while the instance-based variant, termed as IAMPO, utilizes importance sampling to re-weight the real samples used to train the model. Besides model learning, we also investigate how to improve policy optimization in the model usage phase by selecting simulated samples with different probabilities according to their uncertainty. Extensive experiments on challenging continuous control tasks show that FAMPO and IAMPO, coupled with our model usage technique, achieve superior performance against baselines, which demonstrates the effectiveness of the proposed methods.

Train Your Own GNN Teacher: Graph-Aware Distillation on Textual Graphs
C. Mavromatis, V. N. Ioannidis, S. Wang, D. Zheng, S. Adeshina, J. Ma, H. Zhao, C. Faloutsos, G. Karypis
In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2023)
[abs] [pdf]

Fair and Optimal Classification via Post-Processing
R. Xian, L. Yin, H. Zhao
In Proceedings of the 40th International Conference on Machine Learning (ICML 2023)
[abs] [pdf] [code] [slides] [video]

Understanding the Impact of Adversarial Robustness on Accuracy Disparity
Y. Hu, F. Wu, H. Zhang, and H. Zhao
In Proceedings of the 40th International Conference on Machine Learning (ICML 2023)
[abs] [pdf]

Structural Re-weighting Improves Graph Domain Adaptation
S. Liu, T. Li, Y. Feng, N. Tran, H. Zhao, Q. Qiu, and Pan Li
In Proceedings of the 40th International Conference on Machine Learning (ICML 2023)
[abs] [pdf]

Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning
Q. Jiang, C. Chen, H. Zhao, L. Chen, Q. Ping, S. Dinh Tran, Y. Xu, B. Zeng, T. Chilimbi
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023)
[abs] [pdf]

Contrastive loss has been increasingly used in learning representations from multiple modalities. In the limit, the nature of the contrastive loss encourages modalities to exactly match each other in the latent space. Yet it remains an open question how the modality alignment affects the downstream task performance. In this paper, based on an information-theoretic argument, we first prove that exact modality alignment is sub-optimal in general for downstream prediction tasks. Hence we advocate that the key of better performance lies in meaningful latent modality structures instead of perfect modality alignment. To this end, we propose three general approaches to construct latent modality structures. Specifically, we design 1) a deep feature separation loss for intra-modality regularization; 2) a Brownian-bridge loss for inter-modality regularization; and 3) a geometric consistency loss for both intra- and inter-modality regularization. Extensive experiments are conducted on two popular multi-modal representation learning frameworks: the CLIP-based two-tower model and the ALBEF-based fusion model. We test our model on a variety of tasks including zero/few-shot image classification, image-text retrieval, visual question answering, visual reasoning, and visual entailment. Our method achieves consistent improvements over existing methods, demonstrating the effectiveness and generalizability of our proposed approach on latent modality structure regularization.

Costs and Benefits of Fair Regression
H. Zhao
Transactions on Machine Learning Research (TMLR 2023)
[abs] [pdf]

Real-world applications of machine learning tools in high-stakes domains are often regulated to be fair, in the sense that the predicted target should satisfy some quantitative notion of parity with respect to a protected attribute. However, the exact tradeoff between fairness and accuracy with a real-valued target is not entirely clear. In this paper, we characterize the inherent tradeoff between statistical parity and accuracy in the regression setting by providing a lower bound on the error of any attribute-blind fair regressor. Our lower bound is sharp, algorithm-independent, and admits a simple interpretation: when the moments of the target differ between groups, any fair algorithm has to make an error on at least one of the groups. We further extend this result to give a lower bound on the joint error of any (approximately) fair algorithm, using the Wasserstein distance to measure the quality of the approximation. With our novel lower bound, we also show that the price paid by a fair regressor that does not take the protected attribute as input is less than that of a fair regressor with explicit access to the protected attribute. On the upside, we establish the first connection between individual fairness, accuracy parity, and the Wasserstein distance by showing that if a regressor is individually fair, it also approximately verifies the accuracy parity, where the gap is again given by the Wasserstein distance between the two groups. Inspired by our theoretical results, we develop a practical algorithm for fair regression through the lens of representation learning, and conduct experiments on a real-world dataset to corroborate our findings.

Learning Structured Representations by Embedding Class Hierarchy
S. Zeng, R. des Combes, H. Zhao
In Proceedings of the 11th International Conference on Learning Representations (ICLR 2023)
[abs] [pdf] [code]

Adaptive Power Method: Eigenvector Estimation from Sampled Data
S. Shin, H. Zhao, I. Shomorony
In Proceedings of the 34th International Conference on Algorithmic Learning Theory (ALT 2023)
[abs] [pdf]

FedMM: A Communication Efficient Solver for Federated Adversarial Domain Adaptation
Y. Shen, J. Du, H. Zhao, Z. Ji, C. Ma, M. Gao
In Proceedings of the 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023)
[abs] [pdf]

Federated adversary domain adaptation is a unique distributed minimax training task due to the heterogeneous data among different local clients, where each client only sees a subset of the data that merely belongs to either the source or target domain. Despite the extensive research in distributed minimax optimization, existing communication efficient solvers that exploit multiple steps of the local update are still not able to generate satisfactory solutions for federated adversarial domain adaptation because of the gradient divergence issue among clients. To tackle this problem, we propose a distributed minimax optimizer, referred to as FedMM, by introducing dual variables to bridge the gradient gap among clients. This algorithm is effective even in the extreme case where each client has different label classes and some clients only have unlabeled data. We prove that FedMM admits benign convergence to a stationary point under domain-shifted unlabeled data. On a variety of benchmark datasets, extensive experiments show that FedMM consistently achieves both better communication savings and significant accuracy improvements over existing federated optimizers based on the stochastic gradient descent ascent (SGDA) algorithm. When training from scratch, for example, it outperforms other SGDA based federated average methods by around 20% in accuracy over the same communication rounds; and it consistently outperforms when training from pre-trained models.

Invariant Feature Subspace Recovery for Multi-Class Classification
G. Balasubramaniam, H. Wang, H. Zhao
NeurIPS Distribution Shifts (DistShift) Workshop, 2022 (NeurIPS 2022)
[abs] [pdf]

Algorithms and Theory for Supervised Gradual Domain Adaptation
J. Dong, S. Zhou, B. Wang, H. Zhao
Transactions on Machine Learning Research (TMLR 2022)
[abs] [pdf]

The phenomenon of data distribution evolving over time has been observed in a range of applications, calling for the need for adaptive learning algorithms. We thus study the problem of supervised gradual domain adaptation, where labeled data from shifting distributions are available to the learner along the trajectory, and we aim to learn a classifier on a target data distribution of interest. Under this setting, we provide the first generalization upper bound on the learning error under mild assumptions. Our results are algorithm agnostic, general for a range of loss functions, and only depend linearly on the averaged learning error across the trajectory. This shows significant improvement compared to the previous upper bound for unsupervised gradual domain adaptation, where the learning error on the target domain depends exponentially on the initial error on the source domain. Compared with the offline setting of learning from multiple domains, our results also suggest the potential benefits of the temporal structure among different domains in adapting to the target one. Empirically, our theoretical results imply that learning proper representations across the domains will effectively mitigate learning errors. Motivated by these theoretical insights, we propose a min-max learning objective to learn the representation and classifier simultaneously. Experimental results on both semi-synthetic and large-scale real datasets corroborate our findings and demonstrate the effectiveness of our objectives.

Fundamental Limits and Tradeoffs in Invariant Representation Learning
H. Zhao*, C. Dan*, B. Aragam, T. Jaakkola, G. Gordon, and P. Ravikumar
Journal of Machine Learning Research (JMLR 2022)
(Also presented at NeurIPS 2023 Journal Track)
[abs] [JMLR] [arXiv] [poster]

A wide range of machine learning applications such as privacy-preserving learning, algorithmic fairness, and domain adaptation/generalization among others, involve learning \emph{invariant representations} of the data that aim to achieve two competing goals: (a) maximize information or accuracy with respect to a target response, and (b) maximize invariance or independence with respect to a set of protected features (e.g.\ for fairness, privacy, etc). Despite their wide applicability, theoretical understanding of the optimal tradeoffs --- with respect to accuracy, and invariance --- achievable by invariant representations is still severely lacking. In this paper, we provide an information theoretic analysis of such tradeoffs under both classification and regression settings. More precisely, we provide a geometric characterization of the accuracy and invariance achievable by any representation of the data; we term this feasible region the information plane. We provide an inner bound for this feasible region for the classification case, and an exact characterization for the regression case, which allows us to either bound or exactly characterize the Pareto optimal frontier between accuracy and invariance. Although our contributions are mainly theoretical, a key practical application of our results is in certifying the potential sub-optimality of any given representation learning algorithm for either classification or regression tasks. Our results shed new light on the fundamental interplay between accuracy and invariance, and may be useful in guiding the design of future representation learning algorithms.

Conditional Supervised Contrastive Learning for Fair Text Classification
J. Chi, W. Shand, Y. Yu, K.-W. Chang, H. Zhao, and Y. Tian
In Proceedings of the 2022 Empirical Methods in Natural Language Processing (EMNLP 2022 Findings)
[abs] [pdf]

Exploring Gradient-based Multi-directional Controls in GANs
Z. Chen, R. Jiang, B. Duke, H. Zhao, and P. Aarabi
In Proceedings of the 17th European Conference on Computer Vision (ECCV 2022, oral)
[abs] [pdf]

Generative Adversarial Networks (GANs) have been widely applied in modeling diverse image distributions. However, despite its impressive applications, the structure of the latent space in GANs largely remains as a black-box, leaving its controllable generation an open problem, especially when spurious correlations between different semantic attributes exist in the image distributions. To address this problem, previous methods typically learn linear directions or individual channels that control semantic attributes in the image space. However, they often suffer from imperfect disentanglement, or are unable to obtain multi-directional controls. In this work, in light of the above challenges, we propose a novel approach that discovers nonlinear controls, which enables multi-directional manipulation as well as effective disentanglement, based on gradient information in the learned GAN latent space. More specifically, we first learn interpolation directions by following the gradients from classification networks trained separately on the attributes, and then navigate the latent space by exclusively controlling channels activated for the target attribute in the learned directions. Empirically, with small training data, our approach is able to gain fine-grained controls over a diverse set of bi-directional and multi-directional attributes, and we showcase its ability to achieve disentanglement significantly better than state-of-the-art methods both qualitatively and quantitatively.

Greedy Modality Selection via Approximate Submodular Maximization
R. Cheng, G. Balasubramaniam, Y. He, Y. H. Tsai, H. Zhao
In Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022)
[abs] [pdf]

Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond
H. Wang, B. Li, H. Zhao
In Proceedings of the 39th International Conference on Machine Learning (ICML 2022)
[abs] [pdf] [code]

The vast majority of existing algorithms for unsupervised domain adaptation (UDA) focus on adapting from a labeled source domain to an unlabeled target domain directly in a one-off way. Gradual domain adaptation (GDA), on the other hand, assumes a path of ($T\mathrm{-}1$) unlabeled intermediate domains bridging the source and the target, and aims to provide better generalization on the target domain by leveraging the intermediate ones. Under certain assumptions, \citet{kumar2020understanding} proposed a simple algorithm, \textit{gradual self-training}, along with a generalization bound in the order of $e^{\mathcal O(T)}(\eps_0 \mathrm{+}\mathcal O\bigl(\sqrt {\frac{\log T}{n}}\bigr) \bigr)$ for the target domain error, where $\eps_0$ is the source domain error and $n$ is the data size of each domain. Due to the exponential factor, this upper bound becomes vacuous when $T$ is only moderately large. In this work, we analyze gradual self-training under more general and relaxed assumptions, and prove a significantly improved generalization bound as $\eps_0\mathrm{+}\widetilde{\mathcal O}\bigl(T\Delta \mathrm{+} \frac{T}{\sqrt{n}} \mathrm{+} \frac{1}{\sqrt{nT}}\bigr)$, where $\Delta$ is the average distributional distance between consecutive domains. Compared with the existing bound with an \emph{exponential} dependency on $T$ as a \textit{multiplicative} factor, our bound only depends on $T$ \emph{linearly and additively}. Perhaps more interestingly, our result implies the existence of an optimal choice of $T$ that minimizes the generalization error, and it also naturally suggests an optimal way to construct the path of intermediate domains so as to minimize the accumulative path length $T\Delta$ between the source and the target. To corroborate the implications of our theory, we examine gradual self-training on multiple semi-synthetic and real datasets, which confirms our findings. We believe our insights provide a path forward towards the design of future GDA algorithms.

Provable Domain Generalization via Invariant-Feature Subspace Recovery
H. Wang, H. Si, B. Li, H. Zhao
In Proceedings of the 39th International Conference on Machine Learning (ICML 2022)
[abs] [pdf] [code]

Domain generalization asks for models trained on a set of training environments to perform well on unseen test environments. Recently, a series of algorithms such as Invariant Risk Minimization (IRM) has been proposed for domain generalization. However, Rosenfeld et al. (2021) shows that in a simple linear data model, even if non-convexity issues are ignored, IRM and its extensions cannot generalize to unseen environments with less than ds+1 training environments, where ds is the dimension of the spurious-feature subspace. In this paper, we propose to achieve domain generalization with Invariant-feature Subspace Recovery (ISR). Our first algorithm, ISR-Mean, can identify the subspace spanned by invariant features from the first-order moments of the class-conditional distributions, and achieve provable domain generalization with ds+1 training environments under the data model of Rosenfeld et al. (2021). Our second algorithm, ISR-Cov, further reduces the required number of training environments to O(1) using the information of second-order moments. Notably, unlike IRM, our algorithms bypass non-convexity issues and enjoy global convergence guarantees. Empirically, our ISRs can obtain superior performance compared with IRM on synthetic benchmarks. In addition, on three real-world image and text datasets, we show that ISR-Mean can be used as a simple yet effective post-processing method to increase the worst-case accuracy of trained models against spurious correlations and group shifts.

Rethinking Controllable Variational Autoencoders
H. Shao, Y. Yang, H. Lin, L. Lin, Y. Chen, Q. Yang, H. Zhao
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022)
[abs] [pdf]

The Controllable Variational Autoencoder (ControlVAE) combines automatic control theory with the basic VAE model to manipulate the KL-divergence for overcoming posterior collapse and learning disentangled representations. It has shown success in a variety of applications, such as image generation, disentangled representation learning, and language modeling. However, when it comes to disentangled representation learning, ControlVAE does not delve into the rationale behind it. The goal of this paper is to develop a deeper understanding of ControlVAE in learning disentangled representations, including the choice of a desired KL-divergence (i.e, set point), and its stability during training. We first fundamentally explain its ability to disentangle latent variables from an information bottleneck perspective. We show that KL-divergence is an upper bound of the variational information bottleneck. By controlling the KL-divergence gradually from a small value to a target value, ControlVAE can disentangle the latent factors one by one. Based on this finding, we propose a new DynamicVAE that leverages a modified incremental PI (proportional-integral) controller, a variant of the proportional-integral-derivative (PID) algorithm, and employs a moving average as well as a hybrid annealing method to evolve the value of KL-divergence smoothly in a tightly controlled fashion. In addition, we analytically derive a lower bound of the set point for disentangling. We then theoretically prove the stability of the proposed approach. Evaluation results on multiple benchmark datasets demonstrate that DynamicVAE achieves a good trade-off between the disentanglement and reconstruction quality. We also discover that it can separate disentangled representation learning and reconstruction via manipulating the desired KL-divergence.

Cross-Lingual Transfer with Class-weighted Language-Invariant Representations
R. Xian, H. Ji, H. Zhao
In Proceedings of the 10th International Conference on Learning Representations (ICLR 2022)
[abs] [pdf] [code]

Conditional Contrastive Learning with Kernel
Y. H. Tsai, T. Li, M. Q. Ma, H. Zhao, K. Zhang, L-P. Morency, R. Salakhutdinov
In Proceedings of the 10th International Conference on Learning Representations (ICLR 2022)
[abs] [pdf] [code]

Conditional contrastive learning frameworks consider the conditional sampling procedure that constructs positive or negative data pairs conditioned on specific variables. Fair contrastive learning constructs negative pairs, for example, from the same gender (conditioning on sensitive information), which in turn reduces undesirable information from the learned representations; weakly supervised contrastive learning constructs positive pairs with similar annotative attributes (conditioning on auxiliary information), which in turn are incorporated into the representations. Although conditional contrastive learning enables many applications, the conditional sampling procedure can be challenging if we cannot obtain sufficient data pairs for some values of the conditioning variable. This paper presents Conditional Contrastive Learning with Kernel (CCL-K) that converts existing conditional contrastive objectives into alternative forms that mitigate the insufficient data problem. Instead of sampling data according to the value of the conditioning variable, CCLK uses the Kernel Conditional Embedding Operator that samples data from all available data and assigns weights to each sampled data given the kernel similarity between the values of the conditioning variable. We conduct experiments using weakly supervised, fair, and hard negatives contrastive learning, showing CCL-K outperforms state-of-the-art baselines.

Inherent Tradeoffs in Learning Fair Representations
H. Zhao and G. Gordon
Journal of Machine Learning Research (JMLR 2022)
(Extended version of an earlier paper with the same title appearing in NeurIPS 2019)
[abs] [JMLR] [arXiv]

Real-world applications of machine learning tools in high-stakes domains are often regulated to be fair, in the sense that the predicted target should satisfy some quantitative notion of parity with respect to a protected attribute. However, the exact tradeoff between fairness and accuracy is not entirely clear, even for the basic paradigm of classification problems. In this paper, we characterize an inherent tradeoff between statistical parity and accuracy in the classification setting by providing a lower bound on the sum of group-wise errors of any fair classifiers. Our impossibility theorem could be interpreted as a certain uncertainty principle in fairness: if the base rates differ among groups, then any fair classifier satisfying statistical parity has to incur a large error on at least one of the groups. We further extend this result to give a lower bound on the joint error of any (approximately) fair classifiers, from the perspective of learning fair representations. To show that our lower bound is tight, assuming oracle access to Bayes (potentially unfair) classifiers, we also construct an algorithm that returns a randomized classifier which is both optimal (in terms of accuracy) and fair. Interestingly, when the protected attribute can take more than two values, an extension of this lower bound does not admit an analytic solution. Nevertheless, in this case, we show that the lower bound can be efficiently computed by solving a linear program, which we term as the TV-Barycenter problem, a barycenter problem under the TV-distance. On the upside, we prove that if the group-wise Bayes optimal classifiers are close, then learning fair representations leads to an alternative notion of fairness, known as the accuracy parity, which states that the error rates are close between groups. Finally, we also conduct experiments on real-world datasets to confirm our theoretical findings.

Towards Return Parity in Markov Decision Processes
J. Chi, J. Shen, X. Dai, W. Zhang, Y. Tian, and H. Zhao
In Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS 2022)
[abs] [pdf]

Online Continual Adaptation with Active Self-Training
S. Zhou, H. Zhao, S. Zhang, L. Wang, H. Chang, Z. Wang, and W. Zhu
In Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS 2022)
[abs] [pdf]

Models trained with offline data often suffer from continual distribution shifts and expensive labeling in changing environments. This calls for a new online learning paradigm where the learner can continually adapt to changing environments with limited labels. In this paper, we propose a new online setting -- Online Active Continual Adaptation, where the learner aims to continually adapt to changing distributions using both unlabeled samples and active queries of limited labels. To this end, we propose Online Self-Adaptive Mirror Descent (OSAMD), which adopts an online teacher-student structure to enable online self-training from unlabeled data, and a margin-based criterion that decides whether to query the labels to track changing distributions. Theoretically, we show that, in the separable case, OSAMD has an $O({T}^{1/2})$ dynamic regret bound under mild assumptions, which is even tighter than the lower bound $\Omega(T^{2/3})$ of traditional online learning with full labels. In the general case, we show a regret bound of $O({\alpha^*}^{1/3} {T}^{2/3} + \alpha^* T)$, where $\alpha^*$ denotes the separability of domains and is usually small. Our theoretical results show that OSAMD can fast adapt to changing environments with active queries. Empirically, we demonstrate that OSAMD achieves favorable regrets under changing environments with limited labels on both simulated and real-world data, which corroborates our theoretical findings.

Invariant Information Bottleneck for Domain Generalization
B. Li, Y. Shen, Y. Wang, W. Zhu, C. J. Reed, J. Zhang, D. Li, K. Keutzer, and H. Zhao
In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI 2022)
[abs] [pdf] [poster]

Quantifying and Improving Transferability in Domain Generalization
G. Zhang, H. Zhao, Y. Yu, and P. Poupart
In Proceedings of the 35th Advances in Neural Information Processing Systems (NeurIPS 2021)
[abs] [pdf] [poster] [code]

Out-of-distribution generalization is one of the key challenges when transferring a model from the lab to the real world. Existing efforts mostly focus on building invariant features among source and target domains. Based on invariant features, a high-performing classifier on source domains could hopefully behave equally well on a target domain. In other words, the invariant features are \emph{transferable}. However, in practice, there are no perfectly transferable features, and some algorithms seem to learn ''more transferable'' features than others. How can we understand and quantify such \emph{transferability}? In this paper, we formally define transferability that one can quantify and compute in domain generalization. We point out the difference and connection with common discrepancy measures between domains, such as total variation and Wasserstein distance. We then prove that our transferability can be estimated with enough samples and give a new upper bound for the target error based on our transferability. Empirically, we evaluate the transferability of the feature embeddings learned by existing algorithms for domain generalization. Surprisingly, we find that many algorithms are not quite learning transferable features, although few could still survive. In light of this, we propose a new algorithm for learning transferable features and test it over various benchmark datasets, including RotatedMNIST, PACS, Office-Home and WILDS-FMoW. Experimental results show that the proposed algorithm achieves consistent improvement over many state-of-the-art algorithms, corroborating our theoretical findings.

EventKE: Event-Enhanced Knowledge Graph Embedding
Z. Zhang, H. Wang, H. Zhao, H. Tong and H. Ji
In Proceedings of the 2021 Empirical Methods in Natural Language Processing (EMNLP 2021 Findings)
[abs] [pdf] [code]

Understanding and Mitigating Accuracy Disparity in Regression
J. Chi, Y. Tian, G. Gordon and H. Zhao
In Proceedings of the 38th International Conference on Machine Learning (ICML 2021)
[abs] [pdf] [code]

Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation
H. Wang, H. Zhao, B. Li
In Proceedings of the 38th International Conference on Machine Learning (ICML 2021)
[abs] [pdf] [code]

Multi-task learning (MTL) aims to improve the generalization of several related tasks by learning them jointly. As a comparison, in addition to the joint training scheme, modern meta-learning allows unseen tasks with limited labels during the test phase, in the hope of fast adaptation over them. Despite the subtle difference between MTL and meta-learning in the problem formulation, both learning paradigms share the same insight that the shared structure between existing training tasks could lead to better generalization and adaptation. In this paper, we take one important step further to understand the close connection between these two learning paradigms, through both theoretical analysis and empirical investigation. Theoretically, we first demonstrate that MTL shares the same optimization formulation with a class of gradient-based meta-learning (GBML) algorithms. We then prove that for over-parameterized neural networks with sufficient depth, the learned predictive functions of MTL and GBML are close. In particular, this result implies that the predictions given by these two models are similar over the same unseen task. Empirically, we corroborate our theoretical findings by showing that, with proper implementation, MTL is competitive against state-of-the-art GBML algorithms on a set of few-shot image classification benchmarks. Since existing GBML algorithms often involve costly second-order bi-level optimization, our first-order MTL method is an order of magnitude faster on large-scale datasets such as mini-ImageNet. We believe this work could help bridge the gap between these two learning paradigms, and provide a computationally efficient alternative to GBML that also supports fast task adaptation.

Information Obfuscation of Graph Neural Networks
P. Liao, H. Zhao, K. Xu, T. Jaakkola, G. Gordon, S. Jegelka and R. Salakhutdinov
In Proceedings of the 38th International Conference on Machine Learning (ICML 2021)
[abs] [pdf] [code]

Learning Invariant Representations and Risks for Semi-supervised Domain Adaptation
B. Li, Y. Wang, S. Zhang, D. Li, T. Darrell, K. Keutzer and H. Zhao
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021)
[abs] [pdf] [code]

The success of supervised learning hinges on the assumption that the training and test data come from the same underlying distribution, which is often not valid in practice due to potential distribution shift. In light of this, most existing methods for unsupervised domain adaptation focus on achieving domain-invariant representations and small source domain error. However, recent works have shown that this is not sufficient to guarantee good generalization on the target domain, and in fact, is provably detrimental under label distribution shift. Furthermore, in many real-world applications it is often feasible to obtain a small amount of labeled data from the target domain and use them to facilitate model training with source data. Inspired by the above observations, in this paper we propose the first method that aims to simultaneously learn invariant representations and risks under the setting of semi-supervised domain adaptation (Semi-DA). First, we provide a finite sample bound for both classification and regression problems under Semi-DA. The bound suggests a principled way to obtain target generalization, i.e. by aligning both the marginal and conditional distributions across domains in feature space. Motivated by this, we then introduce the LIRR algorithm for jointly \textbf{L}earning \textbf{I}nvariant \textbf{R}epresentations and \textbf{R}isks. Finally, extensive experiments are conducted on both classification and regression tasks, which demonstrates LIRR consistently achieves state-of-the-art performance and significant improvements compared with the methods that only learn invariant representations or invariant risks.

Self-supervised Representation Learning with Relative Predictive Coding
Y. H. Tsai*, M. Q. Ma*, M. Yang, H. Zhao, L-P Morency, and R. Salakhutdinov
In Proceedings of the 9th International Conference on Learning Representations (ICLR 2021)
[abs] [pdf] [video]

On Dyadic Fairness: Exploring and Mitigating Bias in Graph Connections
P. Li, Y. Wang, H. Zhao, P. Hong, H. Liu
In Proceedings of the 9th International Conference on Learning Representations (ICLR 2021)
[abs] [pdf] [video]

Model-based Policy Optimization with Unsupervised Model Adaptation
J. Shen, H. Zhao, W. Zhang and Y. Yu
In Proceedings of the 34th Advances in Neural Information Processing Systems (NeurIPS 2020, Spotlight)
[abs] [pdf] [code] [video]

Neural Methods for Point-wise Dependency Estimation
Y. H. Tsai, H. Zhao, M. Yamada, L-P. Morency and R. Salakhutdinov
In Proceedings of the 34th Advances in Neural Information Processing Systems (NeurIPS 2020, Spotlight)
[abs] [pdf] [video]

Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift
R. Combes*, H. Zhao*, Y.X. Wang and G. Gordon
In Proceedings of the 34th Advances in Neural Information Processing Systems (NeurIPS 2020)
[abs] [pdf] [code] [video]

Trade-offs and Guarantees of Adversarial Representation Learning for Information Obfuscation
H. Zhao*, J. Chi*, Y. Tian and G. Gordon
In Proceedings of the 34th Advances in Neural Information Processing Systems (NeurIPS 2020)
[abs] [pdf] [video] [slides] [poster]

A Review of Single-Source Deep Unsupervised Visual Domain Adaptation
S. Zhao, X. Yue, S. Zhang, B. Li, H. Zhao, B. Wu, R. Krishna, J. Gonzalez, A. Sangiovanni-Vincentelli, S. and K. Keutzer
IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
[abs] [pdf]

On Learning Language-Invariant Representations for Universal Machine Translation
H. Zhao, J. Hu and A. Risteski
In Proceedings of the 37th International Conference on Machine Learning (ICML 2020)
[abs] [pdf] [video] [slides] [blog]

The goal of universal machine translation is to learn to translate between any pair of languages, given a corpus of paired translated documents for a small subset of all pairs of languages. Despite impressive empirical results and an increasing interest in massively multilingual models, theoretical analysis on translation errors made by such universal machine translation models is only nascent. In this paper, we formally prove certain impossibilities of this endeavour in general, as well as prove positive results in the presence of additional (but natural) structure of data. For the former, we derive a lower bound on the translation error in the many-to-one translation setting, which shows that any algorithm aiming to learn shared sentence representations among multiple language pairs has to make a large translation error on at least one of the translation tasks, if no assumption on the structure of the languages is made. For the latter, we show that if the paired documents in the corpus follow a natural encoder-decoder generative process, we can expect a natural notion of ``generalization'': a linear number of language pairs, rather than quadratic, suffices to learn a good representation. Our theory also explains what kinds of connection graphs between pairs of languages are better suited: ones with longer paths result in worse sample complexity in terms of the total number of documents per language pair needed. We believe our theoretical insights and implications contribute to the future algorithmic design of universal machine translation.

Deep Fair Clustering for Visual Learning
P. Li, H. Zhao and H. Liu
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020)
[abs] [pdf]

DyCRS: Dynamic Interpretable Postoperative Complication Risk Scoring
W. Wang, H. Zhao, H. Zhuang, N. Shah and R. Padman
In Proceedings of The Web Conference 2020 (WWW 2020, Oral)
[abs] [pdf]

Early identification of patients at risk for postoperative complications can facilitate timely workups and treatments and improve health outcomes. Currently, a widely-used surgical risk calculator online web system developed by the American College of Surgeons (ACS) uses patients’ static features, e.g. gender, age, to assess the risk of postoperative complications. However, the most crucial signals that reflect the actual postoperative physical conditions of patients are usually real-time dynamic signals, including the vital signs of patients (e.g., heart rate, blood pressure) collected from postoperative monitoring. In this paper, we develop a dynamic postoperative complication risk scoring framework (DyCRS) to detect the “at-risk” patients in a real-time way based on postoperative sequential vital signs and static features. DyCRS is based on adaptations of the Hidden Markov Model (HMM) that captures hidden states as well as observable states to generate a real-time, probabilistic, complication risk score. Evaluating our model using electronic health record (EHR) on elective Colectomy surgery from a major health system, we show that DyCRS significantly outperforms the state-of-the-art ACS calculator and real-time predictors with 50.16% area under precision-recall curve (AUCPRC) gain on average in terms of detection effectiveness. In terms of earliness, our DyCRS can predict 15hrs55mins earlier on average than clinician's diagnosis with the recall of 60% and precision of 55%. Furthermore, Our DyCRS can extract interpretable patients' stages, which are consistent with previous medical postoperative complication studies. We believe that our contributions demonstrate significant promise for developing a more accurate, robust and interpretable postoperative complication risk scoring system, which can benefit more than 50 million annual surgeries in the US by substantially lowering adverse events and healthcare costs.

Conditional Learning of Fair Representations
H. Zhao, A. Coston, T. Adel and G. Gordon
In Proceedings of the 8th International Conference on Learning Representations (ICLR 2020, Spotlight)
NeurIPS 2019 Workshop on Machine Learning with Guarantees (NeurIPS 2019)
[abs] [pdf] [video] [slides] [code]

Continual Learning with Adaptive Weights (CLAW)
T. Adel, H. Zhao and R. E. Turner
In Proceedings of the 8th International Conference on Learning Representations (ICLR 2020)
[abs] [pdf] [video]

Inherent Tradeoffs in Learning Fair Representations
H. Zhao and G. Gordon
In Proceedings of the 33rd Advances in Neural Information Processing Systems (NeurIPS 2019)
[abs] [pdf] [poster] [slides] [blog]

With the prevalence of machine learning in high-stakes applications, especially the ones regulated by anti-discrimination laws or societal norms, it is crucial to ensure that the predictive models do not propagate any existing bias or discrimination. Due to the ability of deep neural nets to learn rich representations, recent advances in algorithmic fairness have focused on learning fair representations with adversarial techniques to reduce bias in data while preserving utility simultaneously. In this paper, through the lens of information theory, we provide the first result that quantitatively characterizes the tradeoff between demographic parity and the joint utility across different population groups. Specifically, when the base rates differ between groups, we show that any method aiming to learn fair representations admits an information-theoretic lower bound on the joint error across these groups. To complement our negative results, we also prove that if the optimal decision functions across different groups are close, then learning fair representations leads to an alternative notion of fairness, known as the accuracy parity, which states that the error rates are close between groups. Finally, our theoretical findings are also confirmed empirically on real-world datasets.

Adversarial Privacy Preservation under Attribute Inference Attack
H. Zhao*, J. Chi*, Y. Tian and G. Gordon
In NeurIPS 2019 Workshop on Machine Learning with Guarantees (NeurIPS 2019)
[abs] [pdf]

On Learning Invariant Representations for Domain Adaptation
H. Zhao, R. Combes, K. Zhang and G. Gordon
In Proceedings of the 36th International Conference on Machine Learning (ICML 2019, Long Oral)
[abs] [pdf] [supplement] [poster] [slides] [blog]

Due to the ability of deep neural nets to learn rich representations, recent advances in unsupervised domain adaptation have focused on learning domain-invariant features that achieve a small error on the source domain. The hope is that the learnt representation, together with the hypothesis learnt from the source domain, can generalize to the target domain. In this paper, we first construct a simple counterexample showing that, contrary to common belief, the above conditions are not sufficient to guarantee successful domain adaptation. In particular, the counterexample (Fig. 1) exhibits \emph{conditional shift}: the class-conditional distributions of input features change between source and target domains. To give a sufficient condition for domain adaptation, we propose a natural and interpretable generalization upper bound that explicitly takes into account the aforementioned shift. Moreover, we shed new light on the problem by proving an information-theoretic lower bound on the joint error of \emph{any} domain adaptation method that attempts to learn invariant representations. Our result characterizes a fundamental tradeoff between learning invariant representations and achieving small joint error on both domains when the marginal label distributions differ from source to target. Finally, we conduct experiments on real-world datasets that corroborate our theoretical findings. We believe these insights are helpful in guiding the future design of domain adaptation and representation learning algorithms.

Learning Neural Networks with Adaptive Regularization
H. Zhao*, Y. H. Tsai*, R. Salakhutdinov and G. Gordon
In Proceedings of the 33rd Advances in Neural Information Processing Systems (NeurIPS 2019)
[abs] [pdf] [poster] [slides] [code]

Feed-forward neural networks can be understood as a combination of an intermediate representation and a linear hypothesis. While most previous works aim to diversify the representations, we explore the complementary direction by performing an adaptive and data-dependent regularization motivated by the empirical Bayes method. Specifically, we propose to construct a matrix-variate normal prior (on weights) whose covariance matrix has a Kronecker product structure. This structure is designed to capture the correlations in neurons through backpropagation. Under the assumption of this Kronecker factorization, the prior encourages neurons to borrow statistical strength from one another. Hence, it leads to an adaptive and data-dependent regularization when training networks on small datasets. To optimize the model, we present an efficient block coordinate descent algorithm with analytical solutions. Empirically, we demonstrate that the proposed method helps networks converge to local optima with smaller stable ranks and spectral norms. These properties suggest better generalizations and we present empirical results to support this expectation. We also verify the effectiveness of the approach on multiclass classification and multitask regression problems with various network structures.

Efficient Multitask Feature and Relationship Learning
H. Zhao, O. Stretcu, A. Smola and G. Gordon
In Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence (UAI 2019)
Also In Learning with Limited Labeled Data: Weak Supervision and Beyond workshop at NIPS 2017
[abs] [pdf] [supplement] [poster]

On Strategyproof Conference Peer Review
Y. Xu*, H. Zhao*, X. Shi and N. B. Shah
In Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI 2019)
[abs] [pdf] [supplement] [Full arXiv version]

We consider peer review under a conference setting where there are conflicts between the reviewers and the submissions. Under such conflicts, reviewers can manipulate their reviews in a strategic manner to influence the final rankings of their own papers. Present-day peer-review systems are not designed to guard against such strategic behavior, beyond minimal (and insufficient) checks such as not assigning a paper to a conflicted reviewer. In this work, we address this problem through the lens of social choice, and present a theoretical framework for strategyproof and efficient peer review. Given the conflict graph which satisfies a simple property, we first present and analyze a flexible framework for reviewer-assignment and aggregation for the reviews that guarantees not only strategyproofness but also a natural efficiency property (unanimity). Our framework is based on the so-called partitioning method, and can be treated as a generalization of this type of method to conference peer review settings. We then empirically show that the requisite property on the (authorship) conflict graph is indeed satisfied in the ICLR-17 submissions data, and further demonstrate a simple trick to make the partitioning method more practically appealing under conference peer-review settings. Finally, we complement our positive results with negative theoretical results where we prove that under slightly stronger requirements, it is impossible for any algorithm to be both strategyproof and efficient.

Active Learning of Strict Partial Orders: A Case Study on Concept Prerequisite Relations
C. Liang, J. Ye, H. Zhao, B. Pursel and C. Lee Giles
In Proceedings of the 12th International Conference on Educational Data Mining (EDM 2019)
[abs] [pdf]

Deep Generative and Discriminative Domain Adaptation
H. Zhao, J. Hu, Z. Zhu, A. Coates and G. Gordon
In Proceedings of the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019)
[abs] [pdf] [poster]

Adversarial Multiple Source Domain Adaptation
H. Zhao*, S. Zhang*, G. Wu, J. Costeira, J. Moura and G. Gordon
In Proceedings of the 32nd Advances in Neural Information Processing Systems (NeurIPS 2018)
[abs] [pdf] [supplement] [poster] [code]

Frank-Wolfe Optimization for Symmetric-NMF under Simplicial Constraint
H. Zhao and G. Gordon
In Proceedings of the 34th Conference on Uncertainty in Artificial Intelligence (UAI 2018)
[abs] [pdf] [supplement]

Convolutional-Recurrent Neural Networks for Speech Enhancement
H. Zhao, S. Zarar, I. Tashev and C. H. Lee
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018, Oral)
[abs] [pdf] [slides]

Approximate Empirical Bayes for Deep Neural Networks
H. Zhao*, Y. H. Tsai*, R. Salakhutdinov and G. Gordon
In Uncertainty in Deep Learning workshop at UAI (UAI UDL 2018)
[abs] [pdf] [poster]

Multiple Source Domain Adaptation with Adversarial Learning
H. Zhao*, S. Zhang*, G. Wu, J. Costeira, J. Moura and G. Gordon
In 6th International Conference on Learning Representations (ICLR 2018 workshop track)
[abs] [pdf] [poster]

While domain adaptation has been actively researched in recent years, most theoretical results and algorithms focus on the single-source-single-target adaptation setting. Naive application of such algorithms on multiple source domain adaptation problem may lead to suboptimal solutions. We propose a new generalization bound for domain adaptation when there are multiple source domains with labeled instances and one target domain with unlabeled instances. Compared with existing bounds, the new bound does not require expert knowledge about the target distribution, nor the optimal combination rule for multisource domains. Interestingly, our theory also leads to an efficient learning strategy using adversarial neural networks: we show how to interpret it as learning feature representations that are invariant to the multiple domain shifts while still being discriminative for the learning task. To this end, we propose two models, both of which we call multisource domain adversarial networks (MDANs): the first model optimizes directly our bound, while the second model is a smoothed approximation of the first one, leading to a more data-efficient and task-adaptive model. The optimization tasks of both models are minimax saddle point problems that can be optimized by adversarial training. To demonstrate the effectiveness of MDANs, we conduct extensive experiments showing superior adaptation performance on three real-world datasets: sentiment analysis, digit classification, and vehicle counting.

Linear Time Computation of Moments in Sum-Product Networks
H. Zhao and G. Gordon
In Proceedings of the 31st Advances in Neural Information Processing Systems (NIPS 2017)
[abs] [pdf] [poster]

Bayesian online algorithms for Sum-Product Networks (SPNs) need to update their posterior distribution after seeing one single additional instance. To do so, they must compute moments of the model parameters under this distribution. The best existing method for computing such moments scales quadratically in the size of the SPN, although it scales linearly for trees. This unfortunate scaling makes Bayesian online algorithms prohibitively expensive, except for small or tree-structured SPNs. We propose a linear-time algorithm that works even when the SPN is a general directed acyclic graph (DAG). Our algorithm significantly broadens the applicability of Bayesian online algorithms for SPNs. There are three key ingredients in the design and analysis of our algorithm: 1). For each edge in the graph, we find a linear time reduction from the moment computation problem to a joint inference problem in SPNs. 2). Using the property that each SPN computes a multilinear polynomial, we construct an efficient procedure for polynomial evaluation by differentiation without expanding the network that may contain exponentially many positive monomials. 3). We propose a dynamic programming method to further reduce the computation of the moments of all the edges in the graph from quadratic to linear. We demonstrate the usefulness of our linear time moment computation algorithm by applying it to develop a linear time assume density filter (ADF) for SPNs.

Unsupervised Domain Adaptation with a Relaxed Covariate Shift Assumption
T. Adel, H. Zhao and A. Wong
In Proceedings of the 31th AAAI Conference on Artificial Intelligence (AAAI 2017)
[abs] [pdf]

Discovering Order in Unordered Datasets: Generative Markov Networks
Y. H. Tsai, H. Zhao, R. Salakhutdinov and N. Jojic
In Time Series workshop at NIPS (NIPS TSW 2017)
[abs] [pdf] [slides] [poster]

A Unified Approach for Learning the Parameters of Sum-Product Networks
H. Zhao, P. Poupart and G. Gordon
In Proceedings of the 30th Advances in Neural Information Processing Systems (NIPS 2016)
[abs] [pdf] [supplement] [poster] [code]

We present a unified approach for learning the parameters of Sum-Product networks (SPNs). We prove that any complete and decomposable SPN is equivalent to a mixture of trees where each tree corresponds to a product of univariate distributions. Based on the mixture model perspective, we characterize the objective function when learning SPNs based on the maximum likelihood estimation (MLE) principle and show that the optimization problem can be formulated as a signomial program. Both the projected gradient descent (PGD) and the exponentiated gradient (EG) in this setting can be viewed as first order approximations of the signomial program after proper transformation of the objective function. Based on the signomial program formulation, we construct two parameter learning algorithms for SPNs by using sequential monomial approximations (SMA) and the concave-convex procedure (CCCP), respectively. The two proposed methods naturally admit multiplicative updates, hence effectively avoiding the projection operation. With the help of the unified framework, we also show that, in the case of SPNs, CCCP leads to the same algorithm as Expectation Maximization (EM) despite the fact that they are different in general. Extensive experiments on 20 data sets demonstrate the effectiveness and efficiency of the two proposed approaches for learning SPNs. We also show that the proposed methods can improve the performance of structure learning and yield state-of-the-art results.

Collapsed Variational Inference for Sum-Product Networks
H. Zhao, T. Adel, G. Gordon and B. Amos
In Proceedings of the 33rd International Conference on Machine Learning (ICML 2016)
[abs] [pdf] [poster] [slides] [code]

Online Algorithms for Sum-Product Networks with Continuous Variables
P. Jaini, A. Rashwan, H. Zhao, Y. Liu, E. Banijamali, Z. Chen and P. Poupart
In Proceedings of the 8th International Conference on Probabilistic Graphical Models (PGM 2016)
[abs] [pdf]

Online and Distributed Bayesian Moment Matching for Parameter Learning in Sum-Product Networks
A. Rashwan, H. Zhao and P. Poupart
In Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS 2016)
[abs] [pdf]

On the Relationship between Sum-Product Networks and Bayesian Networks
H. Zhao, M. Melibari and P. Poupart
In Proceedings of the 32nd International Conference on Machine Learning (ICML 2015)
[abs] [pdf] [supplement] [Full arXiv version] [slides] [poster]

Self-Adaptive Hierarchical Sentence Model
H. Zhao, Z. Lu and P. Poupart
In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI 2015)
[abs] [pdf] [slides] [poster] [code]

SoF: Soft-Cluster Matrix Factorization for Probabilistic Clustering
H. Zhao, P. Pouart, Y. Zhang and M. Lysy
In Proceedings of the 29th AAAI Conference on Artificial Intelligence (AAAI 2015)
[abs] [pdf] [poster]

Global Network Alignment in the Context Of Aging
F. Faisal, H. Zhao and T. Milenkovic
IEEE/ACM Transactions on Computational Biology and Bioinformatics (IEEE/ACM TCBB 2014)
In Proceedings of the 4th ACM International Conference on Bioinformatics, Computational Biology and Biomedicine (ACM BCB 2013)
[abs] [pdf] [supplement]

Analogous to sequence alignment, network alignment (NA) can be used to transfer biological knowledge across species between conserved network regions. NA faces two algorithmic challenges: 1) Which cost function to use to capture “similarities” between nodes in different networks? 2) Which alignment strategy to use to rapidly identify “high-scoring” alignments from all possible alignments? We “break down” existing state-of-the-art methods that use both different cost functions and different alignment strategies to evaluate each combination of their cost functions and alignment strategies. We find that a combination of the cost function of one method and the alignment strategy of another method beats the existing methods. Hence, we propose this combination as a novel superior NA method. Then, since human aging is hard to study experimentally due to long lifespan, we use NA to transfer aging-related knowledge from well annotated model species to poorly annotated human. By doing so, we produce novel human aging-related knowledge, which complements currently available knowledge about aging that has been obtained mainly by sequence alignment. We demonstrate significant similarity between topological and functional properties of our novel predictions and those of known aging-related genes. We are the first to use NA to learn more about aging.

A Sober Look at Spectral Learning
H. Zhao and P. Poupart
In Method of Moments and Spectral Learning workshop at ICML (ICML MM 2014)
[abs] [pdf] [slides] [poster] [code]

Pre-prints

MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning
J. Shen, J. Yao, R. Yang, Y. Sun, F. Luo, R. Pan, T. Zhang, H. Zhao
arXiv preprint
[abs] [pdf]

Reward modeling is a key step in building safe foundation models when applying reinforcement learning from human feedback (RLHF) to align Large Language Models (LLMs). However, reward modeling based on the Bradley-Terry (BT) model assumes a global reward function, failing to capture the inherently diverse and heterogeneous human preferences. Hence, such oversimplification limits LLMs from supporting personalization and pluralistic alignment. Theoretically, we show that when human preferences follow a mixture distribution of diverse subgroups, a single BT model has an irreducible error. While existing solutions, such as multi-objective learning with fine-grained annotations, help address this issue, they are costly and constrained by predefined attributes, failing to fully capture the richness of human values. In this work, we introduce MiCRo, a two-stage framework that enhances personalized preference learning by leveraging large-scale binary preference datasets without requiring explicit fine-grained annotations. In the first stage, MiCRo introduces context-aware mixture modeling approach to capture diverse human preferences. In the second stage, MiCRo integrates an online routing strategy that dynamically adapts mixture weights based on specific context to resolve ambiguity, allowing for efficient and scalable preference adaptation with minimal additional supervision. Experiments on multiple preference datasets demonstrate that MiCRo effectively captures diverse human preferences and significantly improves downstream personalization.

MergeBench: A Benchmark for Merging Domain-Specialized LLMs
Y. He, S. Zeng, Y. Hu, R. Yang, T. Zhang, and H. Zhao
arXiv preprint
[abs] [pdf] [project page] [code] [Hugging Face models]

Model merging provides a scalable alternative to multi-task training by combining specialized finetuned models through parameter arithmetic, enabling efficient deployment without the need for joint training or access to all task data. While recent methods have shown promise, existing evaluations are limited in both model scale and task diversity, leaving open questions about their applicability to large, domain-specialized LLMs. To tackle the challenges, we introduce MergeBench, a comprehensive evaluation suite designed to assess model merging at scale. MergeBench builds on state-of-the-art open-source language models, including Llama and Gemma families at 2B to 9B scales, and covers five key domains: instruction following, mathematics, multilingual understanding, coding and safety. We standardize finetuning and evaluation protocols, and assess eight representative merging methods across multi-task performance, forgetting and runtime efficiency. Based on extensive experiments, we provide practical guidelines for algorithm selection and share insights showing that model merging tends to perform better on stronger base models, with techniques such as merging coefficient tuning and sparsification improving knowledge retention. However, several challenges remain, including the computational cost on large models, the gap for in-domain performance compared to multi-task models, and the underexplored role of model merging in standard LLM training pipelines. We hope MergeBench provides a foundation for future research to advance the understanding and practical application of model merging. Our project page is at https://yifei-he.github.io/mergebench/.

Neural Probabilistic Circuits: Enabling Compositional and Interpretable Predictions through Logical Reasoning
W. Chen, S. Yu, H. Shao, L. Sha, H. Zhao
arXiv preprint
[abs] [pdf]

End-to-end deep neural networks have achieved remarkable success across various domains but are often criticized for their lack of interpretability. While post hoc explanation methods attempt to address this issue, they often fail to accurately represent these black-box models, resulting in misleading or incomplete explanations. To overcome these challenges, we propose an inherently transparent model architecture called Neural Probabilistic Circuits (NPCs), which enable compositional and interpretable predictions through logical reasoning. In particular, an NPC consists of two modules: an attribute recognition model, which predicts probabilities for various attributes, and a task predictor built on a probabilistic circuit, which enables logical reasoning over recognized attributes to make class predictions. To train NPCs, we introduce a three-stage training algorithm comprising attribute recognition, circuit construction, and joint optimization. Moreover, we theoretically demonstrate that an NPC's error is upper-bounded by a linear combination of the errors from its modules. To further demonstrate the interpretability of NPC, we provide both the most probable explanations and the counterfactual explanations. Empirical results on four benchmark datasets show that NPCs strike a balance between interpretability and performance, achieving results competitive even with those of end-to-end black-box models while providing enhanced interpretability.

Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and Beyond
W. Chen, X. Zhang, B. Lin, X. Lin, H. Zhao, Q. Zhang, J. T. Kwok
arXiv preprint
[abs] [pdf] [github]

A Unified Post-Processing Framework for Group Fairness in Classification
R. Xian, H. Zhao
arXiv preprint
[abs] [pdf] [code]

Efficient Model Editing with Task Vector Bases: A Theoretical Framework and Scalable Approach
S. Zeng, Y. He, W. You, Y. Hao, Y. H. Tsai, M. Yamada, H. Zhao
arXiv preprint
[abs] [pdf]

Invariant-Feature Subspace Recovery: A New Class of Provable Domain Generalization Algorithms
H. Wang, G. Balasubramaniam, H. Si, B. Li, H. Zhao
arXiv preprint
[abs] [pdf]

Domain generalization asks for models trained over a set of training environments to generalize well in unseen test environments. Recently, a series of algorithms such as Invariant Risk Minimization (IRM) have been proposed for domain generalization. However, \citet{risks-of-IRM} shows that in a simple linear data model, even if non-convexity issues are ignored, IRM and its extensions cannot generalize to unseen environments with less than $d_s\mathrm{+}1$ training environments, where $d_s$ is the dimension of the spurious-feature subspace. In this work, we propose \textbf{I}nvariant-feature \textbf{S}ubspace \textbf{R}ecovery (ISR): a new class of algorithms to achieve provable domain generalization across the settings of classification and regression problems. First, in the binary classification setup of \citet{risks-of-IRM}, we show that our first algorithm, \textbf{ISR-Mean}, can identify the subspace spanned by invariant features from the first-order moments of the class-conditional distributions, and achieve provable domain generalization with $d_s\mathrm{+}1$ training environments. Our second algorithm, \textbf{ISR-Cov}, further reduces the required number of training environments to $\cO(1)$ using the information of second-order moments. Notably, unlike IRM, our algorithms bypass non-convexity issues and enjoy global convergence guarantees. Next, we extend ISR-Mean to the more general setting of multi-class classification and propose \textbf{ISR-Multiclass}, which leverages class information and provably recovers the invariant-feature subspace with $\lceil d_s/k \rceil + 1$ training environments for $k$-class classification. Finally, for regression problems, we propose \textbf{ISR-Regression} that can identify the invariant-feature subspace with $d_s + 1$ training environments. Empirically, we demonstrate the superior performance of our ISRs compared with IRM on synthetic benchmarks. Furthermore, ISRs can be used as simple yet effective post-processing methods for any given black-box feature extractors such as neural nets, and we show they can improve the worst-case accuracy of (pre-)trained models against spurious correlations and group shifts over multiple real-world datasets.

Online Mirror Descent for Tchebycheff Scalarization in Multi-Objective Optimization
M. Liu, X. Zhang, C. Xie, K. Donahue, H. Zhao
arXiv preprint
[abs] [pdf] [slides]

People

Current (by alphabetical order)

Weixin Chen (PhD in CS)

Yuen Chen (PhD in CS, co-advised with Hari Sundaram)

Yifei He (PhD in CS)

Yuzheng Hu (PhD in CS)

Seiyun Shin (PhD in ECE, co-advised with Ilan Shomorony, Mavis Future Faculty Fellows)

Haozhe Si (PhD in ECE)

Ruicheng Xian (PhD in CS)

Siqi (Cindy) Zeng (PhD in CS)

Meitong Liu (HKU CS undergrad)

Samuel Schapiro (UIUC CS undergrad)

Yuxuan Wan (UIUC Math undergrad)

Alumni

Haoxiang Wang (PhD in ECE, co-advised with Bo Li, Mavis Future Faculty Fellows -> Research Scientist, Nvidia)

Aditya Sinha (MSCS @ UIUC -> Research Scientist, Netflix)

Qilong Wu (MSCS @ UIUC -> PhD in CS @ UIUC)

Gargi Balasubramaniam (MSCS @ UIUC, Siebel Scholar -> Research Engineer, Google DeepMind)

Yifei He (MSCS @ UIUC -> PhD in CS @ UIUC)

Siqi (Cindy) Zeng (undergrad @ CMU Math -> PhD in CS @ UIUC)

Haozhe Si (undergrad in ECE @ UIUC -> PhD in ECE @ UIUC)

Sixian Du (undergrad in CS @ PKU -> Stanford MSEE)

Peiyuan (Alex) Liao (undergrad in CS @ CMU -> PhD in CS @ Stanford)

(Brian) Bo Li (undergrad in CS @ Harbin Institute of Technology -> PhD in CS @ Nanyang Technological University)

Ashutosh Sharma (MSCS, Siebel Scholar -> Research Engineer, MIT-IBM Watson AI Lab)

Jingyan Shen (Pinterest -> PhD in CS @ New York University)

Teaching

Term	Course	Location	Time
Spring 2025	CS 442 - Trustworthy Machine Learning	Siebel Center 1302	TR 2PM - 3:15PM
Spring 2024	CS 446 - Machine Learning	1320 Digital Computer Laboratory	TR 12:30PM - 1:45PM
Fall 2023	CS 442 - Trustworthy Machine Learning	1310 Digital Computer Laboratory	WF 12:30PM - 1:45PM
Spring 2023	CS 598: Transfer Learning	Siebel Center 0216	WF 12:30PM - 1:45PM
Fall 2022	CS 498 ML - Trustworthy Machine Learning	4025 Campus Instructional Facility	TR 2PM - 3:15PM
Spring 2022	CS 442 - Trustworthy Machine Learning	Siebel Center 1109	WF 3:30PM - 4:45PM
Fall 2021	CS 598 - Special Topics: Transfer Learning	Siebel Center 0216	WF 2PM - 3:15PM

Misc

I enjoy sketching and calligraphy at my spare time. If I have a long vacation, I also enjoy traveling. My math genealogy can be found here.