Zekai Wang

I am an M.S. student in the School of Computer Science at Wuhan University, advised by Prof. Weiwei Liu. Before that, I received my B.S. degree in computer science from Wuhan University in 2021. I was a visiting student at the University of California, Berkeley from January 2020 to June 2020. I interned at Sea AI Lab, Singapore from September 2022 to June 2023.

My research primarily revolves around machine learning, specifically emphasizing trustworthy machine learning and multi-task learning.

Certified robustness: JMLR, AAAI’24, ICML’22
Empirical robustness: ICML’23, NeurIPS’22
Multi-task learning: TKDE, ACL findings’22, ACL’21

Selected Publications [full list]

(* indicates equal contribution)

JMLR
RVCL: Evaluating the Robustness of Contrastive Learning via Verification

Zekai Wang, and Weiwei Liu

Journal of Machine Learning Research, 2023

Abstract Bib Paper

Contrastive adversarial training has successfully improved the robustness of contrastive learning (CL). However, the robustness metric in these methods depends on attack algorithms, image labels, and downstream tasks, introducing reliability concerns. To address these issues, this paper proposes a novel Robustness Verification framework for Contrastive Learning (RVCL). Specifically, we define the verification problem of CL from deterministic and probabilistic perspectives, then provide several effective metrics to evaluate the robustness of CL encoder. Furthermore, we use extreme value theory to reveal the relationship between the robust radius of the CL encoder and that of the supervised downstream task. Extensive experiments on various benchmark models and datasets validate theoretical findings, and further demonstrate RVCL’s capability to evaluate the robustness of both CL encoders and images.
@article{wang2023rvcl, title = {RVCL: Evaluating the Robustness of Contrastive Learning via Verification}, author = {Wang, Zekai and Liu, Weiwei}, journal = {Journal of Machine Learning Research}, volume = {24}, number = {396}, pages = {1--43}, year = {2023} }
ICML’23
Better Diffusion Models Further Improve Adversarial Training

Zekai Wang*, Tianyu Pang*, Chao Du, Min Lin, Weiwei Liu, and Shuicheng Yan

In International Conference on Machine Learning, 2023

New state-of-the-art performance on the RobustBench leaderboard
Abstract arXiv Bib Paper Code Poster

It has been recognized that the data generated by the denoising diffusion probabilistic model (DDPM) improves adversarial training. After two years of rapid development in diffusion models, a question naturally arises: can better diffusion models further improve adversarial training? This paper gives an affirmative answer by employing the most recent diffusion model which has higher efficiency (∼20 sampling steps) and image quality (lower FID score) compared with DDPM. Our adversarially trained models achieve state-of-the-art performance on RobustBench using only generated data (no external datasets). Under the L∞-norm threat model with ε=8/255, our models achieve 70.69% and 42.67% robust accuracy on CIFAR-10 and CIFAR-100, respectively, i.e. improving upon previous state-of-the-art models by +4.58% and +8.03%. Under the L2-norm threat model with ε=128/255, our models achieve 84.86% on CIFAR-10 (+4.44%). These results also beat previous works that use external data. We also provide compelling results on the SVHN and TinyImageNet datasets.
@inproceedings{wang2023better, title = {Better Diffusion Models Further Improve Adversarial Training}, author = {Wang, Zekai and Pang, Tianyu and Du, Chao and Lin, Min and Liu, Weiwei and Yan, Shuicheng}, booktitle = {International Conference on Machine Learning}, volume = {202}, pages = {36246--36263}, year = {2023} }
ICML’22Oral
Robustness Verification for Contrastive Learning

Zekai Wang, and Weiwei Liu

In International Conference on Machine Learning, 2022

Long Presentation (top 2%)
Abstract Bib Paper Video Code Poster

Contrastive adversarial training has successfully improved the robustness of contrastive learning (CL). However, the robustness metric used in these methods is linked to attack algorithms, image labels and downstream tasks, all of which may affect the consistency and reliability of robustness metric for CL. To address these problems, this paper proposes a novel Robustness Verification framework for Contrastive Learning (RVCL). Furthermore, we use extreme value theory to reveal the relationship between the robust radius of the CL encoder and that of the supervised downstream task. Extensive experimental results on various benchmark models and datasets verify our theoretical findings, and further demonstrate that our proposed RVCL is able to evaluate the robustness of both models and images.
@inproceedings{wang2022robustness, title = {Robustness Verification for Contrastive Learning}, author = {Wang, Zekai and Liu, Weiwei}, booktitle = {International Conference on Machine Learning}, volume = {162}, pages = {22865--22883}, year = {2022} }
NeurIPS’22
On the Tradeoff Between Robustness and Fairness

Xinsong Ma, Zekai Wang, and Weiwei Liu

In Advances in Neural Information Processing Systems, 2022

Abstract Bib Paper Code

Interestingly, recent experimental results have identified a robust fairness phenomenon in adversarial training (AT), namely that a robust model well-trained by AT exhibits a remarkable disparity of standard accuracy and robust accuracy among different classes compared with natural training. However, the effect of different perturbation radii in AT on robust fairness has not been studied, and one natural question is raised: does a tradeoff exist between average robustness and robust fairness? Our extensive experimental results provide an affirmative answer to this question: with an increasing perturbation radius, stronger AT will lead to a larger class-wise disparity of robust accuracy. Theoretically, we analyze the class-wise performance of adversarially trained linear models with mixture Gaussian distribution. Our theoretical results support our observations. Moreover, our theory shows that adversarial training easily leads to more serious robust fairness issue than natural training. Motivated by theoretical results, we propose a fairly adversarial training (FAT) method to mitigate the tradeoff between average robustness and robust fairness. Experimental results validate the effectiveness of our proposed method.
@inproceedings{ma2022tradeoff, title = {On the Tradeoff Between Robustness and Fairness}, author = {Ma, Xinsong and Wang, Zekai and Liu, Weiwei}, booktitle = {Advances in Neural Information Processing Systems}, volume = {35}, pages = {26230--26241}, year = {2022} }
TKDE
Task Variance Regularized Multi-Task Learning

Yuren Mao, Zekai Wang, Weiwei Liu, Xuemin Lin, and Wenbin Hu

IEEE Transactions on Knowledge and Data Engineering, 2023

Abstract Bib Paper

Multi-task Learning (MTL), which involves the simultaneous learning of multiple tasks, can achieve better performance than learning each task independently. It has achieved great success in various applications, ranging from Computer Vision (CV) to Natural Language Processing (NLP). In MTL, the losses of the including tasks are jointly optimized. However, it is common for these tasks to be competing. When the tasks are competing, minimizing the losses of some tasks increases the losses of others, which accordingly increases the task variance (variance between the task-specific loss); furthermore, it induces under-fitting in some tasks and over-fitting in others, which degenerates the generalization performance of an MTL model. To address this issue, it is necessary to control the task variance; thus, task variance regularization is a natural choice. While intuitive, task variance regularization remains unexplored in MTL. Accordingly, to fill this gap, we study the generalization error bound of MTL through the lens of task variance and propose the task variance matters the generalization performance of MTL. Furthermore, this paper investigates how the task variance might be effectively regularized, and consequently proposes a multi-task learning method based on adversarial multi-armed bandit. The proposed method, dubbed BanditMTL, regularizes the task variance by means of a mirror gradient ascent-descent algorithm. Adopting BanditMTL both in CV and NLP applications is found to achieve state-of-the-art performance. The results of extensive experiments back up our theoretical analysis and validate the superiority of our proposals.
@article{mao2023task, title = {Task Variance Regularized Multi-Task Learning}, author = {Mao, Yuren and Wang, Zekai and Liu, Weiwei and Lin, Xuemin and Hu, Wenbin}, journal = {IEEE Transactions on Knowledge and Data Engineering}, volume = {35}, number = {8}, pages = {8615-8629}, year = {2023} }
ACL’21
BanditMTL: Bandit-based Multi-task Learning for Text Classification

Yuren Mao, Zekai Wang, Weiwei Liu, Xuemin Lin, and Wenbin Hu

In Annual Meeting of the Association for Computational Linguistics, 2021

Abstract Bib Paper Video

Task variance regularization, which can be used to improve the generalization of Multi-task Learning (MTL) models, remains unexplored in multi-task text classification. Accordingly, to fill this gap, this paper investigates how the task might be effectively regularized, and consequently proposes a multi-task learning method based on adversarial multi-armed bandit. The proposed method, named BanditMTL, regularizes the task variance by means of a mirror gradient ascent-descent algorithm. Adopting BanditMTL in the multi-task text classification context is found to achieve state-of-the-art performance. The results of extensive experiments back up our theoretical analysis and validate the superiority of our proposals.
@inproceedings{mao2021banditmtl, title = {BanditMTL: Bandit-based Multi-task Learning for Text Classification}, author = {Mao, Yuren and Wang, Zekai and Liu, Weiwei and Lin, Xuemin and Hu, Wenbin}, booktitle = {Annual Meeting of the Association for Computational Linguistics}, pages = {5506--5516}, year = {2021} }