Publications

Published 30+ papers in top venues on robust learning and multimodal learning, including 20+ as first or corresponding author.

and denote equal contribution and corresponding authorship. You can find full list of my publications on my Google Scholar.

By Venue and Authorship
Venue Papers 1st and
NeurIPS/ICLR 12 7
CVPR/ICCV 7 3
AAAI 5 3
MM 2 2
TIP 2 2
Others 6 5
Total 34 22
By Research Topic
Category Topic Papers
Robust Learning
(12)
Diffusion Solver 3
Group Robustness 1
Imbalanced Learning 3
OOD Generalization 2
Robustness 3
Multimodal Learning
(15)
Image Generation 3
MLLMs De-Hallucination 2
MLLMs Reasoning 2
MLLMs Safety 1
Robust Adaptation for VLMs 5
Video Generation 2
Others 8

2026

  1. TIP
    Hybrid granularity distribution estimation for few-shot learning: statistics transfer from categories and instances   Few-Shot Learning
    Shuo Wang, Tianyu Qi, Xingyu Zhu, Yanbin Hao, Beier Zhu, and 2 more authors
    IEEE Transactions on Image Processing
  2. ICLR
    Reducing class-wise performance disparity via margin regularization   Robustness
    Beier Zhu, Kesen Zhao, Jiequan Cui, and 4 more authors
    In International Conference on Learning Representations
  3. ICLR
    Real-time motion-controllable autoregressive video diffusion   Video Generation
    Kesen Zhao, Jiaxin Shi, Beier Zhu, and 5 more authors
    In International Conference on Learning Representations
  4. ICLR
    PMI: flow-based inversion correction via proximal operator   Image Generation
    Chenru Wang, Beier Zhu, and Chi Zhang
    In International Conference on Learning Representations
  5. ICLR
    Look carefully: adaptive visual reinforcements in multimodal large language models for hallucination mitigation   MLLMs De-Hallucination
    Xingyu Zhu, Kesen Zhao, Liang Yi, Shuo Wang, Zhicai Wang, Beier Zhu, and 2 more authors
    In International Conference on Learning Representations
  6. ICLR
    GuardAlign: robust safety alignment in multimodal large language models   MLLMs Safety
    Xingyu Zhu, Beier Zhu, Junfeng Fang, and 4 more authors
    In International Conference on Learning Representations
  7. ICLR
    Streaming drag-oriented interactive video manipulation: drag anything, anytime!   Video Generation
    Junbao Zhou, Yuan Zhou, Kesen Zhao, Qingshan Xu, Beier Zhu, and 2 more authors
    In International Conference on Learning Representations
  8. ICLR
    Subject-consistent and pose-diverse text-to-image generation   Image Generation
    Zhanxin Gao, Beier Zhu, Liang Yao, and 2 more authors
    In International Conference on Learning Representations
  9. AAAI
    Hierarchical semantic alignment for image clustering   Image Clustering
    Xingyu Zhu, Beier Zhu, Yunfan Li, and 4 more authors
    In AAAI Conference on Artificial Intelligence
  10. AAAI
    DEPO: Dual-efficiency preference optimization for LLM agents   LLM Agent
    Sirui Chen, Mengshi Zhao, Lei Xu, Yuying Zhao, Beier Zhu, and 3 more authors
    In AAAI Conference on Artificial Intelligence

2025

  1. arXiv
    Parallel diffusion solver via residual dirichlet policy optimization   Diffusion Solver
    Ruoyu Wang, Ziyu LiBeier Zhu, and 5 more authors
  2. NeurIPS
    Adaptive stochastic coefficients for accelerating diffusion sampling   Diffusion Solver
    Ruoyu WangBeier Zhu, Junzhi Li, and 2 more authors
    In Advances in Neural Information Processing Systems
  3. NeurIPS
    Spotlight
    Enhancing CLIP robustness via cross-modality alignment   Robust Adaptation for VLMs
    Xingyu Zhu, Beier Zhu, Shuo Wang, and 2 more authors
    In Advances in Neural Information Processing Systems
  4. MM
    Oral
    Benchmarking and Bridging Emotion Conflicts for Multimodal Emotion Reasoning   MLLMs Reasoning
    Zhiyuan Han, Beier Zhu, Yanlong Xu, and 2 more authors
    In ACM International Conference on Multimedia
  5. ICCV
    Unsupervised visual chain-of-thought reasoning via preference optimization   MLLMs Reasoning
    Kesen Zhao, Beier Zhu, Qianru Sun, and 1 more author
    In International Conference on Computer Vision
  6. ICCV
    Distilling parallel gradients for fast ODE solvers of diffusion models   Diffusion Solver
    Beier Zhu, Ruoyu Wang, Tong Zhao, and 2 more authors
    In International Conference on Computer Vision
  7. ICCV
    Dynamic Multimodal Prototype Learning in Vision-Language Models   Robust Adaptation for VLMs
    Xingyu Zhu, Shuo Wang, Beier Zhu, and 6 more authors
    In International Conference on Computer Vision
  8. FCS
    Debiasing vision-language models for vision tasks: a survey   Survey
    Beier Zhu, and Hanwang Zhang
    Frontiers of Computer Science
  9. CVPR
    Highlight
    Project-probe-aggregate: efficient fine-tuning for group robustness   Group Robustness
    Beier Zhu, Jiequan Cui, Hanwang Zhang, and 1 more author
    In Computer Vision and Pattern Recognition Conference
  10. CVPR
    StyleStudio: text-driven style transfer with selective control of style elements   Image Generation
    Mingkun Lei, Xue Song, Beier Zhu, and 2 more authors
    In Computer Vision and Pattern Recognition Conference
  11. CVPR
    Devils in middle layers of large vision-language models: Interpreting, detecting and mitigating object hallucinations via attention lens   MLLMs De-Hallucination
    Zhangqi Jiang, Junkai Chen, Beier Zhu, and 3 more authors
    In Computer Vision and Pattern Recognition Conference
  12. arXiv
    Generalized kullback-leibler divergence loss   Robustness
    Jiequan Cui, Beier Zhu, Qingshan Xu, and 5 more authors

2024

  1. Thesis
    Towards unbiased, accurate and robust fine-tuning of zero-shot vision models   others
    Zhu Beier
  2. NeurIPS
    Spotlight
    Enhancing zero-shot vision models by label-free prompt distribution learning and bias correcting   Imbalanced Learning
    Xingyu ZhuBeier Zhu, Yi Tan, and 3 more authors
    In Advances in Neural Information Processing Systems
  3. NeurIPS
    Robust fine-tuning of zero-shot models via variance reduction   OOD Generalization
    Beier Zhu, Jiequan Cui, and Hanwang Zhang
    In Advances in Neural Information Processing Systems
  4. MM
    Oral
    Selective vision-language subspace projection for few-shot CLIP   Robust Adaptation for VLMs
    Xingyu ZhuBeier Zhu, Yi Tan, and 3 more authors
    In ACM International Conference on Multimedia
  5. CVPR
    Classes are not equal: an empirical study on image recognition fairness   Robustness
    Jiequan Cui, Beier Zhu, Xin Wen, and 3 more authors
    In Computer Vision and Pattern Recognition Conference

2023

  1. NeurIPS
    Generalized logit adjustment: Calibrating fine-tuned models by removing label bias in foundation models   Imbalanced Learning
    Beier Zhu, Kaihua Tang, Qianru Sun, and 1 more author
    In Advances in Neural Information Processing Systems
  2. AAAI
    Oral
    Debiased fine-tuning for vision-language models by prompt regularization   Robust Adaptation for VLMs
    Beier Zhu, Yulei Niu, Saeil Lee, and 2 more authors
    In AAAI Conference on Artificial Intelligence
  3. AAAI
    Oral
    Leveraging modality-specific representations for audio-visual speech recognition via reinforcement learning   Speech Recognition
    Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, and 1 more author
    In AAAI conference on artificial intelligence
  4. ICCV
    Prompt-aligned gradient for prompt tuning   Robust Adaptation for VLMs
    Beier Zhu, Yulei Niu, Yucheng Han, and 2 more authors
    In International Conference on Computer Vision

2022

  1. AAAI
    Oral
    Cross-domain empirical risk minimization for unbiased long-tailed classification   Imbalanced Learning
    Beier Zhu, Yulei Niu, Xian-Sheng Hua, and 1 more author
    In Proceedings of the AAAI conference on artificial intelligence

2021

  1. TIP
    Structure-coherent deep feature learning for robust face alignment   Face Alignment
    Chunze LinBeier Zhu, Quan Wang, and 4 more authors
    IEEE Transactions on Image Processing

2019

  1. TSG
    Fault location for radial distribution network via topology and reclosure-generating traveling waves   Power System
    Shenxing Shi, Beier Zhu, Aoyu Lei, and 1 more author
    IEEE Transactions on Smart Grid

2018

  1. TSG
    Fault classification for transmission lines based on group sparse representation   Power System
    Shenxing Shi, Beier Zhu, Sohrab Mirsaeidi, and 1 more author
    IEEE Transactions on Smart Grid