Selected Publications

See the full list on my Google Scholar.

Equal Contribution, *Corresponding Author(s)

2024

  1. HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud
     Wencan Cheng,  Hao Tang,  Luc Van Gool,  Jong Hwan Ko
    In CVPR 2024, Seattle, USA
  2. Versatile Navigation under Partial Observability via Value-guided Diffusion Policy
     Gengyu Zhang,  Hao Tang,  Yan Yan
    In CVPR 2024, Seattle, USA
  3. Towards Robust 3D Pose Transfer with Adversarial Learning
     Haoyu Chen,  Hao Tang,  Ehsan Adeli,  Guoying Zhao
    In CVPR 2024, Seattle, USA
  4. On the Faithfulness of Vision Transformer Explanations
     Junyi Wu,  Weitai Kang,  Hao Tang,  Yuan Hong,  Yan Yan
    In CVPR 2024, Seattle, USA
  5. Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer
     Junyi Wu,  Bin Duan,  Weitai Kang,  Hao Tang,  Yan Yan
    In CVPR 2024, Seattle, USA
  6. SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
     Yuxuan Zhang,  Jiaming Liu,  Yiren Song,  Rui Wang,  Hao Tang,  Jinpeng Yu,  Huaxia Li,  Xu Tang,  Yao Hu,  Han Pan,  Zhongliang Jing
    In CVPR 2024, Seattle, USA
  7. Distilling ODE Solvers of Diffusion Models into Smaller Steps
     Sanghwan Kim,  Hao Tang*,  Fisher Yu
    In CVPR 2024, Seattle, USA
  8. Towards Online Real-Time Memory-based Video Inpainting Transformers
     Guillaume Thiry,  Hao Tang*,  Radu Timofte,  Luc Van Gool
    In CVPR 2024, Seattle, USA
  9. Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation
     Hao Tang,  Ling Shao,  Nicu Sebe,  Luc Van Gool
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
  10. 2023

    1. G2P-DDM: Generating Sign Pose Sequence from Gloss Sequence with Discrete Diffusion Model
       Pan Xie,  Qipeng Zhang,  Peng Taiying,  Hao Tang*,  Yao Du,  Zexian Li
      In AAAI 2024, Vancouver, Canada
    2. HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception
       Peiyan Dong,  Zhenglun Kong,  Xin Meng,  Pinrui Yu,  Yifan Gong,  Geng Yuan,  Hao Tang*, Yanzhi Wang
      In NeurIPS 2023, New Orleans, USA
    3. PackQViT: Faster Sub-8-bit Vision Transformers via Full and Packed Quantization on the Mobile
       Peiyan Dong,  Lei Lu,  Chao Wu,  Cheng Lyu,  Geng Yuan,  Hao Tang*, Yanzhi Wang
      In NeurIPS 2023, New Orleans, USA
    4. LART: Neural Correspondence Learning with Latent Regularization Transformer for 3D Motion Transfer
       Haoyu Chen,  Hao Tang,  Radu Timofte,  Luc Van Gool,  Guoying Zhao
      In NeurIPS 2023, New Orleans, USA
    5. Does Graph Distillation See Like Vision Dataset Counterpart?
       Beining Yang,  Kai Wang,  Qingyun Sun,  Cheng Ji,  Xingcheng Fu,  Hao Tang,  Yang You,  Jianxin Li
      In NeurIPS 2023, New Orleans, USA
    6. Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic Image Synthesis
       Hao Tang,  Guolei Sun,  Nicu Sebe,  Luc Van Gool
      IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
    7. Learning Concordant Attention via Target-aware Alignment for Visible-Infrared Person Re-identification
       Jianbing Wu,  Hong Liu,  Yuxin Su,  Wei Shi,  Hao Tang
      In ICCV 2023, Paris, France
    8. SpeedDETR: Speed-aware Transformers for End-to-end Object Detection
       JPeiyan Dong,  JZhenglun Kong,  JXin Meng,  JPeng Zhang,  Hao Tang*,  JYanzhi Wang,  JChih-Hsien Chou
      In ICML 2023, Hawaii, USA
    9. Data Level Lottery Ticket Hypothesis for Vision Transformers
       Xuan Shen,  Zhenglun Kong,  Minghai Qin,  Peiyan Dong,  Geng Yuan,  Xin Meng,  Hao Tang,  Xiaolong Ma,  Yanzhi Wang
      In IJCAI 2023, Macao, China
    10. RZCR: Zero-shot Character Recognition via Radical-based Reasoning
       Xiaolei Diao,  Daqian Shi,  Hao Tang,  Qiang Shen,  Yanzeng Li,  Lei Wu,  Hao Xu
      In IJCAI 2023, Macao, China
    11. Measuring the Consistency and Diversity of 3D Face Generation
       Kunlin Liu,  Wenbo Zhou,  Zhenyu Zhang,  Yanhao Ge,  Hao Tang,  Weiming Zhang,  Nenghai Yu
      IEEE Journal of Selected Topics in Signal Processing (JSTSP), 2023
    12. Transductive Prototypical Attention Reasoning Network for Few-shot SAR Target Recognition
       Haohao Ren,  Sen Liu,  Xuelian Yu,  Lin Zou,  Yun Zhou,  Xuegang Wang,  Hao Tang
      IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023
    13. Multi-Hypothesis Representation Learning for Transformer-Based 3D Human Pose Estimation
       Wenhao Li,  Hong Liu,  Hao Tang,  Pichao Wang
      Elsevier Pattern Recognition (PR), 2023
    14. Graph Transformer GANs for Graph-Constrained House Generation
       Hao Tang,  Zhenyu Zhang,  Humphrey Shi,  Bo Li,  Ling Shao,  Nicu Sebe,  Radu Timofte,  Luc Van Gool
      In CVPR 2023, Vancouver, Canada
    15. Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration
       Guofeng Mei,  Hao Tang,  Xiaoshui Huang,  Weijie Wang,  Juan Liu,  Jian Zhang,  Luc Van Gool,  Qiang Wu
      In CVPR 2023, Vancouver, Canada
    16. SMAE: Few-shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders
       Qingsen Yan,  Song Zhang,  Weiye Chen,  Hao Tang,  Yu Zhu,  Jinqiu Sun,  Luc Van Gool,  Yanning Zhang
      In CVPR 2023, Vancouver, Canada
    17. DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
       Xuan Shen,  Yaohua Wang,  Ming Lin,  Yilun Huang,  Hao Tang,  Xiuyu Sun,  Yanzhi Wang
      In CVPR 2023, Vancouver, Canada
    18. GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
       Ming Tao,  Bingkun Bao,  Hao Tang,  Changsheng Xu
      In CVPR 2023, Vancouver, Canada
    19. Pruning Parameterization with Bi-level Optimization for Efficient Semantic Segmentation on the Edge
       Changdi Yang,  Pu Zhao,  Yanyu Li,  Wei Niu,  Jiexiong Guan,  Hao Tang,  Minghai Qin,  Bin Ren,  Xue Lin,  Yanzhi Wang
      In CVPR 2023, Vancouver, Canada
    20. Go Closer To See Better: Camouflaged Object Detection via Object Area Amplification and Figure-ground Conversion
      Haozhe Xing,  Yan Wang,  Xujun Wei,  Hao Tang,  Shuyong Gao,  Wenqiang Zhang
      IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2022
    21. MLP-GAN for Brain Vessel Image Segmentation
       Bin Xie,  Hao Tang,  Bin Duan,  Dawen Cai,  Yan Yan
      In ICASSP 2023, Rhodes Island, Greece
    22. PI-Trans: Parallel-ConvMLP and Implicit-Transformation Based GAN for Cross-View Image Translation
       Bin Ren,  Hao Tang,  Yiming Wang,  Xia Li,  Wei Wang,  Nicu Sebe
      In ICASSP 2023, Rhodes Island, Greece
    23. TinyCOD: Tiny and Effective Model for Camouflaged Object Detection
       Haozhe Xing,  Shuyong Gao,  Hao Tang,  Tsui Qin Mok,  Yanlan Kang,  Wenqiang Zhang
      In ICASSP 2023, Rhodes Island, Greece
    24. Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis
       Hao Tang,  Xiaojuan Qi,  Guolei Sun,  Dan Xu,  Nicu Sebe,  Radu Timofte,  Luc Van Gool
      In ICLR 2023, Kigali, Rwanda
    25. Interaction Transformer for Human Reaction Generation
       Baptiste Chopin,  Hao Tang,  Naima Otberdout,  Mohamed Daoudi,  Nicu Sebe
      IEEE Transactions on Multimedia (TMM), 2023
    26. DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
       Ming Tao,  Bingkun Bao,  Hao Tang,  Fei Wu,  Longhui Wei,  Qi Tian
      In AAAI 2023, Washington DC, USA
    27. Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
       Zhenglun Kong,  Haoyu Ma,  Geng Yuan,  Mengshu Sun,  Yanyue Xie,  Peiyan Dong,  Xin Meng,  Xuan Shen,  Hao Tang,  Minghai Qin,  Tianlong Chen,  Xiaolong Ma,  Xiaohui Xie,  Zhangyang Wang,  Yanzhi Wang
      In AAAI 2023, Washington DC, USA
    28. HOTCOLD Block: Fooling Thermal Infrared Detectors with a Novel Wearable Design
       Hui Wei,  Zhixiang Wang,  Xuemei Jia,  Yinqiang Zheng,  Hao Tang,  Shin'ichi Satoh,  Zheng Wang
      In AAAI 2023, Washington DC, USA
    29. Towards Real-Time Segmentation on the Edge
       Yanyu Li,  Changdi Yang,  Pu Zhao,  Geng Yuan,  Wei Niu,  Jiexiong Guan,  Hao Tang,  Minghai Qin,  Qing Jin,  Bin Ren,  Xue Lin,  Yanzhi Wang
      In AAAI 2023, Washington DC, USA

    2022

    1. Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis
       Hao Tang,  Ling Shao,  Philip HS Torr,  Nicu Sebe
      Springer International Journal of Computer Vision (IJCV), 2022
    2. Quasi-equilibrium Feature Pyramid Network for Salient Object Detection
       Yue Song,  Hao Tang,  Mengyi Zhao,  Nicu Sebe,  Wei Wang
      IEEE Transactions on Image Processing (TIP), 2022
    3. AO2-DETR: Arbitrary-Oriented Object Detection Transformer
       Linhui Dai,  Hong Liu,  Hao Tang,  Zhiwei Wu,  Pinhao Song
      IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2022
    4. SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction
       Yitong Xia,  Hao Tang,  Radu Timofte,  Luc Van Gool
      In BMVC 2022, London, UK
    5. Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation
       Hao Tang,  Philip HS Torr,  Nicu Sebe
      IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
    6. Facial Expression Translation using Landmark Guided GANs
       Hao Tang,  Nicu Sebe
      IEEE Transactions on Affective Computing (TAFFC), 2022
    7. Supervised Multi-scale Attention-guided Ship Detection in Optical Remote Sensing Images
       Jianming Hu,  Xiyang Zhi,  Shikai Jiang,  Hao Tang,  Wei Zhang,  Lorenzo Bruzzone
      IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2022
    8. 3D-Aware Semantic-Guided Generative Model for Human Synthesis
       Jichao Zhang,  Enver Sangineto,  Hao Tang,  Aliaksandr Siarohin,  Zhun Zhong,  Nicu Sebe,  Wei Wang
      In ECCV 2022, Tel Aviv, Israel
    9. Mining Relations among Cross-Frame Affinities for Video Semantic Segmentation
       Guolei Sun,  Yun Liu,  Hao Tang,  Ajad Chhatkuli,  Le Zhang,  Luc Van Gool
      In ECCV 2022, Tel Aviv, Israel
    10. Towards Interpretable Video Super-Resolution via Alternative Optimization
       Jiezhang Cao,  Jingyun Liang,  Kai Zhang,  Wenguan Wang,  Qin Wang,  Yulun Zhang,  Hao Tang,  Luc Van Gool
      In ECCV 2022, Tel Aviv, Israel
    11. Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution
       Yushu Wu,  Yifan Gong,  Pu Zhao,  Yanyu Li,  Zheng Zhan,  Wei Niu,  Hao Tang,  Minghai Qin,  Bin Ren,  Yanzhi Wang
      In ECCV 2022, Tel Aviv, Israel
    12. SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
       Zhenglun Kong,  Peiyan Dong,  Xiaolong Ma,  Xin Meng,  Wei Niu,  Mengshu Sun,  Xuan Shen,  Geng Yuan,  Bin Ren,  Minghai Qin,  Hao Tang,  Yanzhi Wang
      In ECCV 2022, Tel Aviv, Israel
    13. Unsupervised High-Resolution Portrait Gaze Correction and Animation
       Jichao Zhang,  Jingjing Chen,  Hao Tang,  Enver Sangineto,  Peng Wu,  Yan Yan,  Nicu Sebe,  Wei Wang
      IEEE Transactions on Image Processing (TIP), 2022
    14. Cross-view Panorama Image Synthesis with Progressive Attention GANs
       Songsong Wu,  Hao Tang,  Xiaoyuan Jing,  Jianjun Qian,  Nicu Sebe,  Yan Yan,  Qinghua Zhang
      Elsevier Pattern Recognition (PR), 2022
    15. RCRN: Real-world Character Image Restoration Network via Skeleton Extraction
       Daqian Shi,  Xiaolei Diao,  Hao Tang,  Xiaomin Li,  Hao Xing,  Hao Xu
      In ACM MM 2022, Lisbon, Portugal
    16. CharFormer: A Glyph Fusion based Attentive Framework for High-precision Character Image Denoising
       Daqian Shi,  Xiaolei Diao,  Lida Shi,  Hao Tang,  Yang Chi,  Chuntao Li,  Hao Xu
      In ACM MM 2022, Lisbon, Portugal
    17. Real-Time Portrait Stylization on the Edge
       Yanyu Li,  Xuan Shen,  Geng Yuan,  Jiexiong Guan,  Wei Niu,  Hao Tang,  Bin Ren,  Yanzhi Wang
      In IJCAI Demo 2022, Vienna, Austria
    18. Looking Outside The Window: Wide-Context Transformer for The Semantic Segmentation of High-Resolution Remote Sensing Images
       Lei Ding,  Dong Lin,  Shaofu Lin,  Jing Zhang,  Xiaojie Cui,  Yuebin Wang,  Hao Tang,  Lorenzo Bruzzone
      IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2022
    19. Continual Attentive Fusion for Incremental Learning in Semantic Segmentation
       Guanglei Yang,  Enrico Fini,  Dan Xu,  Paolo Rota,  Mingli Ding,  Hao Tang,  Xavier Alameda-Pineda,  Elisa Ricci
      IEEE Transactions on Multimedia (TMM), 2022
    20. DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
       Ming Tao,  Hao Tang,  Fei Wu,  Xiaoyuan Jing,  Bingkun Bao,  Changsheng Xu
      In CVPR 2022, New Orleans, USA
    21. MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation
       Wenhao Li,  Hong Liu,  Hao Tang,  Pichao Wang,  Luc Van Gool
      In CVPR 2022, New Orleans, USA
    22. Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
       Zipeng Xu,  Tianwei Lin,  Hao Tang,  Fu Li,  Dongliang He,  Nicu Sebe,  Radu Timofte,  Luc Van Gool,  Errui Ding
      In CVPR 2022, New Orleans, USA
    23. Physically-guided Disentangled Implicit Rendering for 3D Face Modeling
       Zhenyu Zhang,  Yanhao Ge,  Ying Tai,  Weijian Cao,  Renwang Chen,  Kunlin Liu,  Hao Tang,  Xiaoming Huang,  Chengjie Wang,  Zhifeng Xie,  Dongjin Huang
      In CVPR 2022, New Orleans, USA
    24. Learning to Restore 3D Face from In-the-Wild Degraded Images
       Zhenyu Zhang,  Yanhao Ge,  Ying Tai,  Xiaoming Huang,  Chengjie Wang,  Hao Tang,  Dongjin Huang,  Zhifeng Xie
      In CVPR 2022, New Orleans, USA
    25. Local and Global GANs with Semantic-Aware Upsampling for Image Generation
       Hao Tang,  Ling Shao,  Philip HS Torr,  Nicu Sebe
      IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
    26. Cross-View Panorama Image Synthesis
       Songsong Wu,  Hao Tang,  Xiaoyuan Jing,  Haifeng Zhao,  Jianjun Qian,  Nicu Sebe,  Yan Yan
      IEEE Transactions on Multimedia (TMM), 2022
    27. Geometry-Contrastive Transformer for Generalized 3D Pose Transfer
       Haoyu Chen,  Hao Tang,  Zitong Yu,  Nicu Sebe,  Guoying Zhao
      In AAAI 2022, Vancouver, Canada
    28. Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
       Yidi Li,  Hong Liu,  Hao Tang
      In AAAI 2022, Vancouver, Canada

    2021

    1. Adversarial Shape Learning for Building Extraction in VHR Remote Sensing Images
       Lei Ding,  Hao Tang,  Yahui Liu,  Yilei Shi,  Xiaoxiang Zhu,  Lorenzo Bruzzone
      IEEE Transactions on Image Processing (TIP), 2021
    2. Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation
       Bin Ren,  Hao Tang,  Nicu Sebe
      In BMVC 2021, Virtual
    3. AniFormer: Data-driven 3D Animation with Transformer
       Haoyu Chen,  Hao Tang,  Nicu Sebe,  Guoying Zhao
      In BMVC 2021, Virtual
    4. Highly Efficient Natural Image Matting
       Yijie Zhong,  Bo Li,  Lv Tang,  Hao Tang,  Shouhong Ding
      In BMVC 2021, Virtual
    5. Layout-to-Image Translation with Double Pooling Generative Adversarial Networks
       Hao Tang,  Nicu Sebe
      IEEE Transactions on Image Processing (TIP), 2021
    6. AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks
       Hao Tang,  Hong Liu,  Dan Xu,  Philip HS Torr,  Nicu Sebe
      IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
    7. Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction
       Guanglei Yang,  Hao Tang,  Mingli Ding,  Nicu Sebe,  Elisa Ricci
      In ICCV 2021, Montreal, Canada
    8. Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer
       Haoyu Chen,  Hao Tang,  Henglin Shi,  Wei Peng,  Nicu Sebe,  Guoying Zhao
      In ICCV 2021, Montreal, Canada
    9. Cross-View Exocentric to Egocentric Video Synthesis
       Gaowen Liu,  Hao Tang,  Hugo Latapie,  Jason Corso,  Yan Yan
      In ACM MM 2021, Chengdu, China
    10. Total Generate: Cycle in Cycle Generative Adversarial Networks for Generating Human Faces, Hands, Bodies, and Natural Scenes
       Hao Tang,  Nicu Sebe
      IEEE Transactions on Multimedia (TMM), 2021

    2020

    1. Bipartite Graph Reasoning GANs for Person Image Generation
       Hao Tang,  Song Bai,  Philip H.S. Torr,  Nicu Sebe
      In BMVC 2020, Manchester, UK
    2. Dual Attention GANs for Semantic Image Synthesis
       Hao Tang,  Song Bai,  Nicu Sebe
      In ACM MM 2020, Seattle, USA
    3. Dual In-painting Model for Unsupervised Gaze Correction and Animation in the Wild
       Jichao Zhang,  Jingjing Chen,  Hao Tang,  Wei Wang,  Yan Yan,  Enver Sangineto,  Nicu Sebe
      In ACM MM 2020, Seattle, USA
    4. Unified Generative Adversarial Networks for Controllable Image-to-Image Translation
       Hao Tang,  Hong Liu,  Nicu Sebe
      IEEE Transactions on Image Processing (TIP), 2020
    5. XingGAN for Person Image Generation
       Hao Tang,  Song Bai,  Li Zhang,  Philip H.S. Torr,  Nicu Sebe
      In ECCV 2020, Glasgow, UK
    6. When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition with Limited Data
       Hao Tang,  Hong Liu,  Wei Xiao,  Nicu Sebe
      IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2020
    7. LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images
       Lei Ding,  Hao Tang,  Lorenzo Bruzzone
      IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2020
    8. Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation
       Hao Tang,  Dan Xu,  Yan Yan,  Philip H.S. Torr,  Nicu Sebe
      In CVPR 2020, Seattle, USA

    2019

    1. Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation
       Hao Tang,  Dan Xu,  Gaowen Liu,  Wei Wang,  Yan Yan,  Nicu Sebe
      In ACM MM 2019, Nice, France
    2. Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation
       Hao Tang,  Dan Xu,  Nicu Sebe,  Yanzhi Wang,  Jason J. Corso,  Yan Yan
      In CVPR 2019, Long Beach, USA

    2018

    1. GestureGAN for Hand Gesture-to-Gesture Translation in the Wild
       Hao Tang,  Wei Wang,  Dan Xu,  Yan Yan,  Nicu Sebe
      In ACM MM 2018, Seoul, South Korea
    2. Attention Guided Convolutional Neural Fields for Monocular Depth Estimation
       Dan Xu,  Wei Wang,  Hao Tang,  Hong Liu,  Nicu Sebe,  Elisa Ricci
      In CVPR 2019, Salt Lake City, USA

    2016

    1. A Novel Feature Matching Strategy for Large Scale Image Retrieval
       Hao Tang,  Hong Liu
      In IJCAI 2016, New York, USA

    2015

    1. Gender Classification Using Pyramid Segmentation for Unconstrained Back-facing Video Sequences
       Hao Tang,  Hong Liu,  Wei Xiao
      In ACM MM 2015, Brisbane, Australia