Hao Zhang (张浩)

Ph.D in Computer Science
Google Scholar
Video Retrieval Group (VIREO)
Department of Computer Science
City University of Hong Kong


Email: zhanghaoinf AT gmail DOT com

Short Bio

Hao Zhang received Ph.D degree from City University of Hong Kong, supervised by Prof. Chong-Wah Ngo.
He received the M.S. degree from Chinese University of Hong Kong (2013), and the B.S. degree from Nanjing University (2012).

His research interest lies in multimedia content analysis, including atomic visual action detection, semantical concept indexing and multimedia event detection in large-scale videos. He serves as a technical program committee (TPC) member for ACM Multimedia 2019/2020, and reviewer for ACM Multimedia 2021, CVPR2021/2022, ICCV2021, ECCV2022, IEEE TCSVT, IEEE TCDS, ACM TOMM, Neurocomputing, ICCV 2019 Workshop, ICME 2020, 《中国科学: 信息科学》.

Publication

Selected Works (* corresponding author)
  • Long-term Leap Attention, Short-term Periodic Shift for Video Classification Poster
    H. Zhang, L. C. Cheng, Y. B. Hao*, C. W. Ngo, ACM Multimedia (MM), 2022, Oral.

  • Token Shift Transformer for Video Classification Poster
    H. Zhang, Y. B. Hao*, C. W. Ngo, ACM Multimedia (MM), 2021.

  • A Fine Granularity Object-level Representation for Event Detection and Recounting
    H. Zhang, C. W. Ngo, IEEE. Trans on Multimedia (TMM), 2018

  • Group Contextualization for Video Recognition
    Y. B. Hao, H. Zhang*, C. W. Ngo, X. N. He, Computer Vision and Pattern Recognition (CVPR), 2022.

  • Compact Bilinear Augmented Query Structured Attention for Sport Highlights Classification
    Y. B. Hao, H. Zhang*, C. W. Ngo, Q. Liu, X. J. Hu, ACM Multimedia (MM), 2020, Oral.

  • Unsupervised Video Hashing with Multi-granularity Contextualization and Multi-structure Preservation
    Y. B. Hao, J. R. Duan, H. Zhang, B. Zhu, P. Y. Zhou, X. N. He, ACM Multimedia (MM), 2022.

  • Hierarchical Hourglass Convolutional Network for Efficient Video Classification
    Y. Tan, Y. B. Hao, H. Zhang, S. Wang, X. N. He, ACM Multimedia (MM), 2022.

  • Parameterization of Cross-token Relations with Relative Positional Encoding for Vision MLP
    Z. C. Wang, Y. B. Hao, X. Y. Gao, H. Zhang, S. Wang, T. T. Mu, X. N. He, ACM Multimedia (MM), 2022.

  • Fine-grained Cross-modal Alignment Network for Text-Video Retrieval
    N. Han, J. J. Chen, G. Xiao, H. Zhang, Y. Zeng, C. H. Chen, ACM Multimedia (MM), 2021, Oral.

Cooperation Works
  • Adversarial Multi-Grained Embedding Network for Cross-Modal Text-Video Retrieval
    N. Han, J. J. Chen, H. Zhang, H. W. Wang, H. Chen, ACM Trans on Multimedia Computing Communications and Applications (TOMM), 2022.

  • Adaptive Temporal Grouping for Black-box Adversarial Attacks on Videos
    Z. P. Wei, J. J. Chen, H. Zhang, L. X. Jiang, Y. G. Jiang, ACM International Conference on Multimedia Retrieval (ICMR), 2022.

  • Chinese White Dolphin Detection in the Wild
    H. Zhang, Q. Zhang, PA. Nguyen, V. Lee, A. Chan, ACM Multimedia Asia (MM-Asia), 2021.

  • Person-level Action Recognition in Complex Events via TSD-TSM networks
    Y. B. Hao, Z. N. Liu, H. Zhang*, B. Zhu, J. J. Chen, Y. G. Jiang, C. W. Ngo, ACM Multimedia Workshop(MMW), 2020.
    (Rank 3rd in HiEve2020 Grand Challenge) (Track-4)

  • Visual Relations Augmented Cross-modal Retrieval
    Y. T. Guo, J. J. Chen, H. Zhang, Y. G. Jiang, ACM International Conference on Multimedia Retrieval (ICMR), Oral, October 2020.

  • Enhanced VIREO KIS at VBS 2018
    PA. Nguyen, Y. J. Lu, H. Zhang, C. W. Ngo, International Conference on Multimedia Modeling, (MMM) Feburary 2018.

  • On the Selection of Anchors and Targets for Video Hyperlinking
    Z. Q. Cheng, H. Zhang, X. Wu, C. W. Ngo, International Conference on Multimedia Retrieval (ICMR), Bucharest, Romania, June 2017.

  • Concept-Based Interactive Search System
    Y. J. Lu, P. A. Nguyen, H. Zhang, C. W. Ngo, International Conference on Multimedia Modeling (MMM), Reykjavik, Iceland, January 2017.

  • Object Pooling for Multimedia Event Detection and Evidence Localization
    H. Zhang, C. W. Ngo, ITE Trans. on Media Technology and Applications, vol. 4, no. 3, pp. 218-226, 2016

  • Semantic Reasoning in Zero Example Video Event Retrieval
    M. D. Boer, Y. J. Lu, H. Zhang, K. Schutte, W. Kraaij, C. W. Ngo, ACM Trans on Multimedia Computing Communications and Applications (TOMM), 2017

  • Blind late fusion in multimedia event retrieval
    M. D. Boer, K. Schutte, H. Zhang, Y. J. Lu, C. W. Ngo, W. Kraaij, International Journal of Multimedia Information Retrieval (IJMIR), Sept 2016

  • VIREO @ TRECVID 2016: Multimedia Event Detection, Ad-hoc Video Search, Video-to-Text Description, PPT, Poster
    H. Zhang, L. Pang, Y. J. Lu, C. W. Ngo, NIST TRECVID Workshop (TRECVID'16), Gaithersburg, USA, Nov 2016.

  • Event Detection with Zero Example: Select the Right and Suppress the Wrong Concepts
    Y. J. Lu, H. Zhang, M. D. Boer, C. W. Ngo, ACM International Conference on Multimedia Retrieval (ICMR), New York, USA, June 2016. (Oral)

  • VIREO-TNO @ TRECVID 2015: Multimedia Event Detection
    H. Zhang, Y. J. Lu, M. D. Boer, F. T. Haar, Z. F. Qiu, K. Schutte, W. Kraaij, C. W. Ngo, NIST TRECVID Workshop (TRECVID'15), Gaithersburg, USA, Nov 2015.

  • VIREO-TNO @ TRECVID 2014: Multimedia Event Detection and Recounting (MED and MER)
    C. W. Ngo, Y. J. Lu, H. Zhang, T. Yao, C. C. Tan, L. Pang, Maaike de Boer, John Schavemaker, Klamer Schutte and Wessel Kraaij, NIST TRECVID Workshop (TRECVID'14), Orlando, USA, 2014.

  • VIREO @ TRECVID 2014: Instance Search and Semantic Indexing
    W. Zhang, H. Zhang, T. Yao, Y. J. Lu, J. J. Chen, C. W. Ngo, NIST TRECVID Workshop (TRECVID'14), Orlando, USA, 2014.


Proudly powered by Bootstrap