Publications

Rongjie Li, Yu Wu, Xuming He (2024). Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning. In CVPR 2024.

PDF Code

Rongjie Li, Songyang Zhang, DahuaLin, KaiChen, Xuming He (2024). From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models. In CVPR 2024.

PDF Code

Longtian Qiu*, Shan Ning*, Xuming He (2024). Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training. In AAAI 2024.

PDF Code DOI

Rongjie Li, Songyang Zhang, Xuming He (2023). SGTR+: End-to-end Scene Graph Generation with Transformer. In TPAMI 2023.

PDF Code

Yu Wu, Yana Wei, Haozhe Wang, Yongfei Liu, Sibei Yang, Xuming He (2023). Grounded Image Text Matching with Mismatched Relation Reasoning. In ICCV 2023.

PDF Code

Peiyan Gu, Chuyu Zhang, Ruijie Xu, Xuming He (2023). Class-relation Knowledge Distillation for Novel Class Discovery. In ICCV 2023.

PDF Code

Chuyu Zhang, Ruijie Xu, Xuming He (2023). Novel Class Discovery for Long-tailed Recognition. In TMLR 2023.

PDF Code

Zhitong Gao, Yucong Chen, Chuyu Zhang, Xuming He (2023). Modeling Multimodal Aleatoric Uncertainty in Segmentation with Mixture of Stochastic Experts. In ICLR 2023.

PDF Code Poster Video

Shan Ning*, Longtian Qiu*, Yongfei Liu, Xuming He (2023). HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models. In CVPR 2023.

PDF Code

Ziyu Guo*, Renrui Zhang*, Longtian Qiu*, Xianzheng Ma, Xupeng Miao, Xuming He, Bin Cui (2023). Calip: Zero-shot enhancement of clip with parameter-free attention. In AAAI 2023.

PDF Code DOI

Tailin Chen, Desen Zhou, Jian Wang, Shidong Wang, Qian He, Chuanyang Hu, Errui Ding, Yu Guan, Xuming He (2023). Part-aware Prototypical Graph Network for One-shot Skeleton-based Action Recognition(*Best Student Paper*). In FG 2023.

PDF Code

Haozhe Wang, Chao Du, Panyan Fang, Shuo Yuan, Xuming He, Liang Wang, Bo Zheng (2022). ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning. In KDD 22.

PDF Code

Yongfei Liu, Shao-yen Tseng, Chenfei Wu, Vasudev Lal, Xuming He, Nan Duan (2022). KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation. In Findings of NAACL 2022.

PDF

Rongjie Li, Songyang Zhang, Xuming He (2022). SGTR: End-to-end Scene Graph Generation with Transformer. In CVPR 2022.

PDF Code

Jiangwei Xie, Shipeng Yan, Xuming He (2022). General Incremental Learning with Domain-aware Categorical Representations. In CVPR 2022.

PDF

Weizhen Liu, Qian He, Xuming He (2022). Weakly Supervised Nuclei Segmentation via Instance Learning. In IEEE ISBI 2022.

PDF Code

Lin Song, Songyang Zhang, Songtao Liu, Zeming Li, Xuming He, Hongbin Sun, Jian Sun, Nanning Zheng (2021). Dynamic Grained Encoder for Vision Transformers. In NeurIPS 2021.

PDF

Shuailin Li, Zhitong Gao, Xuming He (2021). Superpixel-guided Iterative Learning from Noisy Labels for Medical Image Segmentation. In Medical Image Computing and Computer Assisted Intervention Society 2021.

PDF Code

Tailin Chen, Desen Zhou, Jian Wang, Shidong Wang, Yu Guan, Xuming He, Errui Ding (2021). Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition. In ACM International Conference on Multimedia 2021.

PDF

Songyang Zhang, Jiale Zhou, Xuming He (2021). Learning Implicit Temporal Alignment for Few-shot Video Classification. In International Joint Conference on Artificial Intelligence 2021.

PDF Code

Shipeng Yan, Jiale Zhou, Jiangwei Xie, Songyang Zhang, Xuming He (2021). An EM Framework for Online Incremental Learning of Semantic Segmentation. In ACM International Conference on Multimedia 2021.

PDF

Qian He, Desen Zhou, Bo Wan, Xuming He (2021). Single Image 3D Object Estimation with Primitive Graph Networks. In ACM International Conference on Multimedia 2021.

PDF Code

Qian He, Shuailin Li, Xuming He (2021). Weakly Supervised Volumetric Segmentation via Self-taught Shape Denoising Model. In Medical Imaging with Deep Learning 2021.

PDF Code

Songyang Zhang, Zeming Li, Shipeng Yan, Xuming He, Jian Sun (2021). Distribution Alignment: A Unified Framework for Long-tail Visual Recognition. IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021.

PDF Code

Yongfei Liu, Bo Wan, Lin Ma, Xuming He (2021). Relation-aware Instance Refinement for Weakly Supervised Visual Grounding. IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021.

PDF Code

Shipeng Yan, Jiangwei Xie, Xuming He (2021). DER: Dynamically Expandable Representation for Class Incremental Learning(Oral). IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021.

PDF Code

Rongjie Li, Songyang Zhang, Bo Wan, Xuming He (2021). Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation. IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021.

PDF Code

Shuaiyi Huang, Qiuyue Wang, Xuming He (2020). Confidence-aware Adversarial Learning for Self-supervised Semantic Matching. In Chinese Conference on Pattern Recognition and Computer Vision 2020.

PDF Code

Yongfei Liu, Xiangyi Zhang, Songyang Zhang, Xuming He (2020). Part-aware prototype Network for Few-shot Semantic Segmentation. In European Conference of Computer Vision 2020.

PDF Code

Shuailin Li, Chuyu Zhang, Xuming He (2020). Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images. In Medical Image Computing and Computer Assisted Intervention Society 2020.

PDF Code

Yongfei Liu, Bo Wan, Xiaodan Zhu, Xuming He (2020). Learning Cross Modal Context Graph for Visual Grounding. In Association for the Advancement of Artificial Intelligence, 2020.

PDF Code

Haozhe Wang, Jiale Zhou, Xuming He (2020). Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning. In International Conference on Autonomous Agents and Multiagent Systems, 2020.

PDF Code

Bo Wan, Desen Zhou, Yongfei Liu, Rongjie Li, Xuming He (2019). Pose-aware Multi-level Feature Network for Human Object Interaction Detection. In International Conference on Computer Vision, 2019.

PDF Code

Shuaiyi Huang, Qiuyue Wang, Songyang Zhang, Shipeng Yan, Xuming He (2019). Dynamic Context Correspondence Network for Semantic Alignment. In International Conference on Computer Vision, 2019.

PDF Code Poster

Songyang Zhang, Shipeng Yan, Xuming He (2019). LatentGNN: Learning Efficient Non-local Relations for Visual Recognition. In International Conference on Machine Learning, 2019.

PDF Code Poster

Shipeng Yan, Songyang Zhang, Xuming He (2019). A Dual Attention Network With Semantic Embedding for Few-shot Learning. In Association for the Advancement of Artificial Intelligence, 2019.

PDF Code

Qian He, Desen Zhou, Xuming He (2018). 3D Object Structure Recovery via Semi-supervised Learning on Videos. In British Machine Vision Conference, 2018.

PDF Code

Alexander Mathews, Lexing Xie, Xuming He (2018). SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text. In IEEE Conference on Computer Vision and Pattern Recognition, 2018.

PDF

Hongtao Yang, Xuming He, Fatih Porikli (2018). One-shot Action Localization by Learning Sequence Matching Network. In IEEE Conference on Computer Vision and Pattern Recognition, 2018.

PDF

Miaomiao Liu, Xuming He, Mathieu Salzmann (2018). Geometry-aware Deep Network for Single-Image Novel View Synthesis. In IEEE Conference on Computer Vision and Pattern Recognition, 2018.

PDF

Hongtao Yang, Xuming He, Fatih Porikli (2018). Instance-aware Detailed Action Labeling in Videos. In IEEE Winter Conference on Applications of Computer Vision, 2018.

PDF

Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu (2018). 3D Box Proposals from a Single Monocular Image of an Indoor Scene. In Association for the Advancement of Artificial Intelligence, 2018.

PDF

Feiyang Cheng, Xuming He, Hong Zhang (2017). Stacked Learning to Search for Scene Labeling. In IEEE Transactions on Image Processing, 2017.

PDF

Salman H. Khan, Xuming He, Fatih Porikli, Mohammed Bennamoun (2017). Forest Change Detection in Incomplete Satellite Images With Deep Neural Networks. In IEEE Transactions on Geoscience and Remote Sensing, 2017.

PDF

Yufan Liu, Songyang Zhang, Mai Xu, Xuming He (2017). Predicting Salient Face in Multiple-face Videos. In IEEE Conference on Computer Vision and Pattern Recognition, 2017.

PDF

Salman Khan, Xuming He, Fatih Porikli, Ferdous Sohel, Roberto Togneri, Mohammed Bennamoun (2017). Learning deep structured network for weakly supervised change detection. In International Joint Conference on Artificial Intelligence, 2017.

PDF

Haoyang Zhang, Xuming He (2017). Deep Free-Form Deformation Network for Object-Mask Registration. In International Conference on Computer Vision, 2017.

PDF

Haoyang Zhang, Xuming He, Faith Porikli (2017). Learning Spatial Transforms for Refining Object Segment Proposals. In IEEE Winter Conference on Applications of Computer Vision, 2017.

PDF

Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu (2017). Indoor Scene Parsing with Instance Segmentation, Semantic Labeling and Support Relationship Inference. In IEEE Conference on Computer Vision and Pattern Recognition, 2017.

PDF

Tao Wang, Xuming He, Songzhi Su, Yin Guan (2017). Efficient Scene Layout Aware Object Detection for Traffic Surveillance. In IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017.

PDF

Zeeshan Hayder, Xuming He, Mathieu Salzmann (2017). Boundary-aware Instance Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition, 2017.

PDF

Yansheng Ming, Hongdong Li, Xuming He (2016). Contour Completion without Region Segmentation. In IEEE Transactions on Image Processing, 2016.

PDF

Alexander Mathews, Lexing Xie, Xuming He (2016). SentiCap: Generating Image Descriptions with Sentiments. In AAAI Conference on Artificial Intelligence, 2016.

PDF

Haoyang Zhang, Xuming He, Fatih Porikli, Laurent Kneip (2016). Semantic Context and Depth-aware Object Proposal Generation. In IEEE International Conference on Image Processing, 2016.

PDF

Yurui Xie, Fatih Porikli, Xuming Hes (2016). Object-Aware Dictionary Learning with Deep Features. In IEEE Asian Conference on Computer Vision, 2016.

PDF

Haoyang Zhang, Xuming He, Fatih Porikli (2016). Learning to Generate Object Segment Proposals with Multi-modal Cue. In IEEE Asian Conference on Computer Vision, 2016.

PDF

Zeeshan Hayder, Xuming He, Mathieu Salzmann (2016). Learning to Co-Generate Object Proposals with a Deep Structured Network. In IEEE Conference on Computer Vision and Pattern Recognition, 2016.

PDF

Buyu Liu, Xuming He (2016). Learning Dynamic Hierarchical Models for Anytime Scene Labeling. In IEEE European Conference on Computer Vision, 2016.

PDF

Miaomiao Liu, Xuming He, Mathieu Salzmann (2016). Building Scene Models by Completing and Hallucinating Depth and Semantics. In IEEE European Conference on Computer Vision, 2016.

PDF