Publications

For a more complete list of publications from Prof Xuming He, please refer to his homepage.

(2023). Grounded Image Text Matching with Mismatched Relation Reasoning. In ICCV 2023.

PDF Code

(2023). HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models. In CVPR 2023.

PDF Code

(2023). Calip: Zero-shot enhancement of clip with parameter-free attention. In AAAI 2023.

PDF Code DOI

(2023). Part-aware Prototypical Graph Network for One-shot Skeleton-based Action Recognition(*Best Student Paper*). In FG 2023.

PDF Code

(2022). ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning. In KDD 22.

PDF Code

(2022). KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation. In Findings of NAACL 2022.

PDF

(2022). SGTR: End-to-end Scene Graph Generation with Transformer. In CVPR 2022.

PDF Code

(2022). General Incremental Learning with Domain-aware Categorical Representations. In CVPR 2022.

PDF

(2021). Dynamic Grained Encoder for Vision Transformers. In NeurIPS 2021.

PDF

(2021). Superpixel-guided Iterative Learning from Noisy Labels for Medical Image Segmentation. In Medical Image Computing and Computer Assisted Intervention Society 2021.

PDF Code

(2021). Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition. In ACM International Conference on Multimedia 2021.

PDF

(2021). Learning Implicit Temporal Alignment for Few-shot Video Classification. In International Joint Conference on Artificial Intelligence 2021.

PDF Code

(2021). Single Image 3D Object Estimation with Primitive Graph Networks. In ACM International Conference on Multimedia 2021.

PDF Code

(2021). Distribution Alignment: A Unified Framework for Long-tail Visual Recognition. IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021.

PDF Code

(2021). Relation-aware Instance Refinement for Weakly Supervised Visual Grounding. IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021.

PDF Code

(2021). DER: Dynamically Expandable Representation for Class Incremental Learning(Oral). IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021.

PDF Code

(2020). Confidence-aware Adversarial Learning for Self-supervised Semantic Matching. In Chinese Conference on Pattern Recognition and Computer Vision 2020.

PDF Code

(2020). Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images. In Medical Image Computing and Computer Assisted Intervention Society 2020.

PDF Code

(2020). Learning Cross Modal Context Graph for Visual Grounding. In Association for the Advancement of Artificial Intelligence, 2020.

PDF Code

(2020). Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning. In International Conference on Autonomous Agents and Multiagent Systems, 2020.

PDF Code

(2019). Pose-aware Multi-level Feature Network for Human Object Interaction Detection. In International Conference on Computer Vision, 2019.

PDF Code

(2019). A Dual Attention Network With Semantic Embedding for Few-shot Learning. In Association for the Advancement of Artificial Intelligence, 2019.

PDF Code

(2018). 3D Object Structure Recovery via Semi-supervised Learning on Videos. In British Machine Vision Conference, 2018.

PDF Code

(2018). SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text. In IEEE Conference on Computer Vision and Pattern Recognition, 2018.

PDF

(2018). One-shot Action Localization by Learning Sequence Matching Network. In IEEE Conference on Computer Vision and Pattern Recognition, 2018.

PDF

(2018). Geometry-aware Deep Network for Single-Image Novel View Synthesis. In IEEE Conference on Computer Vision and Pattern Recognition, 2018.

PDF

(2018). Instance-aware Detailed Action Labeling in Videos. In IEEE Winter Conference on Applications of Computer Vision, 2018.

PDF

(2018). 3D Box Proposals from a Single Monocular Image of an Indoor Scene. In Association for the Advancement of Artificial Intelligence, 2018.

PDF

(2017). Stacked Learning to Search for Scene Labeling. In IEEE Transactions on Image Processing, 2017.

PDF

(2017). Forest Change Detection in Incomplete Satellite Images With Deep Neural Networks. In IEEE Transactions on Geoscience and Remote Sensing, 2017.

PDF

(2017). Predicting Salient Face in Multiple-face Videos. In IEEE Conference on Computer Vision and Pattern Recognition, 2017.

PDF

(2017). Learning deep structured network for weakly supervised change detection. In International Joint Conference on Artificial Intelligence, 2017.

PDF

(2017). Deep Free-Form Deformation Network for Object-Mask Registration. In International Conference on Computer Vision, 2017.

PDF

(2017). Learning Spatial Transforms for Refining Object Segment Proposals. In IEEE Winter Conference on Applications of Computer Vision, 2017.

PDF

(2017). Indoor Scene Parsing with Instance Segmentation, Semantic Labeling and Support Relationship Inference. In IEEE Conference on Computer Vision and Pattern Recognition, 2017.

PDF

(2017). Efficient Scene Layout Aware Object Detection for Traffic Surveillance. In IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017.

PDF

(2017). Boundary-aware Instance Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition, 2017.

PDF

(2016). Contour Completion without Region Segmentation. In IEEE Transactions on Image Processing, 2016.

PDF

(2016). SentiCap: Generating Image Descriptions with Sentiments. In AAAI Conference on Artificial Intelligence, 2016.

PDF

(2016). Semantic Context and Depth-aware Object Proposal Generation. In IEEE International Conference on Image Processing, 2016.

PDF

(2016). Object-Aware Dictionary Learning with Deep Features. In IEEE Asian Conference on Computer Vision, 2016.

PDF

(2016). Learning to Generate Object Segment Proposals with Multi-modal Cue. In IEEE Asian Conference on Computer Vision, 2016.

PDF

(2016). Learning to Co-Generate Object Proposals with a Deep Structured Network. In IEEE Conference on Computer Vision and Pattern Recognition, 2016.

PDF

(2016). Learning Dynamic Hierarchical Models for Anytime Scene Labeling. In IEEE European Conference on Computer Vision, 2016.

PDF

(2016). Building Scene Models by Completing and Hallucinating Depth and Semantics. In IEEE European Conference on Computer Vision, 2016.

PDF