Human Object Interaction Detection

HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models

This paper proposes an efficient HOI detection framework that leverages CLIP's knowledge for better generalization.

Pose-aware Multi-level Feature Network for Human Object Interaction Detection

Reasoning human object interactions is a core problem in human-centric scene understanding and detecting such relations poses a unique challenge to vision systems due to large variations in human-object configurations, multiple co-occurring relation …