Multimodal Learning

HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models

This paper proposes an efficient HOI detection framework that leverages CLIP's knowledge for better generalization.