Abstract: Multimodal feature fusion for object detection aims to obtain a more complete representation of object features by integrating information from multiple modalities. However, the main ...