Abstract: The language-guided robot grasping task requires a robot agent to integrate multimodal information from both visual and linguistic inputs to predict actions for target-driven grasping. While ...
Abstract: Detecting distant small objects under adverse visual conditions such as rain, fog, or low light remains a critical challenge in autonomous driving. To address this issue, we propose a novel ...
The Adorant figurine from Geißenklösterle Cave, approximately 38,000 years old, consists of a small ivory plate bearing an anthropomorphic figure and multiple sequences of notches and dots. The ...
WS-DETR is a real-time traffic object detection model built upon the RT-DETR framework. The model incorporates two key innovations: Wavelet-Mamba Dual Path Block (WM-Dual Block) for improving ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results