A fundamental challenge for GUI agents is robustly grounding natural language instructions, which requires not only precise spatial alignment (locating elements accurately) but also correct semantic ...
InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes, Zesong Yang, Bangbang Yang, Wenqi Dong, Chenxuan Cao, Liyuan Cui, Yuewen Ma, Zhaopeng Cui, Hujun Bao It ...
Abstract: Efficient image compression is crucial for remote sensing (RS) satellite systems, as it determines the performance of machine vision applications analyzing the downlinked image data at ...
Abstract: In recent years, the semantic segmentation of multimodal remote-sensing images using convolutional methods has received significant attention. Owing to the localized nature of convolutional ...