InstructSAM is a training-free framework for Instruction-Oriented Object Counting, Detection, and Segmentation (InstructCDS). We construct EarthInstruct, an InstructCDS benchmark for remote sensing.
Abstract: Content creation applications have become a cornerstone of nextgeneration personal devices. A prime example is video generation, which involves generation, language encoding, editing, and ...