RS-Neg is the first benchmark to evaluate negation comprehension in remote sensing tasks across region-level and scene-level scenarios. It reveals that advanced remote sensing MLLMs struggle with negation, showing hallucinations and performance drops. NeFo, a test-time learning method, improves negation understanding using only 5% unlabeled test data and generalizes well to new tasks.
RS-Neg Benchmark and NeFo Method for Negation Understanding in Remote Sensing MLLMs
from English