WBCMor VQA: A Bilingual English-Urdu Hematology Visual Question Answering Benchmark
Researchers have introduced WBCMor VQA, a clinically validated bilingual benchmark for leukemia and normal white blood cell analysis in English and Urdu. This resource addresses the gap in multilingual healthcare technologies, particularly in regions like Pakistan where clinical documentation often mismatches patient communication languages. The dataset comprises 110,000 bilingual question-answer pairs annotated across 20,000 single-cell images of leukemic and normal white blood cells. To ensure linguistic consistency and clinical correctness, the benchmark utilizes morphology-aware annotations from the LeukemiaAttri and WBCAtt datasets alongside a domain-specific Urdu hematology dictionary. The study also highlights the limitations of existing English-centric vision-language resources in diverse healthcare environments. Baseline performance metrics were established by evaluating multiple open-source Vision Language Models on this new benchmark. This resource aims to facilitate the development of accessible AI systems for multilingual medical contexts.