A systematic evaluation of vision-language models for observational astronomical reasoning tasks — Wenke Ren, Hengxiao Guo, Wenwen Zuo, Xiaoman Zhang | Kutubxona