Improving Reasoning Capabilities in Small Models through Mixture-of-Layers Distillation with Stepwise Attention on Key Information — Yao Chen, Jiawei Sheng, Wenyuan Zhang, Tingwen Liu | Kutubxona