智能新闻 The Counterintuitive Challenge of Post-Training in Multimodal Models The Synergy Dilemma of Long-Chain SFT and RL 2025年8月3日 Introduction In the …