M2Chat: Empowering VLM for Multimodal LLM Interleaved

M2Chat is a novel unified multimodal LLM framework for generating interleaved text-image conversation across various scenarios.

BibTex: