convert image patch to 1D token embedding and give it to LLM's input

ㅇㅇ2025.12.15
조회31
right?