convert image patch to 1D token embedding and give it to LLM's input

ㅇㅇ2025.12.15
조회24
right?