AI ResearchKR

We Benchmarked MiniCPM-o 4.5 in Korean. Here's What Actually Happens.

We benchmarked MiniCPM-o 4.5's Korean performance side by side with English. Image descriptions, OCR, document extraction — what works, what breaks, and why the root cause is architecture, not prompts.

We Benchmarked MiniCPM-o 4.5 in Korean. Here's What Actually Happens.

We Benchmarked MiniCPM-o 4.5 in Korean. Here's What Actually Happens.

MiniCPM-o 4.5 is an omni model optimized for English and Chinese. How well does it handle Korean?

We tested with the same images, same questions — one in Korean, one in English, side by side. Image description, OCR, document extraction, and fine-tuning, all tested hands-on.

The short answer: Korean works. But there are fascinating failure modes, and the root cause isn't what you'd expect.

Test Setup

ItemSpec
ModelMiniCPM-o 4.5 (BF16, 17.6GB VRAM)
Frameworktransformers 4.51.0, PyTorch 2.x
MethodSame image + semantically identical Korean/English prompts
Decodingsampling=True, temperature=0.7, repetition_penalty=1.2

System prompts were set per language:

python
system_prompts = {
    "ko": "당신은 한국어 전문 어시스턴트입니다. 반드시 한국어로만 답변하세요. 중국어, 영어, 러시아어 등 다른 언어를 섞지 마세요.",
    "en": "You are a helpful assistant. Respond only in English.",
}

What Works Well

Image Description: Eiffel Tower

🔒

Sign in to continue reading

Create a free account to access the full content.

Related Posts