Models & Algorithms•March 8, 2026•KR

From Evaluation to Deployment — The Complete Fine-tuning Guide

Evaluate with Perplexity and KoBEST benchmarks, merge LoRA weights, and deploy with vLLM/Ollama/HuggingFace Spaces.

From Evaluation to Deployment — The Complete Fine-tuning Guide

In Part 1 we covered LoRA fundamentals and ran our first fine-tuning. In Part 2 we tackled QLoRA and Korean dataset construction. Training is done. Now two questions remain:

Series: Part 1: LoRA Theory | Part 2: QLoRA + Korean | Part 3 (this post)

Did the model actually improve? (Evaluation)
How do we serve it to users? (Deployment)

🔒

Sign in to continue reading

Create a free account to access the full content.

AI Tools & Agents

Self-Evolving AI Agents — The New Paradigm of 2026

GenericAgent, Evolver, Open Agents — comparing 3 self-evolving agent frameworks that learn, adapt, and grow without human coding.

AI Tools & Agents

Build Your Own LLM Knowledge Base — A Karpathy-Style Knowledge System

Complete guide to building a permanent personal knowledge system with Obsidian + Claude Code. Wiki + Memory dual-axis architecture.

AI Tools & Agents

Why Karpathy's CLAUDE.md Got 48K Stars — And How to Write Your Own

One markdown file raised AI coding accuracy from 65% to 94%. Analyzing Karpathy's 4 rules and practical writing guide.

From Evaluation to Deployment — The Complete Fine-tuning Guide

From Evaluation to Deployment — The Complete Fine-tuning Guide

1. Evaluation Methodology

Measuring Perplexity

Sign in to continue reading

Related Posts

Self-Evolving AI Agents — The New Paradigm of 2026

Build Your Own LLM Knowledge Base — A Karpathy-Style Knowledge System

Why Karpathy's CLAUDE.md Got 48K Stars — And How to Write Your Own