PaperBanana: AI Now Generates Publication-Quality Academic Illustrations
PaperBanana from Google and Peking University is an agentic system that automatically generates publication-ready academic illustrations from paper text.

PaperBanana: AI Now Generates Publication-Quality Academic Illustrations

What's the most time-consuming part of writing a paper? Experiments? Writing? Many researchers point to "creating figures." Methodology diagrams, architecture schematics, result visualizations... each one can take hours or even days.
Researchers from Google and Peking University have released PaperBanana to solve this problem. It's an agentic system that automatically generates publication-ready academic illustrations from paper text alone.
Why Is This Needed?
We're in an era where LLMs help with paper writing, code generation, and experiment design. But figures? We still manually create them in PowerPoint, Figma, or TikZ.
- Half a day for one methodology diagram
- Start over when revision requests come in
- Maintaining consistent style is challenging
PaperBanana addresses this bottleneck.
How Does It Work?

PaperBanana uses four specialized agents working in collaboration:
1. Reference Retrieval Agent
- Searches figure styles from similar papers
- Identifies visual conventions in the field
- Collects reference layout patterns
2. Content Planning Agent
- Extracts key concepts from paper text
- Decides which elements to include
- Designs information hierarchy
3. Image Rendering Agent
- Leverages Vision-Language Models
- Generates actual images
- Places text, arrows, boxes, etc.
4. Iterative Refinement Agent
- Self-evaluates generated figures
- Identifies areas for improvement
- Iterates until quality criteria are met
Performance: PaperBananaBench

The team created a benchmark with 292 test cases collected from NeurIPS 2025 papers.
Evaluation criteria:
- Faithfulness: Does it match the paper content?
- Conciseness: Is it free of unnecessary elements?
- Readability: Is it easy to understand?
- Aesthetics: Does it look visually appealing?
PaperBanana outperformed existing methods across all criteria.
Extended Capabilities

Beyond methodology diagrams, it can also generate statistical plots:
- Experimental result graphs
- Comparison charts
- Data distribution visualizations
Limitations and Outlook
Current limitations:
- Complex 3D structures remain challenging
- Limited support for domain-specific notations
- Generation takes several minutes
However, it's significant as the final piece of AI research workflow automation. We're approaching an era where the entire paper writing process can be AI-assisted.
Resources
- Paper: PaperBanana: Automating Academic Illustration for AI Scientists
- Authors: Google Research, Peking University
Subscribe to Newsletter
Related Posts

Self-Evolving AI Agents — The New Paradigm of 2026
GenericAgent, Evolver, Open Agents — comparing 3 self-evolving agent frameworks that learn, adapt, and grow without human coding.

Paperclip — The Open-Source Framework for Running AI Agent Companies
30K GitHub stars in 3 weeks. An open-source multi-agent orchestration platform with org charts, budgets, and governance. Heartbeat scheduling, per-agent monthly budgets, and company templates.

MIRAGE — Do Multimodal AIs Actually "See" Images?
GPT-5.1, Gemini 3 Pro, and Claude Opus 4.5 retain 70-80% of benchmark scores without any image input. A 3B text-only model outperforms all multimodal models and radiologists on chest X-ray benchmarks. Stanford MIRAGE paper review.