SSkilltecabyclaudinhocode
Enviar skill
← Voltar para o catálogo

paperbanana

Documentos

Use quando o usuário precisar de diagramas acadêmicos, figuras de metodologia, gráficos estatísticos ou slides de apresentação a partir de descrições de texto ou arquivos de dados. Também use para avaliar figuras geradas em relação a referências.

31estrelas
Ver no GitHub ↗Autor: PlutoLeiLicença: MIT

PaperBanana - Academic Illustration Generator

Multi-agent pipeline (Retriever → Planner → Stylist → Visualizer → Critic) for publication-quality academic diagrams, statistical plots, and presentation slides.

API key: Set provider keys in PaperBanana project's .env file. Timeout: 300000 (5 min) for all generation commands.


Commands

All commands run from project root: cd <paperbanana_dir> && python -m paperbanana.cli <cmd>

Command Selection Decision Tree

Route user requests to the right subcommand before looking up parameters:

User intentSignal wordsSubcommand
方法论/架构/流程图 from text or PDF"method figure", "架构图", "流程图", "methodology", "pipeline diagram", "论文配图"generate
Statistical plot from data file"plot", "curve", "bar chart", "scatter", "heatmap", has CSV/JSONplot
Single presentation slide"slide", "一张幻灯片", "封面图", single prompt fileslide
Batch slide generation"all slides", "批量生成", "N 张幻灯片", prompts/ directoryslide-batch
Compare generated vs human reference"evaluate", "对比", "与参考图对比"evaluate
Manage reference dataset"download dataset", "清缓存"data
First-time provider config"setup", "配置 API key"setup

Ambiguous input: If user provides just a description with no subcommand signal, default to generate (see Argument Parsing table for details).

Out-of-scope: Pure code generation (matplotlib/seaborn script) is NOT paperbanana's job — those go to matplotlib / scientific-visualization skills. Paperbanana is for AI-driven image generation + critique loops.

Note (upstream sync pending): Upstream paperbanana CLI adds subcommands (plot-batch #123, sweep #118) and a claude_code VLM provider (#115) not yet reflected in this table. See the llmsresearch/paperbanana CHANGELOG for the authoritative CLI surface.

generate — Methodology Diagrams

python -m paperbanana.cli generate --input '<file>' --caption '<caption>' --optimize --verbose

When user provides inline text (no file): write to temp file, use as --input.

ParameterDefaultDescription
--input / -iPath to methodology text file or PDF (.pdf requires pip install 'paperbanana')
--caption / -cFigure caption / communicative intent
--output / -oautoOutput image path
--vlm-providergeminiVLM provider: gemini, anthropic, openai, bedrock, openrouter
--vlm-modelautoVLM model name
--image-providerautoImage gen provider: google_imagen, openai, bedrock, openrouter
--image-modelautoImage gen model name
--iterations / -n3Max critic rounds
--autooffLoop until critic is satisfied (safety cap via --max-iterations)
--max-iterations30Safety cap for --auto mode
--optimizeoffPreprocess inputs (parallel enrichment + caption sharpening)
--continueoffContinue from the latest run
--continue-runContinue from a specific run ID
--feedbackUser feedback for the critic when continuing a run
--aspect-ratio / -arautoTarget aspect ratio: 1:1, 2:3, 3:2, 3:4, 4:3, 9:16, 16:9, 21:9
--format / -fpngOutput format: png, jpeg, webp
--dry-runoffValidate inputs without making API calls
--exemplar-retrievaloffEnable external exemplar retrieval before planning
--seedRandom seed for reproducible generation
--verbose / -voffShow detailed agent progress and timing
--auto-download-dataoffAuto-download expanded reference set (~257MB) on first run
--venueAcademic venue style: neurips, icml, acl, ieee, custom
--pagesPage range for PDF input (e.g., 3-5)
--configPath to config YAML file

Venue styles: --venue neurips applies NeurIPS-specific methodology and plot style guides from data/guidelines/. Each venue has distinct color palettes, layout conventions, and typography expectations.

PDF input: --input paper.pdf --pages 3-5 extracts text from the specified pages as source context.

Exemplar advanced flags: --exemplar-retrieval enables retrieval; see generate --help for additional config flags (--exemplar-endpoint, --exemplar-mode, --exemplar-top-k, --exemplar-timeout, --exemplar-retries).

plot — Statistical Plots

python -m paperbanana.cli plot --data '<data.csv>' --intent '<intent>' --optimize --verbose
ParameterDefaultDescription
--data / -dPath to data file (CSV or JSON) [required]
--intentCommunicative intent for the plot [required]
--output / -oautoOutput image path
--vlm-providergeminiVLM provider
--iterations / -n3Refinement iterations
--format / -fpngOutput format
--aspect-ratio / -arautoTarget aspect ratio
--optimizeoffEnrich context and sharpen caption
--autooffLoop until critic satisfied
--verbose / -voffDetailed progress

slide — Presentation Slides

python -m paperbanana.cli slide --input '<prompt.md>' --resolution 4k
ParameterDefaultDescription
--input / -iPath to slide prompt markdown file [required]
--caption / -cautoSlide intent description
--output / -oautoOutput image path
--image-modelautoImage gen model
--vlm-modelautoVLM model name
--iterations / -n3Max critic rounds
--style / -sStyle preset name (see table below)
--list-stylesoffList all available style presets and exit
--resolution / -r4kOutput resolution: 1k, 2k, 4k
--configPath to config YAML file

slide-batch — Batch Slide Generation

python -m paperbanana.cli slide-batch --prompts-dir '<dir>' --resolution 4k
ParameterDefaultDescription
--prompts-dirDirectory containing slide prompt markdown files [required]
--output-dirautoOutput directory
--image-modelautoImage gen model
--style / -sStyle preset applied to all slides
--iterations / -n3Max critic rounds per slide
--resolution / -r4kOutput resolution

Style Presets (23 available)

Use --style <name> with slide or slide-batch. Use --list-styles to see all.

StyleSourceBest For
blueprintbaoyuArchitecture, system design, technical
chalkboardbaoyuClassroom, teaching, education
corporatebaoyuBusiness, investor, quarterly reports
minimalbaoyuExecutive briefings, clean/simple
sketch-notesbaoyuTutorials, guides, beginner content
watercolorbaoyuLifestyle, wellness, artistic
dark-atmosphericbaoyuEntertainment, gaming, cinematic
notionbaoyuSaaS, product, dashboards
bold-editorialbaoyuProduct launches, keynotes, marketing
editorial-infographicbaoyuScience communication, explainers
fantasy-animationbaoyuStorytelling, magical, children
intuition-machinebaoyuAcademic research, bilingual
pixel-artbaoyuGaming, retro, developer culture
scientificbaoyuBiology, chemistry, medical
vector-illustrationbaoyuCreative, children, flat design
vintagebaoyuHistorical, heritage, expedition
tech-keynoteelite-pptApple/Tesla premium minimalism
creative-boldelite-pptGoogle/Airbnb energetic innovation
`financial-e

Como adicionar

/plugin marketplace add PlutoLei/paperbanana-skill

O comando exato pode variar conforme o repositório. Confira o README no GitHub.

Comentários · Nenhum comentário

Entre para comentar. Entrar

  • Ainda não há comentários. Seja o primeiro.