Benchmark Report Creator
by cskiro
Use PROACTIVELY when creating research reports, experiment writeups, technical whitepapers, or empirical study documentation. Orchestrates the complete benchmark report pipeline with structure, diagrams, hi-res PNG capture, and PDF export. Provides working scripts, CSS templates, and complete command sequences for publication-quality AI/ML benchmark reports. Not for slides, blog posts, or simple README files.
Skill Details
Repository Files
6 files in this skill directory
name: benchmark-report-creator description: Use PROACTIVELY when creating research reports, experiment writeups, technical whitepapers, or empirical study documentation. Orchestrates the complete benchmark report pipeline with structure, diagrams, hi-res PNG capture, and PDF export. Provides working scripts, CSS templates, and complete command sequences for publication-quality AI/ML benchmark reports. Not for slides, blog posts, or simple README files. author: "Connor" category: benchmarking
Benchmark Report Creator
Overview
End-to-end orchestrator for creating publication-quality benchmark reports. This skill coordinates the full pipeline rather than duplicating individual component skills.
┌─────────────────────────────────────────────────────────────────────────┐
│ BENCHMARK REPORT PIPELINE │
├─────────────────────────────────────────────────────────────────────────┤
│ │
│ 1. STRUCTURE 2. DIAGRAMS 3. EXPORT │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ Markdown │ │ HTML diagram │ │ pandoc + │ │
│ │ template │───►│ with academic│─────►│ weasyprint │ │
│ │ (sections) │ │ styling │ │ (final PDF) │ │
│ └──────────────┘ └──────┬───────┘ └──────────────┘ │
│ │ │
│ ▼ │
│ ┌──────────────┐ │
│ │ Playwright │ ◄── High-res PNG capture │
│ │ script │ (2x deviceScaleFactor) │
│ │ (hi-res PNG) │ │
│ └──────────────┘ │
│ │
└─────────────────────────────────────────────────────────────────────────┘
Key Capabilities:
- Complete pipeline orchestration (not component duplication)
- Working hi-res PNG capture script (fixes Playwright CLI limitations)
- Exact CSS templates from paralleLLM empathy v1.0/v2.0
- Full command sequences with copy-paste examples
When to Use This Skill
Trigger Phrases:
- "create a benchmark report"
- "write up my AI experiment"
- "publication-quality benchmark documentation"
- "full report with diagrams and PDF"
- "empathy experiment style report"
Use Cases:
- AI/ML benchmark evaluation reports
- Model comparison studies
- Empirical experiment writeups
- Research documentation with figures
NOT for:
- Single diagram creation (use html-diagram-creator)
- Simple markdown to PDF (use markdown-to-pdf-converter)
- ASCII diagrams (use ascii-diagram-creator)
Quick Start
Prerequisites
# Install all dependencies
brew install pandoc
pip install weasyprint
npm install playwright
npx playwright install chromium
Complete Pipeline (3 commands)
# 1. Create HTML diagram
# [Use html-diagram-creator skill or copy template]
# 2. Capture diagram to high-res PNG
node scripts/capture-diagram.js diagram.html figures/figure-1.png
# 3. Convert markdown report to PDF
pandoc report.md --standalone --css=templates/pdf-style.css -t html | weasyprint - report.pdf
Pipeline Phases
Phase 1: Document Structure
Use the report-creator pattern or copy from reference/report-template.md:
| Section | Content |
|---|---|
| Abstract | 150-250 word summary |
| Executive Summary | Key finding + metrics table |
| 1. Background | Research context, hypotheses |
| 2. Methodology | Design, variables, protocol |
| 3. Results | Statistics, observations |
| 4. Discussion | Hypothesis evaluation |
| 5. Limitations | Methodological constraints |
| 6. Future Work | Research directions |
| 7. Conclusion | Synthesis |
| Appendices | Supporting materials |
Phase 2: Create Diagrams
Use the html-diagram-creator pattern for publication-quality HTML diagrams.
Color Palette (Academic Standard):
| Stage | Fill | Border | Usage |
|---|---|---|---|
| Data Preparation | #E3F2FD |
#1976D2 |
Input processing |
| Execution | #E8F5E9 |
#388E3C |
API calls, inference |
| Analysis | #FFF3E0 |
#F57C00 |
Evaluation, scoring |
→ Full templates: reference/html-templates.md
Phase 3: Capture to PNG (CRITICAL)
Important: The Playwright CLI --device-scale-factor flag does NOT exist. Use the provided Node.js script instead:
# Working high-res capture (2x retina quality)
node scripts/capture-diagram.js diagram.html output.png
# The script handles:
# - deviceScaleFactor: 2 (retina quality)
# - .diagram-container selector targeting
# - Proper file:// protocol handling
Why not CLI? Playwright's CLI screenshot command doesn't support --device-scale-factor. The bundled script uses the Playwright API directly.
Phase 4: Embed in Markdown
<figure style="margin: 2em auto; page-break-inside: avoid; text-align: center;">
<img src="figures/figure-1.png" alt="Description" style="max-width: 100%; height: auto;">
<figcaption style="text-align: center; font-style: italic; margin-top: 1em;">
Figure 1: Caption text.
</figcaption>
</figure>
Phase 5: Export to PDF
# Two-step (for debugging)
pandoc report.md -o report.html --standalone --css=templates/pdf-style.css
weasyprint report.html report.pdf
# One-liner (production)
pandoc report.md --standalone --css=templates/pdf-style.css -t html | weasyprint - report.pdf
File Structure
benchmark-report-creator/
├── SKILL.md # This file (orchestrator)
├── templates/
│ └── pdf-style.css # Academic CSS (empathy v1.0/v2.0)
├── scripts/
│ └── capture-diagram.js # Working hi-res PNG capture
├── reference/
│ ├── report-template.md # Full markdown template
│ ├── html-templates.md # Diagram HTML templates
│ └── command-reference.md # All commands in one place
└── examples/
└── sample-report/ # Complete working example
CSS Template Standards
The bundled templates/pdf-style.css follows empathy v1.0/v2.0 conventions:
Tables (Academic Style)
- Header: 2px solid top/bottom borders
- Body: 1px solid bottom borders
- Last row: 2px solid bottom border
page-break-inside: avoid
Figures
max-width: 100%(NOT min-width)margin: 2em autofor centeringpage-break-inside: avoid- Italic captions centered below
Page Control
- 2cm margins
- Headings:
page-break-after: avoid - Orphans/widows: 3 lines minimum
Troubleshooting
| Issue | Cause | Solution |
|---|---|---|
| Blurry PNG | Using CLI instead of script | Use scripts/capture-diagram.js |
| Image off-center | Missing margin auto | Add margin: 2em auto to figure |
| Table split | Missing page-break rule | CSS has this by default |
| Weasyprint warnings | Unsupported CSS props | Safe to ignore (gap, overflow-x) |
| Empty PNG | Wrong selector | Check .diagram-container exists |
Success Checklist
- All prerequisites installed (pandoc, weasyprint, playwright)
- Diagram HTML created with academic styling
- PNG captured at 2x resolution using script (not CLI)
- Markdown has proper figure tags with captions
- PDF renders without errors
- Tables don't split across pages
- Figures centered with captions
Related Skills (User Global)
These skills may be installed in user's ~/.claude/skills/:
| Skill | Purpose | When to Use Directly |
|---|---|---|
| report-creator | Document templates | Structure guidance only |
| html-diagram-creator | Diagram HTML | Single diagram creation |
| html-to-png-converter | PNG export reference | CLI-only workflows |
| markdown-to-pdf-converter | PDF pipeline | Non-benchmark documents |
This orchestrator combines all four into a single coherent workflow for benchmark reports.
Based on: paralleLLM empathy-experiment-v1.0.pdf and v2.0.pdf
Related Skills
Dbt Transformation Patterns
Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.
Clinical Decision Support
Generate professional clinical decision support (CDS) documents for pharmaceutical and clinical research settings, including patient cohort analyses (biomarker-stratified with outcomes) and treatment recommendation reports (evidence-based guidelines with decision algorithms). Supports GRADE evidence grading, statistical analysis (hazard ratios, survival curves, waterfall plots), biomarker integration, and regulatory compliance. Outputs publication-ready LaTeX/PDF format optimized for drug develo
Scientific Schematics
Create publication-quality scientific diagrams using Nano Banana Pro AI with smart iterative refinement. Uses Gemini 3 Pro for quality review. Only regenerates if quality is below threshold for your document type. Specialized in neural network architectures, system diagrams, flowcharts, biological pathways, and complex scientific visualizations.
Mermaid Diagrams
Comprehensive guide for creating software diagrams using Mermaid syntax. Use when users need to create, visualize, or document software through diagrams including class diagrams (domain modeling, object-oriented design), sequence diagrams (application flows, API interactions, code execution), flowcharts (processes, algorithms, user journeys), entity relationship diagrams (database schemas), C4 architecture diagrams (system context, containers, components), state diagrams, git graphs, pie charts,
Diagram Generation
Mermaid diagram generation for architecture visualization, data flow diagrams, and component relationships. Use for documentation, PR descriptions, and architectural analysis.
Scientific Schematics
Create publication-quality scientific diagrams using Nano Banana Pro AI with smart iterative refinement. Uses Gemini 3 Pro for quality review. Only regenerates if quality is below threshold for your document type. Specialized in neural network architectures, system diagrams, flowcharts, biological pathways, and complex scientific visualizations.
Clinical Decision Support
Generate professional clinical decision support (CDS) documents for pharmaceutical and clinical research settings, including patient cohort analyses (biomarker-stratified with outcomes) and treatment recommendation reports (evidence-based guidelines with decision algorithms). Supports GRADE evidence grading, statistical analysis (hazard ratios, survival curves, waterfall plots), biomarker integration, and regulatory compliance. Outputs publication-ready LaTeX/PDF format optimized for drug develo
Materialize Docs
Materialize documentation for SQL syntax, data ingestion, concepts, and best practices. Use when users ask about Materialize queries, sources, sinks, views, or clusters.
Dbt Transformation Patterns
Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.
Infographic Syntax Creator
Generate AntV Infographic syntax outputs. Use when asked to turn user content into the Infographic DSL (template selection, data structuring, theme), or to output `infographic <template>` plain syntax.
