Xlsx
by 404kidwiz
Expert in automating Excel workflows using Node.js (ExcelJS, SheetJS) and Python (pandas, openpyxl).
Skill Details
Repository Files
1 file in this skill directory
name: xlsx description: Expert in automating Excel workflows using Node.js (ExcelJS, SheetJS) and Python (pandas, openpyxl).
XLSX Skill
Purpose
Provides expertise in creating, reading, modifying, and automating Excel spreadsheet workflows. Specializes in programmatic spreadsheet manipulation using ExcelJS, SheetJS, pandas, and openpyxl for data processing, reporting, and automation.
When to Use
- Creating Excel reports programmatically
- Reading and parsing XLSX files
- Modifying existing spreadsheets while preserving formatting
- Automating repetitive Excel tasks
- Converting between CSV and XLSX formats
- Building Excel templates with formulas
- Extracting data from complex spreadsheets
- Generating formatted financial or data reports
Quick Start
Invoke this skill when:
- Creating Excel reports programmatically
- Reading and parsing XLSX files
- Modifying existing spreadsheets while preserving formatting
- Automating repetitive Excel tasks
- Converting between CSV and XLSX formats
Do NOT invoke when:
- Creating Google Sheets → different API
- Building Excel add-ins → use appropriate .NET/JS skill
- Data analysis without Excel output → use data-analyst
- CSV-only operations → use csv-data-wrangler
Decision Framework
Excel Task?
├── Node.js Environment → ExcelJS (full-featured) or SheetJS (parsing)
├── Python Environment → openpyxl (Excel) or pandas (data + Excel)
├── Heavy Data Processing → pandas with openpyxl engine
├── Complex Formatting → ExcelJS or openpyxl
├── Template-Based → Fill existing templates with data
└── Large Files → Streaming readers (ExcelJS streaming, pandas chunks)
Core Workflows
1. Excel Report Generation (Node.js)
- Initialize ExcelJS workbook
- Create worksheets with appropriate names
- Define columns with headers and widths
- Add data rows from source
- Apply styling (fonts, borders, fills)
- Add formulas for calculations
- Set print area and page setup
- Write to file buffer or stream
2. Spreadsheet Data Extraction (Python)
- Load workbook with openpyxl or pandas
- Identify data ranges and headers
- Handle merged cells and formatting
- Extract data into structured format
- Validate and clean extracted data
- Handle multiple sheets if needed
- Convert to desired output format
3. Template-Based Reporting
- Create master template with formatting and formulas
- Load template workbook
- Identify data insertion points
- Insert data while preserving formulas
- Update any date/reference cells
- Recalculate formulas if needed
- Save as new file (preserve template)
Best Practices
- Use streaming mode for large files to manage memory
- Preserve existing styles when modifying files
- Validate data types before writing to cells
- Handle merged cells explicitly
- Use named ranges for maintainability
- Test with actual Excel application, not just libraries
Anti-Patterns
- Loading huge files in memory → Use streaming readers
- Hardcoding cell references → Use named ranges or dynamic lookup
- Ignoring data types → Explicitly set number, date, text types
- Overwriting formulas → Check cell types before writing
- Missing error handling → Handle corrupted/password-protected files
Related Skills
Dask
Parallel/distributed computing. Scale pandas/NumPy beyond memory, parallel DataFrames/Arrays, multi-file processing, task graphs, for larger-than-RAM datasets and parallel workflows.
Scikit Survival
Comprehensive toolkit for survival analysis and time-to-event modeling in Python using scikit-survival. Use this skill when working with censored survival data, performing time-to-event analysis, fitting Cox models, Random Survival Forests, Gradient Boosting models, or Survival SVMs, evaluating survival predictions with concordance index or Brier score, handling competing risks, or implementing any survival analysis workflow with the scikit-survival library.
Polars
Fast DataFrame library (Apache Arrow). Select, filter, group_by, joins, lazy evaluation, CSV/Parquet I/O, expression API, for high-performance data analysis workflows.
Scikit Survival
Comprehensive toolkit for survival analysis and time-to-event modeling in Python using scikit-survival. Use this skill when working with censored survival data, performing time-to-event analysis, fitting Cox models, Random Survival Forests, Gradient Boosting models, or Survival SVMs, evaluating survival predictions with concordance index or Brier score, handling competing risks, or implementing any survival analysis workflow with the scikit-survival library.
Dask
Distributed computing for larger-than-RAM pandas/NumPy workflows. Use when you need to scale existing pandas/NumPy code beyond memory or across clusters. Best for parallel file processing, distributed ML, integration with existing pandas code. For out-of-core analytics on single machine use vaex; for in-memory speed use polars.
Anndata
Data structure for annotated matrices in single-cell analysis. Use when working with .h5ad files or integrating with the scverse ecosystem. This is the data format skill—for analysis workflows use scanpy; for probabilistic models use scvi-tools; for population-scale queries use cellxgene-census.
Matplotlib
Low-level plotting library for full customization. Use when you need fine-grained control over every plot element, creating novel plot types, or integrating with specific scientific workflows. Export to PNG/PDF/SVG for publication. For quick statistical plots use seaborn; for interactive plots use plotly; for publication-ready multi-panel figures with journal styling, use scientific-visualization.
Dashboard Design
USE THIS SKILL FIRST when user wants to create and design a dashboard, ESPECIALLY Vizro dashboards. This skill enforces a 3-step workflow (requirements, layout, visualization) that must be followed before implementation. For implementation and testing, use the dashboard-build skill after completing Steps 1-3.
Writing Effective Prompts
Structure Claude prompts for clarity and better results using roles, explicit instructions, context, positive framing, and strategic organization. Use when crafting prompts for complex tasks, long documents, tool workflows, or code generation.
Flowchart Creator
Create HTML flowcharts and process diagrams with decision trees, color-coded stages, arrows, and swimlanes. Use when users request flowcharts, process diagrams, workflow visualizations, or decision trees.
