Customizing Vectors R

by jeremy-allen

skill

|

Skill Details

Repository Files

6 files in this skill directory


name: customizing-vectors-r description: | Type-stable vector operations and custom vector classes using vctrs. Use this skill when building R packages with custom types, need guaranteed output types regardless of input values, implementing consistent coercion/casting rules, or creating vector classes that work seamlessly with data frames. Covers when to use vctrs vs base R, building custom vector classes, coercion methods, and testing vctrs classes.

Customizing Vectors R

This skill covers type-stable operations and custom vector class design using the vctrs package.

Core Benefits

  • Type stability - Predictable output types regardless of input values
  • Size stability - Predictable output sizes from input sizes
  • Consistent coercion rules - Single set of rules applied everywhere
  • Robust class design - Proper S3 vector infrastructure

When to Use vctrs

Use vctrs when:

  1. Building custom vector classes - Data frame compatibility, subsetting
  2. Type-stable functions in packages - Guaranteed output types
  3. Consistent coercion/casting - Explicit rules, predictable behavior
  4. Size/length stability - Predictable sizing from inputs

Don't use vctrs when:

  • Simple one-off analyses - Base R is sufficient
  • No custom classes needed - Standard types work fine
  • Performance critical + simple operations - Base R may be faster
  • External API constraints - Must return base R types

vctrs vs Base R Decision Matrix

Use Case Base R vctrs When to Choose vctrs
Simple combining c() vec_c() Need type stability, consistent rules
Custom classes S3 manually new_vctr() Want data frame compatibility, subsetting
Type conversion as.*() vec_cast() Need explicit, safe casting
Finding common type Not available vec_ptype_common() Combining heterogeneous inputs
Size operations length() vec_size() Working with non-vector objects

Building Custom Vector Classes

See custom-vector-class.md for the complete pattern:

  • Constructor (low-level)
  • Helper (user-facing)
  • Format method

Type-Stable Functions

See type-stable-functions.md for:

  • Guaranteed output types with vec_cast()
  • Avoiding type-depends-on-data patterns

Coercion and Casting

See coercion-casting.md for:

  • Explicit casting with clear rules
  • Common type finding with vec_ptype_common()
  • Self-coercion methods
  • Cross-type coercion methods

Implementation Patterns

Coercion Methods

Define vec_ptype2.* methods for type compatibility:

vec_ptype2.pkg_percent.pkg_percent <- function(x, y, ...) new_percent()
vec_ptype2.pkg_percent.double <- function(x, y, ...) double()

Casting Methods

Define vec_cast.* methods for conversion:

vec_cast.pkg_percent.double <- function(x, to, ...) new_percent(x)
vec_cast.double.pkg_percent <- function(x, to, ...) vec_data(x)

See coercion-methods.md for complete examples.

Performance Considerations

When vctrs Adds Overhead

  • Simple operations - vec_c(1, 2) vs c(1, 2)
  • One-off scripts - Type safety less critical
  • Small vectors - Overhead may outweigh benefits

When vctrs Improves Performance

  • Package functions - Type stability prevents expensive re-computation
  • Complex classes - Consistent behavior reduces debugging
  • Data frame operations - Robust column type handling
  • Repeated operations - Predictable types enable optimization

Package Development

Exports and Dependencies

# DESCRIPTION
Imports: vctrs

# NAMESPACE - Import what you need
importFrom(vctrs, vec_assert, new_vctr, vec_cast, vec_ptype_common)

Testing vctrs Classes

See testing-vctrs.md for:

  • Type stability tests
  • Coercion tests

Key Insight

vctrs is most valuable in package development where type safety, consistency, and extensibility matter more than raw speed for simple operations.

source: Sarah Johnson's gist https://gist.github.com/sj-io/3828d64d0969f2a0f05297e59e6c15ad

Related Skills

Attack Tree Construction

Build comprehensive attack trees to visualize threat paths. Use when mapping attack scenarios, identifying defense gaps, or communicating security risks to stakeholders.

skill

Grafana Dashboards

Create and manage production Grafana dashboards for real-time visualization of system and application metrics. Use when building monitoring dashboards, visualizing metrics, or creating operational observability interfaces.

skill

Matplotlib

Foundational plotting library. Create line plots, scatter, bar, histograms, heatmaps, 3D, subplots, export PNG/PDF/SVG, for scientific visualization and publication figures.

skill

Scientific Visualization

Create publication figures with matplotlib/seaborn/plotly. Multi-panel layouts, error bars, significance markers, colorblind-safe, export PDF/EPS/TIFF, for journal-ready scientific plots.

skill

Seaborn

Statistical visualization. Scatter, box, violin, heatmaps, pair plots, regression, correlation matrices, KDE, faceted plots, for exploratory analysis and publication figures.

skill

Shap

Model interpretability and explainability using SHAP (SHapley Additive exPlanations). Use this skill when explaining machine learning model predictions, computing feature importance, generating SHAP plots (waterfall, beeswarm, bar, scatter, force, heatmap), debugging models, analyzing model bias or fairness, comparing models, or implementing explainable AI. Works with tree-based models (XGBoost, LightGBM, Random Forest), deep learning (TensorFlow, PyTorch), linear models, and any black-box model

skill

Pydeseq2

Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.

skill

Query Writing

For writing and executing SQL queries - from simple single-table queries to complex multi-table JOINs and aggregations

skill

Pydeseq2

Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.

skill

Scientific Visualization

Meta-skill for publication-ready figures. Use when creating journal submission figures requiring multi-panel layouts, significance annotations, error bars, colorblind-safe palettes, and specific journal formatting (Nature, Science, Cell). Orchestrates matplotlib/seaborn/plotly with publication styles. For quick exploration use seaborn or plotly directly.

skill

Skill Information

Category:Skill
Last Updated:1/11/2026