Data Analytics Engineering
by vasilyu1983
Analytics engineering for reliable metrics and BI readiness. Build transformation layers, dimensional models, semantic metrics, data quality tests, and documentation. Use when you need dbt or SQL transformation strategy, metrics definition, or analytics data modeling.
Skill Details
Repository Files
1 file in this skill directory
name: data-analytics-engineering description: Analytics engineering for reliable metrics and BI readiness. Build transformation layers, dimensional models, semantic metrics, data quality tests, and documentation. Use when you need dbt or SQL transformation strategy, metrics definition, or analytics data modeling.
Data Analytics Engineering
Scope
- Define metrics, grains, and dimensional models.
- Build transformation layers and semantic models.
- Implement data quality tests and observability.
- Document datasets, lineage, and ownership.
- Align analytics outputs with BI and product needs.
Ask For Inputs
- Business metrics and decision use cases.
- Source systems, data freshness, and latency needs.
- Existing warehouse, tooling, and orchestration.
- Expected data volumes and change cadence.
- Governance requirements and access controls.
Workflow
- Define metric dictionary and grains.
- Design staging, intermediate, and mart layers.
- Model dimensions and facts with clear keys.
- Build semantic layer and metric definitions.
- Add tests for freshness, nulls, ranges, and duplicates.
- Document lineage, owners, and SLAs.
- Plan rollout, backfills, and validation checks.
Outputs
- Metric dictionary and semantic model.
- Data model with schema and grain definitions.
- Transformation plan and dbt or SQLMesh structure.
- Data quality test suite and alerting plan.
- Documentation and ownership map.
Quality Checks
- Keep metric definitions stable and versioned.
- Treat metrics as APIs: document changes, deprecate safely, and backfill deliberately.
- Define data contracts for core tables (schema, freshness, keys) to control downstream breakage.
- Avoid mixed grains in a single model.
- Ensure tests cover critical joins and aggregates.
- Validate against source of truth and historical baselines.
Templates
assets/metric-dictionary.mdfor metric definitions and owners.assets/semantic-layer-spec.mdfor entities, measures, and dimensions.assets/data-quality-test-plan.mdfor test coverage planning.
Resources
references/modeling-patterns.mdfor modeling guidance and data quality patterns.references/tool-comparison-2026.mdfor dbt vs SQLMesh vs Coalesce decision matrix.data/sources.jsonfor curated vendor docs and trend-tracking sources (use as a WebSearch seed list).
Related Skills
- Use data-lake-platform for platform architecture.
- Use data-sql-optimization for query tuning.
- Use ai-ml-data-science for modeling and experiments.
Trend Awareness Protocol
IMPORTANT: When users ask recommendation questions about analytics engineering, data modeling, or BI, you MUST use WebSearch to check current trends before answering. If WebSearch is unavailable, use data/sources.json + web browsing and state what you verified vs assumed.
Trigger Conditions
- "What's the best tool for [analytics engineering/data modeling/BI]?"
- "What should I use for [transformation/semantic layer/metrics]?"
- "What's the latest in analytics engineering?"
- "Current best practices for [dbt/metrics layers/data quality]?"
- "Is [tool/approach] still relevant in 2026?"
- "[dbt] vs [SQLMesh] vs [other]?"
- "Best BI tool for [use case]?"
- "SQLMesh acquisition" or "Fivetran transformation"
- "Agentic analytics" or "AI data workflows"
- "Metric debt" or "metric governance"
Required Searches
- Search:
"analytics engineering best practices 2026" - Search:
"[dbt/SQLMesh/semantic layer] vs alternatives 2026" - Search:
"analytics engineering trends January 2026" - Search:
"[specific tool] new releases 2026" - Search:
"agentic analytics AI data 2026"(for AI-related queries)
What to Report
After searching, provide:
- Current landscape: What analytics tools/patterns are popular NOW
- Emerging trends: New tools, patterns, or standards gaining traction
- Deprecated/declining: Tools/approaches losing relevance or support
- Recommendation: Based on fresh data, not just static knowledge
Example Topics (verify with fresh search)
- Transformation tools (dbt, SQLMesh, Coalesce)
- Semantic layers (dbt Semantic Layer, Cube, AtScale, warehouse-native)
- Metrics stores and headless BI
- Data quality tools (dbt tests, Elementary, dbt-expectations/Metaplane)
- BI platforms (Metabase, Superset, Lightdash, Hex)
- Data modeling patterns (dimensional, wide tables, activity schema)
- Analytics engineering workflows and CI/CD
- Agentic AI workflows for analytics
- Data mesh and domain-owned data products
Related Skills
Xlsx
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Analyzing Financial Statements
This skill calculates key financial ratios and metrics from financial statement data for investment analysis
Data Storytelling
Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.
Kpi Dashboard Design
Design effective KPI dashboards with metrics selection, visualization best practices, and real-time monitoring patterns. Use when building business dashboards, selecting metrics, or designing data visualization layouts.
Dbt Transformation Patterns
Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.
Sql Optimization Patterns
Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.
Clinical Decision Support
Generate professional clinical decision support (CDS) documents for pharmaceutical and clinical research settings, including patient cohort analyses (biomarker-stratified with outcomes) and treatment recommendation reports (evidence-based guidelines with decision algorithms). Supports GRADE evidence grading, statistical analysis (hazard ratios, survival curves, waterfall plots), biomarker integration, and regulatory compliance. Outputs publication-ready LaTeX/PDF format optimized for drug develo
Anndata
This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.
