Hypothesis Test

by astoreyai

data

Guide selection and interpretation of statistical hypothesis tests. Use when: (1) Choosing appropriate test for research data, (2) Checking assumptions before analysis, (3) Interpreting test results correctly, (4) Reporting statistical findings, (5) Troubleshooting assumption violations.

Skill Details

Repository Files

1 file in this skill directory


name: hypothesis-test description: "Guide selection and interpretation of statistical hypothesis tests. Use when: (1) Choosing appropriate test for research data, (2) Checking assumptions before analysis, (3) Interpreting test results correctly, (4) Reporting statistical findings, (5) Troubleshooting assumption violations." allowed-tools: Read, Write version: 1.0.0

Hypothesis Testing Skill

Purpose

Guide appropriate selection and interpretation of statistical hypothesis tests for research data analysis.

Test Selection Decision Tree

Step 1: How many variables?

One variable:

  • Categorical → Chi-square goodness of fit
  • Continuous → One-sample t-test

Two variables:

  • Both categorical → Chi-square test of independence
  • One categorical, one continuous → T-test or ANOVA
  • Both continuous → Correlation or regression

Three+ variables:

  • Multiple predictors → Multiple regression or ANOVA
  • Complex designs → Mixed models or advanced methods

Step 2: Check assumptions

For t-tests:

  1. Independence of observations
  2. Normality (especially for small N)
  3. Homogeneity of variance

Violations?

  • Non-normal → Mann-Whitney U (non-parametric)
  • Unequal variance → Welch's t-test
  • Dependent observations → Paired t-test or mixed models

For ANOVA:

  1. Independence
  2. Normality
  3. Homogeneity of variance
  4. No outliers

Violations?

  • Non-normal → Kruskal-Wallis test
  • Unequal variance → Welch's ANOVA
  • Outliers → Robust methods or transformation

Step 3: Interpret results

Always report:

  1. Test statistic (t, F, χ²)
  2. Degrees of freedom
  3. p-value
  4. Effect size with CI
  5. Descriptive statistics

Example:

Independent samples t-test showed a significant difference between 
groups, t(98) = 3.45, p < .001, d = 0.69, 95% CI [0.29, 1.09]. 
The experimental group (M = 45.2, SD = 8.3) scored higher than 
control (M = 37.8, SD = 9.1).

Common Tests Reference

Research Question Test Assumptions
2 groups, continuous outcome Independent t-test Normality, equal variance
2 measurements, same people Paired t-test Normality of differences
3+ groups, one factor One-way ANOVA Normality, homogeneity
3+ groups, multiple factors Factorial ANOVA Normality, homogeneity
Relationship between variables Pearson correlation Linearity, normality
Predict continuous outcome Linear regression Linearity, normality of residuals
2 categorical variables Chi-square test Expected frequencies ≥5
Ordinal data, 2 groups Mann-Whitney U None (non-parametric)
Ordinal data, paired Wilcoxon signed-rank None (non-parametric)

Assumption Checking

Normality

Visual: Q-Q plot, histogram
Statistical: Shapiro-Wilk test (N < 50), Kolmogorov-Smirnov (N ≥ 50)
Guideline: Robust to moderate violations if N ≥ 30

Homogeneity of Variance

Visual: Box plots, residual plots
Statistical: Levene's test, Bartlett's test
Guideline: Ratio of largest/smallest variance < 4

Independence

Check: Research design, data collection
Red flags: Time series, clustered data, repeated measures
Solution: Use appropriate model (mixed effects, GEE)

Integration

Use with data-analyst agent for complete statistical analysis workflow and experiment-designer agent for planning appropriate analyses.


Version: 1.0.0

Related Skills

Xlsx

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas

data

Clickhouse Io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

datacli

Clickhouse Io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

datacli

Analyzing Financial Statements

This skill calculates key financial ratios and metrics from financial statement data for investment analysis

data

Data Storytelling

Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.

data

Kpi Dashboard Design

Design effective KPI dashboards with metrics selection, visualization best practices, and real-time monitoring patterns. Use when building business dashboards, selecting metrics, or designing data visualization layouts.

designdata

Dbt Transformation Patterns

Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.

testingdocumenttool

Sql Optimization Patterns

Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.

designdata

Anndata

This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.

arttooldata

Xlsx

Spreadsheet toolkit (.xlsx/.csv). Create/edit with formulas/formatting, analyze data, visualization, recalculate formulas, for spreadsheet processing and analysis.

tooldata

Skill Information

Category:Data
Version:1.0.0
Allowed Tools:Read, Write
Last Updated:11/11/2025