Hypothesis Test
by astoreyai
Guide selection and interpretation of statistical hypothesis tests. Use when: (1) Choosing appropriate test for research data, (2) Checking assumptions before analysis, (3) Interpreting test results correctly, (4) Reporting statistical findings, (5) Troubleshooting assumption violations.
Skill Details
Repository Files
1 file in this skill directory
name: hypothesis-test description: "Guide selection and interpretation of statistical hypothesis tests. Use when: (1) Choosing appropriate test for research data, (2) Checking assumptions before analysis, (3) Interpreting test results correctly, (4) Reporting statistical findings, (5) Troubleshooting assumption violations." allowed-tools: Read, Write version: 1.0.0
Hypothesis Testing Skill
Purpose
Guide appropriate selection and interpretation of statistical hypothesis tests for research data analysis.
Test Selection Decision Tree
Step 1: How many variables?
One variable:
- Categorical → Chi-square goodness of fit
- Continuous → One-sample t-test
Two variables:
- Both categorical → Chi-square test of independence
- One categorical, one continuous → T-test or ANOVA
- Both continuous → Correlation or regression
Three+ variables:
- Multiple predictors → Multiple regression or ANOVA
- Complex designs → Mixed models or advanced methods
Step 2: Check assumptions
For t-tests:
- Independence of observations
- Normality (especially for small N)
- Homogeneity of variance
Violations?
- Non-normal → Mann-Whitney U (non-parametric)
- Unequal variance → Welch's t-test
- Dependent observations → Paired t-test or mixed models
For ANOVA:
- Independence
- Normality
- Homogeneity of variance
- No outliers
Violations?
- Non-normal → Kruskal-Wallis test
- Unequal variance → Welch's ANOVA
- Outliers → Robust methods or transformation
Step 3: Interpret results
Always report:
- Test statistic (t, F, χ²)
- Degrees of freedom
- p-value
- Effect size with CI
- Descriptive statistics
Example:
Independent samples t-test showed a significant difference between
groups, t(98) = 3.45, p < .001, d = 0.69, 95% CI [0.29, 1.09].
The experimental group (M = 45.2, SD = 8.3) scored higher than
control (M = 37.8, SD = 9.1).
Common Tests Reference
| Research Question | Test | Assumptions |
|---|---|---|
| 2 groups, continuous outcome | Independent t-test | Normality, equal variance |
| 2 measurements, same people | Paired t-test | Normality of differences |
| 3+ groups, one factor | One-way ANOVA | Normality, homogeneity |
| 3+ groups, multiple factors | Factorial ANOVA | Normality, homogeneity |
| Relationship between variables | Pearson correlation | Linearity, normality |
| Predict continuous outcome | Linear regression | Linearity, normality of residuals |
| 2 categorical variables | Chi-square test | Expected frequencies ≥5 |
| Ordinal data, 2 groups | Mann-Whitney U | None (non-parametric) |
| Ordinal data, paired | Wilcoxon signed-rank | None (non-parametric) |
Assumption Checking
Normality
Visual: Q-Q plot, histogram
Statistical: Shapiro-Wilk test (N < 50), Kolmogorov-Smirnov (N ≥ 50)
Guideline: Robust to moderate violations if N ≥ 30
Homogeneity of Variance
Visual: Box plots, residual plots
Statistical: Levene's test, Bartlett's test
Guideline: Ratio of largest/smallest variance < 4
Independence
Check: Research design, data collection
Red flags: Time series, clustered data, repeated measures
Solution: Use appropriate model (mixed effects, GEE)
Integration
Use with data-analyst agent for complete statistical analysis workflow and experiment-designer agent for planning appropriate analyses.
Version: 1.0.0
Related Skills
Xlsx
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Analyzing Financial Statements
This skill calculates key financial ratios and metrics from financial statement data for investment analysis
Data Storytelling
Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.
Kpi Dashboard Design
Design effective KPI dashboards with metrics selection, visualization best practices, and real-time monitoring patterns. Use when building business dashboards, selecting metrics, or designing data visualization layouts.
Dbt Transformation Patterns
Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.
Sql Optimization Patterns
Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.
Anndata
This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.
Xlsx
Spreadsheet toolkit (.xlsx/.csv). Create/edit with formulas/formatting, analyze data, visualization, recalculate formulas, for spreadsheet processing and analysis.
