Performing Regression Analysis
by jeremylongshore
|
Skill Details
Repository Files
4 files in this skill directory
name: performing-regression-analysis description: | This skill empowers Claude to perform regression analysis and modeling using the regression-analysis-tool plugin. It analyzes datasets, generates appropriate regression models (linear, polynomial, etc.), validates the models, and provides performance metrics. Use this skill when the user explicitly requests regression analysis, prediction based on data, or mentions terms like "linear regression," "polynomial regression," "regression model," or "predictive modeling." This skill is also helpful when the user needs to understand the relationship between variables in a dataset. allowed-tools: Read, Write, Edit, Grep, Glob, Bash version: 1.0.0
Overview
This skill enables Claude to analyze data, build regression models, and provide insights into the relationships between variables. It leverages the regression-analysis-tool plugin to automate the process and ensure best practices are followed.
How It Works
- Data Analysis: Claude analyzes the provided data to understand its structure and identify potential relationships between variables.
- Model Generation: Based on the data, Claude generates appropriate regression models (e.g., linear, polynomial).
- Model Validation: Claude validates the generated models to ensure their accuracy and reliability.
- Performance Reporting: Claude provides performance metrics and insights into the model's effectiveness.
When to Use This Skill
This skill activates when you need to:
- Perform regression analysis on a given dataset.
- Predict future values based on existing data using regression models.
- Understand the relationship between independent and dependent variables.
- Evaluate the performance of a regression model.
Examples
Example 1: Predicting House Prices
User request: "Can you build a regression model to predict house prices based on square footage and number of bedrooms?"
The skill will:
- Analyze the provided data on house prices, square footage, and number of bedrooms.
- Generate a regression model (likely multiple to compare) to predict house prices.
- Provide performance metrics such as R-squared and RMSE.
Example 2: Analyzing Sales Trends
User request: "I need to analyze the sales data for the past year and identify any trends using regression analysis."
The skill will:
- Analyze the provided sales data.
- Generate a regression model to identify trends and patterns in the sales data.
- Visualize the trend and report the equation and R-squared value.
Best Practices
- Data Preparation: Ensure the data is clean and preprocessed before performing regression analysis.
- Model Selection: Choose the appropriate regression model based on the data and the problem.
- Validation: Always validate the model to ensure its accuracy and reliability.
Integration
This skill works independently using the regression-analysis-tool plugin. It can be used in conjunction with other data analysis and visualization tools to provide a comprehensive understanding of the data.
Related Skills
Attack Tree Construction
Build comprehensive attack trees to visualize threat paths. Use when mapping attack scenarios, identifying defense gaps, or communicating security risks to stakeholders.
Grafana Dashboards
Create and manage production Grafana dashboards for real-time visualization of system and application metrics. Use when building monitoring dashboards, visualizing metrics, or creating operational observability interfaces.
Matplotlib
Foundational plotting library. Create line plots, scatter, bar, histograms, heatmaps, 3D, subplots, export PNG/PDF/SVG, for scientific visualization and publication figures.
Scientific Visualization
Create publication figures with matplotlib/seaborn/plotly. Multi-panel layouts, error bars, significance markers, colorblind-safe, export PDF/EPS/TIFF, for journal-ready scientific plots.
Seaborn
Statistical visualization. Scatter, box, violin, heatmaps, pair plots, regression, correlation matrices, KDE, faceted plots, for exploratory analysis and publication figures.
Shap
Model interpretability and explainability using SHAP (SHapley Additive exPlanations). Use this skill when explaining machine learning model predictions, computing feature importance, generating SHAP plots (waterfall, beeswarm, bar, scatter, force, heatmap), debugging models, analyzing model bias or fairness, comparing models, or implementing explainable AI. Works with tree-based models (XGBoost, LightGBM, Random Forest), deep learning (TensorFlow, PyTorch), linear models, and any black-box model
Pydeseq2
Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.
Query Writing
For writing and executing SQL queries - from simple single-table queries to complex multi-table JOINs and aggregations
Pydeseq2
Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.
Scientific Visualization
Meta-skill for publication-ready figures. Use when creating journal submission figures requiring multi-panel layouts, significance annotations, error bars, colorblind-safe palettes, and specific journal formatting (Nature, Science, Cell). Orchestrates matplotlib/seaborn/plotly with publication styles. For quick exploration use seaborn or plotly directly.
