Hic Compartment Shift

by BIsnake2001

art

This skill performs A/B compartment shift analysis between two Hi-C samples.

Skill Details

Repository Files

1 file in this skill directory


name: hic-compartment-shift description: This skill performs A/B compartment shift analysis between two Hi-C samples.

Compartment shift Analysis


Overview

This skill performs A/B compartment shift analysis using PC1 eigenvector values extracted from Hi-C data, following the HOMER framework. It supports two conditions, each with two or more replicates, and uses the PC1 values (E1 column) from user-provided TSV files.

Major steps include:

  • Refer to Inputs & Outputs to verify necessary files.
  • Always prompt user for genome assembly used. Never decide by yourself.
  • Convert TSV (Chrom, start, end, weight, E1) into HOMER-compatible PC1 bedGraph files.
  • Generate a unified genomic bin list for annotatePeaks.
  • Extract PC1 values across all samples.
  • Perform differential PC1 analysis with replicate-aware limma statistics.
  • Produce differential compartment tables and stitched compartment-shift domains.

When to use this skill

Use this skill when you want to:

  • Detect compartment shifts between two conditions (e.g., cell type 1 vs cell type 2)
  • Identify statistically significant changes in PC1 values across genomic bins
  • Determine regions that flip between A and B compartments
  • Integrate compartment shift results with other genomic datasets

Inputs & Outputs

Inputs

Example input set:

  • CT1_rep1.tsv
  • CT1_rep2.tsv
  • CT2_rep1.tsv
  • CT2_rep2.tsv

Additional requirements:

  • All TSVs must share identical bins.

Outputs

compartments_shift_analysis/
    shift_regions/
        diff_PC1_CT2_vs_CT1.txt
        regions.*.txt # other region files output by the tools used.
    temp/
        bins_PC1.txt
        PC1_all_samples.txt
        *.bedGraph # other bedGraph file

Decision Tree

Step 1: Convert TSV files to PC1 bedGraph

awk 'BEGIN{OFS="	"} NR>1 && NF==5 {print $1, $2, $3, $5}' CT1_rep1.tsv > CT1_rep1.PC1.bedGraph

Step 2: Create a bin list for annotatePeaks

Use any one TSV as the template:

awk 'BEGIN{OFS="	"} NR>1 && NF==5 {print $1, $2, $3}' CT1_rep1.tsv > bins_PC1.txt

The resulting bins_PC1.txt defines genomic intervals for PC1 extraction.

Step 3: Compartment shift analysis

Call:

  • mcp_homer-tools__homer_differential_PC1

with:

  • bins_pc1_path: Path to the bins_PC1.txt file generated earlier,
  • genome: HOMER genome identifier, provided by user.
  • bedgraph_paths: List of PC1 bedGraph files in the exact replicate order (e.g., CT1_rep1, CT1_rep2, CT2_rep1, CT2_rep2).
  • experiment_labels: List of experiment group labels matching bedGraph order (e.g. ['CT1','CT1','CT2','CT2']).
  • merged_output_path: Output path for merged PC1 table. Empty → '<bins_pc1_path>.merged_PC1.txt'.
  • diff_output_path: Output path for differential PC1 table. Empty → 'diff_PC1.txt'.

Related Skills

Team Composition Analysis

This skill should be used when the user asks to "plan team structure", "determine hiring needs", "design org chart", "calculate compensation", "plan equity allocation", or requests organizational design and headcount planning for a startup.

artdesign

Startup Financial Modeling

This skill should be used when the user asks to "create financial projections", "build a financial model", "forecast revenue", "calculate burn rate", "estimate runway", "model cash flow", or requests 3-5 year financial planning for a startup.

art

Startup Metrics Framework

This skill should be used when the user asks about "key startup metrics", "SaaS metrics", "CAC and LTV", "unit economics", "burn multiple", "rule of 40", "marketplace metrics", or requests guidance on tracking and optimizing business performance metrics.

art

Market Sizing Analysis

This skill should be used when the user asks to "calculate TAM", "determine SAM", "estimate SOM", "size the market", "calculate market opportunity", "what's the total addressable market", or requests market sizing analysis for a startup or business opportunity.

art

Anndata

This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.

arttooldata

Geopandas

Python library for working with geospatial vector data including shapefiles, GeoJSON, and GeoPackage files. Use when working with geographic data for spatial analysis, geometric operations, coordinate transformations, spatial joins, overlay operations, choropleth mapping, or any task involving reading/writing/analyzing vector geographic data. Supports PostGIS databases, interactive maps, and integration with matplotlib/folium/cartopy. Use for tasks like buffer analysis, spatial joins between dat

artdatacli

Market Research Reports

Generate comprehensive market research reports (50+ pages) in the style of top consulting firms (McKinsey, BCG, Gartner). Features professional LaTeX formatting, extensive visual generation with scientific-schematics and generate-image, deep integration with research-lookup for data gathering, and multi-framework strategic analysis including Porter's Five Forces, PESTLE, SWOT, TAM/SAM/SOM, and BCG Matrix.

artdata

Plotly

Interactive scientific and statistical data visualization library for Python. Use when creating charts, plots, or visualizations including scatter plots, line charts, bar charts, heatmaps, 3D plots, geographic maps, statistical distributions, financial charts, and dashboards. Supports both quick visualizations (Plotly Express) and fine-grained customization (graph objects). Outputs interactive HTML or static images (PNG, PDF, SVG).

artdata

Excel Analysis

Analyze Excel spreadsheets, create pivot tables, generate charts, and perform data analysis. Use when analyzing Excel files, spreadsheets, tabular data, or .xlsx files.

artdata

Neurokit2

Comprehensive biosignal processing toolkit for analyzing physiological data including ECG, EEG, EDA, RSP, PPG, EMG, and EOG signals. Use this skill when processing cardiovascular signals, brain activity, electrodermal responses, respiratory patterns, muscle activity, or eye movements. Applicable for heart rate variability analysis, event-related potentials, complexity measures, autonomic nervous system assessment, psychophysiology research, and multi-modal physiological signal integration.

arttooldata

Skill Information

Category:Creative
Last Updated:11/23/2025