Altinity Expert Clickhouse Storage
by Altinity
Diagnose ClickHouse disk usage, compression efficiency, part sizes, and storage bottlenecks. Use for disk space issues and slow IO.
Skill Details
Repository Files
2 files in this skill directory
name: altinity-expert-clickhouse-storage description: Diagnose ClickHouse disk usage, compression efficiency, part sizes, and storage bottlenecks. Use for disk space issues and slow IO.
Storage and Disk Usage Analysis
Diagnose disk usage, compression efficiency, part sizes, and storage bottlenecks.
Diagnostics
Run all queries from the file checks.sql and analyze the results.
Ad-Hoc Query Guidelines
Required Safeguards
-- Always limit results
limit 100
-- For part_log
where event_date >= today() - 1
Key Tables
system.disks- Disk configurationsystem.parts- Part storage detailssystem.columns- Column compressionsystem.storage_policies- Tiered storagesystem.detached_parts- Orphaned parts
Cross-Module Triggers
| Finding | Load Module | Reason |
|---|---|---|
| Poor compression | altinity-expert-clickhouse-schema |
Codec recommendations |
| Many small parts | altinity-expert-clickhouse-merges |
Merge backlog |
| High write IO | altinity-expert-clickhouse-ingestion |
Batch sizing |
| System logs large | altinity-expert-clickhouse-logs |
TTL configuration |
| Slow disk + merges | altinity-expert-clickhouse-merges |
Merge optimization |
Settings Reference
| Setting | Notes |
|---|---|
min_bytes_for_wide_part |
Threshold for Wide vs Compact parts |
min_rows_for_wide_part |
Row threshold for Wide parts |
max_bytes_to_merge_at_max_space_in_pool |
Max merge size |
prefer_not_to_merge |
Disable merges (emergency) |
Related Skills
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Team Composition Analysis
This skill should be used when the user asks to "plan team structure", "determine hiring needs", "design org chart", "calculate compensation", "plan equity allocation", or requests organizational design and headcount planning for a startup.
Startup Financial Modeling
This skill should be used when the user asks to "create financial projections", "build a financial model", "forecast revenue", "calculate burn rate", "estimate runway", "model cash flow", or requests 3-5 year financial planning for a startup.
Startup Metrics Framework
This skill should be used when the user asks about "key startup metrics", "SaaS metrics", "CAC and LTV", "unit economics", "burn multiple", "rule of 40", "marketplace metrics", or requests guidance on tracking and optimizing business performance metrics.
Market Sizing Analysis
This skill should be used when the user asks to "calculate TAM", "determine SAM", "estimate SOM", "size the market", "calculate market opportunity", "what's the total addressable market", or requests market sizing analysis for a startup or business opportunity.
Clinical Decision Support
Generate professional clinical decision support (CDS) documents for pharmaceutical and clinical research settings, including patient cohort analyses (biomarker-stratified with outcomes) and treatment recommendation reports (evidence-based guidelines with decision algorithms). Supports GRADE evidence grading, statistical analysis (hazard ratios, survival curves, waterfall plots), biomarker integration, and regulatory compliance. Outputs publication-ready LaTeX/PDF format optimized for drug develo
Anndata
This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.
Geopandas
Python library for working with geospatial vector data including shapefiles, GeoJSON, and GeoPackage files. Use when working with geographic data for spatial analysis, geometric operations, coordinate transformations, spatial joins, overlay operations, choropleth mapping, or any task involving reading/writing/analyzing vector geographic data. Supports PostGIS databases, interactive maps, and integration with matplotlib/folium/cartopy. Use for tasks like buffer analysis, spatial joins between dat
Market Research Reports
Generate comprehensive market research reports (50+ pages) in the style of top consulting firms (McKinsey, BCG, Gartner). Features professional LaTeX formatting, extensive visual generation with scientific-schematics and generate-image, deep integration with research-lookup for data gathering, and multi-framework strategic analysis including Porter's Five Forces, PESTLE, SWOT, TAM/SAM/SOM, and BCG Matrix.
