Altinity Expert Clickhouse Memory
by Altinity
Diagnose ClickHouse RAM usage, OOM errors, memory pressure, and allocation patterns. Use for memory-related issues and out-of-memory errors.
Skill Details
Repository Files
2 files in this skill directory
name: altinity-expert-clickhouse-memory description: Diagnose ClickHouse RAM usage, OOM errors, memory pressure, and allocation patterns. Use for memory-related issues and out-of-memory errors.
Memory Usage and OOM Diagnostics
Diagnose RAM usage, memory pressure, OOM risks, and memory allocation patterns.
Diagnostics
Run all queries from the file checks.sql and analyze the results.
Problem Investigation
High Memory from Aggregations
Solutions:
- Add
max_bytes_before_external_group_by - Use
max_threadspragma to limit parallelism - Restructure query to reduce group by cardinality
High Memory from JOINs
Solutions:
- Use
max_bytes_in_join - Consider
join_algorithm = 'partial_merge'or'auto' - Ensure smaller table on right side
Ad-Hoc Query Guidelines
Required Safeguards
-- Always time-bound log queries
where event_date >= today() - 1
-- Limit results
limit 100
Memory-Related Metrics
MemoryTracking- current tracked memoryMemoryResident- RSSOSMemoryTotal,OSMemoryFreeWithoutCached- system memory
Cross-Module Triggers
| Finding | Load Module | Reason |
|---|---|---|
| High merge memory | altinity-expert-clickhouse-merges |
Analyze merge patterns |
| Large dictionaries | altinity-expert-clickhouse-dictionaries |
Dictionary optimization |
| Cache too large | altinity-expert-clickhouse-caches |
Cache sizing |
| PK memory high | altinity-expert-clickhouse-schema |
ORDER BY optimization |
| Query OOMs | altinity-expert-clickhouse-reporting |
Query optimization |
Settings Reference
| Setting | Scope | Notes |
|---|---|---|
max_memory_usage |
Query | Per-query limit |
max_memory_usage_for_user |
User | Per-user aggregate |
max_server_memory_usage |
Server | Global limit |
max_server_memory_usage_to_ram_ratio |
Server | Auto-limit as % of RAM |
max_bytes_before_external_group_by |
Query | Spill aggregation to disk |
max_bytes_in_join |
Query | Spill join to disk |
Related Skills
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Clinical Decision Support
Generate professional clinical decision support (CDS) documents for pharmaceutical and clinical research settings, including patient cohort analyses (biomarker-stratified with outcomes) and treatment recommendation reports (evidence-based guidelines with decision algorithms). Supports GRADE evidence grading, statistical analysis (hazard ratios, survival curves, waterfall plots), biomarker integration, and regulatory compliance. Outputs publication-ready LaTeX/PDF format optimized for drug develo
Geopandas
Python library for working with geospatial vector data including shapefiles, GeoJSON, and GeoPackage files. Use when working with geographic data for spatial analysis, geometric operations, coordinate transformations, spatial joins, overlay operations, choropleth mapping, or any task involving reading/writing/analyzing vector geographic data. Supports PostGIS databases, interactive maps, and integration with matplotlib/folium/cartopy. Use for tasks like buffer analysis, spatial joins between dat
Datacommons Client
Work with Data Commons, a platform providing programmatic access to public statistical data from global sources. Use this skill when working with demographic data, economic indicators, health statistics, environmental data, or any public datasets available through Data Commons. Applicable for querying population statistics, GDP figures, unemployment rates, disease prevalence, geographic entity resolution, and exploring relationships between statistical entities.
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Geopandas
Python library for working with geospatial vector data including shapefiles, GeoJSON, and GeoPackage files. Use when working with geographic data for spatial analysis, geometric operations, coordinate transformations, spatial joins, overlay operations, choropleth mapping, or any task involving reading/writing/analyzing vector geographic data. Supports PostGIS databases, interactive maps, and integration with matplotlib/folium/cartopy. Use for tasks like buffer analysis, spatial joins between dat
Datacommons Client
Work with Data Commons, a platform providing programmatic access to public statistical data from global sources. Use this skill when working with demographic data, economic indicators, health statistics, environmental data, or any public datasets available through Data Commons. Applicable for querying population statistics, GDP figures, unemployment rates, disease prevalence, geographic entity resolution, and exploring relationships between statistical entities.
Clinical Decision Support
Generate professional clinical decision support (CDS) documents for pharmaceutical and clinical research settings, including patient cohort analyses (biomarker-stratified with outcomes) and treatment recommendation reports (evidence-based guidelines with decision algorithms). Supports GRADE evidence grading, statistical analysis (hazard ratios, survival curves, waterfall plots), biomarker integration, and regulatory compliance. Outputs publication-ready LaTeX/PDF format optimized for drug develo
Clickhouse Query
Run ClickHouse queries for analytics, metrics analysis, and event data exploration. Use when you need to query ClickHouse directly, analyze metrics, check event tracking data, or test query performance. Read-only by default.
