Omicverse Visualization For Bulk Color Systems And Single Cell D

by Starlitnightly

art

Guide users through OmicVerse plotting utilities showcased in the bulk, color system, and single-cell visualization tutorials, including venn/volcano charts, palette selection, and advanced embedding layouts.

Skill Details

Repository Files

2 files in this skill directory


name: omicverse-visualization-for-bulk-color-systems-and-single-cell-d title: OmicVerse visualization for bulk, color systems, and single-cell data description: Guide users through OmicVerse plotting utilities showcased in the bulk, color system, and single-cell visualization tutorials, including venn/volcano charts, palette selection, and advanced embedding layouts.

OmicVerse visualization for bulk, color systems, and single-cell data

Overview

Leverage this skill when a user wants help recreating or adapting plots from the OmicVerse plotting tutorials:

It covers how to configure OmicVerse's plotting style, choose colors from the Forbidden City palette, and generate bulk as well as single-cell specific figures.

Instructions

  1. Set up the plotting environment
    • Import omicverse as ov, matplotlib.pyplot as plt, and other libraries required by the user's request (pandas, seaborn, scanpy, etc.).
    • Call ov.ov_plot_set() (or ov.plot_set() depending on the installed version) to apply OmicVerse's default styling before generating figures.
    • Load example data via ov.read(...)/ov.pp.preprocess(...) or instruct users to supply their own AnnData/CSV files.
  2. Bulk RNA-seq visuals (t_visualize_bulk)
    • Use ov.pl.venn(sets=..., palette=...) to display overlaps among DEG lists (no more than 4 groups). Encourage setting sets as a dictionary of set names → gene lists.
    • For volcano plots, load the DEG table (result = ov.read('...csv')) and call ov.pl.volcano(result, pval_name='qvalue', fc_name='log2FoldChange', ...). Explain optional keyword arguments such as sig_pvalue, sig_fc, palette, and label formatting.
    • To compare group distributions with box plots, gather long-form data (e.g., from seaborn.load_dataset('tips')) and invoke ov.pl.boxplot(data, x_value=..., y_value=..., hue=..., ax=ax, palette=...). Mention how to adjust figure size, legend placement, and significance annotations.
  3. Color management (t_visualize_colorsystem)
    • Introduce the color book via fb = ov.pl.ForbiddenCity() and demonstrate fb.get_color(name='凝夜紫') for specific hues.
    • Show how to pull predefined palettes (ov.pl.green_color, ov.pl.red_color, etc.) and build dicts mapping cell types/groups to color hex codes.
    • For segmented gradients, combine colors and call ov.pl.get_cmap_seg(colors, name='custom'), then pass the colormap into Matplotlib/Scanpy plotting functions.
    • Highlight using these palettes in embeddings: ov.pl.embedding(adata, basis='X_umap', color='clusters', palette=color_dict, ax=ax).
  4. Single-cell visualizations (t_visualize_single)
    • Remind users to preprocess AnnData if needed (adata = ov.pp.preprocess(adata, mode='shiftlog|pearson', n_HVGs=2000)).
    • IMPORTANT - Data validation: Before plotting, always verify that required data exists:
      # Before plotting by clustering or other categorical variable
      color_col = 'leiden'  # or 'clusters', 'celltype', etc.
      if color_col not in adata.obs.columns:
          raise ValueError(f"Column '{color_col}' not found in adata.obs. Available columns: {list(adata.obs.columns)}")
      
      # Before plotting embeddings
      basis = 'X_umap'  # or 'X_pca', 'X_tsne', etc.
      if basis not in adata.obsm.keys():
          raise ValueError(f"Embedding '{basis}' not found in adata.obsm. Available embeddings: {list(adata.obsm.keys())}")
      
    • For palette optimization, use ov.pl.optim_palette(adata, basis='X_umap', colors='clusters') to auto-generate color schemes when categories clash.
    • Reproduce stacked proportions with ov.pl.cellproportion(adata, groupby='clusters', celltype_clusters='celltype', ax=ax) and transform into stacked area charts by setting kind='area'.
    • Showcase compound embedding utilities:
      • ov.pl.embedding_celltype to place counts/proportions alongside UMAPs.
      • ov.pl.ConvexHull or ov.pl.contour for highlighting regions of interest.
      • ov.pl.embedding_adjust to reposition legends automatically.
      • ov.pl.embedding_density for density overlays, controlling smoothness with adjust.
    • For spatial gene density, describe the workflow: ov.pl.calculate_gene_density(adata, genes=[...], basis='spatial'), then overlay with ov.pl.embedding(..., layer='gene_density', cmap='...').
    • Cover additional charts like ov.pl.single_group_boxplot, ov.pl.bardotplot, ov.pl.dotplot, and ov.pl.marker_heatmap, emphasizing input formats (long-form DataFrame vs. AnnData with .obs annotations) and optional helpers such as ov.pl.add_palue for manual p-value annotations.
  5. Finishing touches and exports
    • Encourage adding titles, axis labels, and fig.tight_layout() to prevent clipping.
    • Suggest saving figures with fig.savefig('plot.png', dpi=300, bbox_inches='tight') and documenting color mappings for reproducibility.
    • Troubleshoot common issues:
      • Missing AnnData keys: Always validate adata.obs columns and adata.obsm embeddings exist before plotting
      • Palette names not found: Verify color dictionaries match actual category values
      • Matplotlib font rendering: When using Chinese characters, ensure appropriate fonts are installed
      • "Could not find X in adata.obs": Check that clustering or annotation has been performed before trying to visualize results. Use defensive checks to compute missing prerequisites on-the-fly.

Examples

  • "Plot a three-set Venn diagram of overlapping DEG lists and reuse Forbidden City colors for consistency."
  • "Load the dentate gyrus AnnData, color clusters with fb.get_color selections, and render an embedding with adjusted legend placement."
  • "Generate single-cell proportion bar/area plots plus gene-density overlays using OmicVerse helper functions."

References

Related Skills

Team Composition Analysis

This skill should be used when the user asks to "plan team structure", "determine hiring needs", "design org chart", "calculate compensation", "plan equity allocation", or requests organizational design and headcount planning for a startup.

artdesign

Startup Financial Modeling

This skill should be used when the user asks to "create financial projections", "build a financial model", "forecast revenue", "calculate burn rate", "estimate runway", "model cash flow", or requests 3-5 year financial planning for a startup.

art

Startup Metrics Framework

This skill should be used when the user asks about "key startup metrics", "SaaS metrics", "CAC and LTV", "unit economics", "burn multiple", "rule of 40", "marketplace metrics", or requests guidance on tracking and optimizing business performance metrics.

art

Market Sizing Analysis

This skill should be used when the user asks to "calculate TAM", "determine SAM", "estimate SOM", "size the market", "calculate market opportunity", "what's the total addressable market", or requests market sizing analysis for a startup or business opportunity.

art

Anndata

This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.

arttooldata

Geopandas

Python library for working with geospatial vector data including shapefiles, GeoJSON, and GeoPackage files. Use when working with geographic data for spatial analysis, geometric operations, coordinate transformations, spatial joins, overlay operations, choropleth mapping, or any task involving reading/writing/analyzing vector geographic data. Supports PostGIS databases, interactive maps, and integration with matplotlib/folium/cartopy. Use for tasks like buffer analysis, spatial joins between dat

artdatacli

Market Research Reports

Generate comprehensive market research reports (50+ pages) in the style of top consulting firms (McKinsey, BCG, Gartner). Features professional LaTeX formatting, extensive visual generation with scientific-schematics and generate-image, deep integration with research-lookup for data gathering, and multi-framework strategic analysis including Porter's Five Forces, PESTLE, SWOT, TAM/SAM/SOM, and BCG Matrix.

artdata

Plotly

Interactive scientific and statistical data visualization library for Python. Use when creating charts, plots, or visualizations including scatter plots, line charts, bar charts, heatmaps, 3D plots, geographic maps, statistical distributions, financial charts, and dashboards. Supports both quick visualizations (Plotly Express) and fine-grained customization (graph objects). Outputs interactive HTML or static images (PNG, PDF, SVG).

artdata

Excel Analysis

Analyze Excel spreadsheets, create pivot tables, generate charts, and perform data analysis. Use when analyzing Excel files, spreadsheets, tabular data, or .xlsx files.

artdata

Neurokit2

Comprehensive biosignal processing toolkit for analyzing physiological data including ECG, EEG, EDA, RSP, PPG, EMG, and EOG signals. Use this skill when processing cardiovascular signals, brain activity, electrodermal responses, respiratory patterns, muscle activity, or eye movements. Applicable for heart rate variability analysis, event-related potentials, complexity measures, autonomic nervous system assessment, psychophysiology research, and multi-modal physiological signal integration.

arttooldata

Skill Information

Category:Creative
Last Updated:11/4/2025