Scientific Skills
by oimiragieo
Comprehensive scientific research toolkit with 139 specialized skills for biology, chemistry, medicine, data science, and computational research. Transforms Claude into an AI research assistant with access to scientific databases, analysis tools, and domain-specific workflows.
Skill Details
Repository Files
1059 files in this skill directory
name: scientific-skills description: Comprehensive scientific research toolkit with 139 specialized skills for biology, chemistry, medicine, data science, and computational research. Transforms Claude into an AI research assistant with access to scientific databases, analysis tools, and domain-specific workflows. license: MIT version: 2.17.0 metadata: skill-author: K-Dense Inc. original-repo: https://github.com/K-Dense-AI/claude-scientific-skills categories: - scientific-databases - bioinformatics - cheminformatics - machine-learning - data-analysis - scientific-writing skill-count: 139
Claude Scientific Skills
Overview
A comprehensive collection of 139 ready-to-use scientific skills that transform Claude into an AI research assistant capable of executing complex multi-step scientific workflows across biology, chemistry, medicine, and related fields.
When to Use
Invoke this skill when:
- Working on scientific research tasks
- Need access to specialized databases (PubMed, ChEMBL, UniProt, etc.)
- Performing bioinformatics or cheminformatics analysis
- Creating literature reviews or scientific documents
- Analyzing single-cell RNA-seq, proteomics, or multi-omics data
- Drug discovery and molecular analysis workflows
- Statistical analysis and machine learning on scientific data
Quick Start
// Invoke the main skill catalog
Skill({ skill: 'scientific-skills' });
// Or invoke specific sub-skills directly
Skill({ skill: 'scientific-skills/rdkit' }); // Cheminformatics
Skill({ skill: 'scientific-skills/scanpy' }); // Single-cell analysis
Skill({ skill: 'scientific-skills/biopython' }); // Bioinformatics
Skill({ skill: 'scientific-skills/literature-review' }); // Literature review
Skill Categories
Scientific Databases (28+)
| Skill | Description |
|---|---|
pubchem |
Chemical compound database |
chembl-database |
Bioactivity database for drug discovery |
uniprot-database |
Protein sequence and function database |
pdb |
Protein Data Bank structures |
drugbank-database |
Drug and drug target information |
kegg |
Pathway and genome database |
clinvar-database |
Clinical variant interpretations |
cosmic-database |
Cancer mutation database |
ensembl-database |
Genome browser and annotations |
geo-database |
Gene expression data |
gwas-database |
Genome-wide association studies |
reactome-database |
Biological pathways |
string-database |
Protein-protein interactions |
alphafold-database |
Protein structure predictions |
biorxiv-database |
Preprint server for biology |
clinicaltrials-database |
Clinical trial registry |
ena-database |
European Nucleotide Archive |
fda-database |
FDA drug approvals and labels |
gene-database |
Gene information from NCBI |
zinc-database |
Commercially available compounds |
brenda-database |
Enzyme database |
clinpgx-database |
Pharmacogenomics annotations |
uspto-database |
Patent database |
Python Analysis Libraries (55+)
| Skill | Description |
|---|---|
rdkit |
Cheminformatics toolkit |
scanpy |
Single-cell RNA-seq analysis |
anndata |
Annotated data matrices |
biopython |
Computational biology tools |
pytorch-lightning |
Deep learning framework |
scikit-learn |
Machine learning library |
transformers |
NLP and deep learning models |
pandas / polars / vaex |
Data manipulation |
matplotlib / seaborn / plotly |
Visualization |
deepchem |
Deep learning for chemistry |
esm |
Evolutionary Scale Modeling |
datamol |
Molecular data processing |
pymatgen |
Materials science |
qiskit |
Quantum computing |
pymoo |
Multi-objective optimization |
statsmodels |
Statistical modeling |
sympy |
Symbolic mathematics |
networkx |
Network analysis |
geopandas |
Geospatial analysis |
shap |
Model explainability |
Bioinformatics & Genomics
| Skill | Description |
|---|---|
gget |
Gene and transcript information |
pysam |
SAM/BAM file manipulation |
deeptools |
NGS data analysis |
pydeseq2 |
Differential expression |
scvi-tools |
Deep learning for single-cell |
etetoolkit |
Phylogenetic analysis |
scikit-bio |
Bioinformatics algorithms |
bioservices |
Web services for biology |
cellxgene-census |
Cell atlas exploration |
Cheminformatics & Drug Discovery
| Skill | Description |
|---|---|
rdkit |
Molecular manipulation |
datamol |
Molecular data handling |
molfeat |
Molecular featurization |
diffdock |
Molecular docking |
torchdrug |
Drug discovery ML |
pytdc |
Therapeutics data commons |
cobrapy |
Metabolic modeling |
Scientific Communication
| Skill | Description |
|---|---|
literature-review |
Systematic literature reviews |
scientific-writing |
Academic writing assistance |
scientific-schematics |
AI-generated figures |
scientific-slides |
Presentation generation |
hypothesis-generation |
Hypothesis development |
venue-templates |
Journal-specific formatting |
citation-management |
Reference management |
Clinical & Medical
| Skill | Description |
|---|---|
clinical-decision-support |
Clinical reasoning |
clinical-reports |
Medical report generation |
treatment-plans |
Treatment planning |
pyhealth |
Healthcare ML |
pydicom |
Medical imaging |
Laboratory & Integration
| Skill | Description |
|---|---|
benchling-integration |
Lab informatics platform |
dnanexus-integration |
Genomics cloud platform |
pylabrobot |
Laboratory automation |
flowio |
Flow cytometry data |
omero-integration |
Bioimaging platform |
Core Workflows
Literature Review Workflow
# 7-phase systematic literature review
# 1. Planning with PICO framework
# 2. Multi-database search execution
# 3. Screening with PRISMA flow
# 4. Data extraction and quality assessment
# 5. Thematic synthesis
# 6. Citation verification
# 7. PDF generation
Drug Discovery Workflow
# Using RDKit + ChEMBL + datamol
from rdkit import Chem
from rdkit.Chem import Descriptors, AllChem
# 1. Query ChEMBL for bioactivity data
# 2. Calculate molecular properties
# 3. Filter by drug-likeness (Lipinski)
# 4. Similarity screening
# 5. Substructure analysis
Single-Cell Analysis Workflow
# Using scanpy + anndata
import scanpy as sc
# 1. Load and QC data
# 2. Normalization and feature selection
# 3. Dimensionality reduction (PCA, UMAP)
# 4. Clustering (Leiden algorithm)
# 5. Marker gene identification
# 6. Cell type annotation
Hypothesis Generation Workflow
# 8-step systematic process
# 1. Understand phenomenon
# 2. Literature search
# 3. Synthesize evidence
# 4. Generate competing hypotheses
# 5. Evaluate quality
# 6. Design experiments
# 7. Formulate predictions
# 8. Generate report
Sub-Skill Structure
Each sub-skill follows a consistent structure:
scientific-skills/
├── SKILL.md # This file (catalog/index)
├── skills/ # Individual skill directories
│ ├── rdkit/
│ │ ├── SKILL.md # Skill documentation
│ │ ├── references/ # API references, patterns
│ │ └── scripts/ # Example scripts
│ ├── scanpy/
│ ├── biopython/
│ └── ... (139 total)
Invoking Sub-Skills
Direct Invocation
// Invoke specific skill
Skill({ skill: 'scientific-skills/rdkit' });
Skill({ skill: 'scientific-skills/scanpy' });
Chained Workflows
// Multi-skill workflow
Skill({ skill: 'scientific-skills/literature-review' });
Skill({ skill: 'scientific-skills/hypothesis-generation' });
Skill({ skill: 'scientific-skills/scientific-schematics' });
Prerequisites
- Python 3.9+ (3.12+ recommended)
- uv package manager (recommended)
- Platform: macOS, Linux, or Windows with WSL2
Best Practices
- Start with the right skill: Use the category tables above to find appropriate skills
- Chain skills for complex workflows: Literature review → Hypothesis → Experiment design
- Use database skills for data access: Query databases before analysis
- Visualize results: Use matplotlib/seaborn/plotly skills for publication-quality figures
- Document findings: Use scientific-writing skill for formal documentation
Integration with Agent Framework
Recommended Agent Pairings
| Agent | Scientific Skills |
|---|---|
data-engineer |
polars, dask, vaex, zarr-python |
python-pro |
All Python-based skills |
database-architect |
Database skills for schema design |
technical-writer |
literature-review, scientific-writing |
Example Agent Spawn
Task({
subagent_type: 'python-pro',
description: 'Analyze molecular dataset with RDKit',
prompt: `You are the PYTHON-PRO agent with scientific research expertise.
## Task
Analyze the molecular dataset for drug-likeness properties.
## Skills to Invoke
1. Skill({ skill: "scientific-skills/rdkit" })
2. Skill({ skill: "scientific-skills/datamol" })
## Workflow
1. Load molecular data
2. Calculate descriptors
3. Apply Lipinski filters
4. Generate visualization
5. Report findings
`,
});
Resources
Bundled Documentation
skills/*/SKILL.md- Individual skill documentationskills/*/references/- API references and patternsskills/*/scripts/- Example scripts and templates
External Resources
Version History
- v2.17.0 - Current version with 139 skills
- Integrated from K-Dense-AI/claude-scientific-skills repository
License
MIT License - Open source and freely available for research and commercial use.
Related Skills
Xlsx
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Analyzing Financial Statements
This skill calculates key financial ratios and metrics from financial statement data for investment analysis
Data Storytelling
Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.
Kpi Dashboard Design
Design effective KPI dashboards with metrics selection, visualization best practices, and real-time monitoring patterns. Use when building business dashboards, selecting metrics, or designing data visualization layouts.
Dbt Transformation Patterns
Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.
Sql Optimization Patterns
Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.
Anndata
This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.
Xlsx
Spreadsheet toolkit (.xlsx/.csv). Create/edit with formulas/formatting, analyze data, visualization, recalculate formulas, for spreadsheet processing and analysis.
