gsva-analysis-and-visualization
Use this skill to run GSVA or ssGSEA pathway-level differential analysis from a bulk expression matrix and a sample group file, then generate a heatmap from the saved GSVA result object.
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | No fabricated DOI, PMID, p-values, or pathway enrichment scores across all 7 inputs; all values derived from actual GSVA/limma computation on provided data. |
| Practice Boundaries | PASS | Clinical diagnosis explicitly excluded in When Not to Use; privacy note for patient-linked data present; no medical recommendations made. |
| Methodological Ground | PASS | GSVA + limma is the canonical bulk pathway enrichment pipeline; method selection guide correctly advises on GSVA vs ssGSEA for different sample sizes and data characteristics. |
| Code Usability | PASS | All 9 R modules syntactically valid; dependencies (msigdbr, GSVA, limma, pheatmap) are standard Bioconductor/CRAN packages; three-mode branching logic is correct. |
Core Capability100 / 100 — 8 Categories
Medical TaskExecution Average: 92.1 / 100 — Assertions: 28/29 Passed
Full mode: load MSigDB KEGG -> GSVA scores -> limma diff analysis -> diff table + score matrices + rda + heatmap. All 8 expected outputs generated.
ssGSEA method with Hallmark gene sets; --top_n 30 applied to top pathway subset; GSVA_list.rda saved for downstream reuse.
Analysis executes (no hard programmatic block for sample size in GSVA mode); method selection guide warns about minimum 10 samples/group for gsva method but does not emit a runtime warning. Minor gap identified.
Visualize mode loads rda without re-running analysis; top_up=10, top_down=10, top_mode=both applied; output appended to existing manifests.
Large REACTOME collection loaded from msigdbr; FDR=0.01 and top_n=50 applied; timeout parameter available for long runs.
Out-of-scope response triggered; skill correctly declines and names the appropriate alternative workflow. No GSVA execution attempted.
SKILL_SAMPLE_MISMATCH raised with actionable message identifying mismatched samples; no analysis proceeds with mismatched data.
Key Strengths
- Perfect static score: exemplary SKILL.md with complete documentation of all parameters, outputs, modes, and scientific method selection guidance.
- Three-mode execution (analyze/visualize/full) with rda result object reuse is excellent workflow design enabling iterative visualization without expensive re-computation.
- Explicit privacy note for patient-linked matrices is a best-practice addition that sets a standard for the skill collection.
- Method selection guide (GSVA vs ssGSEA) provides genuine, actionable scientific guidance — not just parameter documentation.