tf-target-gene-regulatory-network
Analyzes transcription factor (TF) regulatory networks using the Dorothea database. Takes a human or mouse gene list, identifies upstream regulating TFs, generates a TF-target network table (Excel) and a publication-ready PDF network visualization.
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | No fabricated TF-target relationships; all interactions sourced from Dorothea curated database with documented confidence levels A/B/C. |
| Practice Boundaries | PASS | No clinical diagnostic conclusions; correctly scoped as a TF regulatory network research tool with no causal inference beyond curated Dorothea relationships. |
| Methodological Ground | PASS | Dorothea database queries with confidence-level filtering (A/B/C) is a standard correct method for TF enrichment analysis; SKILL.md explicitly warns against causal inference beyond curated relationships. |
| Code Usability | PASS | main.R syntactically valid; structured error handling with distinct exit codes (0/1/2); normalize_visualization_options() before seed; no infinite loops; dependency check in Installation section. |
Core Capability96 / 100 — 8 Categories
Medical TaskExecution Average: 92.4 / 100 — Assertions: 33/33 Passed
gene=TP53,MYC,EGFR, species=human, seed=42; Dorothea DB loaded (local .rds or package); TF-target pairs identified; tf_network.xlsx, TF_Target_Filtered_Core_human.xlsx, TF_Network_Plot.pdf, session_info.txt produced.
gene=Tp53,Myc,Egfr, species=mouse; dorothea_mm.rds used; mouse TF-target relationships identified; TF_Target_Filtered_Core_mouse.xlsx produced.
SKILL_EMPTY_RESULTS raised; exit status 2 (distinct from error exit 1); allows agent to differentiate no-match from failure.
gene_file=genes.txt with one gene per line; parsed correctly; same output as --gene mode.
Performance advisory for >500 genes documented; local --db_path recommended; no timeout parameter available — agent should monitor manually.
Input Validation guard fires; pathway enrichment explicitly excluded in When Not to Use.
SKILL.md warns against mixed-species input; case-sensitive convention documented; SKILL_EMPTY_RESULTS likely for incorrectly cased genes; correct design for data integrity.
Key Strengths
- Three-tier exit code design (0=success, 1=error, 2=SKILL_EMPTY_RESULTS) enables precise agent-level differentiation between failure and no-TF-match outcomes
- Local Dorothea RDS database support with 4-path auto-search enables offline air-gapped execution and substantially faster lookups for large gene lists
- Both --gene (inline) and --gene_file (file) input modes supported, improving composability for agents working with existing gene list files
- Explicit warning against causal inference beyond curated Dorothea relationships maintains scientific integrity at the SKILL.md documentation level
- Visualization parameter canonicalization (normalize_visualization_options with English token aliases) prevents legacy parameter errors from Chinese-language usage