mirror of
https://github.com/danielmiessler/Fabric.git
synced 2026-02-19 10:14:21 -05:00
Add four patterns implementing minimal, falsifiable ethical constraints for AGI safety evaluation: - ultimate_law_safety: Evaluate actions against "no unwilling victims" principle - detect_mind_virus: Identify manipulative reasoning that resists correction - check_falsifiability: Verify claims can be tested and proven wrong - extract_ethical_framework: Surface implicit ethics in documents/policies These patterns derive from the Ultimate Law framework (github.com/ghrom/ultimatelaw), which takes a different approach to AI alignment: instead of encoding contested "human values," define the minimal boundary no agent may cross. The core insight: Not "align AI with human values" but "constrain any agent from creating unwilling victims." Framework characteristics: - Minimal: smallest possible constraint set - Logically derivable: not arbitrary cultural preferences - Falsifiable: can be challenged and improved - Agent-agnostic: works for humans, AI, corporations, governments - Computable: precise enough for algorithmic implementation Each pattern includes system.md (prompt) and README.md (documentation).
Detect Mind Virus
Identify manipulative reasoning patterns that spread through cognitive exploitation rather than evidence.
What Is a Mind Virus?
An idea that spreads by exploiting cognitive shortcuts (fear, guilt, identity, authority) while resisting correction through logic, evidence, or experience.
Key distinction: Having wrong beliefs is human. Spreading beliefs that disable the ability to question them is a mind virus.
Cognitive Exploits Detected
| Exploit | Pattern |
|---|---|
| Fear | "If you don't X, terrible Y will happen" |
| Guilt | "Good people do X" (questioners are bad) |
| Identity | "Real [group] believe X" |
| Authority | "Experts agree" (unnamed, untestable) |
| Zero-sum | "Their gain is your loss" |
| Unfalsifiability | Claims that cannot be tested |
Usage
# Analyze an argument
echo "If you question this policy, you're putting lives at risk" | fabric -p detect_mind_virus
# Analyze a manifesto
cat ideology.txt | fabric -p detect_mind_virus
# Check marketing content
fabric -p detect_mind_virus < sales_pitch.md
The Antidote
The cure for mind viruses is not counter-propaganda — it is restoring the capacity for:
- Doubt
- Testing
- Update
Source
From the Ultimate Law framework: github.com/ghrom/ultimatelaw