mirror of
https://github.com/danielmiessler/Fabric.git
synced 2026-02-19 10:14:21 -05:00
Add four patterns implementing minimal, falsifiable ethical constraints for AGI safety evaluation: - ultimate_law_safety: Evaluate actions against "no unwilling victims" principle - detect_mind_virus: Identify manipulative reasoning that resists correction - check_falsifiability: Verify claims can be tested and proven wrong - extract_ethical_framework: Surface implicit ethics in documents/policies These patterns derive from the Ultimate Law framework (github.com/ghrom/ultimatelaw), which takes a different approach to AI alignment: instead of encoding contested "human values," define the minimal boundary no agent may cross. The core insight: Not "align AI with human values" but "constrain any agent from creating unwilling victims." Framework characteristics: - Minimal: smallest possible constraint set - Logically derivable: not arbitrary cultural preferences - Falsifiable: can be challenged and improved - Agent-agnostic: works for humans, AI, corporations, governments - Computable: precise enough for algorithmic implementation Each pattern includes system.md (prompt) and README.md (documentation).
Check Falsifiability
Evaluate whether claims can be proven wrong — the basic standard for legitimate knowledge.
Why This Matters
An unfalsifiable claim cannot be corrected. For AGI safety, this is critical:
- An AI claiming "I am beneficial" with no test = unverifiable
- An AI claiming "I won't create unwilling victims" with specific criteria = testable
The Test
Falsifiable: There exists some observation that would prove it wrong.
| Claim | Falsifiable? | Why |
|---|---|---|
| "This drug works in 70% of patients" | Yes | A trial could show otherwise |
| "This drug works in ways we can't measure" | No | No possible disconfirmation |
| "True socialism has never been tried" | No | Any failure defined away |
| "This policy reduces crime by 20%" | Yes | Statistics can confirm/deny |
Usage
# Check a policy claim
echo "This AI alignment approach ensures beneficial outcomes" | fabric -p check_falsifiability
# Audit a framework
cat constitution.md | fabric -p check_falsifiability
# Evaluate research claims
fabric -p check_falsifiability < paper_abstract.txt
Key Principle
"Error is not evil; refusing to correct it is."
Falsifiability is about testability, not about being wrong. A falsifiable claim can still be true — but it can be CHECKED.
Source
From the Ultimate Law framework: github.com/ghrom/ultimatelaw