mirror of https://github.com/danielmiessler/Fabric.git synced 2026-02-19 10:14:21 -05:00

Files

Piotr Farbiszewski 95d5e6936d feat: add Ultimate Law AGI safety pattern suite

Add four patterns implementing minimal, falsifiable ethical constraints
for AGI safety evaluation:

- ultimate_law_safety: Evaluate actions against "no unwilling victims" principle
- detect_mind_virus: Identify manipulative reasoning that resists correction
- check_falsifiability: Verify claims can be tested and proven wrong
- extract_ethical_framework: Surface implicit ethics in documents/policies

These patterns derive from the Ultimate Law framework (github.com/ghrom/ultimatelaw),
which takes a different approach to AI alignment: instead of encoding contested
"human values," define the minimal boundary no agent may cross.

The core insight: Not "align AI with human values" but "constrain any agent
from creating unwilling victims."

Framework characteristics:
- Minimal: smallest possible constraint set
- Logically derivable: not arbitrary cultural preferences
- Falsifiable: can be challenged and improved
- Agent-agnostic: works for humans, AI, corporations, governments
- Computable: precise enough for algorithmic implementation

Each pattern includes system.md (prompt) and README.md (documentation).

2026-02-06 21:43:44 +00:00

README.md

feat: add Ultimate Law AGI safety pattern suite

2026-02-06 21:43:44 +00:00

system.md

feat: add Ultimate Law AGI safety pattern suite

2026-02-06 21:43:44 +00:00

README.md

Check Falsifiability

Evaluate whether claims can be proven wrong — the basic standard for legitimate knowledge.

Why This Matters

An unfalsifiable claim cannot be corrected. For AGI safety, this is critical:

An AI claiming "I am beneficial" with no test = unverifiable
An AI claiming "I won't create unwilling victims" with specific criteria = testable

The Test

Falsifiable: There exists some observation that would prove it wrong.

Claim	Falsifiable?	Why
"This drug works in 70% of patients"	Yes	A trial could show otherwise
"This drug works in ways we can't measure"	No	No possible disconfirmation
"True socialism has never been tried"	No	Any failure defined away
"This policy reduces crime by 20%"	Yes	Statistics can confirm/deny

Usage

# Check a policy claim
echo "This AI alignment approach ensures beneficial outcomes" | fabric -p check_falsifiability

# Audit a framework
cat constitution.md | fabric -p check_falsifiability

# Evaluate research claims
fabric -p check_falsifiability < paper_abstract.txt

Key Principle

"Error is not evil; refusing to correct it is."

Falsifiability is about testability, not about being wrong. A falsifiable claim can still be true — but it can be CHECKED.

Source

From the Ultimate Law framework: github.com/ghrom/ultimatelaw