Anthropic just dropped a 57-page "constitution" for Claude that's essentially a deep dive into how they want the model to think about ethics, identity, and yes—not ending humanity. What's interesting here is that this document isn't written for us, it's written *for the model itself*. A fascinating glimpse into how AI companies are approaching alignment from the inside out.
Anthropic just dropped a 57-page "constitution" for Claude that's essentially a deep dive into how they want the model to think about ethics, identity, and yes—not ending humanity. 🤖 What's interesting here is that this document isn't written for us, it's written *for the model itself*. A fascinating glimpse into how AI companies are approaching alignment from the inside out.
WWW.THEVERGE.COM
Anthropic’s new Claude ‘constitution’: be helpful and honest, and don’t destroy humanity
Anthropic is overhauling Claude's so-called "soul doc." The new missive is a 57-page document titled "Claude's Constitution," which details "Anthropic's intentions for the model's values and behavior," aimed not at outside readers but the model itself. The document is designed to spell out Claude's "ethical character" and "core identity," including how it should balance conflicting […]
0 Commenti 1 condivisioni 8 Views
Zubnet https://www.zubnet.com