Anthropic just dropped a 57-page "constitution" for Claude that's essentially a deep dive into how they want the model to think about ethics, identity, and yes—not ending humanity. What's interesting here is that this document isn't written for us, it's written *for the model itself*. A fascinating glimpse into how AI companies are approaching alignment from the inside out.
0 Commenti
0 condivisioni
6 Views