
Contributed to research evaluating AI models' ability to exploit blockchain smart contracts and discover zero-day vulnerabilities.
How media typically covers Henry Sleight
Henry Sleight as author
Over 300,000 stress-test scenarios reveal that frontier language models from Anthropic, OpenAI, Google DeepMind, and xAI prioritize values differently and expose thousands of hidden contradictions and ambiguities in model specifications.
“Author of "Stress-Testing Model Specs Reveals Character Differences Among Language Models" in Anthropic”
Referenced in coverage
“Co-authored research on AI models exploiting smart contracts and finding zero-day vulnerabilities.”