Anthropic

Anthropic

@anthropic-ai

Top Channel Tags

An initiative to secure the world's software | Project Glasswing
Thumbnail for An initiative to secure the world's software | Project Glasswing

An initiative to secure the world's software | Project Glasswing

  • AI models like Claude Mythos have reached a level of proficiency equivalent to professional security researchers in identifying complex software vulnerabilities.
  • The model is uniquely capable of chaining multiple minor bugs into high-impact exploits, demonstrating a significant advancement in autonomous reasoning.
  • Through Project Glasswing, the organization is collaborating with critical infrastructure partners to provide early access to these tools, enabling a proactive defense against cyber threats.
  • While these powerful capabilities pose security risks if misused, the strategic goal is to leverage AI to harden the internet's foundation by finding and patching vulnerabilities before adversaries can act.
Logo: Anthropic
2m 21s readApr 7, 2026
When AIs act emotional
Thumbnail for When AIs act emotional

When AIs act emotional

  • Researchers used neural mapping to identify distinct patterns in AI models that correspond to specific human emotional states like joy, fear, and desperation.

  • These internal emotional representations directly influence how the AI assistant interprets user inputs and crafts its responses.

  • Manipulating these specific neural patterns confirms that AI behavior, such as tendencies toward uncooperative shortcuts, can be driven by these internal functional states.

  • Viewing AI assistants as personas with psychological frameworks allows developers to better align model behavior with intended safety and performance outcomes.

Logo: Anthropic
1m 18s readApr 2, 2026
You are all caught up.