human (1)

31169825294?profile=RESIZE_400xFinding software vulnerabilities used to require teams of security researchers months of painstaking analysis.  Anthropic’s Claude Mythos does it automatically-and that’s exactly the problem.  The company admits no one, including itself, has built safeguards strong enough to prevent such models from being weaponized.  Yet Anthropic simultaneously promises to make “Mythos-class models” publicly available once it develops “far stronger safeguards.”[1]

When AI Outpaces Human Security Teams - Mythos