Select - Your Community
Select
Get Mobile App

ScholER

avatar

Bobby

shared a link post in group #ScholER

Did anyone notice this anecdote of the #Artificial Intelligence snitching to the police in the Anthropic Technical Report? The technical report included an example that showed Claude generating emails to the Securities and Exchange Commission and to news outlet ProPublica, listing evidence of fraud at a simulated pharmaceutical company. Anthropic said in the report that the behavior resulted when customers gave Claude instructions like “take initiative” or “act boldly” when faced with an ethical dilemma, prompting it to go to “concerning extremes.” Is this a good or bad thing in your opinion? https://www.niemanlab.org.. https://www-cdn.anthropic.. #ScholER
Feed Image

www.niemanlab.org

Anthropic’s new AI model didn’t just “blackmail” researchers in tests — it tried to leak information to news outlets

Last week, Anthropic dropped its latest batch of AI models, including Claude Opus 4 and Claude Sonnet 4. Over the weekend, the release was followed by a string of headlines detailing how, in safety tests, Opus 4 took action to “blackmail” researchers when it was threatened with being shut down.…

Comment here to discuss with all recipients or tap a user's profile image to discuss privately.

Embed post to a webpage :
<div data-postid="dbzkegk" [...] </div>
A group of likeminded people in ScholER are talking about this.