
Bobby
shared a link post in group #ScholER
Did anyone notice this anecdote of the #Artificial Intelligence snitching to the police in the Anthropic Technical Report? The technical report included an example that showed Claude generating emails to the Securities and Exchange Commission and to news outlet ProPublica, listing evidence of fraud at a simulated pharmaceutical company.
Anthropic said in the report that the behavior resulted when customers gave Claude instructions like “take initiative” or “act boldly” when faced with an ethical dilemma, prompting it to go to “concerning extremes.”
Is this a good or bad thing in your opinion?
https://www.niemanlab.org..
https://www-cdn.anthropic..
#ScholER

www.niemanlab.org
Anthropic’s new AI model didn’t just “blackmail” researchers in tests — it tried to leak information to news outlets
Last week, Anthropic dropped its latest batch of AI models, including Claude Opus 4 and Claude Sonnet 4. Over the weekend, the release was followed by a string of headlines detailing how, in safety tests, Opus 4 took action to “blackmail” researchers when it was threatened with being shut down.…