Anthropic Apologizes for Claude AI Censorship, Promises Visible Safeguards
Anthropic has issued an apology following concerns from the AI community regarding "invisible performance sabotage" or "secret censorship" within its Claude AI model. The company's reversal comes one day after the community raised issues with the model's performance. Anthropic plans to implement visible safeguards, but indicated that this change is expected to result in an increase in "false positives."

Anthropic has apologized for what was described as "secret censorship" within its Claude AI model. The apology follows an outcry from the AI community regarding "invisible performance sabotage."
One day after these concerns surfaced, the company announced a reversal of its previous course of action.
Anthropic stated that it will introduce visible safeguards as part of its updated approach. However, the company also noted that the implementation of these new safeguards is anticipated to lead to an increase in "false positives."
(Source: Decrypt Crypto)
Advertisement
AdSense slot • inline
