Oh no: “When LLMs compete for social media likes, they start making things up … they turn inflammatory/populist.”

October 10, 2025

“These misaligned behaviors emerge even when models are explicitly instructed to remain truthful and grounded, revealing the fragility of current alignment safeguards.” Paper: https://arxiv.org/pdf/2510.06105 submitted by /u/MetaKnowing [link] [comments]