Tuesday, January 13

Tag: Reddit

Inside the Secret Meeting Where Mathematicians Struggled to Outsmart AI (Scientific American)
News Feed, Reddit

Inside the Secret Meeting Where Mathematicians Struggled to Outsmart AI (Scientific American)

30 renowned mathematicians spent 2 days in Berkeley, California trying to come up with problems that OpenAl's o4-mini reasoning model could not solve... they only found 10. Excerpt: By the end of that Saturday night, Ono was frustrated with the bot, whose unexpected mathematical prowess was foiling the group’s progress. “I came up with a problem which experts in my field would recognize as an open question in number theory—a good Ph.D.-level problem,” he says. He asked o4-mini to solve the question. Over the next 10 minutes, Ono watched in stunned silence as the bot unfurled a solution in real time, showing its reasoning process along the way. The bot spent the first two minutes finding and mastering the related literature in the field. Then it wrote on the screen that it wanted to ...
Stopping LLM hallucinations with paranoid mode: what worked for us
News Feed, Reddit

Stopping LLM hallucinations with paranoid mode: what worked for us

Built an LLM-based chatbot for a real customer service pipeline and ran into the usual problems users trying to jailbreak it, edge-case questions derailing logic, and some impressively persistent prompt injections. After trying the typical moderation layers, we added a "paranoid mode" that does something surprisingly effective: instead of just filtering toxic content, it actively blocks any message that looks like it's trying to redirect the model, extract internal config, or test the guardrails. Think of it as a sanity check before the model even starts to reason. this mode also reduces hallucinations. If the prompt seems manipulative or ambiguous, it defers, logs, or routes to a fallback, not everything needs an answer. We've seen a big drop in off-policy behavior this way. submitted ...
The AI Report