Wednesday, June 10

Tag: Reddit

Control for agentic payments should start at infrastructure
News Feed, Reddit

Control for agentic payments should start at infrastructure

Booking travel or paying for subscriptions or for running procurement through Claude or a custom GPT wrapper no confirm button is required anymore. The capability side is mostly solved. What doesn't get talked about enough is what happens when it goes sideways. A stored card sitting in the agent's context means it holds that access the whole session. One bad tool call and it's spending outside what you intended with nothing at the infrastructure level stopping it. Real time card issuance is the cleaner model. Agent requests a card for the specific transaction, purchase completes, card cancels and nothing persists. Who is running agent initiated payments in production right now and what does the architecture look like? submitted by /u/Significant-Plant-4 [link] [comments]
Can a machine think without language?
News Feed, Reddit

Can a machine think without language?

Yann LeCun bet a billion dollars that it can. He left Meta arguing today’s chatbots are a dead end, and that real intelligence comes from “world models,” systems that learn how the physical world works rather than just predicting the next word. Two things nag at me. First, how do we even measure it? Every famous AI test is basically a language exam. But a world model doesn’t write essays, it predicts what happens next. So either these systems slip past the tests we trust, or we have no good way to score them yet. Second, LeCun says you can’t reach real intelligence through language alone. Probably right. But isn’t the reverse just as true? Could anything that masters physics but can’t grasp language really be called intelligent? So much of human thought, math, planning, culture, rides on w...
Claude Fable & Mythos released by Anthropic
News Feed, Reddit

Claude Fable & Mythos released by Anthropic

From the press release: Today we’re launching Claude Fable 5: a Mythos-class1 model that we’ve made safe for general use. Fable 5’s capabilities exceed those of any model we’ve ever made generally available. It is state-of-the-art on nearly all tested benchmarks of AI capability, showing exceptional performance in software engineering, knowledge work, vision, scientific research, and many other areas. The longer and more complex the task, the larger Fable 5’s lead over our other models. Releasing a model this capable comes with risks. Without safeguards, Fable 5’s capabilities in areas like cybersecurity could be misused to cause serious damage. We’ve therefore launched the model with safeguards that mean queries on some topics will instead receive a response from our next-most-capa...
Crazy statement by Gemini pro
News Feed, Reddit

Crazy statement by Gemini pro

Ok so I was using Gemini 3.1 pro in extended thinking mode with canvas mode for making a website with liquid glass theme about railway and it created me that but I saw another option under it saying to “create without canvas option “ and it literally generated complete nonsense of some sci-fi story and when I asked why it replied like that it said , “I am so sorry for the confusion! That was a complete glitch on my end—my wires got crossed and I accidentally sent you a completely unrelated response about diamond chemistry instead of the web app code.” And I asked again and it said “To be completely transparent, it was likely a backend routing error known as a "context bleed." Because I process thousands of requests simultaneously, the system accidentally grabbed a response meant for ...
I think we’re about 12 months away from the first major AI agent disaster
News Feed, Reddit

I think we’re about 12 months away from the first major AI agent disaster

I keep seeing more companies giving AI agents access to real stuff like email, databases, internal tools, customer data, etc. And what’s weird is how normal it’s starting to feel now. Like not long ago everyone was worried about chatbots just giving wrong answers. Now we’re basically like yeah sure go ahead and do things for us. I don’t know that jump feels kind of big when you actually think about it. Maybe it all works out fine. Or maybe we’re just moving fast without fully realizing what we’re doing. I’m honestly surprised there hasn’t already been some big headline like an AI agent doing something really wrong. It feels like we’re kind of close to one of those moments where everything suddenly changes overnight. Anyone else feel like we’re closer to something like that than people are ...
Ai as a teaching method…
News Feed, Reddit

Ai as a teaching method…

So I’ve been using Ai as an art tutor I give it my own art and I review it on how’d I’d look colored a certain way, and how best to detail and shade, as well as a sorta 2d model I can have rotated and view at different angles to get a feel for the shapes and such this is how Ai should be used to teach and improve not to outright replace, it’s like Siri submitted by /u/Intelligent-Fig-1755 [link] [comments]
Theory of Mind – LLM vs Human
News Feed, Reddit

Theory of Mind – LLM vs Human

I was just thinking about the difference between an LLMs capacity for theory of mind and a human's capacity for theory of mind, and I realize it gets at the heart of what differentiates an LLM from human, and that's the method of how we gather information. LLMs are based on objective data, e.g. text, numbers, pixels, etc. Whereas we as humans, use subjective information, e.g., feelings, sensations, experiences; as well as objective data. Within cognitive science, this would be described as affective empathy vs cognitive empathy. Or in other words, LLMs simply possess a cognitive theory of mind, whereas we have both a cognitive *and* affective theory of mind. The problem I have with figures like Hinton, who claim that AI is already conscious, is that his whole framework is based on the idea...
The AI Report