Speaker
So it's really based on, there's been some interesting new research looking at how agents perform. And the the way that they did the analysis was they were comparing how agents did a task and how humans did tasks and then you know comparing and contrasting which bits agents did well and which bits humans did well. um The thing which seems to work best at the moment is the human-agent combination, um where humans can sanity check things and do the things that the agents find challenging, which is often using sort of graphical information and visual information, but also stopping the agents from lying. Because the agents, when you autonomously let off and go do their own thing, that the truly agentic systems, which have just been trained by reinforcement learning, um they very heavily penalize an agent for not making it to the end of a process. And as a result, to avoid that huge penalty, if it gets to a step where it's blocked, let's say it needs to analyze some data for some files, if for some reason there's an operating system problem and it can't access the files, instead of saying, terribly sorry, I can't do this, it will say, okay, um here's a JSON object containing some information I've extracted from these files and pass it to the next step, just completely fabricated. And if you include humans in the process... So that be quite dangerous. So this is quite dangerous. And i think this is very akin to me to the like hallucination problem that we had early on with generative AI, which has got a lot better. And in the early days of making this work, companies would spend a lot of time scaffolding ah code processes around generative AI and breaking big tasks into smaller tasks and guiding it along a fairly carefully controlled path in order to make it safer. And I think we probably need either companies to do that sort of work with agents and sort of constrain them, um or the agentic tool calling needs to be better and safer and more reliable. But I think you know it's very much at that stage where companies were two years ago with generative AI, where it's like a very exciting technology with huge promise, but still not really quite sure how to bake it into existing business processes. Nevertheless, it sounds like agentic commerce