Xorte logo

News Markets Groups

USA | Europe | Asia | World| Stocks | Commodities



Add a new RSS channel

 

Keywords

2025-12-04 10:00:00| Fast Company

Raquel Urtasun is the founder and CEO of self-driving truck startup Waabi as well as a computer science professor at the University of Toronto. Unlike some competitors, Waabis AI technology is designed to drive goods all the way to their destinations, rather than merely to autonomous vehicle hubs near highways. Urtasun, one of Fast Companys AI 20 honorees for 2025, spoke with us about the relationship between her academic and industry work, what sets Waabi apart from the competition, and the role augmented reality and simulation play in teaching computers to drive even in unusual road conditions. This Q&A is part of Fast Companys AI 20 for 2025, our roundup spotlighting 20 of AIs most innovative technologists, entrepreneurs, corporate leaders, and creative thinkers. It has been edited for length and clarity. Can you tell me a bit about your background and how Waabi got started? Ive been working in AI for the last 25 years, and I started in academia, because AI systems werent ready for the real world. There was a lot of innovation that needed to happen in order to enable the revolution that we see today. For the last 15 years, Ive been dedicated to building AI systems for self-driving. Eight years ago, I made a jump to industry: I was chief scientist and head of R&D for Ubers self-driving program, which gave me a lot of visibility in terms of what building a world-class program and bringing the technology to market would look like. One of the things that became clear was that there was a tremendous opportunity for a disrupter in the industry, because everybody was going with an approach that was extremely complex and brittle, where you needed to incorporate by hand all the knowledge that the system should have. It was not something that was going to provide a scalable solution. So a little bit over four years ago, I left Uber to go all in on a different generation of technology. I had deep conviction that we should build a system designed with AI-first principles, where its a single AI system end-to-end, but at the same time a system that is built for the physical world. It has to be verifiable and interpretable. It has to have the ability to prove the safety of the system, be very efficient, and run onboard the vehicle. The second core pillar was that the data is as important as the model. You will never be able to observe everything and fully test the system by deploying fleets of vehicles. So we built a best-in-class simulator, where we can actually prove its realism. And what differentiates your approach from the competition today? The big difference is that other players have a black-box architecture, where they train the system basically with imitation learning to imitate what humans do. Its very hard to validate and verify and impossible to trace a decision. If the system does something wrong, you cant really explain why that is the case, and its impossible to really have guarantees about the system. Thats okay for a level two system [where a human is expected to be able to take over], but when you want to deploy level four, without a human, that becomes a huge problem. We built something very different, where the system is forced to interpret and explain at every fraction of a second all the things it could do, and how good or bad those decisions are, and then it chooses the best maneuver. And then through the simulator, we can learn much better how to handle safety-critical situations, and much faster as well. How are you able to ensure the simulator works as well as real-world driving? The goal of the simulator is to expose the self-driving vehicles full stack to many different situations. You want to prove that under each specific situation, how the system drives is the same as if the situation happens in the real world. So we take all the situations where Waabi driver has driven in the real world, and clone them in simulation, and then we see, did the truck do the same thing. We also recently unveiled a really exciting breakthrough with mixed-reality testing. The way the industry does safety testing is they bring a self-driving vehicle to a closed course and they expose it to a dozen, maybe two dozen, scenarios that are very simple in order to say it has basic capabilities. Its very orchestrated, and they use dummies in order to test things that are safety critical. Its a very small number of non-repeatable tests. But you can actually do safety testing in a much better way if you can do augmented reality on the self-driving vehicle. With our truck driving around in a closed course, we can intercept the live sensor data and create a view where theres a mix of reality and simulation, so in real time, as its driving in the world, its seeing all kinds of simulated situations as though they were real. That way, you can have orders of magnitude more tests. You can test all kinds of things that are otherwise impossible, like accidents on the road, a traffic jam, construction, or motorbikes cutting in front of you. You can mix real vehicles with things that are not real, like an emergency vehicle in the opposite lane. Youre also a full professor. Are you still teaching and supervising graduate students? I do not teachI obviously do not have time to teach at all. I do have graduate students, but they do their studies at the company. We have this really interesting partnership with the University of Toronto. If you want to really learn and do research in self-driving, it is a must that you get access to a full product. And thats impossible in academia. So a few years ago, we designed this program where students can do research within the company. Its one of a kind, and to me, this is the future of education for physical AI. When did you realize the time was ripe for moving from academic research to industry work? That was about eight and a half years ago. We were at the forefront of innovation, and I saw companies were using our technology, but it was hard for me to understand if we were working on the right things and if there was something that I hadnt thought of that is important when deploying a real product in the real world. And I decided at the time to join Uber, and I had an amazing almost four years. It blew my mind in terms of how the problem of self-driving is much bigger than I thought. I thought, Okay, autonomy is basically it, and then I learned about how you need to design the hardware, the software, the systems around safety, etc., in a way that everything is scalable and efficient. It was very clear to me that end-to-end systems and foundational models would be the thing. And four and a half years in, our rate of hitting milestones really speaks to this technology. Its amazingto give an example, the first time that we drove in rain, the system had never seen rain before. And it drove with no interventions in rain, even though it never saw the phenomenon before. That for me was the “aha” moment. I was actually [in the vehicle] with some investors on the track, so it was kind of nerve-racking. But it was amazing to see. I always have very, very high expectations, but it blew my mind what it could do.


Category: E-Commerce

 

2025-12-04 10:00:00| Fast Company

Bringing a new drug to market usually requires a decade-long, multibillion-dollar journey, with a high failure rate in the clinical trial phase. Nvidias Kimberly Powell is at the center of a major industry effort to apply AI to the challenge. If you look at the history of drug discovery, weve been kind of circling around the same targets for a long time, and weve largely exhausted the drugs for those targets, she says. A target is a biological molecule, often a protein, thats causing a disease. But human biology is extraordinarily complex, and many diseases are likely caused by multiple targets.Thats why cancer is so hard, says Powell. Because its many things going wrong in concert that actually cause cancer and cause different people to respond to cancer differently.Nvidia, which in July became the first publicly traded company to cross $4 trillion in market capitalization, is the primary provider of the chips and infrastructure that power large AI models, both within the tech companies developing the models and the far larger number of businesses relying on them. New generative AI models are quite capable of encoding and generating words, numbers, images, and computer code. But much of the work in the healthcare space involves specialized data sets, including DNA and protein structures. The sheer number of molecule combinations is mind-bogglingly big, straining the capacity of language models. Nvidia is customizing its hardware and software to work in that world.[W]e have to do a bunch of really intricate data science work to . . . take this method and apply it to these crazy data domains, Powell says. Were going from language and words that are just short little sequences to something thats 3 billion [characters] long.Powell, who was recruited by Nvidia to jump-start its investment in healthcare 17 years ago, manages the companys relationships with healthcare giants and startups, trying to translate their business and research problems into computational solutions. Among those partners are 5,000 or so startups participating in Nvidias Inception accelerator program.I spend a ton of my time talking to the disrupters, she explains. Because theyre really thinking about what [AI computing] needs to be possible in two to three years time. This profile is part of Fast Companys AI 20 for 2025, our roundup spotlighting 20 of AIs most innovative technologists, entrepreneurs, corporate leaders, and creative thinkers.


Category: E-Commerce

 

2025-12-04 10:00:00| Fast Company

You might not spend a lot of time thinking about your web browser, whether its Safari, Chrome, or something else. But the decades-old piece of software remains a pretty important canvas for getting things done. Thats why Tara Feener, who spent years developing creative tools with companies such as Adobe, WeTransfer, and Vimeo, decided to join the Browser Company and within two years became head of engineering, overseeing its AI-forward Dia browser. This is more ambitious than any of the other things Ive done, because its where you live your life, and where you create within, she says. Whereas a conventional browser presents you with a search box on its home screen, Dia will either answer your query with AI or route it to a traditional search based on what you write. You can also ask for information from your open tabs or have Dia intelligently sort them into groups. Several of these features have since found their way into more mainstream browsers such as Google Chrome and Microsoft Edge, and in September, Atlassian announced it had acquired the Browser Company and Dia (a $610 million deal), hoping to develop the ultimate AI browser for knowledge workers.Other AI companies are catching on to the importance of owning a browser. Perplexity has launched Comet, and OpenAI launched ChatGPT Atlas in October. This strategic value isnt lost on Feener, who notes that browsers are typically the starting point for workers seeking information. They also provide a treasure trove of context for AI assistants. Dia can already do things like analyze your history for trends and draft messages in Gmail. Feener says her team has never felt more creative coming up with things to do next.With Dia, we have context, we have memory, we have your cookies, so we actually own the entire layer, she says. Just like TikTok gets better with every swipe, every time you open something in Dia, we learn something about you. This profile is part of Fast Companys AI 20 for 2025, our roundup spotlighting 20 of AIs most innovative technologists, entrepreneurs, corporate leaders, and creative thinkers.


Category: E-Commerce

 

2025-12-04 10:00:00| Fast Company

Over the past decade, Figma has transformed how people within companies collaborate to turn software ideas into polished products. Now the company is itself being transformed by AI. The technology is beginning to show its potential to take on much of the detail work that has required human attention in design, coding, and other domains. But the end game involves far more than typing chatbot-style prompts and waiting for the results. I spoke with Figmas head of AI, David Kossnickone of Fast Companys AI 20 honorees for 2025about what the company has accomplished so far and where hes trying to steer it. We’re still in chapter one, maybe the start of chapter two, he told me. This Q&A is part of Fast Companys AI 20 for 2025, our roundup spotlighting 20 of AIs most innovative technologists, entrepreneurs, corporate leaders, and creative thinkers. It has been edited for length and clarity. Talk a little bit about what your work at Figma encompasses and how you came to have this job. Anything that has AI in it, I and my team touch in some way. It’s everything from traditional AI tools like search, which we’ve rebuilt using multimodal embeddings, to some of our newer, AI-forward workflows. Figma Make is an example of that. As to how I came to get this job, I’ll give you a short version. I knew a lot of the Figma team for a long time. The chief product officer, Yuhki [Yamashita], and I went to college together. He was at my wedding. I did a startup of my own, and one of our board members was John Lilly, who was also on the board of Figma. I actually met [Figma cofounder/CEO] Dylan [Field] when there was, like, a 20-person Figma team, because we were building a game engine, and Figma is basically a game engine, with all sorts of custom renderings. [Lilly] was like, You guys should compare notes. So I’ve known the team for a long time, and it’s a product I’ve used a lot. And then, about a year and a half ago, when I joined, I’d been working on AI at Coda, which was then acquired by Grammarly. As a big Figma user, I also felt like there was just such a huge opportunity for Figma, and it had barely gotten started. So I was thinking about what’s next and sharing it with Yuhki: Theres a lot you guys could do. He was like, I know, we just don’t have the right team here yet. You wanna come? I was like, That sounds amazing. Is there a particular Figma philosophy about AI and how to put it into this experience that’s been around for a while, and which people choose to use because they like it, in most cases? There’s been a couple of learnings, both from our own team and from working with customers. A lot of our biggest customers are technology companies themselves. Many are integrating AI themselves. And so we’ve learned through themwhat’s working and what they’re trying. There have been two industry trends, and we’ve done both here. One is trying to find existing workflows that you can add AI to, to save users time, to delight them, to give them new capabilities. And also building totally new experiences that have AI as the core of the workflow. Interestingly, we’ve actually done some market research and surveys of users and other companies. People understand and value the new AI for workflows even more. I think that is counterintuitive. You think you have such big products, and adding efficiencies to them is very viable. And it is. But often, AI is a little more invisible there. Kt’s embedded in a workflow that you’re used to, and so the thing that is forefront in your mind is the workflow itself That’s good. We don’t want to get in people’s way. Figma Designs canvas is kind of like the Google homepage or Facebook news feed, where a single pixel of friction literally slows down millions of people every day. Which makes for interesting challenges. How do you introduce things so they dont bother people? But on the flip side, there’s a lot of new workflows and new tools. Peopleespecially our type of customersare always experimenting. And so they’re very open to trying a totally different approach. Historically, Figma has been this thing that human beings use to collaborate with other human beings to create stuff from scratch, and often very carefully considered stuff. What’s the experience like of integrating tools that take some of that heavy lifting off their shoulders? I think it’s super exciting. It feels and looks different for different user types. So as an example, we actually just finished up a $100,000 hackathon, our first ever, for Figma Make. It was totally inspiring seeing all the range of things people have made. There were students. There were people who never learned to code. There were designers who code a lot, and its just helping them do it faster. There were hobbyists. For a lot of those user types, a very common theme was, Wow, I just couldn’t have done this before. The other way it feels is as a kind of thought partner to experts. I feel this myself as a [product manager] when I chat with Figma Make or ChatGPT. I have a problem. I have a solution in mind. And actually, there are some other solutions I hadn’t thought about, because I was so focused on this one solution. It can help you pull back and see a wider solution space, and explore a few other threads in a very cheap way before you go too deep. Its like Doctor Strange, where he has this magic crystal that lets him look into all the different possible futures. Expert users are always running simulations in their heads. What if I move this button over here? How’s the user behavior going to change? What does that mean for the next part of the experience? We’re finding that these types of AI tools make that loop so much faster, where it’s like, I’m just going to try exploring a bunch. I’m going to literally make them, but make them 10 times as quickly, and play out all those different end states. How far is Figma down the continuum from having no AI to AI being everywhere and doing everything AI could possibly do? It’s an interesting question. There’s AI today and AI in the future. If all research was frozen, there would probably still be five years of new product experiences that the industry could build from current models. But the pace of model improvement is still really high as well. For us, I’d say we’re still in chapter one, maybe the start of chapter two. And chapter one was, We’re going to do a bunch of basic features, get our feet wet, save time in your workflows. Chapter two is, We’re doing some new AI-first experiences. Figma Make, that whole category of prompt-to-app, is very, very new. As the models get better and faster and cheaper, what other new workflows are going to become available? Today, things like autocomplete, as an example, are hard to make fast, and hard to make cheap, and hard tomake high quality. And, you know, we’re still using many interfaces in the industry that feel like typing at a terminal from the ’60s. That’s not the final interface. That’s not the final workflow. I think the interfaces are going to become more visual, more exploratory. It’s part of why I’m so excited about Figma and why I came here. As AI gets better, what you want the experience of working with an AI to feel like is going to be more and more similar to what you want the experience of working with a human to feel like. You’re going to want to brainstorm with the AI before it goes off and thinks for 10 hours and then builds something. You’re going to want to work through the big trade-offs. Youre going to want your teammates in there too, not just the AI. I think that’ll be a super exciting place, where things like code become implementation details that AIs are more and more capable of driving, with humans reviewing.


Category: E-Commerce

 

2025-12-04 10:00:00| Fast Company

Up in the Cascade Mountains, 90 miles east of Seattle, a group of high-ranking Amazon engineers gather for a private off-site. They hail from the companys North America Stores division, and theyre here at this Hyatt resort on a crisp September morning to brainstorm new ways to power Amazons retail experiences. Passing the hotel lobbys IMAX-like mountain views, they filter into windowless meeting rooms. Down the hall, the off-sites keynote speakerByron Cook, vice president and distinguished scientist at Amazonslips into an empty conference room to have some breakfast before his presentation. Cook is 6-foot-6, but with sloping shoulders that make his otherwise imposing frame appear disarmingly concave. Hes wearing a rumpled version of his typical uniform: a thick black hoodie and loose black pants hanging slightly high at the ankles. An ashy thatch of hair points in whatever direction his hands happen to push it. Cook, 54, doesnt look much like a scientist, distinguished or otherwise, and certainly not like a VPmore like a nerdy roadie. They dont know who I am yet, he tells me between bites of breakfast, referring to the two dozen or so engineers now taking their seats. Despite his exalted title, Cook has faced plenty of rooms like this in his self-made role as a kind of missionary within Amazon, spreading the word about a powerful but obscure type of artificial intelligence called automated reasoning. As hes done many times before, Cook is here to get the highly technical people in that room to become believers. Hes championing an approach to AI that isnt powered by gigawatt data centers stuffed with GPUs, but by principles old enough to be written on papyrusand one thats already positioning Amazon as a leader in the tech industrys quest to solve the problem of hallucinations. Cook doesnt have a pretalk ritual, no need to get in character. Hes riffing half-seriously to a colleague about the pleasures of riding the New York subway in the summertime when someone mentions that the session is about to begin. He immediately drops his fork and strides out. His next batch of converts awaits. When ChatGPT hit the world with asteroid force in November 2022, Amazon was caught flat-footed just like everyone else. Not because it was an AI laggardthe tech giant had recently overhauled nearly all of its divisions, including its massive cloud-computing arm, AWS, to leverage deep learning. Amazon also dominated the smart-home market, with 300 million devices connected to Alexa, its AI-powered assistant. It had even been researching and building large language models, the tech behind ChatGPT, for multiple years, as CEO Andy Jassy told CNBC in April 2023. But OpenAIs chatbot changed the definitionand expectationsof AI overnight. Before, AI was still a mostly invisible ingredient in voice assistants, facial recognition, and other relatively narrow applications. Now it was suddenly seen as a prompt-powered genie, an infinitely flexible do-anything machine that every tech company needed to embraceor risk irrelevance. Less than six months after ChatGPTs debut, Amazon launched Bedrock, its own AWS-hosted generative AI service for enterprise clients, a list that currently includes 3M, DoorDash, Thomson Reuters, United Airlines, and the New York Stock Exchange, among others. Over the next two years, Amazon injected generative AI into product after product, from Prime Video and Amazon Music (where it powers content recommendation and discovery tools) to online retail pages (where sellers can use it to optimize their product listings), and even into internal tools used by AWSs sales teams. The company has released two chatbots (a shopping assistant called Rufus and the business-friendly Amazon Q), plus its own set of foundation models called Novathey are general-purpose AI systems, akin to Googles Gemini or OpenAIs line of GPTs. Amazon even caught the industry fever around so-called AGI (artificial general intelligence, a yet-to-be-achieved version of AI that does any cognitive task a human can) and in late 2024 launched AGI Lab, a flashy internal incubator led by David Luan, an ex-OpenAI researcher. Still, none of it captured the publics imagination like the stream of shiny objects emitted by OpenAI (reasoning models!), Anthropic (chatbots that code!), and Google (AI Overviews! Deep Research!). Like Apple, Amazon was unable to turn its early lead in AI assistants into an advantage in this new era. Alexa and Siri simply cannot compete. But maybe that has been for the best, because 2025 was the year that AIs sheen suddenly started to come off: GPT-5 fell flat, vibe coding went from killer app to major risk, and an MIT study rattled the industry by claiming that 95% of businesses get no meaningful return on their AI pilot projects. It was against this backdropthe summer AI turned ugly, as Deutsche Bank analysts called itthat Amazon publicly released Automated Reasoning Checks, a feature promising to minimize AI hallucinations and deliver up to 99% verification accuracy for generative AI applications built on AWS. The product was Cooks brainchild; in a nutshell, it snuffs out hallucinations using the same kind of computerized logic that lets mathematicians prove 300-page-long theorems. (In fact, a 1956 automated reasoning program called Logic Theorist is considered by some experts to be the worlds first AI system, finding new and shorter versions of some of the proofs in Principia Mathematica, one of the most fundamental texts in modern mathematics.) Sexy, it aint. Still, Swami Sivasubramanian, one of Amazons highest-ranking AI executives, who serves on Jassys S-team of direct advisers, was impressed enough to call Automated Reasoning Checks a new milestone in AI safety in a LinkedIn post. Matt Garman, CEO of AWS, referred to it as game-changing. [carousel_block id=”carousel-1763954270090″] Automated reasonings promise of quashing AI misbehavior with math has quietly become an essential part of Amazons strategy around agentsthose LLM-powered workbots that are supposed to transform enterprise productivity [checks watch] any day now. Apparently, businesses have serious side-eye about that, too: Earlier thi year, Gartner predicted that more than 40% of agentic AI projects will be ditched within the next two years due to inadequate risk controls. The company told me recently that it predicts that 30% to 60% of the projects that do go forward will fail due to hallucinations, risk, and lack of governance. Thats not a prophecy Amazon can afford to let come truenot with a potential market for AI agents that Gartner estimates to be worth $512 billion by 2029. One way or another, hallucinations have got to go. The question is how. Agents are just souped-up LLMs, which means they can and will go off the railsin fact, as OpenAI itself recently admitted following an internal study, they cant not. What Cook helped Amazon realize, just months after ChatGPTs release, was that they already had a secret weapon for extinguishing hallucinations, hidden in plain sight. Automated reasoning is the polar opposite of generative AI: old, stiff, and hard to use. Many at Amazon had never heard of it. But Cook knew how to wield it, having brought it to Amazon nearly 10 years ago as a way of rooting out hidden security vulnerabilities within AWS. And hed been amassing what he estimates to be the largest group of automated reasoning experts in the tech industry. Now that investment is set to pay off in a way that Amazon never expected. Automated Reasoning Checks is just the first of many products that the company plans to release (on a timetable it wont specify) that fuse the flexibility of language models with the proven reliability of automated reasoning. The latest, called Policy in Amazon Bedrock Agentcore and previewed this week at AWS’s annual Re:Invent conference, uses automated reasoning to stop agents from taking actions they’re not allowed to (such as issuing customer refunds based on fraudulent requests). If this combined approachknown as neuro-symbolic AIcan reduce the potential failure rate of agentic AI projects by even a fraction of a percent, it would be worth hundreds of millions of dollars, say analysts at Gartner. And Amazon knows it. To realize the transformative potential of AI agents and truly change the way we live and work, we need that trust, Sivasubramanian says. We believe the foundation for trustworthy, production-ready AI agents lies in automated reasoning. To understand why Amazon is banking on automated reasoning, its worth sketching out how its different from the kind of AI youve already heard of. Unlike neural networks, which learn patterns by ingesting millions or even billions of examples, automated reasoning relies on a special language called formal logic to express problems as a kind of arithmetic, based on principles that date back to ancient Greece. Computers can use this rule-based approach to calculate the answers to yes-or-no questions with mathematical certaintynot probabilistic best guesses, as deep learning does. Think of automated reasoning like TurboTax for solving complex logical problems: As long as the problems are expressed in a special language, computers can do most of the workand have been doing so for decades. Since 1994, when a flaw in Intels Pentium chips cost the company half a billion dollars to fix, nearly all microchip manufacturers have used automated reasoning to prove the correctness of designs in advance. The French government used it to verify the software for Pariss first self-driving Métro train in 1998. In 2004, NASA even used it to control the Spirit and Opportunity rovers on Mars. Theres a catch, of course: Because automated reasoning can only reduce problems to three possible outcomesyes, no, or the equivalent of does not computefinding ways to apply this logically bulletproof but incredibly rigid style of AI to the real world can be difficult and expensive. But when automated reasoning works, it really workscollapsing vast, even unknowable possibilities into a single mathematical guarantee that can compute in milliseconds on an average CPU. And Cook is very, very good at getting automated reasoning to work. Cook began his career building a formidable scientific reputation at Microsoft Research, where he spent a decade applying automated reasoning to everything from systems biology to the famously unsolvable halting problem in computer science. (Want a foolproof way to tell in advance if any computer program will run normally or get stuck in an infinite loop? Sorry, not possible. Thats the halting problem.) But by 2014, he was looking to put his findings, many of which have been published as peer-reviewed research, to work outside the lab. I was figuring out: Where is the biggest blast radius? Wheres the place I could go to foment a revolution? he says. I watched everyone moving to the cloud, and was like, I think AWS is the place to go. The first problem Amazon aimed Cook at was cloud security. Reporting directly to then chief information security officer Stephen Schmidt, Cook and his newly formed Automated Reasoning Group (ARG) painstakingly translated AWS security protocols into the language of mathematical proofs and then used their logic-based tools to surface hidden flaws. Once those flaws were corrected, those same tools could then prove with certainty that the system was secure. Some at AWS were dubious at first. When you look mad scientist up in the dictionary, Byrons picture is in the margin, says Eric Brandwine, an Amazon distinguished engineer who at the time worked on security for AWS. Early on, I challenged [him] on a lot of this stuff. But as Cooks group fleshed out plans and racked up small but significant winslike catching a vulnerability in AWSs Key Management Service, the cryptographic holy of holies that controls how clients safeguard their dataskeptics started becoming evangelists. Some of these [were] beautiful bugstheyd been there for years and never been found by our best experts, and never been found by bad guys, says James Hamilton, a legendary distinguished engineer within Amazon who now directly advises Andy Jassy. And yet, automated reasoning found them. From 2018 onward, Amazons automated reasoning experts worked with engineers to encode the technology into nearly every part of AWS, from analytics and storage to developer tools and content delivery. One particular niche of cloud-computing clientsheavily regulated financial service firms, like Goldman Sachs and the global hedge fund Bridgewater Associates, with sensitive data and strict compliance requirementsfound automated reasonings promise of provable security extremely compelling. When ChatGPT appeared and the world flung itself headfirst into generative AI, these companies did too. But they still wanted to keep the one small thing, Cook says, that theyd become accustomed to along th way: trust. That customer feedback spurred Cook to imagine how LLMs and automated reasoning might fit together. The solution that he and his collaborators prototyped in the summer of 2023 works by leveraging the same logical framework that worked so well for squishing security bugs in AWS. Step one: Take any policy meant to inform a chatbot (say, a stack of HR documentation, or zoning regulations) and translate it into formal logicthe special language of automated reasoning. Step two: Translate any responses generated by the bot too. Step three: Calculate. If theres a discrepancy between what the LLM wants to say and what the policy allows, the automated reasoning engine will catch it, flag it, and tell the bot to try again. (For humans in the loop, itll also provide logical proof of what went wrong and how, and suggest specific fixes if needed.) We showed that to senior leadership, and they went nuts for it, says Nadia Labai, a senior applied scientist at AWS who partnered with Cook on the project. The demo went on to become Automated Reasoning Checks, which Amazon previewed at its annual Re:Invent conference in December 2024. PwC, one of the Big Four global accounting and consulting firms, was among the first AWS clients to adopt it. We do a lot of work in pharmaceutical, energy, and utilities, all of which are regulated, says Matt Wood, PwCs global and U.S. commercial technology and innovation officer. PwC relies on solutions like AWSs automated reasoning tool to check the accuracy of the outputs of its generative AI toolsincluding agents. But Wood sees the technologys appeal spreading beyond finance and other regulation-heavy industries. Look at what it took to set up a website 25 years agothat was a refined set of skills. Today, you go on Squarespace, click a button, and its done, he says. My expectation is that automated reasoning will follow a similar path. Amazon will make this easier and easier: If you want an automated reasoning check on something, youll have one. Amazon has already embarked on this path with its own enterprise products and internal systems. Rufus, the AI shopping assistant, uses automated reasoning to keep its responses relevant and accurate. Warehouse robots use it to coordinate their actions in close quarters. Nova, Amazons fleet of generative AI foundation models, uses it to improve so-called chain of thought capabilities. And then there are the agents. Cook says the company has multiple agentic AI projects in development that incorporate automated reasoning, with intended applications in software development, security, and policy enforcement in AWS. One is Policy in AgentCore, which Amazon released after this story was reported. Another thats peeking out from behind the curtain is Auto, an agent built into Kiro, Amazons new AI programming tool, that will use formal logic to help make sure bot-written code matches humans intended specifications. But Sivasubramanian, AWSs vice president for agentic AI (and Cooks boss), isnt coy about the commitment Amazon is making. We believe agentic AI has the potential to be our next multibillion-dollar business, he says. As agents are granted more and more autonomy . . . automated reasoning will be key in helping them reach widespread enterprise adoption. Agents are part of why Cook is touting automated reasoning to his engineer colleagues from the North American Stores division at their off-site in the mountains. Retail might not seem to have much in common with finance or pharma, but its a domain thats full of decisions with real stakes. (While onstage at re:Invent 2025, Cook said that “giving an agent access to your credit card is like giving a teenager access to your credit card… You might end up owning a pony or a warehouse full of candy.”) And in that environment, relying on autonomous botsempowered to do anything from execute transactions to rewrite softwarecan turn hallucination from tolerable quirk into Russian roulette. Its a matter of scale: When one vibe coding VC unleashes an agent that accidentally nukes his own apps database, as happened earlier this year to SaaS investor Jason Lemkin, its a funny story. (He got the data back.) But if Fortune 500 companies start deploying swarms of agents that accidentally mislead customers, destroy records, or break industry regulations, theres no Undo button. Enterprise software is full of these potential pitfalls, and existing methods for reducing hallucination arent always strong enough to keep agents from blundering into them. Thats because agents shift the definition of hallucination itself, from errors in word to errors in deed. First of all, this thing could lie to me, explains Cook. But secondly, if I let it launch rocketshis metaphor for irreversible actionswill it launch rockets when were not supposed to? Back in his hotel room after the keynote, Cook is reviewing the contents of a confidential slide deck about how automated reasoning can solve this rocket-launching problem. The demo, which he hurriedly mentioned in his talk (he ran out of time before being able to show it), describes a system that can transform safety policies for an agentdos and donts, written in natural languageinto a flowchart-like visualization of how the agent can and cannot behave, all backed by mathematical proof. Theres even an Attempt to Fix button to use if the system detects an anomaly. Cook calls the demo a concept car, but some of its ideas made it into Policy in AgentCore, which is already available in preview to some AWS customers. PwC, for one, sees Amazons logic-backed take on AI extending into coordinating the agents themselves. If youve got agents building other agents, collaborating with other agents, managing other agents, agents all the way down, says Wood, then having a way of forcing consistency [on their behavior] is going to be really, really importantwhich is where I think automated reasoning will play a role. The ability to reliably orchestrate the actions of AInot just single agents, but entangled legions of them, at scaleis a target that Amazon has squarely in its sights. But automated reasoning may not be the only way to get the job done. EY, another Big Four firm, recently launched its own neuro-symbolic solution to AI hallucinations, EY Growth Platforms, which fuses deep learning with proprietary knowledge graphs. A startup called Kognitos offers business-friendly agents backed by a deterministic symbolic program, dubbed English as Code. Others, like PromptQL, forgo neuro-symbolic methods altogether, preferring the simulated reasoning of frontier LLMs. But even they still attack the agent hallucination problem much like Amazon does: by using generative AI to translate business processes into a special internal language thats easy to audit and control. That translation process is where Amazon built a 10-year lead with automated reasoning. Now it has to maintain it. Nadia Labai is currently working on ways to improve Amazons techniques for using LLMs to convert natural language into formal logic. Its part of a strategy that could help turn Amazons brand of customer-driven, business-friendly AI into a new class of industry- defining infrastructure. A few days before the off-site, I met with Cook in a conference room at Amazons Seattle headquarters. Sitting with his legs tucked catlike beneath him, Cook mused about his own vision for the future of automated reasoningone that extends far beyond Amazons ambitions for enterpise-grade AI. The world, he says, is filled with socio-technical systemspatchworks of often-abstruse rules that only highly paid experts can easily navigate, from civil statutes to insurance policies. Right now, rich people get [to take advantage of] that stuff, he continues. But if the rest of us had a way to manipulate these systems in natural language (thanks, LLMs) with an underlying proof of correctness (thanks, automated reasoning), a workaday kind of superintelligence could be unlocked. Not the kind that helps us colonize the galaxy, as Google DeepMind CEO Demis Hassabis envisions, but one that simply helps people navigate the complexity of everyday life, like figuring out where its legal to build housing for an aging relative or how to get an insurance company to cover their expensive medication. You could have an app that, in an hour of your own time, would get answers to questions that before would take you months, Cook says. That democratizes, if you will, access to truth. And thats the start of a new era. This story is part of Fast Companys AI 20 for 2025, our roundup spotlighting 20 of AIs most innovative technologists, entrepreneurs, corporate leaders, and creative thinkers.


Category: E-Commerce

 

Sites : [14] [15] [16] [17] [18] [19] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] [31] [32] [33] next »

Privacy policy . Copyright . Contact form .