How We Built Rufus, Amazon’s AI-Powered Shopping Assistant

Claudio Ctin7 hours ago5 hours ago9 mins

“What do I need for cold weather golf?”

“What are the differences between trail shoes and running shoes?”

“What are the best dinosaur toys for a five year old?”

These are some of the open-ended questions customers might ask a helpful sales associate in a brick-and-mortar store. But how can customers get answers to similar questions while shopping online?

Amazon’s answer is Rufus, a shopping assistant powered by generative AI. Rufus helps Amazon customers make more informed shopping decisions by answering a wide range of questions within the Amazon app. Users can get product details, compare options, and receive product recommendations.

I lead the team of scientists and engineers that built the large language model (LLM) that powers Rufus. To build a helpful conversational shopping assistant, we used innovative techniques across multiple aspects of generative AI. We built a custom LLM specialized for shopping; employed retrieval-augmented generation with a variety of novel evidence sources; leveraged reinforcement learning to improve responses; made advances in high-performance computing to improve inference efficiency and reduce latency; and implemented a new streaming architecture to get shoppers their answers faster.

How Rufus Gets Answers

Most LLMs are first trained on a broad dataset that informs the model’s overall knowledge and capabilities, and then are customized for a particular domain. That wouldn’t work for Rufus, since our aim was to train it on shopping data from the very beginning—the entire Amazon catalog, for starters, as well as customer reviews and information from community Q&A posts. So our scientists built a custom LLM that was trained on these data sources along with public information on the web.

But to be prepared to answer the vast span of questions that could possibly be asked, Rufus must be empowered to go beyond its initial training data and bring in fresh information. For example, to answer the question, “Is this pan dishwasher-safe?” the LLM first parses the question, then it figures out which retrieval sources will help it generate the answer.

Our LLM uses retrieval-augmented generation (RAG) to pull in information from sources known to be reliable, such as the product catalog, customer reviews, and community Q&A posts; it can also call relevant Amazon Stores APIs. Our RAG system is enormously complex, both because of the variety of data sources used and the differing relevance of each one, depending on the question.

Every LLM, and every use of generative AI, is a work in progress. For Rufus to get better over time, it needs to learn which responses are helpful and which can be improved. Customers are the best source of that information. Amazon encourages customers to give Rufus feedback, letting the model know if they liked or disliked the answer, and those responses are used in a reinforcement learning process. Over time, Rufus learns from customer feedback and improves its responses.

Special Chips and Handling Techniques for Rufus

Rufus needs to be able to engage with millions of customers simultaneously without any noticeable delay. This is particularly challenging since generative AI applications are very compute-intensive, especially at Amazon’s scale.

To minimize delay in generating responses while also maximizing the number of responses that our system could handle, we turned to Amazon’s specialized AI chips, Trainium and Inferentia, which are integrated with core Amazon Web Services (AWS). We collaborated with AWS on optimizations that improve model inference efficiency, which were then made available to all AWS customers.

But standard methods of processing user requests in batches will cause latency and throughput problems because it’s difficult to predict how many tokens (in this case, units of text) an LLM will generate as it composes each response. Our scientists worked with AWS to enable Rufus to use continuous batching, a novel LLM technique that enables the model to start serving new requests as soon as the first request in the batch finishes, rather than waiting for all requests in a batch to finish. This technique improves the computational efficiency of AI chips and allows shoppers to get their answers quickly.

We want Rufus to provide the most relevant and helpful answer to any given question. Sometimes that means a long-form text answer, but sometimes it’s short-form text, or a clickable link to navigate the store. And we had to make sure the presented information follows a logical flow. If we don’t group and format things correctly, we could end up with a confusing response that’s not very helpful to the customer.

That’s why Rufus uses an advanced streaming architecture for delivering responses. Customers don’t need to wait for a long answer to be fully generated—instead, they get the first part of the answer while the rest is being generated. Rufus populates the streaming response with the right data (a process called hydration) by making queries to internal systems. In addition to generating the content for the response, it also generates formatting instructions that specify how various answer elements should be displayed.

Even though Amazon has been using AI for more than 25 years to improve the customer experience, generative AI represents something new and transformative. We’re proud of Rufus, and the new capabilities it provides to our customers.

Please follow and like us:

Stiri similare

Crypto Market Rebounds Despite SEC’s XRP Appeal as Crypto All-Stars ICO Continues to Grow

Claudio Ctin1 hour ago51 mins ago

The crypto market has bounced back – even with the SEC throwing a curveball in the Ripple lawsuit. Meanwhile, the new Crypto All-Stars (STARS) project continues to gain traction in its ICO thanks to its unique take on meme coin staking. SEC vs. Ripple – Round Two Could Shake Up Crypto Regulation The SEC is…

Why Was PewDiePie Banned From Twitch?

Claudio Ctin1 hour ago56 mins ago

Photo Credit: @Pewdiepie|YouTube The fans of Felix Kjellberg, popularly known as PewDiePie were recently left puzzled after the YouTuber was banned from the livestreaming platform, Twitch. This isn’t the first time PewDiePie has faced a ban on the platform. However, what intrigued the curiosity of his fans more was the reason behind the ban. Here…

Moongate Launches New Rewards Program and NFT Collection

Claudio Ctin1 hour ago51 mins ago

PRESS RELEASE – Hong Kong, Hong Kong, October 4th, 2024] Moongate Protocol, an attention asset protocol disrupting the $1 trillion-plus attention economy, announces significant milestones in its expansion, the launch of its Moon Odyssey community points program, and the upcoming release of its first NFT collection, the Moongate Voyager Pass. “Moongate Protocol is an attention…

The Beast in Me: Brittany Snow, Natalie Morales, & more join Claire Danes in Netflix mystery thriller

Claudio Ctin1 hour ago56 mins ago

Claire Danes (Homeland) may have turned down the role of Pamela Voorhees in Peacock’s upcoming Friday the 13th streaming series Crystal Lake, but back in March she did sign on to star in the mystery thriller limited series The Beast in Me for Netflix. Matthew Rhys (Perry Mason) joined Danes in the cast back in…

Interview: Kate Siegel and Alanah Pearce on Making V/H/S/Beyond’s Stowaway

Claudio Ctin2 hours ago56 mins ago

Photo Credit: Shudder ComingSoon Senior Editor Brandon Schreur spoke to Kate Siegel and Alanah Pearce about their segment, Stowaway, in V/H/S/Beyond. The two of them discussed Siegel making her directorial debut in the horror anthology movie, their love for found footage, how Pearce filmed some of the segment’s most intense scenes, and more. “V/H/S/Beyond, the…

Analyst Says PEPE Bearish Continuation Is Possible For A 50% Price Crash

Claudio Ctin2 hours ago51 mins ago

The PEPE price could be in trouble from here after failing to maintain its upward momentum. This has led to a restart of the bearish momentum, and this could continue if bulls fail to pull up the price. In the event that bears do win out in this situation and maintain control, the PEPE price…

NYFF Review: Nickel Boys Finds Miraculous Beauty in the Horrors of the World

Claudio Ctin2 hours ago56 mins ago

Nickel Boys, RaMell Ross’ narrative feature debut, is the story of a stubborn world, resisting change. Adapted from Colson Whitehead’s The Nickel Boys, it’s an experimental rendition shooting mainly through POV. We meet our protagonist not by looking at him, but by observing the world as he sees it. Elwood (Ethan Herisse) is the kind…

What Time Will Superman & Lois Season 4 Release on The CW?

Claudio Ctin2 hours ago56 mins ago

Photo Credit: cbs Have the creators announced when Superman & Lois Season 4 will premiere on CW? This captivating series, inspired by DC Comics characters, has captured the hearts of viewers. The duo navigates the challenges of juggling their careers, the pursuit of justice, and the demands of parenthood in the modern world. Fans are…

Marty Supreme Cast Adds Fran Drescher as Timothée Chalamet’s Mom

Claudio Ctin2 hours ago55 mins ago

(Photo by Rodin Eckenroth/Getty Images) The Marty Supreme cast continues to grow, with Fran Drescher joining the upcoming A24 project as the mother of Timothée Chalamet’s character. Drescher, who is best known for her role in the hit CBS sitcom The Nanny, which ran for six years in the 1990s, will join the project that…

US Trailer for Mikael Håfström’s ‘Stockholm Bloodbath’ Revenge Film

Claudio Ctin2 hours ago2 hours ago

“Today’s the perfect day for bloodbath.” Brainstorm Media has revealed the official US trailer for an action epic about a real moment in history titled Stockholm Bloodbath, yet another new film made by Swedish director Mikael Håfström. He also directed this year’s Slingshot, but this film was finished before that one and premiered in Sweden…

Five Nights at Freddy’s 2 Update Given by Josh Hutcherson, Sequel Will Be Scarier Than 1st Movie

Claudio Ctin2 hours ago2 hours ago

Photo Credit: Universal Pictures Josh Hutcherson has given a Five Nights at Freddy’s 2 update. Based on the popular video game series created by Scott Cawthon, Universal Pictures’ Five Nights at Freddy’s was released in United States theaters in October 2023. The film, directed by Emma Tammi, stars Hutcherson as Mike Schmidt, while Piper Rubio…

Bitcoin Enters Positive Seasonality Period, But There’s a Catch: CryptoQuant

Claudio Ctin2 hours ago51 mins ago

Bitcoin (BTC) usually performs well in the fourth quarter of bull cycle years, especially after halving events. CryptoQuant analysts say this year will be no different; however, things are not adding up. According to a CryptoQuant report, Bitcoin’s apparent demand growth is still slow, and it will need to grow at faster rates to propel…

Interview: Kevin Smith talks about The 4:30 Movie, fake trailers, coming of age and more

Claudio Ctin2 hours ago2 hours ago

In the four decades he has released films, director Kevin Smith has done everything from romantic comedies to epic fantasy comedies to animated comedies and even horror comedies. While Smith certainly has a niche audience that has stuck with him since the 1990s, the director has never shied away from making movies he would have…

Primate: 10 join Troy Kotsur and Johnny Sequoyah in the cast of horror film

Claudio Ctin2 hours ago2 hours ago

Back in July, it was announced that Oscar-winner Troy Kotsur (CODA) had signed on to star in the horror film Primate. Since then, Johnny Sequoyah (pictured above) of Dexter: New Blood has signed on to star in the film with Kostur – and now, as the project draws closer to production, Deadline has revealed that…