June 2024

Amazon Science

The latest news and research from Amazon’s science community. #AmazonScience

Published Jul 6, 2024

This month’s newsletter features Amazon’s research at ICML 2024, CVPR 2024, and NAACL 2024, awards and recognitions in our science community, and several generative AI updates.

Interpretable ensemble models improve product retrieval: New information retrieval models are constantly being released, but evaluating them takes time. At The Web Conference, Amazon scientists proposed adding new models to an ensemble and then using Shapley value analysis to determine whether to keep them.

Five ways Amazon is preparing for the energy demands of the future: From investing in new carbon-free energy projects to advocating for grid modernization and collaborating with key stakeholders around the world, learn how Amazon is working toward a cleaner energy future.

Automated evaluation of RAG pipelines with exam generation: Retrieval-augmented generation (RAG) is a leading way to curb "hallucination" in large language models (LLMs), and at this year’s International Conference on Machine Learning (ICML 2024), Amazon researchers will show how to leverage item response theory to automatically generate "exams" for evaluating RAG approaches.

Average Fisher information for each category in Bloom’s taxonomy category. Different levels in Bloom's taxonomy differentiate between the knowledge dimension (factual, conceptual, procedural, and meta-cognitive) and the cognitive-process dimension (remember, understand, apply, analyze, evaluate, and create).

Amazon researchers receive Best Paper Award at the Symposium on Foundations of Responsible Computing (FORC 2024) The paper, co-authored by Tiffany (Siqi) Deng, AGI applied science manager, Emily Diana, research assistant professor at the Toyota Technological Institute at Chicago, and Michael Kearns and Aaron Roth, Amazon Scholars and UPenn professors, addresses the challenge of creating a balanced dataset when sensitive-group information is unavailable at deployment time. The researchers propose using a small labeled dataset to train a proxy function that assigns sampling probabilities based on the proxy classification, without revealing significantly more about the group membership of any individual sample than can be ascertained from base rates alone.

A quick guide to Amazon’s papers at NAACL 2024: Unsurprisingly, work involving LLMs, either as a subject of inquiry themselves or as tools for other natural-language-processing applications, predominates at this year’s conference. This paper guide sorts Amazon’s papers into those that deal explicitly with LLMs and those that don’t — although in many cases, the ones that don’t present general techniques or datasets that could be used with either LLMs or more-traditional models.

A quick guide to Amazon’s papers at CVPR 2024: A plurality of the papers deal with vision-language models, while a number of others concern related topics such as visual question answering, hallucination mitigation, and retrieval-aided generation. At the same time, however, classical computer vision topics such as 3-D reconstruction, object tracking, and pose estimation remain well represented.

The Amazon Science booth at the 2024 Conference on Computer Vision and Pattern Recognition.

Amazon Research Award-funded paper receives Best Paper Award: With the support of an Amazon Research Award, a team from Imperial College London and Amazon Web Services (AWS) received an Industry Track Best Paper Award at this year’s International Conference on Software Testing, Verification and Validation (ICST 2024). Their paper presents two new tools, fuzz-d and DafnyFuzz, which improves Dafny compiler testing. The researchers found 24 critical bugs, including 9 soundness issues, surpassing XDsmith, and their testing campaign led to improvements in the Dafny language specification, addressing ambiguous or under-documented language features.

Amazon Scholar honored with IEEE Photonics Society Quantum Electronics Award: Joint Quantum Institute Fellow, NIST researcher, adjunct associate professor in the Department of Physics at UMIACS, and Amazon Scholar Alexey Gorshkov at the AWS Center for Quantum Computing, received the award for his research contributions in the areas of understanding, designing, and controlling interacting quantum systems.

Amazon Visiting Academic receives NSF Early Career Award: Amazon Visiting Academic in Amazon’s AGI organization and assistant professor of computer science at the UCLA Samueli School of Engineering, NANYUN PENG, received the award from the National Science Foundation (NSF) to support her research in AI and developing a new category of generative language models. The award is the agency’s highest honor for faculty members in the early stages of their careers, where she will receive a five-year, $586,000 grant to fund her research and teaching efforts.

Peng has been an Amazon Visiting Academic with the company since 2021. Rather than generating words using the familiar auto-regressive model, which predicts words by analyzing the preceding text, she will lead research into insertion-based language models, which mimic human writing by iteratively inserting words into existing text.

Anthropic’s Claude 3.5 Sonnet model now available in Amazon Bedrock: Claude 3.5 Sonnet raises the industry bar for intelligence, outperforming other generative AI models on a wide range of evaluations, including Anthropic’s previously most intelligent model, Claude 3 Opus. Learn more about the model’s strengths and key improvements.

Amazon's new AI-powered tools help advertisers easily create engaging and vibrant images: The growing suite of generative AI tools from Amazon Ads is helping brands to quickly and easily create lifestyle images around their products, elevating the customer discovery experience. Hear from Jason (Jay) Richman, vice president of product and technology for Amazon Ads, about the new aspect ratio capability and what other features his team is planning to release this year.

How small businesses can boost productivity using generative AI: Amazon Web Services principal applied scientist and founder of Bean Path, a nonprofit education organization, Nashlie Sephus, Ph.D., shares her tips for using free generative AI tools to help grow businesses. Sephus’ videos shows small-business owners and entrepreneurs how to use AWS PartyRock, a free, generative AI app-building tool, built on Amazon Bedrock to help address some pain points.

Dr. Nashlie Sephus, a principal applied scientist in the Amazon Web Services machine learning team.

Upcoming conferences

SIGIR 2024, July 14 - 18
ICML 2024, July 21 - 27
ACL 2024, August 11 - 17
KDD 2024, August 25 - 29

New publications

Dr. Yogesh Malhotra, AI-ML-Cyber-Quant Finance Post-Doc

Silicon Valley VCs-Trillion $ Wall Street Hedge Funds-Pentagon Joint Chiefs-Boards-CEOs Leader: MIT-Princeton AI-Quant Finance Faculty-SME: R&D Impact among AI-Quant Finance Nobel Laureates: NSF-UN HQ Advisor

Advance BEYOND GenAI-LLMs with BRINT.com for Agile-Resilient-Sustainable AI: How to Ensure You Are Not Replaced by Generative AI-Large Language Models: Prepare for Post-GPT Future of Cyber-Resilient AI-ML Software Development-Education & Training Skills Development: https://www.linkedin.com/posts/yogeshmalhotra_method-assumptions-hallucination-activity-7217760140534390785-uM7w Given focus on #ArtificialIntelligence (AI)-enabled #Test-#Questions Responses being used a benchmark for characterizing #ArtificialGeneralIntelligence (AGI), one needs to understand the comparison and contrasts of #Human #Intelligence, #Knowledge, and #Learning with #AI-#Machine Intelligence, Knowledge, and Learning: https://lnkd.in/epx6zV3 . For instance, regarding application of #BloomsTaxonomy in AI-ML #Science R&D, one needs to recognize the core limitations of Bloom's Taxonomy for the #GenerativeAI Era of #AIAgents displacing and replacing #Human #Agents in Intelligence, Knowledge, and Learning Tasks and Activities such as in #Networked, #Interconnected, #HyperVelocity #Knowledge-"#Sentiment" #Transmission #SocialNetworks: Click GenAI g-p-c for Answers for: Question 1: https://lnkd.in/es6NtVSb Question 2: https://lnkd.in/erUrGuXH

Buse Ö.

I'm really excited about the Automated evaluation of RAG pipelines with exam generation! This addresses a major concern in the industry- the fear of hallucinations - false output from language models.

2 Reactions

Peter Schaefer M. B. A .

ADVANCED ORTHOMOLECULAR RESEARCH,AGRICULTURAL LIFE SCIENCE AND FOOD ETHNOBOTANY,CARBOHYDRATES AND NANO FIBRES,CONVERTING PLANTS PROTEIN INTO COMPLETE AMINO ACIDS PEPTIDES SYNTHESIS

AIM impress me how is going , all the way exelent people in this section of developmental forward thinking and movement

See more comments

June 2024

Amazon Science

The latest news and research from Amazon’s science community. #AmazonScience

Upcoming conferences

New publications

The Deep Dive

100,314 followers

More articles by this author

Insights from the community

Others also viewed

Co-hosted panel with ACS sheds light on opportunities to use machine learning to work with ultra-large data libraries.

From Dormancy to Dominance: The Support Vector Machine Innovation

PatSeer's AI Classifier achieves 95% accuracy on Gold Standard Datasets

Top 3 analytics picks of the month: January 2021

#16 AI Research News Updates

Do you want to learn about stacking, blending and ensembling machine learning models?

The History of AI-From Futuristic Fiction to the Future of Enterprise

Artificial Intelligence Seminar in NUI Galway on 7 March 2018: "Learning and Reasoning in Artificial Intelligence", by Prof Thomas Lukasiewicz

The Best Way to Follow Emerging Science - LitSurfer

Complexity: Time, Space, & Sample

Explore topics

Upcoming conferences

New publications

The Deep Dive

100,314 followers

May 2024

Jun 6, 2024

April 2024

Apr 19, 2024

The science behind Amazon’s 2022 Devices & Services event

Oct 1, 2022

Amazon scientists' work from Interspeech 2022

Sep 27, 2022

Insights from the community

Others also viewed

Co-hosted panel with ACS sheds light on opportunities to use machine learning to work with ultra-large data libraries.

From Dormancy to Dominance: The Support Vector Machine Innovation

PatSeer's AI Classifier achieves 95% accuracy on Gold Standard Datasets

Top 3 analytics picks of the month: January 2021

#16 AI Research News Updates

Do you want to learn about stacking, blending and ensembling machine learning models?

The History of AI-From Futuristic Fiction to the Future of Enterprise

Artificial Intelligence Seminar in NUI Galway on 7 March 2018: "Learning and Reasoning in Artificial Intelligence", by Prof Thomas Lukasiewicz

The Best Way to Follow Emerging Science - LitSurfer

Complexity: Time, Space, & Sample

Explore topics