-
Multi-Task End-to-End Training Improves Conversational Recommendation
Authors:
Naveen Ram,
Dima Kuzmin,
Ellie Ka In Chio,
Moustafa Farid Alzantot,
Santiago Ontanon,
Ambarish Jash,
Judith Yue Li
Abstract:
In this paper, we analyze the performance of a multitask end-to-end transformer model on the task of conversational recommendations, which aim to provide recommendations based on a user's explicit preferences expressed in dialogue. While previous works in this area adopt complex multi-component approaches where the dialogue management and entity recommendation tasks are handled by separate compone…
▽ More
In this paper, we analyze the performance of a multitask end-to-end transformer model on the task of conversational recommendations, which aim to provide recommendations based on a user's explicit preferences expressed in dialogue. While previous works in this area adopt complex multi-component approaches where the dialogue management and entity recommendation tasks are handled by separate components, we show that a unified transformer model, based on the T5 text-to-text transformer model, can perform competitively in both recommending relevant items and generating conversation dialogue. We fine-tune our model on the ReDIAL conversational movie recommendation dataset, and create additional training tasks derived from MovieLens (such as the prediction of movie attributes and related movies based on an input movie), in a multitask learning setting. Using a series of probe studies, we demonstrate that the learned knowledge in the additional tasks is transferred to the conversational setting, where each task leads to a 9%-52% increase in its related probe score.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Authors:
Sukhdeep S. Sodhi,
Ellie Ka-In Chio,
Ambarish Jash,
Santiago Ontañón,
Ajit Apte,
Ankit Kumar,
Ayooluwakunmi Jeje,
Dima Kuzmin,
Harry Fung,
Heng-Tze Cheng,
Jon Effrat,
Tarush Bali,
Nitin Jindal,
Pei Cao,
Sarvjeet Singh,
Senqiang Zhou,
Tameen Khan,
Amol Wankhede,
Moustafa Alzantot,
Allen Wu,
Tushar Chandra
Abstract:
As more and more online search queries come from voice, automatic speech recognition becomes a key component to deliver relevant search results. Errors introduced by automatic speech recognition (ASR) lead to irrelevant search results returned to the user, thus causing user dissatisfaction. In this paper, we introduce an approach, Mondegreen, to correct voice queries in text space without dependin…
▽ More
As more and more online search queries come from voice, automatic speech recognition becomes a key component to deliver relevant search results. Errors introduced by automatic speech recognition (ASR) lead to irrelevant search results returned to the user, thus causing user dissatisfaction. In this paper, we introduce an approach, Mondegreen, to correct voice queries in text space without depending on audio signals, which may not always be available due to system constraints or privacy or bandwidth (for example, some ASR systems run on-device) considerations. We focus on voice queries transcribed via several proprietary commercial ASR systems. These queries come from users making internet, or online service search queries. We first present an analysis showing how different the language distribution coming from user voice queries is from that in traditional text corpora used to train off-the-shelf ASR systems. We then demonstrate that Mondegreen can achieve significant improvements in increased user interaction by correcting user voice queries in one of the largest search systems in Google. Finally, we see Mondegreen as complementing existing highly-optimized production ASR systems, which may not be frequently retrained and thus lag behind due to vocabulary drifts.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
Zero-Shot Heterogeneous Transfer Learning from Recommender Systems to Cold-Start Search Retrieval
Authors:
Tao Wu,
Ellie Ka-In Chio,
Heng-Tze Cheng,
Yu Du,
Steffen Rendle,
Dima Kuzmin,
Ritesh Agarwal,
Li Zhang,
John Anderson,
Sarvjeet Singh,
Tushar Chandra,
Ed H. Chi,
Wen Li,
Ankit Kumar,
Xiang Ma,
Alex Soares,
Nitin Jindal,
Pei Cao
Abstract:
Many recent advances in neural information retrieval models, which predict top-K items given a query, learn directly from a large training set of (query, item) pairs. However, they are often insufficient when there are many previously unseen (query, item) combinations, often referred to as the cold start problem. Furthermore, the search system can be biased towards items that are frequently shown…
▽ More
Many recent advances in neural information retrieval models, which predict top-K items given a query, learn directly from a large training set of (query, item) pairs. However, they are often insufficient when there are many previously unseen (query, item) combinations, often referred to as the cold start problem. Furthermore, the search system can be biased towards items that are frequently shown to a query previously, also known as the 'rich get richer' (a.k.a. feedback loop) problem. In light of these problems, we observed that most online content platforms have both a search and a recommender system that, while having heterogeneous input spaces, can be connected through their common output item space and a shared semantic representation. In this paper, we propose a new Zero-Shot Heterogeneous Transfer Learning framework that transfers learned knowledge from the recommender system component to improve the search component of a content platform. First, it learns representations of items and their natural-language features by predicting (item, item) correlation graphs derived from the recommender system as an auxiliary task. Then, the learned representations are transferred to solve the target search retrieval task, performing query-to-item prediction without having seen any (query, item) pairs in training. We conduct online and offline experiments on one of the world's largest search and recommender systems from Google, and present the results and lessons learned. We demonstrate that the proposed approach can achieve high performance on offline search retrieval tasks, and more importantly, achieved significant improvements on relevance and user interactions over the highly-optimized production system in online experiments.
△ Less
Submitted 18 August, 2020; v1 submitted 6 August, 2020;
originally announced August 2020.
-
Modeling Information Need of Users in Search Sessions
Authors:
Kishaloy Halder,
Heng-Tze Cheng,
Ellie Ka In Chio,
Georgios Roumpos,
Tao Wu,
Ritesh Agarwal
Abstract:
Users issue queries to Search Engines, and try to find the desired information in the results produced. They repeat this process if their information need is not met at the first place. It is crucial to identify the important words in a query that depict the actual information need of the user and will determine the course of a search session. To this end, we propose a sequence-to-sequence based n…
▽ More
Users issue queries to Search Engines, and try to find the desired information in the results produced. They repeat this process if their information need is not met at the first place. It is crucial to identify the important words in a query that depict the actual information need of the user and will determine the course of a search session. To this end, we propose a sequence-to-sequence based neural architecture that leverages the set of past queries issued by users, and results that were explored by them. Firstly, we employ our model for predicting the words in the current query that are important and would be retained in the next query. Additionally, as a downstream application of our model, we evaluate it on the widely popular task of next query suggestion. We show that our intuitive strategy of capturing information need can yield superior performance at these tasks on two large real-world search log datasets.
△ Less
Submitted 3 January, 2020;
originally announced January 2020.