Skip to main content

Showing 1–11 of 11 results for author: Singh, S S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10886  [pdf, other

    cs.CL cs.LG

    Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM

    Authors: Sri Raghava Muddu, Rupasai Rangaraju, Tejpalsingh Siledar, Swaroop Nath, Pushpak Bhattacharyya, Swaprava Nath, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Sudhanshu Shekhar Singh, Nikesh Garera

    Abstract: Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limi… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2404.05243  [pdf, other

    cs.CL cs.AI

    Product Description and QA Assisted Self-Supervised Opinion Summarization

    Authors: Tejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya

    Abstract: In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) s… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  3. arXiv:2402.15473  [pdf, other

    cs.CL cs.LG

    Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization

    Authors: Swaroop Nath, Tejpalsingh Siledar, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Harshad Khadilkar, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of ten… ▽ More

    Submitted 18 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 19 pages, 6 figures, 21 tables

  4. arXiv:2402.11683  [pdf, other

    cs.CL

    One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

    Authors: Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluat… ▽ More

    Submitted 9 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  5. arXiv:2308.10874  [pdf, other

    cs.LG cs.AI cs.CL cs.NE

    Analyzing Transformer Dynamics as Movement through Embedding Space

    Authors: Sumeet S. Singh

    Abstract: Transformer based language models exhibit intelligent behaviors such as understanding natural language, recognizing patterns, acquiring knowledge, reasoning, planning, reflecting and using tools. This paper explores how their underlying mechanics give rise to intelligent behaviors. Towards that end, we propose framing Transformer dynamics as movement through embedding space. Examining Transformers… ▽ More

    Submitted 14 November, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: V2. Rewrote abstract. Rewrote / re-organized the entire paper into a more formal proposition/argument/result format. To shorten main paper length: Wrote more compact text in general, moved "negative self bias" and "encoder v/s decoder walks" sections to the appendix and packed figures. Styled as TMLR

  6. arXiv:2103.06450  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Full Page Handwriting Recognition via Image to Sequence Extraction

    Authors: Sumeet S. Singh, Sergey Karayev

    Abstract: We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation. Being based on Image to Sequence architecture, it can extract text present in an image and then sequence it correctly without imposing any constraints regarding orientation, layout and size of text and non-tex… ▽ More

    Submitted 26 June, 2022; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: Appeared in ICDAR 2021

  7. arXiv:1802.05415  [pdf, other

    cs.LG cs.CL cs.CV cs.NE

    Teaching Machines to Code: Neural Markup Generation with Visual Attention

    Authors: Sumeet S. Singh

    Abstract: We present a neural transducer model with visual attention that learns to generate LaTeX markup of a real-world math formula given its image. Applying sequence modeling and transduction techniques that have been very successful across modalities such as natural language, image, handwriting, speech and audio; we construct an image-to-markup model that learns to produce syntactically and semanticall… ▽ More

    Submitted 15 June, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: For datasets, visualizations and ancillary material see: https://untrix.github.io/i2l . For source code go to: https://github.com/untrix/im2latex

  8. arXiv:1603.05522  [pdf, ps, other

    stat.AP cs.CV stat.CO

    Tracking multiple moving objects in images using Markov Chain Monte Carlo

    Authors: Lan Jiang, Sumeetpal S. Singh

    Abstract: A new Bayesian state and parameter learning algorithm for multiple target tracking (MTT) models with image observations is proposed. Specifically, a Markov chain Monte Carlo algorithm is designed to sample from the posterior distribution of the unknown number of targets, their birth and death times, states and model parameters, which constitutes the complete solution to the tracking problem. The c… ▽ More

    Submitted 17 March, 2016; originally announced March 2016.

  9. arXiv:1401.2490  [pdf, ps, other

    cs.LG stat.CO stat.ML

    An Online Expectation-Maximisation Algorithm for Nonnegative Matrix Factorisation Models

    Authors: Sinan Yildirim, A. Taylan Cemgil, Sumeetpal S. Singh

    Abstract: In this paper we formulate the nonnegative matrix factorisation (NMF) problem as a maximum likelihood estimation problem for hidden Markov models and propose online expectation-maximisation (EM) algorithms to estimate the NMF and the other unknown static parameters. We also propose a sequential Monte Carlo approximation of our online EM algorithm. We show the performance of the proposed method wit… ▽ More

    Submitted 10 January, 2014; originally announced January 2014.

    Comments: 6 pages, 3 figures

    Journal ref: 16th IFAC Symposium on System Identification, 2012, Volume 16, Part 1,

  10. arXiv:1211.5901  [pdf, ps, other

    stat.ML cs.LG stat.CO

    Bayesian learning of noisy Markov decision processes

    Authors: Sumeetpal S. Singh, Nicolas Chopin, Nick Whiteley

    Abstract: We consider the inverse reinforcement learning problem, that is, the problem of learning from, and then predicting or mimicking a controller based on state/action data. We propose a statistical model for such data, derived from the structure of a Markov decision process. Adopting a Bayesian approach to inference, we show how latent variables of the model can be estimated, and how predictions about… ▽ More

    Submitted 26 November, 2012; originally announced November 2012.

  11. arXiv:1206.4221  [pdf, ps, other

    math.OC cs.DC eess.SY stat.AP

    Distributed Maximum Likelihood for Simultaneous Self-localization and Tracking in Sensor Networks

    Authors: Nikolas Kantas, Sumeetpal S. Singh, Arnaud Doucet

    Abstract: We show that the sensor self-localization problem can be cast as a static parameter estimation problem for Hidden Markov Models and we implement fully decentralized versions of the Recursive Maximum Likelihood and on-line Expectation-Maximization algorithms to localize the sensor network simultaneously with target tracking. For linear Gaussian models, our algorithms can be implemented exactly usin… ▽ More

    Submitted 19 June, 2012; originally announced June 2012.

    Comments: shorter version is about to appear in IEEE Transactions of Signal Processing; 22 pages, 15 figures