Zum Hauptinhalt springen

Showing 1–50 of 309 results for author: Goyal, A

.
  1. arXiv:2408.14774  [pdf, other

    cs.LG cs.CL

    Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning

    Authors: Simran Kaur, Simon Park, Anirudh Goyal, Sanjeev Arora

    Abstract: We introduce Instruct-SkillMix, an automated approach for creating diverse, high quality SFT data. The Instruct-SkillMix pipeline involves two stages, each leveraging an existing powerful LLM: (1) Skill extraction: uses the LLM to extract core "skills" for instruction-following, either from existing datasets, or by directly prompting the model; (2) Data generation: uses the powerful LLM to generat… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  2. arXiv:2408.13347  [pdf, other

    cs.CR

    ORCHID: Streaming Threat Detection over Versioned Provenance Graphs

    Authors: Akul Goyal, Jason Liu, Adam Bates, Gang Wang

    Abstract: While Endpoint Detection and Response (EDR) are able to efficiently monitor threats by comparing static rules to the event stream, their inability to incorporate past system context leads to high rates of false alarms. Recent work has demonstrated Provenance-based Intrusion Detection Systems (Prov-IDS) that can examine the causal relationships between abnormal behaviors to improve threat classific… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  3. arXiv:2408.10629  [pdf, other

    astro-ph.EP

    The GAPS Programme at TNG. LIX. A characterisation study of the $\sim$300 Myr old multi-planetary system orbiting the star BD+40 2790 (TOI-2076)

    Authors: M. Damasso, D. Locci, S. Benatti, A. Maggio, M. Baratella, S. Desidera, K. Biazzo, E. Palle, S. Wang, D. Nardiello, L. Borsato, A. S. Bonomo, S. Messina, G. Nowak, A. Goyal, V. J. S. Bejar, A. Bignamini, L. Cabona, I. Carleo, R. Claudi, R. Cosentino, S. Filomeno, C. Knapic, N. Lodieu, V. Lorenzi , et al. (13 additional authors not shown)

    Abstract: We collected more than 300 high-resolution spectra of the 300 Myr old star BD+40 2790 (TOI-2076) over ~3 years. This star hosts three transiting planets discovered by TESS, with orbital periods ~10, 21, and 35 days. BD+40 2790 shows an activity-induced scatter larger than 30 m/s in the radial velocities. We employed different methods to measure the stellar radial velocities and several models to f… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: Accepted for publication on A&A. Abstract abridged

  4. arXiv:2408.09310  [pdf, other

    cs.LG

    Narrowing the Focus: Learned Optimizers for Pretrained Models

    Authors: Gus Kristiansen, Mark Sandler, Andrey Zhmoginov, Nolan Miller, Anirudh Goyal, Jihwan Lee, Max Vladymyrov

    Abstract: In modern deep learning, the models are learned by applying gradient updates using an optimizer, which transforms the updates based on various statistics. Optimizers are often hand-designed and tuning their hyperparameters is a big part of the training process. Learned optimizers have shown some initial promise, but are generally unsuccessful as a general optimization mechanism applicable to every… ▽ More

    Submitted 21 August, 2024; v1 submitted 17 August, 2024; originally announced August 2024.

  5. arXiv:2408.09162  [pdf, other

    cs.CV cs.LG

    Zero-Shot Object-Centric Representation Learning

    Authors: Aniket Didolkar, Andrii Zadaianchuk, Anirudh Goyal, Mike Mozer, Yoshua Bengio, Georg Martius, Maximilian Seitzer

    Abstract: The goal of object-centric representation learning is to decompose visual scenes into a structured representation that isolates the entities. Recent successes have shown that object-centric representation learning can be scaled to real-world scenes by utilizing pre-trained self-supervised features. However, so far, object-centric methods have mostly been applied in-distribution, with models traine… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  6. arXiv:2407.21783  [pdf, other

    cs.AI cs.CL cs.CV

    The Llama 3 Herd of Models

    Authors: Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang , et al. (510 additional authors not shown)

    Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More

    Submitted 15 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

  7. arXiv:2407.21009  [pdf, other

    cs.AI cs.LG

    AI-Assisted Generation of Difficult Math Questions

    Authors: Vedant Shah, Dingli Yu, Kaifeng Lyu, Simon Park, Nan Rosemary Ke, Michael Mozer, Yoshua Bengio, Sanjeev Arora, Anirudh Goyal

    Abstract: Current LLM training positions mathematical reasoning as a core capability. With publicly available sources fully tapped, there is unmet demand for diverse and challenging math questions. Relying solely on human experts is both time-consuming and costly, while LLM-generated questions often lack the requisite diversity and difficulty. We present a design framework that combines the strengths of LLM… ▽ More

    Submitted 2 September, 2024; v1 submitted 30 July, 2024; originally announced July 2024.

  8. arXiv:2407.00181  [pdf, other

    hep-ph

    $W-$mass and Muon $g-2$ in Inert 2HDM Extended by Singlet Complex Scalar

    Authors: Hrishabh Bharadwaj, Mamta Dahiya, Sukanta Dutta, Ashok Goyal

    Abstract: The deviations of the recent measurements of the muon magnetic moment and the $W-$boson mass from their SM predictions hint to new physics beyond the SM. In this article, we address the observed discrepancies in the $W$-boson mass and muon anomalous magnetic moment in the Inert Two Higgs Doublet Model (I2HDM) extended by a complex scalar field singlet under the SM gauge group. The model is constra… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: 15 pages, 5 figures

  9. arXiv:2406.18158  [pdf, other

    cs.RO cs.CV

    3D-MVP: 3D Multiview Pretraining for Robotic Manipulation

    Authors: Shengyi Qian, Kaichun Mo, Valts Blukis, David F. Fouhey, Dieter Fox, Ankit Goyal

    Abstract: Recent works have shown that visual pretraining on egocentric datasets using masked autoencoders (MAE) can improve generalization for downstream robotics tasks. However, these approaches pretrain only on 2D images, while many robotics applications require 3D scene understanding. In this work, we propose 3D-MVP, a novel approach for 3D multi-view pretraining using masked autoencoders. We leverage R… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  10. arXiv:2406.17232  [pdf, other

    cs.CL

    Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks

    Authors: Yun-Shiuan Chuang, Zach Studdiford, Krirk Nirunwiroj, Agam Goyal, Vincent V. Frigo, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Creating human-like large language model (LLM) agents is crucial for faithful social simulation. Having LLMs role-play based on demographic information sometimes improves human likeness but often does not. This study assessed whether LLM alignment with human behavior can be improved by integrating information from empirically-derived human belief networks. Using data from a human survey, we estima… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  11. arXiv:2406.13046  [pdf, other

    cs.AI

    Bayesian-LoRA: LoRA based Parameter Efficient Fine-Tuning using Optimal Quantization levels and Rank Values trough Differentiable Bayesian Gates

    Authors: Cristian Meo, Ksenia Sycheva, Anirudh Goyal, Justin Dauwels

    Abstract: It is a common practice in natural language processing to pre-train a single model on a general domain and then fine-tune it for downstream tasks. However, when it comes to Large Language Models, fine-tuning the entire model can be computationally expensive, resulting in very intensive energy consumption. As a result, several Parameter Efficient Fine-Tuning (PEFT) approaches were recently proposed… ▽ More

    Submitted 9 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  12. arXiv:2406.08545  [pdf, other

    cs.RO cs.AI cs.CV

    RVT-2: Learning Precise Manipulation from Few Demonstrations

    Authors: Ankit Goyal, Valts Blukis, Jie Xu, Yijie Guo, Yu-Wei Chao, Dieter Fox

    Abstract: In this work, we study how to build a robotic system that can solve multiple 3D manipulation tasks given language instructions. To be useful in industrial and household domains, such a system should be capable of learning new tasks with few demonstrations and solving them precisely. Prior works, like PerAct and RVT, have studied this problem, however, they often struggle with tasks requiring high… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to RSS 2024

  13. arXiv:2405.15485  [pdf, other

    cs.AI cs.CL cs.LG

    Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs

    Authors: Siyuan Guo, Aniket Didolkar, Nan Rosemary Ke, Anirudh Goyal, Ferenc Huszár, Bernhard Schölkopf

    Abstract: We are beginning to see progress in language model assisted scientific discovery. Motivated by the use of LLMs as a general scientific assistant, this paper assesses the domain knowledge of LLMs through its understanding of different mathematical skills required to solve problems. In particular, we look at not just what the pre-trained model already knows, but how it learned to learn from informat… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  14. arXiv:2405.14091  [pdf, other

    astro-ph.EP

    Peas-in-a-Pod Across the Radius Valley: Rocky Systems are Less Uniform in Mass but More Uniform in Size and Spacing

    Authors: Armaan V. Goyal, Songhu Wang

    Abstract: The ubiquity of "peas-in-a-pod" architectural patterns and the existence of the radius valley each present a striking population-level trend for planets with $R_{p} \leq 4 R_{\oplus}$ that serves to place powerful constraints on the formation and evolution of these subgiant worlds. As it has yet to be determined whether the strength of this peas-in-a-pod uniformity differs on either side of the ra… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted to ApJ Letters (May 2024). 17 pages (including 3 for Appendix), 4 figures, 3 tables

  15. arXiv:2405.12205  [pdf, other

    cs.AI cs.LG

    Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

    Authors: Aniket Didolkar, Anirudh Goyal, Nan Rosemary Ke, Siyuan Guo, Michal Valko, Timothy Lillicrap, Danilo Rezende, Yoshua Bengio, Michael Mozer, Sanjeev Arora

    Abstract: Metacognitive knowledge refers to humans' intuitive knowledge of their own thinking and reasoning processes. Today's best LLMs clearly possess some reasoning processes. The paper gives evidence that they also have metacognitive knowledge, including ability to name skills and procedures to apply given a task. We explore this primarily in context of math reasoning, developing a prompt-guided interac… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Preprint. Under review

  16. arXiv:2405.04324  [pdf, other

    cs.AI cs.CL cs.SE

    Granite Code Models: A Family of Open Foundation Models for Code Intelligence

    Authors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang, Yikang Shen, Aditya Prasad, Adriana Meza Soria, Michele Merler, Parameswaran Selvam, Saptha Surendran, Shivdeep Singh, Manish Sethi, Xuan-Hong Dang, Pengyuan Li, Kun-Lung Wu, Syed Zawad, Andrew Coleman, Matthew White, Mark Lewis, Raju Pavuluri, Yan Koyfman, Boris Lublinsky, Maximilien de Bayser, Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal , et al. (21 additional authors not shown)

    Abstract: Large Language Models (LLMs) trained on code are revolutionizing the software development process. Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and LLM-based agents are beginning to show promise for handling complex tasks autonomously. Realizing the full potential of code LLMs requires a wide range of capabili… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Corresponding Authors: Rameswar Panda, Ruchir Puri; Equal Contributors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang

  17. arXiv:2405.00451  [pdf, other

    cs.AI cs.LG

    Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

    Authors: Yuxi Xie, Anirudh Goyal, Wenyue Zheng, Min-Yen Kan, Timothy P. Lillicrap, Kenji Kawaguchi, Michael Shieh

    Abstract: We introduce an approach aimed at enhancing the reasoning capabilities of Large Language Models (LLMs) through an iterative preference learning process inspired by the successful strategy employed by AlphaZero. Our work leverages Monte Carlo Tree Search (MCTS) to iteratively collect preference data, utilizing its look-ahead ability to break down instance-level rewards into more granular step-level… ▽ More

    Submitted 17 June, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: 10 pages, 4 figures, 4 tables (24 pages, 9 figures, 9 tables including references and appendices)

  18. arXiv:2404.18963  [pdf, other

    cs.LG cs.CL

    RE-GrievanceAssist: Enhancing Customer Experience through ML-Powered Complaint Management

    Authors: Venkatesh C, Harshit Oberoi, Anurag Kumar Pandey, Anil Goyal, Nikhil Sikka

    Abstract: In recent years, digital platform companies have faced increasing challenges in managing customer complaints, driven by widespread consumer adoption. This paper introduces an end-to-end pipeline, named RE-GrievanceAssist, designed specifically for real estate customer complaint management. The pipeline consists of three key components: i) response/no-response ML model using TF-IDF vectorization an… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  19. arXiv:2404.17177  [pdf, other

    cs.LG cs.IR

    RE-RFME: Real-Estate RFME Model for customer segmentation

    Authors: Anurag Kumar Pandey, Anil Goyal, Nikhil Sikka

    Abstract: Marketing is one of the high-cost activities for any online platform. With the increase in the number of customers, it is crucial to understand customers based on their dynamic behaviors to design effective marketing strategies. Customer segmentation is a widely used approach to group customers into different categories and design the marketing strategy targeting each group individually. Therefore… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  20. RE-RecSys: An End-to-End system for recommending properties in Real-Estate domain

    Authors: Venkatesh C, Harshit Oberoi, Anil Goyal, Nikhil Sikka

    Abstract: We propose an end-to-end real-estate recommendation system, RE-RecSys, which has been productionized in real-world industry setting. We categorize any user into 4 categories based on available historical data: i) cold-start users; ii) short-term users; iii) long-term users; and iv) short-long term users. For cold-start users, we propose a novel rule-based engine that is based on the popularity of… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  21. arXiv:2404.14146  [pdf

    cond-mat.mtrl-sci cs.LG

    Physics-based reward driven image analysis in microscopy

    Authors: Kamyar Barakati, Hui Yuan, Amit Goyal, Sergei V. Kalinin

    Abstract: The rise of electron microscopy has expanded our ability to acquire nanometer and atomically resolved images of complex materials. The resulting vast datasets are typically analyzed by human operators, an intrinsically challenging process due to the multiple possible analysis steps and the corresponding need to build and optimize complex analysis workflows. We present a methodology based on the co… ▽ More

    Submitted 5 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 12 pages, 4 figures

  22. arXiv:2404.07428  [pdf, other

    cs.RO cs.LG

    AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic Agent

    Authors: Tongzhou Mu, Yijie Guo, Jie Xu, Ankit Goyal, Hao Su, Dieter Fox, Animesh Garg

    Abstract: Encouraged by the remarkable achievements of language and vision foundation models, developing generalist robotic agents through imitation learning, using large demonstration datasets, has become a prominent area of interest in robot learning. The efficacy of imitation learning is heavily reliant on the quantity and quality of the demonstration datasets. In this study, we aim to scale up demonstra… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  23. arXiv:2404.03183  [pdf, other

    cs.CV

    BodyMAP -- Jointly Predicting Body Mesh and 3D Applied Pressure Map for People in Bed

    Authors: Abhishek Tandon, Anujraaj Goyal, Henry M. Clever, Zackory Erickson

    Abstract: Accurately predicting the 3D human posture and the pressure exerted on the body for people resting in bed, visualized as a body mesh (3D pose & shape) with a 3D pressure map, holds significant promise for healthcare applications, particularly, in the prevention of pressure ulcers. Current methods focus on singular facets of the problem -- predicting only 2D/3D poses, generating 2D pressure images,… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024 Project Website: https://bodymap3d.github.io/ Code: https://github.com/RCHI-Lab/BodyMAP

  24. arXiv:2403.18614  [pdf, other

    q-bio.PE physics.bio-ph

    Energy-ordered resource stratification as an agnostic signature of life

    Authors: Akshit Goyal, Mikhail Tikhonov

    Abstract: The search for extraterrestrial life hinges on identifying biosignatures, often focusing on gaseous metabolic byproducts as indicators. However, most such biosignatures require assuming specific metabolic processes. It is widely recognized that life on other planets may not resemble that of Earth, but identifying biosignatures ``agnostic'' to such assumptions has remained a challenge. Here, we pro… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 5 pages, 3 figures

  25. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  26. arXiv:2403.01276  [pdf, other

    q-bio.PE cond-mat.dis-nn cond-mat.stat-mech

    A universal niche geometry governs the response of ecosystems to environmental perturbations

    Authors: Akshit Goyal, Jason W. Rocks, Pankaj Mehta

    Abstract: How ecosystems respond to environmental perturbations is a fundamental question in ecology, made especially challenging due to the strong coupling between species and their environment. Here, we introduce a theoretical framework for calculating the linear response of ecosystems to environmental perturbations in generalized consumer-resource models. Our construction is applicable to a wide class of… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 13 pages, 5 figures

  27. arXiv:2403.01251  [pdf, other

    cs.CL

    Accelerating Greedy Coordinate Gradient via Probe Sampling

    Authors: Yiran Zhao, Wenyue Zheng, Tianle Cai, Xuan Long Do, Kenji Kawaguchi, Anirudh Goyal, Michael Shieh

    Abstract: Safety of Large Language Models (LLMs) has become a critical issue given their rapid progresses. Greedy Coordinate Gradient (GCG) is shown to be effective in constructing adversarial prompts to break the aligned LLMs, but optimization of GCG is time-consuming. To reduce the time cost of GCG and enable more comprehensive studies of LLM safety, in this work, we study a new algorithm called… ▽ More

    Submitted 27 May, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  28. arXiv:2402.18540  [pdf, other

    cs.LG cs.AI cs.CL

    Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates

    Authors: Kaifeng Lyu, Haoyu Zhao, Xinran Gu, Dingli Yu, Anirudh Goyal, Sanjeev Arora

    Abstract: Public LLMs such as the Llama 2-Chat have driven huge activity in LLM research. These models underwent alignment training and were considered safe. Recently Qi et al. (2023) reported that even benign fine-tuning (e.g., on seemingly safe datasets) can give rise to unsafe behaviors in the models. The current paper is about methods and best practices to mitigate such loss of alignment. Through extens… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 20 pages

  29. Radio-only and Radio-to-far-ultraviolet Spectral Energy Distribution Modeling of 14 ULIRGs: Insights into the Global Properties of Infrared Bright Galaxies

    Authors: Subhrata Dey, Arti Goyal, Katarzyna Małek, Tanio Díaz-Santos

    Abstract: We present detailed spectral energy distribution (SED) modeling of 14 local ultraluminous infrared galaxies (ULIRGs) with outstanding photometric data from the literature covering the ultraviolet--infrared (FIR) and radio bands ($\sim$50 MHz to $\sim$30 GHz). We employ the CIGALE SED fitting code to model the ultraviolet--FIR--radio SED. For the radio-only SED modeling, we use the UltraNest packag… ▽ More

    Submitted 7 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Published in the ApJ

    Journal ref: ApJ 966 61 (2024)

  30. arXiv:2401.01623  [pdf, other

    cs.AI cs.CL

    Can AI Be as Creative as Humans?

    Authors: Haonan Wang, James Zou, Michael Mozer, Anirudh Goyal, Alex Lamb, Linjun Zhang, Weijie J Su, Zhun Deng, Michael Qizhe Xie, Hannah Brown, Kenji Kawaguchi

    Abstract: Creativity serves as a cornerstone for societal progress and innovation. With the rise of advanced generative AI models capable of tasks once reserved for human creativity, the study of AI's creative potential becomes imperative for its responsible development and application. In this paper, we prove in theory that AI can be as creative as humans under the condition that it can properly fit the da… ▽ More

    Submitted 25 January, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: The paper examines AI's creativity, introducing Relative and Statistical Creativity for theoretical and practical analysis, along with practical training guidelines. Project Page: ai-relative-creativity.github.io

  31. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  32. arXiv:2312.09191  [pdf, other

    astro-ph.SR astro-ph.HE

    Solar flare catalog from 3 years of Chandrayaan-2 XSM observations

    Authors: Aravind Bharathi Valluvan, Ashwin Goyal, Devansh Jain, Abhinna Sundar Samantaray, Abhilash Sarwade, Kasiviswanathan Sankarasubramanian

    Abstract: We present a catalog of 6266 solar flares detected by the X-Ray Solar Monitor onboard the Chandrayaan-2 lunar orbiter between 1.55 and 12.4 keV (1 and 8 Å) from 2019 September 12 to 2022 November 4, including 1469 type A flares. The catalog represents the first large sample, including both type A, hot thermal flares, and type B, impulsive flares, with a sub-A class sensitive instrument. We also de… ▽ More

    Submitted 8 January, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 29 pages, 15 figures, 5 tables

  33. arXiv:2312.08807  [pdf, other

    hep-ph

    Constraining 2HDM+S model through W-boson mass measurements

    Authors: Anza-Tshilidzi Mulaudzi, Mukesh Kumar, Ashok Goyal, Bruce Mellado

    Abstract: Following a discussion on $W$-boson mass observed at the CDF and ATLAS, we explore the parameter space allowed in the 2HDM+$S$ model. Further, the model parameter space is constrained through vector-like leptons via muon $g-2$ measurements. We show our results for additional scalar mass fixed to $m_S \approx 95$ and $150$~GeV keeping the standard Higgs-boson mass at 125~GeV in all four types of 2H… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 6 pages, 2 figures, Talk presented at SAIP2023

  34. arXiv:2311.15268  [pdf, other

    cs.LG cs.AI

    Unlearning via Sparse Representations

    Authors: Vedant Shah, Frederik Träuble, Ashish Malik, Hugo Larochelle, Michael Mozer, Sanjeev Arora, Yoshua Bengio, Anirudh Goyal

    Abstract: Machine \emph{unlearning}, which involves erasing knowledge about a \emph{forget set} from a trained model, can prove to be costly and infeasible by existing techniques. We propose a nearly compute-free zero-shot unlearning technique based on a discrete representational bottleneck. We show that the proposed technique efficiently unlearns the forget set and incurs negligible damage to the model's p… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  35. arXiv:2311.14910  [pdf, other

    math.DS cs.LG stat.ML

    A latent linear model for nonlinear coupled oscillators on graphs

    Authors: Agam Goyal, Zhaoxing Wu, Richard P. Yim, Binhao Chen, Zihong Xu, Hanbaek Lyu

    Abstract: A system of coupled oscillators on an arbitrary graph is locally driven by the tendency to mutual synchronization between nearby oscillators, but can and often exhibit nonlinear behavior on the whole graph. Understanding such nonlinear behavior has been a key challenge in predicting whether all oscillators in such a system will eventually synchronize. In this paper, we demonstrate that, surprising… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 23 pages, 14 figures

  36. arXiv:2311.13577  [pdf, other

    cs.AI

    Physical Reasoning and Object Planning for Household Embodied Agents

    Authors: Ayush Agrawal, Raghav Prabhakar, Anirudh Goyal, Dianbo Liu

    Abstract: In this study, we explore the sophisticated domain of task planning for robust household embodied agents, with a particular emphasis on the intricate task of selecting substitute objects. We introduce the CommonSense Object Affordance Task (COAT), a novel framework designed to analyze reasoning capabilities in commonsense scenarios. This approach is centered on understanding how these agents can e… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Total: 32 pages ( 16 pages main content, 11 Figures)

  37. arXiv:2311.09665  [pdf, other

    cs.CL

    The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents

    Authors: Yun-Shiuan Chuang, Siddharth Suresh, Nikunj Harlalka, Agam Goyal, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Human groups are able to converge on more accurate beliefs through deliberation, even in the presence of polarization and partisan bias -- a phenomenon known as the "wisdom of partisan crowds." Generated agents powered by Large Language Models (LLMs) are increasingly used to simulate human collective behavior, yet few benchmarks exist for evaluating their dynamics against the behavior of human gro… ▽ More

    Submitted 16 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  38. arXiv:2311.09618  [pdf, other

    physics.soc-ph cs.CL

    Simulating Opinion Dynamics with Networks of LLM-based Agents

    Authors: Yun-Shiuan Chuang, Agam Goyal, Nikunj Harlalka, Siddharth Suresh, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Accurately simulating human opinion dynamics is crucial for understanding a variety of societal phenomena, including polarization and the spread of misinformation. However, the agent-based models (ABMs) commonly used for such simulations often over-simplify human behavior. We propose a new approach to simulating opinion dynamics based on populations of Large Language Models (LLMs). Our findings re… ▽ More

    Submitted 31 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  39. arXiv:2310.17567  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models

    Authors: Dingli Yu, Simran Kaur, Arushi Gupta, Jonah Brown-Cohen, Anirudh Goyal, Sanjeev Arora

    Abstract: With LLMs shifting their role from statistical modeling of language to serving as general-purpose AI agents, how should LLM evaluations change? Arguably, a key ability of an AI agent is to flexibly combine, as needed, the basic skills it has learned. The capability to combine skills plays an important role in (human) pedagogy and also in a paper on emergence phenomena (Arora & Goyal, 2023). This… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  40. arXiv:2310.10928  [pdf, ps, other

    cs.HC cs.AI cs.LG

    Using Audio Data to Facilitate Depression Risk Assessment in Primary Health Care

    Authors: Adam Valen Levinson, Abhay Goyal, Roger Ho Chun Man, Roy Ka-Wei Lee, Koustuv Saha, Nimay Parekh, Frederick L. Altice, Lam Yin Cheung, Munmun De Choudhury, Navin Kumar

    Abstract: Telehealth is a valuable tool for primary health care (PHC), where depression is a common condition. PHC is the first point of contact for most people with depression, but about 25% of diagnoses made by PHC physicians are inaccurate. Many other barriers also hinder depression detection and treatment in PHC. Artificial intelligence (AI) may help reduce depression misdiagnosis in PHC and improve ove… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  41. arXiv:2310.03739  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Aligning Text-to-Image Diffusion Models with Reward Backpropagation

    Authors: Mihir Prabhudesai, Anirudh Goyal, Deepak Pathak, Katerina Fragkiadaki

    Abstract: Text-to-image diffusion models have recently emerged at the forefront of image generation, powered by very large-scale unsupervised or weakly supervised text-to-image training datasets. Due to their unsupervised training, controlling their behavior in downstream tasks, such as maximizing human-perceived image quality, image-text alignment, or ethical image generation, is difficult. Recent works fi… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Code available at https://align-prop.github.io/

  42. arXiv:2309.13502  [pdf, ps, other

    math.OC

    Globally Solving a Class of Bilevel Programs with Spatial Price Equilibrium Constraints

    Authors: Akshit Goyal, Jean-Philippe P. Richard

    Abstract: Bilevel programs with spatial price equilibrium constraints are strategic models that consider a price competition at the lower level. These models find application in facility location-price models, optimal bidding in power networks, and integration of renewable energy sources in distribution networks. In this paper, for the case where the equilibrium at the lower level can be formulated as an op… ▽ More

    Submitted 23 June, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

  43. The LOFAR Two-Metre Sky Survey (LoTSS): VI. Optical identifications for the second data release

    Authors: M. J. Hardcastle, M. A. Horton, W. L. Williams, K. J. Duncan, L. Alegre, B. Barkus, J. H. Croston, H. Dickinson, E. Osinga, H. J. A. Röttgering, J. Sabater, T. W. Shimwell, D. J. B. Smith, P. N. Best, A. Botteon, M. Brüggen, A. Drabent, F. de Gasperin, G. Gürkan, M. Hajduk, C. L. Hale, M. Hoeft, M. Jamrozy, M. Kunert-Bajraszewska, R. Kondapally , et al. (27 additional authors not shown)

    Abstract: The second data release of the LOFAR Two-Metre Sky Survey (LoTSS) covers 27% of the northern sky, with a total area of $\sim 5,700$ deg$^2$. The high angular resolution of LOFAR with Dutch baselines (6 arcsec) allows us to carry out optical identifications of a large fraction of the detected radio sources without further radio followup; however, the process is made more challenging by the many ext… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: 29 pages. Accepted by A&A; data products available at https://lofar-surveys.org/dr2_release.html

    Journal ref: A&A 678, A151 (2023)

  44. arXiv:2308.14969  [pdf, other

    cs.LG cs.CV

    Uncovering the Hidden Cost of Model Compression

    Authors: Diganta Misra, Muawiz Chaudhary, Agam Goyal, Bharat Runwal, Pin Yu Chen

    Abstract: In an age dominated by resource-intensive foundation models, the ability to efficiently adapt to downstream tasks is crucial. Visual Prompting (VP), drawing inspiration from the prompting techniques employed in Large Language Models (LLMs), has emerged as a pivotal method for transfer learning in the realm of computer vision. As the importance of efficiency continues to rise, research into model c… ▽ More

    Submitted 15 March, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Preprint

  45. arXiv:2307.15936  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    A Theory for Emergence of Complex Skills in Language Models

    Authors: Sanjeev Arora, Anirudh Goyal

    Abstract: A major driver of AI products today is the fact that new skills emerge in language models when their parameter set and training corpora are scaled up. This phenomenon is poorly understood, and a mechanistic explanation via mathematical analysis of gradient-based training seems difficult. The current paper takes a different approach, analysing emergence using the famous (and empirical) Scaling Laws… ▽ More

    Submitted 5 November, 2023; v1 submitted 29 July, 2023; originally announced July 2023.

  46. Enhanced Size Uniformity for Near-resonant Planets

    Authors: Armaan V. Goyal, Fei Dai, Songhu Wang

    Abstract: Super-Earths within the same close-in, compact planetary system tend to exhibit a striking degree of uniformity in their radius, mass, and orbital spacing, and this 'peas-in-a-pod' phenomenon itself serves to provide one of the strongest constrains on planet formation at large. While it has been recently demonstrated from independent samples that such planetary uniformity occurs for both configura… ▽ More

    Submitted 15 August, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: 15 pages, 6 figures. Accepted to ApJ July 2023

  47. arXiv:2307.12402  [pdf, ps, other

    cs.CL

    ChatGPT and Bard Responses to Polarizing Questions

    Authors: Abhay Goyal, Muhammad Siddique, Nimay Parekh, Zach Schwitzky, Clara Broekaert, Connor Michelotti, Allie Wong, Lam Yin Cheung, Robin O Hanlon, Lam Yin Cheung, Munmun De Choudhury, Roy Ka-Wei Lee, Navin Kumar

    Abstract: Recent developments in natural language processing have demonstrated the potential of large language models (LLMs) to improve a range of educational and learning outcomes. Of recent chatbots based on LLMs, ChatGPT and Bard have made it clear that artificial intelligence (AI) technology will have significant implications on the way we obtain and search for information. However, these tools sometime… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  48. arXiv:2307.04751  [pdf, other

    cs.RO cs.CV cs.LG

    Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement

    Authors: Anthony Simeonov, Ankit Goyal, Lucas Manuelli, Lin Yen-Chen, Alina Sarmiento, Alberto Rodriguez, Pulkit Agrawal, Dieter Fox

    Abstract: We propose a system for rearranging objects in a scene to achieve a desired object-scene placing relationship, such as a book inserted in an open slot of a bookshelf. The pipeline generalizes to novel geometries, poses, and layouts of both scenes and objects, and is trained from demonstrations to operate directly on 3D point clouds. Our system overcomes challenges associated with the existence of… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Project page: https://anthonysimeonov.github.io/rpdiff-multi-modal/

  49. arXiv:2307.04053  [pdf, other

    cs.CL

    How is Fatherhood Framed Online in Singapore?

    Authors: Tran Hien Van, Abhay Goyal, Muhammad Siddique, Lam Yin Cheung, Nimay Parekh, Jonathan Y Huang, Keri McCrickerd, Edson C Tandoc Jr., Gerard Chung, Navin Kumar

    Abstract: The proliferation of discussion about fatherhood in Singapore attests to its significance, indicating the need for an exploration of how fatherhood is framed, aiding policy-making around fatherhood in Singapore. Sound and holistic policy around fatherhood in Singapore may reduce stigma and apprehension around being a parent, critical to improving the nations flagging birth rate. We analyzed 15,705… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

  50. arXiv:2307.03083  [pdf, other

    cs.CE

    Predicting Opioid Use Outcomes in Minoritized Communities

    Authors: Abhay Goyal, Nimay Parekh, Lam Yin Cheung, Koustuv Saha, Frederick L Altice, Robin O'hanlon, Roger Ho Chun Man, Christian Poellabauer, Honoria Guarino, Pedro Mateu Gelabert, Navin Kumar

    Abstract: Machine learning algorithms can sometimes exacerbate health disparities based on ethnicity, gender, and other factors. There has been limited work at exploring potential biases within algorithms deployed on a small scale, and/or within minoritized communities. Understanding the nature of potential biases may improve the prediction of various health outcomes. As a case study, we used data from a sa… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.