Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Sachdeva, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16715  [pdf, other

    cs.LG

    GC-Bench: A Benchmark Framework for Graph Condensation with New Insights

    Authors: Shengbo Gong, Juntong Ni, Noveen Sachdeva, Carl Yang, Wei Jin

    Abstract: Graph condensation (GC) is an emerging technique designed to learn a significantly smaller graph that retains the essential information of the original graph. This condensed graph has shown promise in accelerating graph neural networks while preserving performance comparable to those achieved with the original, larger graphs. Additionally, this technique facilitates downstream applications such as… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 9 pages

  2. arXiv:2402.09668  [pdf, other

    cs.LG cs.AI cs.CL

    How to Train Data-Efficient LLMs

    Authors: Noveen Sachdeva, Benjamin Coleman, Wang-Cheng Kang, Jianmo Ni, Lichan Hong, Ed H. Chi, James Caverlee, Julian McAuley, Derek Zhiyuan Cheng

    Abstract: The training of large language models (LLMs) is expensive. In this paper, we study data-efficient approaches for pre-training LLMs, i.e., techniques that aim to optimize the Pareto frontier of model quality and training resource/data consumption. We seek to understand the tradeoffs associated with data selection routines based on (i) expensive-to-compute data-quality estimates, and (ii) maximizati… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Under review. 44 pages, 30 figures

  3. arXiv:2310.15433  [pdf, other

    cs.LG cs.IR

    Off-Policy Evaluation for Large Action Spaces via Policy Convolution

    Authors: Noveen Sachdeva, Lequn Wang, Dawen Liang, Nathan Kallus, Julian McAuley

    Abstract: Developing accurate off-policy estimators is crucial for both evaluating and optimizing for new policies. The main challenge in off-policy estimation is the distribution shift between the logging policy that generates data and the target policy that we aim to evaluate. Typically, techniques for correcting distribution shift involve some form of importance sampling. This approach results in unbiase… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Under review. 36 pages, 31 figures

  4. arXiv:2310.11266  [pdf

    cs.CL cs.AI cs.NE

    Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models

    Authors: Khushboo Verma, Marina Moore, Stephanie Wottrich, Karla Robles López, Nishant Aggarwal, Zeel Bhatt, Aagamjit Singh, Bradford Unroe, Salah Basheer, Nitish Sachdeva, Prinka Arora, Harmanjeet Kaur, Tanupreet Kaur, Tevon Hood, Anahi Marquez, Tushar Varshney, Nanfu Deng, Azaan Ramani, Pawanraj Ishwara, Maimoona Saeed, Tatiana López Velarde Peña, Bryan Barksdale, Sushovan Guha, Satwant Kumar

    Abstract: In response to the pressing need for advanced clinical problem-solving tools in healthcare, we introduce BooksMed, a novel framework based on a Large Language Model (LLM). BooksMed uniquely emulates human cognitive processes to deliver evidence-based and reliable responses, utilizing the GRADE (Grading of Recommendations, Assessment, Development, and Evaluations) framework to effectively quantify… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  5. arXiv:2310.09983  [pdf, other

    cs.LG cs.AI cs.CL cs.IR

    Farzi Data: Autoregressive Data Distillation

    Authors: Noveen Sachdeva, Zexue He, Wang-Cheng Kang, Jianmo Ni, Derek Zhiyuan Cheng, Julian McAuley

    Abstract: We study data distillation for auto-regressive machine learning tasks, where the input and output have a strict left-to-right causal structure. More specifically, we propose Farzi, which summarizes an event sequence dataset into a small number of synthetic sequences -- Farzi Data -- which are optimized to maintain (if not improve) model performance compared to training on the full dataset. Under t… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: Under review. 23 pages, 9 figures

  6. arXiv:2301.04272  [pdf, other

    cs.LG cs.CV cs.IR

    Data Distillation: A Survey

    Authors: Noveen Sachdeva, Julian McAuley

    Abstract: The popularity of deep learning has led to the curation of a vast number of massive and multifarious datasets. Despite having close-to-human performance on individual tasks, training parameter-hungry models on large datasets poses multi-faceted problems such as (a) high model-training time; (b) slow research iteration; and (c) poor eco-sustainability. As an alternative, data distillation approache… ▽ More

    Submitted 26 September, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: Accepted at TMLR '23. 21 pages, 4 figures

  7. arXiv:2206.02626  [pdf, other

    cs.IR cs.LG

    Infinite Recommendation Networks: A Data-Centric Approach

    Authors: Noveen Sachdeva, Mehak Preet Dhaliwal, Carole-Jean Wu, Julian McAuley

    Abstract: We leverage the Neural Tangent Kernel and its equivalence to training infinitely-wide neural networks to devise $\infty$-AE: an autoencoder with infinitely-wide bottleneck layers. The outcome is a highly expressive yet simplistic recommendation model with a single hyper-parameter and a closed-form solution. Leveraging $\infty$-AE's simplicity, we also develop Distill-CF for synthesizing tiny, high… ▽ More

    Submitted 12 October, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: Published at NeurIPS '22. $\infty$-AE code available at https://github.com/noveens/infinite_ae_cf and Distill-CF code available at https://github.com/noveens/distill_cf

  8. On Sampling Collaborative Filtering Datasets

    Authors: Noveen Sachdeva, Carole-Jean Wu, Julian McAuley

    Abstract: We study the practical consequences of dataset sampling strategies on the ranking performance of recommendation algorithms. Recommender systems are generally trained and evaluated on samples of larger datasets. Samples are often taken in a naive or ad-hoc fashion: e.g. by sampling a dataset randomly or by selecting users or items with many interactions. As we demonstrate, commonly-used data sampli… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Comments: 9 pages, 4 figures, accepted for publication at WSDM '22. arXiv admin note: substantial text overlap with arXiv:2107.04984

  9. arXiv:2108.00261  [pdf, other

    cs.CL cs.IR cs.LG

    ECLARE: Extreme Classification with Label Graph Correlations

    Authors: Anshul Mittal, Noveen Sachdeva, Sheshansh Agrawal, Sumeet Agarwal, Purushottam Kar, Manik Varma

    Abstract: Deep extreme classification (XC) seeks to train deep architectures that can tag a data point with its most relevant subset of labels from an extremely large label set. The core utility of XC comes from predicting labels that are rarely seen during training. Such rare labels hold the key to personalized recommendations that can delight and surprise a user. However, the large number of rare labels a… ▽ More

    Submitted 31 July, 2021; originally announced August 2021.

    ACM Class: F.2.2; I.2.7

    Journal ref: The Web Conference 2021

  10. arXiv:2107.04984  [pdf, other

    cs.IR

    SVP-CF: Selection via Proxy for Collaborative Filtering Data

    Authors: Noveen Sachdeva, Carole-Jean Wu, Julian McAuley

    Abstract: We study the practical consequences of dataset sampling strategies on the performance of recommendation algorithms. Recommender systems are generally trained and evaluated on samples of larger datasets. Samples are often taken in a naive or ad-hoc fashion: e.g. by sampling a dataset randomly or by selecting users or items with many interactions. As we demonstrate, commonly-used data sampling schem… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: 11 pages, 3 figures, accepted at the SubSetML workshop at ICML '21 (Link: https://sites.google.com/view/icml-2021-subsetml/home)

  11. arXiv:2010.11704  [pdf, other

    cs.CV cs.AI cs.RO eess.IV

    Using Conditional Generative Adversarial Networks to Reduce the Effects of Latency in Robotic Telesurgery

    Authors: Neil Sachdeva, Misha Klopukh, Rachel St. Clair, William Hahn

    Abstract: The introduction of surgical robots brought about advancements in surgical procedures. The applications of remote telesurgery range from building medical clinics in underprivileged areas, to placing robots abroad in military hot-spots where accessibility and diversity of medical experience may be limited. Poor wireless connectivity may result in a prolonged delay, referred to as latency, between a… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 6 pages with 5 figures and 1 table. J Robotic Surg (2020)

    ACM Class: I.4.6; I.2.6; J.3

  12. arXiv:2006.09438  [pdf, other

    cs.LG cs.IR stat.ML

    Off-policy Bandits with Deficient Support

    Authors: Noveen Sachdeva, Yi Su, Thorsten Joachims

    Abstract: Learning effective contextual-bandit policies from past actions of a deployed system is highly desirable in many settings (e.g. voice assistants, recommendation, search), since it enables the reuse of large amounts of log data. State-of-the-art methods for such off-policy learning, however, are based on inverse propensity score (IPS) weighting. A key theoretical requirement of IPS weighting is tha… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 11 pages, 6 figures. Accepted for publication at KDD '20 (Research track)

  13. arXiv:2005.12210  [pdf, other

    cs.IR cs.LG cs.SI

    How Useful are Reviews for Recommendation? A Critical Review and Potential Improvements

    Authors: Noveen Sachdeva, Julian McAuley

    Abstract: We investigate a growing body of work that seeks to improve recommender systems through the use of review text. Generally, these papers argue that since reviews 'explain' users' opinions, they ought to be useful to infer the underlying dimensions that predict ratings or purchases. Schemes to incorporate reviews range from simple regularizers to neural network approaches. Our initial findings revea… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

    Comments: 4 pages, 3 figures. Accepted for publication at SIGIR '20

  14. EDUQA: Educational Domain Question Answering System using Conceptual Network Mapping

    Authors: Abhishek Agarwal, Nikhil Sachdeva, Raj Kamal Yadav, Vishaal Udandarao, Vrinda Mittal, Anubha Gupta, Abhinav Mathur

    Abstract: Most of the existing question answering models can be largely compiled into two categories: i) open domain question answering models that answer generic questions and use large-scale knowledge base along with the targeted web-corpus retrieval and ii) closed domain question answering models that address focused questioning area and use complex deep learning models. Both the above models derive answ… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

    Comments: Published in the 44th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2019

    Journal ref: IEEE ICASSP (2019) 8137-8141

  15. arXiv:1811.09975  [pdf, other

    cs.LG cs.IR stat.ML

    Sequential Variational Autoencoders for Collaborative Filtering

    Authors: Noveen Sachdeva, Giuseppe Manco, Ettore Ritacco, Vikram Pudi

    Abstract: Variational autoencoders were proven successful in domains such as computer vision and speech processing. Their adoption for modeling user preferences is still unexplored, although recently it is starting to gain attention in the current literature. In this work, we propose a model which extends variational autoencoders by exploiting the rich information present in the past preference history. We… ▽ More

    Submitted 25 November, 2018; originally announced November 2018.

    Comments: 9 pages, 6 figures, 2 tables, WSDM2019

    MSC Class: 68T05

  16. Attentive Neural Architecture Incorporating Song Features For Music Recommendation

    Authors: Noveen Sachdeva, Kartik Gupta, Vikram Pudi

    Abstract: Recommender Systems are an integral part of music sharing platforms. Often the aim of these systems is to increase the time, the user spends on the platform and hence having a high commercial value. The systems which aim at increasing the average time a user spends on the platform often need to recommend songs which the user might want to listen to next at each point in time. This is different fro… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

    Comments: Accepted as a paper at the 12th ACM Conference on Recommender Systems (RecSys 18)

    Journal ref: 12th ACM Conference on Recommender Systems (RecSys '18). ACM (2018) 417-421

  17. arXiv:1608.00905  [pdf, other

    cs.MM cs.CV

    PicHunt: Social Media Image Retrieval for Improved Law Enforcement

    Authors: Sonal Goel, Niharika Sachdeva, Ponnurangam Kumaraguru, A V Subramanyam, Divam Gupta

    Abstract: First responders are increasingly using social media to identify and reduce crime for well-being and safety of the society. Images shared on social media hurting religious, political, communal and other sentiments of people, often instigate violence and create law & order situations in society. This results in the need for first responders to inspect the spread of such images and users propagating… ▽ More

    Submitted 15 September, 2016; v1 submitted 2 August, 2016; originally announced August 2016.

  18. arXiv:1509.08205  [pdf, other

    cs.CY

    Characterising Behavior and Emotions on Social Media for Safety: Exploring Online Communication between Police and Citizens

    Authors: Niharika Sachdeva, Ponnurangam Kumaraguru

    Abstract: Increased use of social media by police to connect with citizens has encouraged researchers to study different aspects of information exchange (e.g. type of information, credibility and propagation) during emergency and crisis situation. Research studies lack understanding of human behavior such as engagement, emotions and social interaction between citizen and police department on social media. S… ▽ More

    Submitted 28 September, 2015; originally announced September 2015.

    ACM Class: H.5.3

  19. arXiv:1410.3942  [pdf, other

    cs.CY

    Privacy4ICTD in India: Exploring Perceptions, Attitudes and Awareness about ICT Use

    Authors: Ponnurangam Kumaraguru, Niharika Sachdeva

    Abstract: Several ICT studies give anecdotal evidences showing privacy to be an area of concern that can influence adoption of technology in the developing world. However, in-depth understanding of end users' privacy attitudes and awareness is largely unexplored in developing countries such as India. We conducted a survey with 10,427 Indian citizens to bring forth various insights on privacy expectations an… ▽ More

    Submitted 15 October, 2014; originally announced October 2014.

  20. arXiv:1403.2042  [pdf, other

    cs.CY cs.HC

    Online Social Media and Police in India: Behavior, Perceptions, Challenges

    Authors: Niharika Sachdeva, Ponnurangam Kumaraguru

    Abstract: Police agencies across the globe are increasingly using Online Social Media (OSM) to acquire intelligence and connect with citizens. Developed nations have well thought of strategies to use OSM for policing. However, developing nations like India are exploring and evolving OSM as a policing solution. India, in recent years, experienced many events where rumors and fake content on OSM instigated co… ▽ More

    Submitted 9 March, 2014; originally announced March 2014.

  21. arXiv:1310.1540  [pdf, other

    cs.CR cs.HC

    Three-Way Dissection of a Game-CAPTCHA: Automated Attacks, Relay Attacks, and Usability

    Authors: Manar Mohamed, Niharika Sachdeva, Michael Georgescu, Song Gao, Nitesh Saxena, Chengcui Zhang, Ponnurangam Kumaraguru, Paul C. van Oorschot, Wei-Bang Chen

    Abstract: Existing captcha solutions on the Internet are a major source of user frustration. Game captchas are an interesting and, to date, little-studied approach claiming to make captcha solving a fun activity for the users. One broad form of such captchas -- called Dynamic Cognitive Game (DCG) captchas -- challenge the user to perform a game-like cognitive task interacting with a series of dynamic images… ▽ More

    Submitted 6 October, 2013; originally announced October 2013.

    Comments: 16 pages, 10 figures

  22. arXiv:1306.0195  [pdf, other

    cs.CY

    ChaMAILeon: Exploring the Usability of a Privacy Preserving Email Sharing System

    Authors: Prateek Dewan, Niharika Sachdeva, Mayank Gupta, Ponnurangam Kumaraguru

    Abstract: While passwords, by definition, are meant to be secret, recent trends have witnessed an increasing number of people sharing their email passwords with friends, colleagues, and significant others. However, leading websites like Google advise their users not to share their passwords with anyone, to avoid security and privacy breaches. To understand users' general password sharing behavior and practi… ▽ More

    Submitted 2 June, 2013; originally announced June 2013.

    Comments: 12 pages without references and appendices