Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Kishore, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04220  [pdf, other

    cs.CL cs.LG

    Diffusion Guided Language Modeling

    Authors: Justin Lovelace, Varsha Kishore, Yiwei Chen, Kilian Q. Weinberger

    Abstract: Current language models demonstrate remarkable proficiency in text generation. However, for many applications it is desirable to control attributes, such as sentiment, or toxicity, of the generated language -- ideally tailored towards each specific use case and target audience. For auto-regressive language models, existing guidance methods are prone to decoding errors that cascade during generatio… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: ACL Findings 2024

  2. arXiv:2310.16176  [pdf, other

    cs.CL cs.AI

    Correction with Backtracking Reduces Hallucination in Summarization

    Authors: Zhenzhen Liu, Chao Wan, Varsha Kishore, Jin Peng Zhou, Minmin Chen, Kilian Q. Weinberger

    Abstract: Abstractive summarization aims at generating natural language summaries of a source document that are succinct while preserving the important elements. Despite recent advances, neural text summarization models are known to be susceptible to hallucinating (or more correctly confabulating), that is to produce summaries with details that are not grounded in the source document. In this paper, we intr… ▽ More

    Submitted 31 October, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

  3. arXiv:2307.10323  [pdf, other

    cs.IR cs.CL cs.LG

    IncDSI: Incrementally Updatable Document Retrieval

    Authors: Varsha Kishore, Chao Wan, Justin Lovelace, Yoav Artzi, Kilian Q. Weinberger

    Abstract: Differentiable Search Index is a recently proposed paradigm for document retrieval, that encodes information about a corpus of documents within the parameters of a neural network and directly maps queries to corresponding documents. These models have achieved state-of-the-art performances for document retrieval across many benchmarks. These kinds of models have a significant limitation: it is not… ▽ More

    Submitted 19 August, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

  4. arXiv:2303.16206  [pdf, other

    eess.IV cs.CV cs.MM

    Learning Iterative Neural Optimizers for Image Steganography

    Authors: Xiangyu Chen, Varsha Kishore, Kilian Q Weinberger

    Abstract: Image steganography is the process of concealing secret information in images through imperceptible changes. Recent work has formulated this task as a classic constrained optimization problem. In this paper, we argue that image steganography is inherently performed on the (elusive) manifold of natural images, and propose an iterative neural network trained to perform the optimization steps. In con… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: International Conference on Learning Representations (ICLR) 2023

  5. arXiv:2212.09462  [pdf, other

    cs.CL cs.LG

    Latent Diffusion for Language Generation

    Authors: Justin Lovelace, Varsha Kishore, Chao Wan, Eliot Shekhtman, Kilian Q. Weinberger

    Abstract: Diffusion models have achieved great success in modeling continuous data modalities such as images, audio, and video, but have seen limited use in discrete domains such as language. Recent attempts to adapt diffusion to language have presented diffusion as an alternative to existing pretrained language models. We view diffusion and existing language models as complementary. We demonstrate that enc… ▽ More

    Submitted 7 November, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2023

  6. arXiv:1904.09675  [pdf, other

    cs.CL

    BERTScore: Evaluating Text Generation with BERT

    Authors: Tianyi Zhang, Varsha Kishore, Felix Wu, Kilian Q. Weinberger, Yoav Artzi

    Abstract: We propose BERTScore, an automatic evaluation metric for text generation. Analogously to common metrics, BERTScore computes a similarity score for each token in the candidate sentence with each token in the reference sentence. However, instead of exact matches, we compute token similarity using contextual embeddings. We evaluate using the outputs of 363 machine translation and image captioning sys… ▽ More

    Submitted 24 February, 2020; v1 submitted 21 April, 2019; originally announced April 2019.

    Comments: Code available at https://github.com/Tiiiger/bert_score; To appear in ICLR2020

  7. arXiv:1602.04257  [pdf, other

    cs.AI cs.CY

    Identifying Diabetic Patients with High Risk of Readmission

    Authors: Malladihalli S Bhuvan, Ankit Kumar, Adil Zafar, Vinith Kishore

    Abstract: Hospital readmissions are expensive and reflect the inadequacies in healthcare system. In the United States alone, treatment of readmitted diabetic patients exceeds 250 million dollars per year. Early identification of patients facing a high risk of readmission can enable healthcare providers to to conduct additional investigations and possibly prevent future readmissions. This not only improves t… ▽ More

    Submitted 12 February, 2016; originally announced February 2016.

    Comments: 10 pages, 5 figures, 7 tables

    ACM Class: J.3; H.2.8

  8. arXiv:1112.2112  [pdf, ps, other

    cond-mat.stat-mech cs.SI physics.soc-ph

    Extreme events and event size fluctuations in biased random walks on networks

    Authors: Vimal Kishore, M. S. Santhanam, R. E. Amritkar

    Abstract: Random walk on discrete lattice models is important to understand various types of transport processes. The extreme events, defined as exceedences of the flux of walkers above a prescribed threshold, have been studied recently in the context of complex networks. This was motivated by the occurrence of rare events such as traffic jams, floods, and power black-outs which take place on networks. In t… ▽ More

    Submitted 30 May, 2012; v1 submitted 9 December, 2011; originally announced December 2011.

    Journal ref: Phys. Rev. E 85, 056120 (2012)

  9. arXiv:1102.1789  [pdf, ps, other

    cond-mat.stat-mech cs.SI physics.soc-ph

    Extreme events on complex networks

    Authors: Vimal Kishore, M. S. Santhanam, R. E. Amritkar

    Abstract: We study the extreme events taking place on complex networks. The transport on networks is modelled using random walks and we compute the probability for the occurance and recurrence of extreme events on the network. We show that the nodes with smaller number of links are more prone to extreme events than the ones with larger number of links. We obtain analytical estimates and verify them with num… ▽ More

    Submitted 9 February, 2011; originally announced February 2011.

    Comments: 5 pages, 4 figures

    Journal ref: Phys. Rev. Lett. 106, 188701 (2011)