Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Bryan, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.03576  [pdf, other

    cs.HC

    Mind Drifts, Data Shifts: Utilizing Mind Wandering to Track the Evolution of User Experience with Data Visualizations

    Authors: Anjana Arunkumar, Lace Padilla, Chris Bryan

    Abstract: User experience in data visualization is typically assessed through post-viewing self-reports, but these overlook the dynamic cognitive processes during interaction. This study explores the use of mind wandering -- a phenomenon where attention spontaneously shifts from a primary task to internal, task-related thoughts or unrelated distractions -- as a dynamic measure during visualization explorati… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 11 pages, 11 figures, 2 tables. IEEE Vis 2024 Full Paper

  2. arXiv:2407.19364  [pdf, other

    cs.HC cs.CR

    Defogger: A Visual Analysis Approach for Data Exploration of Sensitive Data Protected by Differential Privacy

    Authors: Xumeng Wang, Shuangcheng Jiao, Chris Bryan

    Abstract: Differential privacy ensures the security of individual privacy but poses challenges to data exploration processes because the limited privacy budget incapacitates the flexibility of exploration and the noisy feedback of data requests leads to confusing uncertainty. In this study, we take the lead in describing corresponding exploration scenarios, including underlying requirements and available ex… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: 11 pages, 8 figures

  3. arXiv:2406.17838  [pdf, other

    cs.LG cs.AI cs.HC

    InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge Distillation

    Authors: Jinbin Huang, Wenbin He, Liang Gou, Liu Ren, Chris Bryan

    Abstract: The emergence of large-scale pre-trained models has heightened their application in various downstream tasks, yet deployment is a challenge in environments with limited computational resources. Knowledge distillation has emerged as a solution in such scenarios, whereby knowledge from large teacher models is transferred into smaller student' models, but this is a non-trivial process that traditiona… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2404.02990  [pdf, other

    cs.CV cs.AI cs.HC

    ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale

    Authors: Jinbin Huang, Chen Chen, Aditi Mishra, Bum Chul Kwon, Zhicheng Liu, Chris Bryan

    Abstract: Generative image models have emerged as a promising technology to produce realistic images. Despite potential benefits, concerns grow about its misuse, particularly in generating deceptive images that could raise significant ethical, legal, and societal issues. Consequently, there is growing demand to empower users to effectively discern and comprehend patterns of AI-generated images. To this end,… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures

  5. arXiv:2311.03547   

    cs.AI cs.CV cs.HC cs.LG

    InterVLS: Interactive Model Understanding and Improvement with Vision-Language Surrogates

    Authors: Jinbin Huang, Wenbin He, Liang Gou, Liu Ren, Chris Bryan

    Abstract: Deep learning models are widely used in critical applications, highlighting the need for pre-deployment model understanding and improvement. Visual concept-based methods, while increasingly used for this purpose, face challenges: (1) most concepts lack interpretability, (2) existing methods require model knowledge, often unavailable at run time. Additionally, (3) there lacks a no-code method for p… ▽ More

    Submitted 25 June, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: The paper has gone through a major update. The newer version will be titled "InFiConD: Interactive No-code Finetuning with Concept-based Knowledge Distillation"

  6. arXiv:2307.10571  [pdf, other

    cs.HC

    Image or Information? Examining the Nature and Impact of Visualization Perceptual Classification

    Authors: Anjana Arunkumar, Lace Padilla, Gi-Yeul Bae, Chris Bryan

    Abstract: How do people internalize visualizations: as images or information? In this study, we investigate the nature of internalization for visualizations (i.e., how the mind encodes visualizations in memory) and how memory encoding affects its retrieval. This exploratory work examines the influence of various design elements on a user's perception of a chart. Specifically, which design elements lead to p… ▽ More

    Submitted 21 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 11 pages, 10 figures, 3 tables, accepted at IEEE Vis 2023

  7. arXiv:2304.06184  [pdf, other

    cs.HC cs.CL

    LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity

    Authors: Anjana Arunkumar, Shubham Sharma, Rakhi Agrawal, Sriram Chandrasekaran, Chris Bryan

    Abstract: Cross-task generalization is a significant outcome that defines mastery in natural language understanding. Humans show a remarkable aptitude for this, and can solve many different types of tasks, given definitions in the form of textual instructions and a small set of examples. Recent work with pre-trained language models mimics this learning style: users can define and exemplify a task for the mo… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: 13 pages, 6 figures, Eurovis 2023

  8. arXiv:2304.01964  [pdf, other

    cs.HC

    PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Models

    Authors: Aditi Mishra, Utkarsh Soni, Anjana Arunkumar, Jinbin Huang, Bum Chul Kwon, Chris Bryan

    Abstract: Large Language Models (LLMs) have gained widespread popularity due to their ability to perform ad-hoc Natural Language Processing (NLP) tasks with a simple natural language prompt. Part of the appeal for LLMs is their approachability to the general public, including individuals with no prior technical experience in NLP techniques. However, natural language prompts can vary significantly in terms o… ▽ More

    Submitted 8 April, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  9. arXiv:2302.04434  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Real-Time Visual Feedback to Guide Benchmark Creation: A Human-and-Metric-in-the-Loop Workflow

    Authors: Anjana Arunkumar, Swaroop Mishra, Bhavdeep Sachdeva, Chitta Baral, Chris Bryan

    Abstract: Recent research has shown that language models exploit `artifacts' in benchmarks to solve tasks, rather than truly learning them, leading to inflated model performance. In pursuit of creating better benchmarks, we propose VAIDA, a novel benchmark creation paradigm for NLP, that focuses on guiding crowdworkers, an under-explored facet of addressing benchmark idiosyncrasies. VAIDA facilitates sample… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: EACL 2023

  10. arXiv:2210.07631  [pdf, other

    cs.CL cs.CV

    Hardness of Samples Need to be Quantified for a Reliable Evaluation System: Exploring Potential Opportunities with a New Task

    Authors: Swaroop Mishra, Anjana Arunkumar, Chris Bryan, Chitta Baral

    Abstract: Evaluation of models on benchmarks is unreliable without knowing the degree of sample hardness; this subsequently overestimates the capability of AI systems and limits their adoption in real world applications. We propose a Data Scoring task that requires assignment of each unannotated sample in a benchmark a score between 0 to 1, where 0 signifies easy and 1 signifies hard. Use of unannotated sam… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2007.06898

  11. arXiv:2210.07566  [pdf, other

    cs.CL cs.CV

    A Survey of Parameters Associated with the Quality of Benchmarks in NLP

    Authors: Swaroop Mishra, Anjana Arunkumar, Chris Bryan, Chitta Baral

    Abstract: Several benchmarks have been built with heavy investment in resources to track our progress in NLP. Thousands of papers published in response to those benchmarks have competed to top leaderboards, with models often surpassing human performance. However, recent studies have shown that models triumph over several popular benchmarks just by overfitting on spurious biases, without truly learning the d… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2005.00816

  12. arXiv:2209.03514  [pdf, other

    cs.HC eess.SY

    PMU Tracker: A Visualization Platform for Epicentric Event Propagation Analysis in the Power Grid

    Authors: Anjana Arunkumar, Andrea Pinceti, Lalitha Sankar, Chris Bryan

    Abstract: The electrical power grid is a critical infrastructure, with disruptions in transmission having severe repercussions on daily activities, across multiple sectors. To identify, prevent, and mitigate such events, power grids are being refurbished as 'smart' systems that include the widespread deployment of GPS-enabled phasor measurement units (PMUs). PMUs provide fast, precise, and time-synchronized… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 10 pages, 5 figures, IEEE VIS 2022 Paper to appear in IEEE TVCG; conference encourages arXiv submission for accessibility

  13. arXiv:2204.01888  [pdf, other

    cs.HC

    ConceptExplainer: Interactive Explanation for Deep Neural Networks from a Concept Perspective

    Authors: Jinbin Huang, Aditi Mishra, Bum Chul Kwon, Chris Bryan

    Abstract: Traditional deep learning interpretability methods which are suitable for model users cannot explain network behaviors at the global level and are inflexible at providing fine-grained explanations. As a solution, concept-based explanations are gaining attention due to their human intuitiveness and their flexibility to describe both global and local model behaviors. Concepts are groups of similarly… ▽ More

    Submitted 24 October, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: 9 pages, 6 figures

  14. arXiv:2108.06023  [pdf, other

    cs.HC

    Bayesian Modelling of Alluvial Diagram Complexity

    Authors: Anjana Arunkumar, Shashank Ginjpalli, Chris Bryan

    Abstract: Alluvial diagrams are a popular technique for visualizing flow and relational data. However, successfully reading and interpreting the data shown in an alluvial diagram is likely influenced by factors such as data volume, complexity, and chart layout. To understand how alluvial diagram consumption is impacted by its visual features, we conduct two crowdsourced user studies with a set of alluvial d… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: To be published in IEEE VIS 2021, Short Paper

  15. arXiv:2105.03839  [pdf, other

    cs.HC

    News Kaleidoscope: Visual Investigation of Coverage Diversity in News Event Reporting

    Authors: Aditi Mishra, Shashank Ginjpalli, Chris Bryan

    Abstract: We develop a visual analytics system, NewsKaleidoscope, to investigate the how news reporting of events varies. NewsKaleidoscope combines several backend text language processing techniques with a coordinated visualization interface tailored for visualization non-expert users. To robustly evaluate NewsKaleidoscope, we conduct a trio of user studies. (1) A usability study with news novices assesses… ▽ More

    Submitted 12 April, 2022; v1 submitted 9 May, 2021; originally announced May 2021.

  16. arXiv:2104.02818  [pdf, other

    cs.HC

    Why? Why not? When? Visual Explanations of Agent Behavior in Reinforcement Learning

    Authors: Aditi Mishra, Utkarsh Soni, Jinbin Huang, Chris Bryan

    Abstract: Reinforcement learning (RL) is used in many domains, including autonomous driving, robotics, stock trading, and video games. Unfortunately, the black box nature of RL agents, combined with legal and ethical considerations, makes it increasingly important that humans (including those are who not experts in RL) understand the reasoning behind the actions taken by an RL agent, particularly in safety-… ▽ More

    Submitted 1 November, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

  17. arXiv:2103.03996  [pdf, other

    cs.HC

    ChartStory: Automated Partitioning, Layout, and Captioning of Charts into Comic-Style Narratives

    Authors: Jian Zhao, Shenyu Xu, Senthil Chandrasegaran, Chris Bryan, Fan Du, Aditi Mishra, Xin Qian, Yiran Li, Kwan-Liu Ma

    Abstract: Visual data storytelling is gaining importance as a means of presenting data-driven information or analysis results, especially to the general public. This has resulted in design principles being proposed for data-driven storytelling, and new authoring tools being created to aid such storytelling. However, data analysts typically lack sufficient background in design and storytelling to make effect… ▽ More

    Submitted 13 May, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

  18. arXiv:2008.03964  [pdf, other

    cs.CL cs.CV cs.LG eess.SY

    DQI: A Guide to Benchmark Evaluation

    Authors: Swaroop Mishra, Anjana Arunkumar, Bhavdeep Sachdeva, Chris Bryan, Chitta Baral

    Abstract: A `state of the art' model A surpasses humans in a benchmark B, but fails on similar benchmarks C, D, and E. What does B have that the other benchmarks do not? Recent research provides the answer: spurious bias. However, developing A to solve benchmarks B through E does not guarantee that it will solve future benchmarks. To progress towards a model that `truly learns' an underlying task, we need t… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: ICML UDL 2020

  19. arXiv:2007.06898  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Our Evaluation Metric Needs an Update to Encourage Generalization

    Authors: Swaroop Mishra, Anjana Arunkumar, Chris Bryan, Chitta Baral

    Abstract: Models that surpass human performance on several popular benchmarks display significant degradation in performance on exposure to Out of Distribution (OOD) data. Recent research has shown that models overfit to spurious biases and `hack' datasets, in lieu of learning generalizable features like humans. In order to stop the inflation in model performance -- and thus overestimation in AI systems' ca… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: Accepted to ICML UDL 2020

  20. arXiv:2005.00816  [pdf, other

    cs.CL

    DQI: Measuring Data Quality in NLP

    Authors: Swaroop Mishra, Anjana Arunkumar, Bhavdeep Sachdeva, Chris Bryan, Chitta Baral

    Abstract: Neural language models have achieved human level performance across several NLP datasets. However, recent studies have shown that these models are not truly learning the desired task; rather, their high performance is attributed to overfitting using spurious biases, which suggests that the capabilities of AI systems have been over-estimated. We introduce a generic formula for Data Quality Index (D… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

    Comments: 63 pages

  21. arXiv:1707.01466  [pdf, other

    cs.SE

    Functional Requirements-Based Automated Testing for Avionics

    Authors: Youcheng Sun, Martin Brain, Daniel Kroening, Andrew Hawthorn, Thomas Wilson, Florian Schanda, Francisco Javier Guzman Jimenez, Simon Daniel, Chris Bryan, Ian Broster

    Abstract: We propose and demonstrate a method for the reduction of testing effort in safety-critical software development using DO-178 guidance. We achieve this through the application of Bounded Model Checking (BMC) to formal low-level requirements, in order to generate tests automatically that are good enough to replace existing labor-intensive test writing procedures while maintaining independence from i… ▽ More

    Submitted 5 July, 2017; originally announced July 2017.

  22. arXiv:1701.08229  [pdf, other

    cs.IR cs.CL cs.CY cs.SI

    Feature Studies to Inform the Classification of Depressive Symptoms from Twitter Data for Population Health

    Authors: Danielle Mowery, Craig Bryan, Mike Conway

    Abstract: The utility of Twitter data as a medium to support population-level mental health monitoring is not well understood. In an effort to better understand the predictive power of supervised machine learning classifiers and the influence of feature sets for efficiently classifying depression-related tweets on a large-scale, we conducted two feature study experiments. In the first experiment, we assesse… ▽ More

    Submitted 27 January, 2017; originally announced January 2017.