Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Fazli, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.17128  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    OSCaR: Object State Captioning and State Change Representation

    Authors: Nguyen Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu

    Abstract: The capability of intelligent models to extrapolate and comprehend changes in object states is a crucial yet demanding aspect of AI research, particularly through the lens of human interaction in real-world settings. This task involves describing complex visual environments, identifying active objects, and interpreting their changes as conveyed through language. Traditional methods, which isolate… ▽ More

    Submitted 2 April, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: NAACL 2024

  2. arXiv:2311.04480  [pdf, other

    cs.CV cs.CL cs.LG

    CLearViD: Curriculum Learning for Video Description

    Authors: Cheng-Yu Chuang, Pooyan Fazli

    Abstract: Video description entails automatically generating coherent natural language sentences that narrate the content of a given video. We introduce CLearViD, a transformer-based model for video description generation that leverages curriculum learning to accomplish this task. In particular, we investigate two curriculum strategies: (1) progressively exposing the model to more challenging samples by gra… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 15 pages, 4 figures

  3. arXiv:2304.01334  [pdf, other

    cs.HC cs.RO

    Clustering Social Touch Gestures for Human-Robot Interaction

    Authors: Ramzi Abou Chahine, Steven Vasquez, Pooyan Fazli, Hasti Seifi

    Abstract: Social touch provides a rich non-verbal communication channel between humans and robots. Prior work has identified a set of touch gestures for human-robot interaction and described them with natural language labels (e.g., stroking, patting). Yet, no data exists on the semantic relationships between the touch gestures in users' minds. To endow robots with touch intelligence, we investigated how peo… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 8 pages

  4. arXiv:2211.09397  [pdf, other

    cs.RO cs.HC

    Charting Visual Impression of Robot Hands

    Authors: Hasti Seifi, Steven A. Vasquez, Hyunyoung Kim, Pooyan Fazli

    Abstract: A wide variety of robotic hands have been designed to date. Yet, we do not know how users perceive these hands and feel about interacting with them. To inform hand design for social robots, we compiled a dataset of 73 robot hands and ran an online study, in which 160 users rated their impressions of the hands using 17 rating scales. Next, we developed 17 regression models that can predict user rat… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 8 pages

  5. arXiv:2111.03994   

    cs.HC cs.CV cs.LG

    NarrationBot and InfoBot: A Hybrid System for Automated Video Description

    Authors: Shasta Ihorn, Yue-Ting Siu, Aditya Bodi, Lothar Narins, Jose M. Castanon, Yash Kant, Abhishek Das, Ilmi Yoon, Pooyan Fazli

    Abstract: Video accessibility is crucial for blind and low vision users for equitable engagements in education, employment, and entertainment. Despite the availability of professional and amateur services and tools, most human-generated descriptions are expensive and time consuming. Moreover, the rate of human-generated descriptions cannot match the speed of video production. To overcome the increasing gaps… ▽ More

    Submitted 11 January, 2022; v1 submitted 7 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: This article has been withdrawn by arXiv administration due to an unresolvable authorship dispute

  6. arXiv:1912.12630  [pdf, other

    cs.LG cs.AI stat.ML

    Real-time Policy Distillation in Deep Reinforcement Learning

    Authors: Yuxiang Sun, Pooyan Fazli

    Abstract: Policy distillation in deep reinforcement learning provides an effective way to transfer control policies from a larger network to a smaller untrained network without a significant degradation in performance. However, policy distillation is underexplored in deep reinforcement learning, and existing approaches are computationally inefficient, resulting in a long distillation time. In addition, the… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

    Comments: In Proceedings of the Workshop on ML for Systems, Thirty-third Conference on Neural Information Processing Systems (NeurIPS), 2019

  7. arXiv:1803.03719  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    DeepMoTIon: Learning to Navigate Like Humans

    Authors: Mahmoud Hamandi, Mike D'Arcy, Pooyan Fazli

    Abstract: We present a novel human-aware navigation approach, where the robot learns to mimic humans to navigate safely in crowds. The presented model, referred to as DeepMoTIon, is trained with pedestrian surveillance data to predict human velocity in the environment. The robot processes LiDAR scans via the trained network to navigate to the target location. We conduct extensive experiments to assess the c… ▽ More

    Submitted 1 August, 2019; v1 submitted 9 March, 2018; originally announced March 2018.

    Comments: 7 pages, In Proceedings of the IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2019

  8. arXiv:1710.06831  [pdf, other

    cs.RO

    Setting Up the Beam for Human-Centered Service Tasks

    Authors: Utkarsh Patel, Emre Hatay, Mike D'Arcy, Ghazal Zand, Pooyan Fazli

    Abstract: We introduce the Beam, a collaborative autonomous mobile service robot, based on SuitableTech's Beam telepresence system. We present a set of enhancements to the telepresence system, including autonomy, human awareness, increased computation and sensing capabilities, and integration with the popular Robot Operating System (ROS) framework. Together, our improvements transform the Beam into a low-co… ▽ More

    Submitted 18 October, 2017; originally announced October 2017.

    Comments: 10 pages

  9. arXiv:0908.2661  [pdf

    cs.MA cs.CY cs.HC

    Human-Robot Teams in Entertainment and Other Everyday Scenarios

    Authors: Pooyan Fazli, Alan K. Mackworth

    Abstract: A new and relatively unexplored research direction in robotics systems is the coordination of humans and robots working as a team. In this paper, we focus upon problem domains and tasks in which multiple robots, humans and other agents are cooperating through coordination to satisfy a set of goals or to maximize utility. We are primarily interested in applications of human robot coordination in… ▽ More

    Submitted 18 August, 2009; originally announced August 2009.

  10. arXiv:0908.2656  [pdf, ps, other

    cs.CV cs.RO

    Semantic Robot Vision Challenge: Current State and Future Directions

    Authors: Scott Helmer, David Meger, Pooja Viswanathan, Sancho McCann, Matthew Dockrey, Pooyan Fazli, Tristram Southey, Marius Muja, Michael Joya, Jim Little, David Lowe, Alan Mackworth

    Abstract: The Semantic Robot Vision Competition provided an excellent opportunity for our research lab to integrate our many ideas under one umbrella, inspiring both collaboration and new research. The task, visual search for an unknown object, is relevant to both the vision and robotics communities. Moreover, since the interplay of robotics and vision is sometimes ignored, the competition provides a venu… ▽ More

    Submitted 18 August, 2009; originally announced August 2009.

    Comments: The IJCAI-09 Workshop on Competitions in Artificial Intelligence and Robotics, Pasadena, California, USA, July 11-17, 2009