Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Kazi, R H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18614  [pdf, other

    cs.HC cs.CV cs.LG

    Augmented Physics: Creating Interactive and Embedded Physics Simulations from Static Textbook Diagrams

    Authors: Aditya Gunturu, Yi Wen, Nandi Zhang, Jarin Thundathil, Rubaiat Habib Kazi, Ryo Suzuki

    Abstract: We introduce Augmented Physics, a machine learning-integrated authoring tool designed for creating embedded interactive physics simulations from static textbook diagrams. Leveraging recent advancements in computer vision, such as Segment Anything and Multi-modal LLMs, our web-based system enables users to semi-automatically extract diagrams from physics textbooks and generate interactive simulatio… ▽ More

    Submitted 10 August, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: UIST 2024

  2. arXiv:2405.18565  [pdf, other

    cs.HC

    Video2MR: Automatically Generating Mixed Reality 3D Instructions by Augmenting Extracted Motion from 2D Videos

    Authors: Keiichi Ihara, Kyzyl Monteiro, Mehrad Faridan, Rubaiat Habib Kazi, Ryo Suzuki

    Abstract: This paper introduces Video2MR, a mixed reality system that automatically generates 3D sports and exercise instructions from 2D videos. Mixed reality instructions have great potential for physical training, but existing works require substantial time and cost to create these 3D experiences. Video2MR overcomes this limitation by transforming arbitrary instructional videos available online into MR 3… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2405.07065  [pdf, other

    cs.HC

    LogoMotion: Visually Grounded Code Generation for Content-Aware Animation

    Authors: Vivian Liu, Rubaiat Habib Kazi, Li-Yi Wei, Matthew Fisher, Timothy Langlois, Seth Walker, Lydia Chilton

    Abstract: Animated logos are a compelling and ubiquitous way individuals and brands represent themselves online. Manually authoring these logos can require significant artistic skill and effort. To help novice designers animate logos, design tools currently offer templates and animation presets. However, these solutions can be limited in their expressive range. Large language models have the potential to he… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  4. arXiv:2401.05631  [pdf, other

    cs.HC cs.AI cs.CL cs.ET cs.GR

    DrawTalking: Building Interactive Worlds by Sketching and Speaking

    Authors: Karl Toby Rosenberg, Rubaiat Habib Kazi, Li-Yi Wei, Haijun Xia, Ken Perlin

    Abstract: We introduce DrawTalking, an approach to building and controlling interactive worlds by sketching and speaking while telling stories. It emphasizes user control and flexibility, and gives programming-like capability without requiring code. An early open-ended study with our prototype shows that the mechanics resonate and are applicable to many creative-exploratory use cases, with the potential to… ▽ More

    Submitted 4 August, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 25 pages, 27 figures; Matching version accepted at UIST 2024

    ACM Class: H.5.2; D.2.2; I.2.7; D.1.7; H.5.1

  5. arXiv:2308.14922  [pdf, other

    cs.HC cs.CV cs.GR

    Automated Conversion of Music Videos into Lyric Videos

    Authors: Jiaju Ma, Anyi Rao, Li-Yi Wei, Rubaiat Habib Kazi, Hijung Valentina Shin, Maneesh Agrawala

    Abstract: Musicians and fans often produce lyric videos, a form of music videos that showcase the song's lyrics, for their favorite songs. However, making such videos can be challenging and time-consuming as the lyrics need to be added in synchrony and visual harmony with the video. Informed by prior work and close examination of existing lyric videos, we propose a set of design guidelines to help creators… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  6. RealityTalk: Real-Time Speech-Driven Augmented Presentation for AR Live Storytelling

    Authors: Jian Liao, Adnan Karim, Shivesh Jadon, Rubaiat Habib Kazi, Ryo Suzuki

    Abstract: We present RealityTalk, a system that augments real-time live presentations with speech-driven interactive virtual elements. Augmented presentations leverage embedded visuals and animation for engaging and expressive storytelling. However, existing tools for live presentations often lack interactivity and improvisation, while creating such effects in video editing tools require significant time an… ▽ More

    Submitted 12 August, 2022; originally announced August 2022.

    Comments: UIST 2022; For the interactive gallery, see https://ilab.ucalgary.ca/realitytalk/

  7. PoseCoach: A Customizable Analysis and Visualization System for Video-based Running Coaching

    Authors: Jingyuan Liu, Nazmus Saquib, Zhutian Chen, Rubaiat Habib Kazi, Li-Yi Wei, Hongbo Fu, Chiew-Lan Tai

    Abstract: Videos are an accessible form of media for analyzing sports postures and providing feedback to athletes. Existing sport-specific systems embed bespoke human pose attributes and thus can be hard to scale for new attributes, especially for users without programming experiences. Some systems retain scalability by directly showing the differences between two poses, but they might not clearly visualize… ▽ More

    Submitted 27 February, 2023; v1 submitted 19 April, 2022; originally announced April 2022.

  8. RealitySketch: Embedding Responsive Graphics and Visualizations in AR through Dynamic Sketching

    Authors: Ryo Suzuki, Rubaiat Habib Kazi, Li-Yi Wei, Stephen DiVerdi, Wilmot Li, Daniel Leithinger

    Abstract: We present RealitySketch, an augmented reality interface for sketching interactive graphics and visualizations. In recent years, an increasing number of AR sketching tools enable users to draw and embed sketches in the real world. However, with the current tools, sketched contents are inherently static, floating in mid air without responding to the real world. This paper introduces a new way to em… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

    Comments: UIST 2020