Zum Hauptinhalt springen

Showing 1–49 of 49 results for author: Ranjan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00592  [pdf

    cs.CV

    Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP

    Authors: Ayush Ranjan, Daniel Wen, Karthik Bhat

    Abstract: Understanding the limitations and weaknesses of state-of-the-art models in artificial intelligence is crucial for their improvement and responsible application. In this research, we focus on CLIP, a model renowned for its integration of vision and language processing. Our objective is to uncover recurring problems and blind spots in CLIP's image comprehension. By delving into both the commonalitie… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    ACM Class: F.2.2; I.2.7

  2. arXiv:2405.10183  [pdf, other

    cs.NE q-bio.PE

    A Guide to Tracking Phylogenies in Parallel and Distributed Agent-based Evolution Models

    Authors: Matthew Andres Moreno, Anika Ranjan, Emily Dolson, Luis Zaman

    Abstract: Computer simulations are an important tool for studying the mechanics of biological evolution. In particular, in silico work with agent-based models provides an opportunity to collect high-quality records of ancestry relationships among simulated agents. Such phylogenies can provide insight into evolutionary dynamics within these simulations. Existing work generally tracks lineages directly, yield… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  3. arXiv:2403.16247  [pdf, other

    cs.CL cs.LG cs.NE

    Improving Sequence-to-Sequence Models for Abstractive Text Summarization Using Meta Heuristic Approaches

    Authors: Aditya Saxena, Ashutosh Ranjan

    Abstract: As human society transitions into the information age, reduction in our attention span is a contingency, and people who spend time reading lengthy news articles are decreasing rapidly and the need for succinct information is higher than ever before. Therefore, it is essential to provide a quick overview of important news by concisely summarizing the top news article and the most intuitive headline… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  4. arXiv:2403.01410  [pdf, other

    cs.RO

    Barrier Functions Inspired Reward Shaping for Reinforcement Learning

    Authors: Nilaksh Nilaksh, Abhishek Ranjan, Shreenabh Agrawal, Aayush Jain, Pushpak Jagtap, Shishir Kolathaya

    Abstract: Reinforcement Learning (RL) has progressed from simple control tasks to complex real-world challenges with large state spaces. While RL excels in these tasks, training time remains a limitation. Reward shaping is a popular solution, but existing methods often rely on value functions, which face scalability issues. This paper presents a novel safety-oriented reward-shaping framework inspired by bar… ▽ More

    Submitted 1 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: 7 pages, 10 figures, Accepted as contributed paper at ICRA 2024

    ACM Class: I.2.9

  5. arXiv:2312.11537  [pdf, other

    cs.CV cs.GR

    FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline

    Authors: Chien-Yu Lin, Qichen Fu, Thomas Merth, Karren Yang, Anurag Ranjan

    Abstract: Super-resolution (SR) techniques have recently been proposed to upscale the outputs of neural radiance fields (NeRF) and generate high-quality images with enhanced inference speeds. However, existing NeRF+SR methods increase training overhead by using extra input features, loss functions, and/or expensive training procedures such as knowledge distillation. In this paper, we aim to leverage SR for… ▽ More

    Submitted 20 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: WACV 2024 (Oral)

  6. arXiv:2311.18168  [pdf, other

    cs.CV cs.LG eess.AS

    Probabilistic Speech-Driven 3D Facial Motion Synthesis: New Benchmarks, Methods, and Applications

    Authors: Karren D. Yang, Anurag Ranjan, Jen-Hao Rick Chang, Raviteja Vemulapalli, Oncel Tuzel

    Abstract: We consider the task of animating 3D facial geometry from speech signal. Existing works are primarily deterministic, focusing on learning a one-to-one mapping from speech signal to 3D face meshes on small datasets with limited speakers. While these models can achieve high-quality lip articulation for speakers in the training set, they are unable to capture the full and diverse distribution of 3D f… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  7. arXiv:2311.17910  [pdf, other

    cs.CV cs.GR

    HUGS: Human Gaussian Splats

    Authors: Muhammed Kocabas, Jen-Hao Rick Chang, James Gabriel, Oncel Tuzel, Anurag Ranjan

    Abstract: Recent advances in neural rendering have improved both training and rendering times by orders of magnitude. While these methods demonstrate state-of-the-art quality and speed, they are designed for photogrammetry of static scenes and do not generalize well to freely moving humans in the environment. In this work, we introduce Human Gaussian Splats (HUGS) that represents an animatable human togethe… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  8. arXiv:2310.15130  [pdf, other

    cs.SD cs.CV eess.AS

    Novel-View Acoustic Synthesis from 3D Reconstructed Rooms

    Authors: Byeongjoo Ahn, Karren Yang, Brian Hamilton, Jonathan Sheaffer, Anurag Ranjan, Miguel Sarabia, Oncel Tuzel, Jen-Hao Rick Chang

    Abstract: We investigate the benefit of combining blind audio recordings with 3D scene information for novel-view acoustic synthesis. Given audio recordings from 2-4 microphones and the 3D geometry and material of a scene containing multiple unknown sound sources, we estimate the sound anywhere in the scene. We identify the main challenges of novel-view acoustic synthesis as sound source localization, separ… ▽ More

    Submitted 15 August, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Interspeech 2024

  9. arXiv:2310.00831  [pdf, other

    cs.CV cs.AI

    Action Recognition Utilizing YGAR Dataset

    Authors: Shuo Wang, Amiya Ranjan, Lawrence Jiang

    Abstract: The scarcity of high quality actions video data is a bottleneck in the research and application of action recognition. Although significant effort has been made in this area, there still exist gaps in the range of available data types a more flexible and comprehensive data set could help bridge. In this paper, we present a new 3D actions data simulation engine and generate 3 sets of sample data to… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: 10 pages, 18 figures

  10. arXiv:2309.15259  [pdf, other

    quant-ph cs.CV eess.IV

    SLIQ: Quantum Image Similarity Networks on Noisy Quantum Computers

    Authors: Daniel Silver, Tirthak Patel, Aditya Ranjan, Harshitta Gandhi, William Cutler, Devesh Tiwari

    Abstract: Exploration into quantum machine learning has grown tremendously in recent years due to the ability of quantum computers to speed up classical programs. However, these efforts have yet to solve unsupervised similarity detection tasks due to the challenge of porting them to run on quantum computers. To overcome this challenge, we propose SLIQ, the first open-sourced work for resource-efficient quan… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Journal ref: Vol. 37 No. 8: AAAI-2023 Technical Tracks 8

  11. arXiv:2309.07164  [pdf, other

    eess.AS cs.AI cs.SD

    Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning Fusion

    Authors: Anshul Ranjan, Kaushik Jegadeesan

    Abstract: This paper presents a novel hybrid Automatic Speech Recognition (ASR) system designed specifically for resource-constrained robots. The proposed approach combines Hidden Markov Models (HMMs) with deep learning models and leverages socket programming to distribute processing tasks effectively. In this architecture, the HMM-based processing takes place within the robot, while a separate PC handles t… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: To be published in IEEE Access, 9 pages, 14 figures, Received valuable support from CCBD PESU, for associated code, see https://github.com/AnshulRanjan2004/PyHMM

    MSC Class: 62M09 (Primary) 62F10; 62F12 (Secondary) ACM Class: I.2.7; I.2.9

  12. arXiv:2308.11096  [pdf, other

    quant-ph cs.AR cs.CV

    MosaiQ: Quantum Generative Adversarial Networks for Image Generation on NISQ Computers

    Authors: Daniel Silver, Tirthak Patel, William Cutler, Aditya Ranjan, Harshitta Gandhi, Devesh Tiwari

    Abstract: Quantum machine learning and vision have come to the fore recently, with hardware advances enabling rapid advancement in the capabilities of quantum machines. Recently, quantum image generation has been explored with many potential advantages over non-quantum techniques; however, previous techniques have suffered from poor quality and robustness. To address these problems, we introduce, MosaiQ, a… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted to appear at ICCV'23

  13. arXiv:2307.16799  [pdf, other

    quant-ph cs.AR cs.DC cs.ET

    Toward Privacy in Quantum Program Execution On Untrusted Quantum Cloud Computing Machines for Business-sensitive Quantum Needs

    Authors: Tirthak Patel, Daniel Silver, Aditya Ranjan, Harshitta Gandhi, William Cutler, Devesh Tiwari

    Abstract: Quantum computing is an emerging paradigm that has shown great promise in accelerating large-scale scientific, optimization, and machine-learning workloads. With most quantum computing solutions being offered over the cloud, it has become imperative to protect confidential and proprietary quantum code from being accessed by untrusted and/or adversarial agents. In response to this challenge, we pro… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

  14. arXiv:2306.11177  [pdf, other

    cs.DC cs.PF

    Pipit: Scripting the analysis of parallel execution traces

    Authors: Abhinav Bhatele, Rakrish Dhakal, Alexander Movsesyan, Aditya K. Ranjan, Onur Cankur

    Abstract: Performance analysis is a critical step in the oft-repeated, iterative process of performance tuning of parallel programs. Per-process, per-thread traces (detailed logs of events with timestamps) enable in-depth analysis of parallel program execution to identify different kinds of performance issues. Often times, trace collection tools provide a graphical tool to analyze the trace output. However,… ▽ More

    Submitted 14 May, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

  15. arXiv:2305.13525  [pdf, other

    cs.LG cs.AI cs.DC cs.PF

    A 4D Hybrid Algorithm to Scale Parallel Training to Thousands of GPUs

    Authors: Siddharth Singh, Prajwal Singhania, Aditya K. Ranjan, Zack Sating, Abhinav Bhatele

    Abstract: Heavy communication, in particular, collective operations, can become a critical performance bottleneck in scaling the training of billion-parameter neural networks to large-scale parallel systems. This paper introduces a four-dimensional (4D) approach to optimize communication in parallel training. This 4D approach is a hybrid of 3D tensor and data parallelism, and is implemented in the AxoNN fra… ▽ More

    Submitted 14 May, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

  16. arXiv:2304.12390  [pdf, other

    cs.CV cs.GR

    Pointersect: Neural Rendering with Cloud-Ray Intersection

    Authors: Jen-Hao Rick Chang, Wei-Yu Chen, Anurag Ranjan, Kwang Moo Yi, Oncel Tuzel

    Abstract: We propose a novel method that renders point clouds as if they are surfaces. The proposed method is differentiable and requires no scene-specific optimization. This unique capability enables, out-of-the-box, surface normal estimation, rendering room-scale point clouds, inverse rendering, and ray tracing with global illumination. Unlike existing work that focuses on converting point clouds to other… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  17. arXiv:2304.01480  [pdf, other

    cs.CV

    FineRecon: Depth-aware Feed-forward Network for Detailed 3D Reconstruction

    Authors: Noah Stier, Anurag Ranjan, Alex Colburn, Yajie Yan, Liang Yang, Fangchang Ma, Baptiste Angles

    Abstract: Recent works on 3D reconstruction from posed images have demonstrated that direct inference of scene-level 3D geometry without test-time optimization is feasible using deep neural networks, showing remarkable promise and high efficiency. However, the reconstructed geometry, typically represented as a 3D truncated signed distance function (TSDF), is often coarse without fine geometric details. To a… ▽ More

    Submitted 18 August, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: ICCV 2023

  18. arXiv:2303.15437  [pdf, other

    cs.CV

    FaceLit: Neural 3D Relightable Faces

    Authors: Anurag Ranjan, Kwang Moo Yi, Jen-Hao Rick Chang, Oncel Tuzel

    Abstract: We propose a generative framework, FaceLit, capable of generating a 3D face that can be rendered at various user-defined lighting conditions and views, learned purely from 2D images in-the-wild without any manual annotation. Unlike existing works that require careful capture setup or human labor, we rely on off-the-shelf pose and illumination estimators. With these estimates, we incorporate the Ph… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  19. arXiv:2303.14189  [pdf, other

    cs.CV

    FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization

    Authors: Pavan Kumar Anasosalu Vasu, James Gabriel, Jeff Zhu, Oncel Tuzel, Anurag Ranjan

    Abstract: The recent amalgamation of transformer and convolutional designs has led to steady improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a hybrid vision transformer architecture that obtains the state-of-the-art latency-accuracy trade-off. To this end, we introduce a novel token mixing operator, RepMixer, a building block of FastViT, that uses structural repara… ▽ More

    Submitted 17 August, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: ICCV 2023

  20. arXiv:2210.14800  [pdf, other

    eess.AS cs.HC cs.SD

    Naturalistic Head Motion Generation from Speech

    Authors: Trisha Mittal, Zakaria Aldeneh, Masha Fedzechkina, Anurag Ranjan, Barry-John Theobald

    Abstract: Synthesizing natural head motion to accompany speech for an embodied conversational agent is necessary for providing a rich interactive experience. Most prior works assess the quality of generated head motion by comparing them against a single ground-truth using an objective metric. Yet there are many plausible head motion sequences to accompany a speech utterance. In this work, we study the varia… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Submitted to ICASSP 2023

  21. arXiv:2207.10237  [pdf, other

    cs.CV

    SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks

    Authors: Chien-Yu Lin, Anish Prabhu, Thomas Merth, Sachin Mehta, Anurag Ranjan, Maxwell Horton, Mohammad Rastegari

    Abstract: Recent isotropic networks, such as ConvMixer and vision transformers, have found significant success across visual recognition tasks, matching or outperforming non-isotropic convolutional neural networks (CNNs). Isotropic architectures are particularly well-suited to cross-layer weight sharing, an effective neural network compression technique. In this paper, we perform an empirical evaluation on… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  22. arXiv:2206.04040  [pdf, other

    cs.CV

    MobileOne: An Improved One millisecond Mobile Backbone

    Authors: Pavan Kumar Anasosalu Vasu, James Gabriel, Jeff Zhu, Oncel Tuzel, Anurag Ranjan

    Abstract: Efficient neural network backbones for mobile devices are often optimized for metrics such as FLOPs or parameter count. However, these metrics may not correlate well with latency of the network when deployed on a mobile device. Therefore, we perform extensive analysis of different metrics by deploying several mobile-friendly networks on a mobile device. We identify and analyze architectural and op… ▽ More

    Submitted 28 March, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted at CVPR 2023

  23. arXiv:2204.03618  [pdf

    eess.IV cs.CV

    Pneumonia Detection in Chest X-Rays using Neural Networks

    Authors: Narayana Darapaneni, Ashish Ranjan, Dany Bright, Devendra Trivedi, Ketul Kumar, Vivek Kumar, Anwesh Reddy Paduri

    Abstract: With the advancement in AI, deep learning techniques are widely used to design robust classification models in several areas such as medical diagnosis tasks in which it achieves good performance. In this paper, we have proposed the CNN model (Convolutional Neural Network) for the classification of Chest X-ray images for Radiological Society of North America Pneumonia (RSNA) datasets. The study als… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

  24. arXiv:2203.12575  [pdf, other

    cs.CV

    NeuMan: Neural Human Radiance Field from a Single Video

    Authors: Wei Jiang, Kwang Moo Yi, Golnoosh Samei, Oncel Tuzel, Anurag Ranjan

    Abstract: Photorealistic rendering and reposing of humans is important for enabling augmented reality experiences. We propose a novel framework to reconstruct the human and the scene that can be rendered with novel human poses and views from just a single in-the-wild video. Given a video captured by a moving camera, we train two NeRF models: a human NeRF model and a scene NeRF model. To train these models,… ▽ More

    Submitted 21 September, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

  25. λ-Scaled-Attention: A Novel Fast Attention Mechanism for Efficient Modeling of Protein Sequences

    Authors: Ashish Ranjan, Md Shah Fahad, Akshay Deepak

    Abstract: Attention-based deep networks have been successfully applied on textual data in the field of NLP. However, their application on protein sequences poses additional challenges due to the weak semantics of the protein words, unlike the plain text words. These unexplored challenges faced by the standard attention technique include (i) vanishing attention score problem and (ii) high variations in the a… ▽ More

    Submitted 8 January, 2022; originally announced January 2022.

    Journal ref: Information Sciences, 2022

  26. arXiv:2110.04252  [pdf, other

    cs.LG cs.CV

    LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time

    Authors: Elvis Nunez, Maxwell Horton, Anish Prabhu, Anurag Ranjan, Ali Farhadi, Mohammad Rastegari

    Abstract: When deploying deep learning models to a device, it is traditionally assumed that available computational resources (compute, memory, and power) remain static. However, real-world computing systems do not always provide stable resource guarantees. Computational resources need to be conserved when load from other processes is high or battery power is low. Inspired by recent works on neural network… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  27. arXiv:2110.03860  [pdf, other

    cs.CV cs.LG

    Token Pooling in Vision Transformers

    Authors: Dmitrii Marin, Jen-Hao Rick Chang, Anurag Ranjan, Anish Prabhu, Mohammad Rastegari, Oncel Tuzel

    Abstract: Despite the recent success in many applications, the high computational requirements of vision transformers limit their use in resource-constrained settings. While many existing methods improve the quadratic complexity of attention, in most vision transformers, self-attention is not the major computation bottleneck, e.g., more than 80% of the computation is spent on fully-connected layers. To impr… ▽ More

    Submitted 11 October, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2023

  28. arXiv:2012.05225  [pdf, other

    cs.CV cs.AI cs.CY cs.LG

    MorphGAN: One-Shot Face Synthesis GAN for Detecting Recognition Bias

    Authors: Nataniel Ruiz, Barry-John Theobald, Anurag Ranjan, Ahmed Hussein Abdelaziz, Nicholas Apostoloff

    Abstract: To detect bias in face recognition networks, it can be useful to probe a network under test using samples in which only specific attributes vary in some controlled way. However, capturing a sufficiently large dataset with specific control over the attributes of interest is difficult. In this work, we describe a simulator that applies specific head pose and facial expression adjustments to images o… ▽ More

    Submitted 10 December, 2020; v1 submitted 9 December, 2020; originally announced December 2020.

  29. arXiv:2011.02523  [pdf, other

    cs.CV cs.GR

    Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

    Authors: Mike Roberts, Jason Ramapuram, Anurag Ranjan, Atulit Kumar, Miguel Angel Bautista, Nathan Paczan, Russ Webb, Joshua M. Susskind

    Abstract: For many fundamental scene understanding tasks, it is difficult or impossible to obtain per-pixel ground truth labels from real images. We address this challenge by introducing Hypersim, a photorealistic synthetic dataset for holistic indoor scene understanding. To create our dataset, we leverage a large repository of synthetic scenes created by professional artists, and we generate 77,400 images… ▽ More

    Submitted 17 August, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

    Comments: Accepted for publication at the International Conference on Computer Vision (ICCV) 2021

  30. arXiv:2011.00773  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Using a Bi-directional LSTM Model with Attention Mechanism trained on MIDI Data for Generating Unique Music

    Authors: Ashish Ranjan, Varun Nagesh Jolly Behera, Motahar Reza

    Abstract: Generating music is an interesting and challenging problem in the field of machine learning. Mimicking human creativity has been popular in recent years, especially in the field of computer vision and image processing. With the advent of GANs, it is possible to generate new similar images, based on trained data. But this cannot be done for music similarly, as music has an extra temporal dimension.… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  31. arXiv:2011.00443  [pdf, other

    cs.CV cs.AI cs.DC

    A Parallel Approach for Real-Time Face Recognition from a Large Database

    Authors: Ashish Ranjan, Varun Nagesh Jolly Behera, Motahar Reza

    Abstract: We present a new facial recognition system, capable of identifying a person, provided their likeness has been previously stored in the system, in real time. The system is based on storing and comparing facial embeddings of the subject, and identifying them later within a live video feed. This system is highly accurate, and is able to tag people with their ID in real time. It is able to do so, even… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

  32. arXiv:2011.00414  [pdf, other

    cs.SI cs.CY cs.DM physics.soc-ph

    Graph based Clustering Algorithm for Social Community Transmission Prediction of COVID-19

    Authors: Varun Nagesh Jolly Behera, Ashish Ranjan, Motahar Reza

    Abstract: A system to model the spread of COVID-19 cases after lockdown has been proposed, to define new preventive measures based on hotspots, using the graph clustering algorithm. This method allows for more lenient measures in areas less prone to the virus spread. There exist methods to model the spread of the virus, by predicting the number of confirmed cases. But the proposed system focuses more on the… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

  33. arXiv:2009.00149  [pdf, other

    cs.CV cs.AI cs.GR cs.LG stat.AP

    GIF: Generative Interpretable Faces

    Authors: Partha Ghosh, Pravir Singh Gupta, Roy Uziel, Anurag Ranjan, Michael Black, Timo Bolkart

    Abstract: Photo-realistic visualization and animation of expressive human faces have been a long standing challenge. 3D face modeling methods provide parametric control but generates unrealistic images, on the other hand, generative 2D models like GANs (Generative Adversarial Networks) output photo-realistic face images, but lack explicit control. Recent methods gain partial control, either by attempting to… ▽ More

    Submitted 25 November, 2020; v1 submitted 31 August, 2020; originally announced September 2020.

    Comments: International Conference on 3D Vision (3DV) 2020

  34. arXiv:2006.01897  [pdf, other

    eess.IV cs.CV

    Automatic Differentiation for All Photons Imaging to See Inside Volumetric Scattering Media

    Authors: Tomohiro Maeda, Ankit Ranjan, Ramesh Raskar

    Abstract: Imaging through dense scattering media - such as biological tissue, fog, and smoke - has applications in the medical and robotics fields. We propose a new framework using automatic differentiation for All Photons Imaging through homogeneous scattering media with unknown optical properties for non-invasive sensing and diagnostics. We overcome the need for the imaging target to be visible to the ill… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

  35. Learning Multi-Human Optical Flow

    Authors: Anurag Ranjan, David T. Hoffmann, Dimitrios Tzionas, Siyu Tang, Javier Romero, Michael J. Black

    Abstract: The optical flow of humans is well known to be useful for the analysis of human action. Recent optical flow methods focus on training deep networks to approach the problem. However, the training data used by them does not cover the domain of human motion. Therefore, we develop a dataset of multi-human optical flow and train optical flow networks on this dataset. We use a 3D model of the human body… ▽ More

    Submitted 4 December, 2019; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: arXiv admin note: text overlap with arXiv:1806.05666

    Report number: 2019

    Journal ref: International Journal of Computer Vision (IJCV) 2019

  36. arXiv:1910.10053  [pdf, other

    cs.CV cs.LG eess.IV

    Attacking Optical Flow

    Authors: Anurag Ranjan, Joel Janai, Andreas Geiger, Michael J. Black

    Abstract: Deep neural nets achieve state-of-the-art performance on the problem of optical flow estimation. Since optical flow is used in several safety-critical applications like self-driving cars, it is important to gain insights into the robustness of those techniques. Recently, it has been shown that adversarial attacks easily fool deep neural networks to misclassify objects. The robustness of optical fl… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: ICCV 2019

  37. arXiv:1907.13615  [pdf, other

    cs.CV cs.GR

    Learning to Dress 3D People in Generative Clothing

    Authors: Qianli Ma, Jinlong Yang, Anurag Ranjan, Sergi Pujades, Gerard Pons-Moll, Siyu Tang, Michael J. Black

    Abstract: Three-dimensional human body models are widely used in the analysis of human pose and motion. Existing models, however, are learned from minimally-clothed 3D scans and thus do not generalize to the complexity of dressed people in common images and videos. Additionally, current models lack the expressive power needed to represent the complex non-linear geometry of pose-dependent clothing shapes. To… ▽ More

    Submitted 22 May, 2020; v1 submitted 31 July, 2019; originally announced July 2019.

    Comments: CVPR-2020 camera ready. Code and data are available at https://cape.is.tue.mpg.de

  38. arXiv:1905.03079  [pdf, other

    cs.CV

    Capture, Learning, and Synthesis of 3D Speaking Styles

    Authors: Daniel Cudeiro, Timo Bolkart, Cassidy Laidlaw, Anurag Ranjan, Michael J. Black

    Abstract: Audio-driven 3D facial animation has been widely explored, but achieving realistic, human-like performance is still unsolved. This is due to the lack of available 3D datasets, models, and standard evaluation metrics. To address this, we introduce a unique 4D face dataset with about 29 minutes of 4D scans captured at 60 fps and synchronized audio from 12 speakers. We then train a neural network on… ▽ More

    Submitted 8 May, 2019; originally announced May 2019.

    Comments: To appear in CVPR 2019

  39. arXiv:1811.01338  [pdf, other

    cs.LG q-bio.BM stat.ML

    Deep Robust Framework for Protein Function Prediction using Variable-Length Protein Sequences

    Authors: Ashish Ranjan, Md Shah Fahad, David Fernandez-Baca, Akshay Deepak, Sudhakar Tripathi

    Abstract: Amino acid sequence portrays most intrinsic form of a protein and expresses primary structure of protein. The order of amino acids in a sequence enables a protein to acquire a particular stable conformation that is responsible for the functions of the protein. This relationship between a sequence and its function motivates the need to analyse the sequences for predicting protein functions. Early g… ▽ More

    Submitted 19 June, 2019; v1 submitted 4 November, 2018; originally announced November 2018.

    Journal ref: IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2019

  40. arXiv:1807.10267  [pdf, other

    cs.CV

    Generating 3D faces using Convolutional Mesh Autoencoders

    Authors: Anurag Ranjan, Timo Bolkart, Soubhik Sanyal, Michael J. Black

    Abstract: Learned 3D representations of human faces are useful for computer vision problems such as 3D face tracking and reconstruction from images, as well as graphics applications such as character generation and animation. Traditional models learn a latent representation of a face using linear subspaces or higher-order tensor generalizations. Due to this linearity, they can not capture extreme deformatio… ▽ More

    Submitted 31 July, 2018; v1 submitted 26 July, 2018; originally announced July 2018.

    Journal ref: European Conference on Computer Vision 2018

  41. arXiv:1806.05666  [pdf, other

    cs.CV

    Learning Human Optical Flow

    Authors: Anurag Ranjan, Javier Romero, Michael J. Black

    Abstract: The optical flow of humans is well known to be useful for the analysis of human action. Given this, we devise an optical flow algorithm specifically for human motion and show that it is superior to generic flow methods. Designing a method by hand is impractical, so we develop a new training database of image sequences with ground truth optical flow. For this we use a 3D model of the human body and… ▽ More

    Submitted 22 July, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: British Machine Vision Conference 2018 (Oral)

  42. arXiv:1805.09806  [pdf, other

    cs.CV

    Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation

    Authors: Anurag Ranjan, Varun Jampani, Lukas Balles, Kihwan Kim, Deqing Sun, Jonas Wulff, Michael J. Black

    Abstract: We address the unsupervised learning of several interconnected problems in low-level vision: single view depth prediction, camera motion estimation, optical flow, and segmentation of a video into the static scene and moving regions. Our key insight is that these four fundamental vision problems are coupled through geometric constraints. Consequently, learning to solve them together simplifies the… ▽ More

    Submitted 11 March, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: CVPR 2019

  43. arXiv:1703.02118  [pdf, other

    cs.ET

    Computing in Memory with Spin-Transfer Torque Magnetic RAM

    Authors: Shubham Jain, Ashish Ranjan, Kaushik Roy, Anand Raghunathan

    Abstract: In-memory computing is a promising approach to addressing the processor-memory data transfer bottleneck in computing systems. We propose Spin-Transfer Torque Compute-in-Memory (STT-CiM), a design for in-memory computing with Spin-Transfer Torque Magnetic RAM (STT-MRAM). The unique properties of spintronic memory allow multiple wordlines within an array to be simultaneously enabled, opening up the… ▽ More

    Submitted 20 November, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

  44. arXiv:1611.00850  [pdf, other

    cs.CV

    Optical Flow Estimation using a Spatial Pyramid Network

    Authors: Anurag Ranjan, Michael J. Black

    Abstract: We learn to compute optical flow by combining a classical spatial-pyramid formulation with deep learning. This estimates large motions in a coarse-to-fine approach by warping one image of a pair at each pyramid level by the current flow estimate and computing an update to the flow. Instead of the standard minimization of an objective function at each pyramid level, we train one deep network per le… ▽ More

    Submitted 21 November, 2016; v1 submitted 2 November, 2016; originally announced November 2016.

    Comments: 10 pages

  45. arXiv:1607.01254  [pdf, ps, other

    cs.AI

    An extended MABAC for multi-attribute decision making using trapezoidal interval type-2 fuzzy numbers

    Authors: Jagannath Roy, Ananta Ranjan, Animesh Debnath, Samarjit Kar

    Abstract: In this paper, we attempt to extend Multi Attributive Border Approximation area Comparison (MABAC) approach for multi-attribute decision making (MADM) problems based on type-2 fuzzy sets (IT2FSs). As a special case of IT2FSs interval type-2 trapezoidal fuzzy numbers (IT2TrFNs) are adopted here to deal with uncertainties present in many practical evaluation and selection problems. A systematic desc… ▽ More

    Submitted 2 December, 2016; v1 submitted 5 July, 2016; originally announced July 2016.

    Comments: 14 pages

  46. arXiv:1505.05269  [pdf, other

    cs.OS

    A Survey Report on Operating Systems for Tiny Networked Sensors

    Authors: Alok Ranjan, H. B. Sahu, Prasant Misra

    Abstract: Wireless sensor network (WSN) has attracted researchers worldwide to explore the research opportunities, with application mainly in health monitoring, industry automation, battlefields, home automation and environmental monitoring. A WSN is highly resource constrained in terms of energy, computation and memory. WSNs deployment ranges from the normal working environment up to hostile and hazardous… ▽ More

    Submitted 20 May, 2015; originally announced May 2015.

    Comments: 12 pages, Submitted to Journal

    Journal ref: Journal of Advanced Research in Networking and Communication Engineering, Vol(1) issue 1, 2014

  47. arXiv:1310.0519  [pdf

    q-bio.NC cs.HC

    Evidence that Cross-Domain Re-interpretations of Creative Ideas are Recognizable

    Authors: Apara Ranjan, Liane Gabora, Brian O'Connor

    Abstract: The goal of this study was to investigate the translate-ability of creative works into other domains. We tested whether people were able to recognize which works of art were inspired by which pieces of music. Three expert painters created four paintings, each of which was the artist's interpretation of one of four different pieces of instrumental music. Participants were able to identify which pai… ▽ More

    Submitted 9 July, 2019; v1 submitted 1 October, 2013; originally announced October 2013.

    Comments: 6 pages. arXiv admin note: substantial text overlap with arXiv:1308.4706

    Journal ref: In G. Stojanov & B. Indurkhya (Co-Chairs), Creativity and (early) cognitive development. Symposium conducted at the meeting of Association for the Advancement of Artificial Intelligence (AAAI), Palo Alto, CA. (2013)

  48. How Insight Emerges in a Distributed, Content-addressable Memory

    Authors: Liane Gabora, Apara Ranjan

    Abstract: We begin this chapter with the bold claim that it provides a neuroscientific explanation of the magic of creativity. Creativity presents a formidable challenge for neuroscience. Neuroscience generally involves studying what happens in the brain when someone engages in a task that involves responding to a stimulus, or retrieving information from memory and using it the right way, or at the right ti… ▽ More

    Submitted 5 July, 2019; v1 submitted 17 June, 2011; originally announced June 2011.

    Comments: 17 pages; 2 figures

    Journal ref: In A. Bristol, O. Vartanian, & J. Kaufman (Eds.), The neuroscience of creativity (pp. 19-43). Cambridge, MA: MIT Press (2013)

  49. arXiv:1003.1814  [pdf

    cs.IR

    An Analytical Approach to Document Clustering Based on Internal Criterion Function

    Authors: Alok Ranjan, Harish Verma, Eatesh Kandpal, Joydip Dhar

    Abstract: Fast and high quality document clustering is an important task in organizing information, search engine results obtaining from user query, enhancing web crawling and information retrieval. With the large amount of data available and with a goal of creating good quality clusters, a variety of algorithms have been developed having quality-complexity trade-offs. Among these, some algorithms seek to m… ▽ More

    Submitted 9 March, 2010; originally announced March 2010.

    Comments: Pages IEEE format, International Journal of Computer Science and Information Security, IJCSIS, Vol. 7 No. 2, February 2010, USA. ISSN 1947 5500, http://sites.google.com/site/ijcsis/