Zum Hauptinhalt springen

Showing 1–50 of 81 results for author: Sridhar, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.20592  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos

    Authors: Aashish Rai, Srinath Sridhar

    Abstract: We introduce EgoSonics, a method to generate semantically meaningful and synchronized audio tracks conditioned on silent egocentric videos. Generating audio for silent egocentric videos could open new applications in virtual reality, assistive technologies, or for augmenting existing datasets. Existing work has been limited to domains like speech, music, or impact sounds and cannot easily capture… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: preprint

  2. arXiv:2406.05059  [pdf, other

    cs.CV

    GenHeld: Generating and Editing Handheld Objects

    Authors: Chaerin Min, Srinath Sridhar

    Abstract: Grasping is an important human activity that has long been studied in robotics, computer vision, and cognitive science. Most existing works study grasping from the perspective of synthesizing hand poses conditioned on 3D or 2D object representations. We propose GenHeld to address the inverse problem of synthesizing held objects conditioned on 3D hand model or 2D image. Given a 3D model of hand, Ge… ▽ More

    Submitted 14 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2405.18377  [pdf, other

    cs.AI

    LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models

    Authors: Anthony Sarah, Sharath Nittur Sridhar, Maciej Szankin, Sairam Sundaresan

    Abstract: The abilities of modern large language models (LLMs) in solving natural language processing, complex reasoning, sentiment analysis and other tasks have been extraordinary which has prompted their extensive adoption. Unfortunately, these abilities come with very high memory and computational costs which precludes the use of LLMs on most hardware platforms. To mitigate this, we propose an effective… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  4. arXiv:2405.01808  [pdf, other

    cs.IT cs.NI

    GRAND Massive Parallel Decoding Framework for Low Latency in Beyond 5G

    Authors: Danilo Gligoroski, Sahana Sridhar, Katina Kralevska

    Abstract: We propose a massive parallel decoding GRAND framework. The framework introduces two novelties: 1. A likelihood function for $M$-QAM demodulated signals that effectively reduces the symbol error pattern space from $\mathcal{O}(5^{N/\log_2 M})$ down to $\mathcal{O}(4^{N/\log_2 M})$; and 2. A massively parallel matrix-vector multiplication for matrices of size $K\times N$ ($K \leq N$) that performs… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted at 15th International Conference on Ubiquitous and Future Networks (ICUFN 2024)

  5. arXiv:2404.14403  [pdf, other

    cs.CV

    GeoDiffuser: Geometry-Based Image Editing with Diffusion Models

    Authors: Rahul Sajnani, Jeroen Vanbaar, Jie Min, Kapil Katyal, Srinath Sridhar

    Abstract: The success of image generative models has enabled us to build methods that can edit images based on text or other user input. However, these methods are bespoke, imprecise, require additional information, or are limited to only 2D image edits. We present GeoDiffuser, a zero-shot optimization-based method that unifies common 2D and 3D image-based object editing capabilities into a single method. O… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  6. arXiv:2404.06246  [pdf, other

    cs.CV cs.AI

    GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields

    Authors: Arnab Dey, Di Yang, Rohith Agaram, Antitza Dantcheva, Andrew I. Comport, Srinath Sridhar, Jean Martinet

    Abstract: Recent advances in Neural Radiance Fields (NeRF) have demonstrated promising results in 3D scene representations, including 3D human representations. However, these representations often lack crucial information on the underlying human pose and structure, which is crucial for AR/VR applications and games. In this paper, we introduce a novel approach, termed GHNeRF, designed to address these limita… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  7. arXiv:2404.04643  [pdf, other

    cs.RO cs.CV

    Constrained 6-DoF Grasp Generation on Complex Shapes for Improved Dual-Arm Manipulation

    Authors: Gaurav Singh, Sanket Kalwar, Md Faizal Karim, Bipasha Sen, Nagamanikandan Govindan, Srinath Sridhar, K Madhava Krishna

    Abstract: Efficiently generating grasp poses tailored to specific regions of an object is vital for various robotic manipulation tasks, especially in a dual-arm setup. This scenario presents a significant challenge due to the complex geometries involved, requiring a deep understanding of the local geometry to generate grasps efficiently on the specified constrained regions. Existing methods only explore set… ▽ More

    Submitted 15 July, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: Project Page: https://constrained-grasp-diffusion.github.io/

  8. arXiv:2402.18386  [pdf, other

    cs.CR cs.DC

    TrustRate: A Decentralized Platform for Hijack-Resistant Anonymous Reviews

    Authors: Rohit Dwivedula, Sriram Sridhar, Sambhav Satija, Muthian Sivathanu, Nishanth Chandran, Divya Gupta, Satya Lokam

    Abstract: Reviews and ratings by users form a central component in several widely used products today (e.g., product reviews, ratings of online content, etc.), but today's platforms for managing such reviews are ad-hoc and vulnerable to various forms of tampering and hijack by fake reviews either by bots or motivated paid workers. We define a new metric called 'hijack-resistance' for such review platforms,… ▽ More

    Submitted 20 July, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 23 pages. Poster at The 24th Privacy Enhancing Technologies Symposium, 2024, Bristol, United Kingdom

  9. arXiv:2401.05342  [pdf, other

    q-bio.NC cs.AI cs.LG

    Most discriminative stimuli for functional cell type clustering

    Authors: Max F. Burg, Thomas Zenkel, Michaela Vystrčilová, Jonathan Oesterle, Larissa Höfling, Konstantin F. Willeke, Jan Lause, Sarah Müller, Paul G. Fahey, Zhiwei Ding, Kelli Restivo, Shashwat Sridhar, Tim Gollisch, Philipp Berens, Andreas S. Tolias, Thomas Euler, Matthias Bethge, Alexander S. Ecker

    Abstract: Identifying cell types and understanding their functional properties is crucial for unraveling the mechanisms underlying perception and cognition. In the retina, functional types can be identified by carefully selected stimuli, but this requires expert domain knowledge and biases the procedure towards previously known cell types. In the visual cortex, it is still unknown what functional types exis… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 November, 2023; originally announced January 2024.

  10. arXiv:2312.13301  [pdf, other

    cs.LG cs.AI

    SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture Search

    Authors: Sharath Nittur Sridhar, Maciej Szankin, Fang Chen, Sairam Sundaresan, Anthony Sarah

    Abstract: Recent one-shot Neural Architecture Search algorithms rely on training a hardware-agnostic super-network tailored to a specific task and then extracting efficient sub-networks for different hardware platforms. Popular approaches separate the training of super-networks from the search for sub-networks, often employing predictors to alleviate the computational overhead associated with search. Additi… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  11. arXiv:2312.08356  [pdf, other

    cs.DB cs.DC

    CUTTANA: Scalable Graph Partitioning for Faster Distributed Graph Databases and Analytics

    Authors: Milad Rezaei Hajidehi, Sraavan Sridhar, Margo Seltzer

    Abstract: Graph partitioning plays a pivotal role in various distributed graph processing applications, including graph analytics, graph neural network training, and distributed graph databases. Graphs that require distributed settings are often too large to fit in the main memory of a single machine. This challenge renders traditional in-memory graph partitioners infeasible, leading to the emergence of str… ▽ More

    Submitted 30 March, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Preprint version, Under-review, Code available after reviews

  12. arXiv:2312.06644  [pdf, other

    cs.CV cs.AI cs.GR

    AnyHome: Open-Vocabulary Generation of Structured and Textured 3D Homes

    Authors: Rao Fu, Zehao Wen, Zichen Liu, Srinath Sridhar

    Abstract: Inspired by cognitive theories, we introduce AnyHome, a framework that translates any text into well-structured and textured indoor scenes at a house-scale. By prompting Large Language Models (LLMs) with designed templates, our approach converts provided textual narratives into amodal structured representations. These representations guarantee consistent and realistic spatial layouts by directing… ▽ More

    Submitted 28 July, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: accepted by ECCV 2024

  13. arXiv:2312.02137  [pdf, other

    cs.CV

    MANUS: Markerless Grasp Capture using Articulated 3D Gaussians

    Authors: Chandradeep Pokhariya, Ishaan N Shah, Angela Xing, Zekun Li, Kefan Chen, Avinash Sharma, Srinath Sridhar

    Abstract: Understanding how we grasp objects with our hands has important applications in areas like robotics and mixed reality. However, this challenging problem requires accurate modeling of the contact between hands and objects. To capture grasps, existing methods use skeletons, meshes, or parametric models that does not represent hand shape accurately resulting in inaccurate contacts. We present MANUS,… ▽ More

    Submitted 28 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2024

  14. arXiv:2310.13759  [pdf, other

    cs.SD eess.AS

    Multi-label Open-set Audio Classification

    Authors: Sripathi Sridhar, Mark Cartwright

    Abstract: Current audio classification models have small class vocabularies relative to the large number of sound event classes of interest in the real world. Thus, they provide a limited view of the world that may miss important yet unexpected or unknown sound events. To address this issue, open-set audio classification techniques have been developed to detect sound events from unknown classes. Although th… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Published at the Workshop on Detection and Classification of Acoustic Scenes and Events, 2023 (DCASE 2023)

  15. arXiv:2310.06338  [pdf, other

    cs.CR cs.DC

    Better Safe than Sorry: Recovering after Adversarial Majority

    Authors: Srivatsan Sridhar, Dionysis Zindros, David Tse

    Abstract: The security of blockchain protocols is a combination of two properties: safety and liveness. It is well known that no blockchain protocol can provide both to sleepy (intermittently online) clients under adversarial majority. However, safety is more critical in that a single safety violation can cause users to lose money. At the same time, liveness must not be lost forever. We show that, in a sync… ▽ More

    Submitted 3 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

  16. arXiv:2308.15609  [pdf, other

    cs.LG cs.AI

    InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning

    Authors: Sharath Nittur Sridhar, Souvik Kundu, Sairam Sundaresan, Maciej Szankin, Anthony Sarah

    Abstract: One-Shot Neural Architecture Search (NAS) algorithms often rely on training a hardware agnostic super-network for a domain specific task. Optimal sub-networks are then extracted from the trained super-network for different hardware platforms. However, training super-networks from scratch can be extremely time consuming and compute intensive especially for large models that rely on a two-stage trai… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  17. arXiv:2308.10337  [pdf, other

    cs.CV

    Strata-NeRF : Neural Radiance Fields for Stratified Scenes

    Authors: Ankit Dhiman, Srinath R, Harsh Rangwani, Rishubh Parihar, Lokesh R Boregowda, Srinath Sridhar, R Venkatesh Babu

    Abstract: Neural Radiance Field (NeRF) approaches learn the underlying 3D representation of a scene and generate photo-realistic novel views with high fidelity. However, most proposed settings concentrate on modelling a single object or a single level of a scene. However, in the real world, we may capture a scene at multiple levels, resulting in a layered capture. For example, tourists usually capture a mon… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: ICCV 2023, Project Page: https://ankitatiisc.github.io/Strata-NeRF/

  18. arXiv:2308.05096  [pdf, other

    cs.DC cs.CR

    Optimal Flexible Consensus and its Application to Ethereum

    Authors: Joachim Neu, Srivatsan Sridhar, Lei Yang, David Tse

    Abstract: Classic BFT consensus protocols guarantee safety and liveness for all clients if fewer than one-third of replicas are faulty. However, in applications such as high-value payments, some clients may want to prioritize safety over liveness. Flexible consensus allows each client to opt for a higher safety resilience, albeit at the expense of reduced liveness resilience. We present the first constructi… ▽ More

    Submitted 3 December, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: To be published at the IEEE Symposium on Security & Privacy 2024

  19. arXiv:2307.16897  [pdf, other

    cs.CV cs.AI

    DiVa-360: The Dynamic Visual Dataset for Immersive Neural Fields

    Authors: Cheng-You Lu, Peisen Zhou, Angela Xing, Chandradeep Pokhariya, Arnab Dey, Ishaan Shah, Rugved Mavidipalli, Dylan Hu, Andrew Comport, Kefan Chen, Srinath Sridhar

    Abstract: Advances in neural fields are enabling high-fidelity capture of the shape and appearance of dynamic 3D scenes. However, their capabilities lag behind those offered by conventional representations such as 2D videos because of algorithmic challenges and the lack of large-scale multi-view real-world datasets. We address the dataset limitation with DiVa-360, a real-world 360 dynamic visual dataset tha… ▽ More

    Submitted 26 March, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

  20. arXiv:2307.12212  [pdf, other

    cs.CR cs.DC

    Content Censorship in the InterPlanetary File System

    Authors: Srivatsan Sridhar, Onur Ascigil, Navin Keizer, François Genon, Sébastien Pierre, Yiannis Psaras, Etienne Rivière, Michał Król

    Abstract: The InterPlanetary File System (IPFS) is currently the largest decentralized storage solution in operation, with thousands of active participants and millions of daily content transfers. IPFS is used as remote data storage for numerous blockchain-based smart contracts, Non-Fungible Tokens (NFT), and decentralized applications. We present a content censorship attack that can be executed with mini… ▽ More

    Submitted 4 December, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: 17 pages (including references and appendices), 15 figures. Accepted to be published at the Network and Distributed System Security (NDSS) Symposium 2024

  21. arXiv:2307.11764  [pdf, other

    cs.CL

    Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT

    Authors: Souvik Kundu, Sharath Nittur Sridhar, Maciej Szankin, Sairam Sundaresan

    Abstract: Large pre-trained language models have recently gained significant traction due to their improved performance on various down-stream tasks like text classification and question answering, requiring only few epochs of fine-tuning. However, their large model sizes often prohibit their applications on resource-constrained edge devices. Existing solutions of yielding parameter-efficient BERT models la… ▽ More

    Submitted 31 August, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: 6 pages, 5 figures, 2 tables

  22. arXiv:2306.06093  [pdf, other

    cs.CV

    HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork

    Authors: Bipasha Sen, Gaurav Singh, Aditya Agarwal, Rohith Agaram, K Madhava Krishna, Srinath Sridhar

    Abstract: Neural Radiance Fields (NeRF) have become an increasingly popular representation to capture high-quality appearance and shape of scenes and objects. However, learning generalizable NeRF priors over categories of scenes or objects has been challenging due to the high dimensionality of network weight space. To address the limitations of existing work on generalization, multi-view consistency and to… ▽ More

    Submitted 23 December, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Project Page: https://hyp-nerf.github.io

  23. Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model

    Authors: David Soong, Sriram Sridhar, Han Si, Jan-Samuel Wagner, Ana Caroline Costa Sá, Christina Y Yu, Kubra Karagoz, Meijian Guan, Hisham Hamadeh, Brandon W Higgs

    Abstract: Large language models (LLMs) have made significant advancements in natural language processing (NLP). Broad corpora capture diverse patterns but can introduce irrelevance, while focused corpora enhance reliability by reducing misleading information. Training LLMs on focused corpora poses computational challenges. An alternative approach is to use a retrieval-augmentation (RetA) method tested in a… ▽ More

    Submitted 30 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Report number: 2305.17116

    Journal ref: PLOS Digit Health, 3(8) , 2024

  24. arXiv:2304.03280  [pdf, other

    cs.CV

    LANe: Lighting-Aware Neural Fields for Compositional Scene Synthesis

    Authors: Akshay Krishnan, Amit Raj, Xianling Zhang, Alexandra Carlson, Nathan Tseng, Sandhya Sridhar, Nikita Jaipuria, James Hays

    Abstract: Neural fields have recently enjoyed great success in representing and rendering 3D scenes. However, most state-of-the-art implicit representations model static or dynamic scenes as a whole, with minor variations. Existing work on learning disentangled world and object neural fields do not consider the problem of composing objects into different world neural fields in a lighting-aware manner. We pr… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: Project website: https://lane-composition.github.io

  25. Nakamoto Consensus under Bounded Processing Capacity

    Authors: Lucianna Kiffer, Joachim Neu, Srivatsan Sridhar, Aviv Zohar, David Tse

    Abstract: For Nakamoto's longest-chain consensus protocol, whose proof-of-work (PoW) and proof-of-stake (PoS) variants power major blockchains such as Bitcoin and Cardano, we revisit the classic problem of the security-performance tradeoff: Given a network of nodes with finite communication- and computation-resources, against what fraction of adversary power is Nakamoto consensus (NC) secure for a given blo… ▽ More

    Submitted 24 June, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: ACM Conference on Computer and Communications Security (CCS) 2024

  26. arXiv:2303.01526  [pdf, other

    cs.CV

    Semantic Attention Flow Fields for Monocular Dynamic Scene Decomposition

    Authors: Yiqing Liang, Eliot Laidlaw, Alexander Meyerowitz, Srinath Sridhar, James Tompkin

    Abstract: From video, we reconstruct a neural volume that captures time-varying color, density, scene flow, semantics, and attention information. The semantics and attention let us identify salient foreground objects separately from the background across spacetime. To mitigate low resolution semantic and attention features, we compute pyramids that trade detail with whole-image context. After optimization,… ▽ More

    Submitted 28 September, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: International Conference on Computer Vision (ICCV) 2023; 10 pages, 8 figures, 3 tables

  27. arXiv:2302.03523  [pdf, other

    cs.CV

    Sparse Mixture Once-for-all Adversarial Training for Efficient In-Situ Trade-Off Between Accuracy and Robustness of DNNs

    Authors: Souvik Kundu, Sairam Sundaresan, Sharath Nittur Sridhar, Shunlin Lu, Han Tang, Peter A. Beerel

    Abstract: Existing deep neural networks (DNNs) that achieve state-of-the-art (SOTA) performance on both clean and adversarially-perturbed images rely on either activation or weight conditioned convolution operations. However, such conditional learning costs additional multiply-accumulate (MAC) or addition operations, increasing inference memory and compute costs. To that end, we present a sparse mixture onc… ▽ More

    Submitted 27 December, 2022; originally announced February 2023.

    Comments: 5 pages, 5 figures, 2 tables

  28. arXiv:2301.09629  [pdf, other

    cs.CV

    LEGO-Net: Learning Regular Rearrangements of Objects in Rooms

    Authors: Qiuhong Anna Wei, Sijie Ding, Jeong Joon Park, Rahul Sajnani, Adrien Poulenard, Srinath Sridhar, Leonidas Guibas

    Abstract: Humans universally dislike the task of cleaning up a messy room. If machines were to help us with this task, they must understand human criteria for regular arrangements, such as several types of symmetry, co-linearity or co-circularity, spacing uniformity in linear or circular patterns, and further inter-object relationships that relate to style and functionality. Previous approaches for this tas… ▽ More

    Submitted 24 March, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Project page: https://ivl.cs.brown.edu/projects/lego-net

  29. arXiv:2301.07213  [pdf, other

    cs.CV cs.RO

    SCARP: 3D Shape Completion in ARbitrary Poses for Improved Grasping

    Authors: Bipasha Sen, Aditya Agarwal, Gaurav Singh, Brojeshwar B., Srinath Sridhar, Madhava Krishna

    Abstract: Recovering full 3D shapes from partial observations is a challenging task that has been extensively addressed in the computer vision community. Many deep learning methods tackle this problem by training 3D shape generation networks to learn a prior over the full 3D shapes. In this training regime, the methods expect the inputs to be in a fixed canonical form, without which they fail to learn a val… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: Accepted at ICRA 2023

  30. arXiv:2212.02493  [pdf, other

    cs.CV

    Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields

    Authors: Rohith Agaram, Shaurya Dewan, Rahul Sajnani, Adrien Poulenard, Madhava Krishna, Srinath Sridhar

    Abstract: Coordinate-based implicit neural networks, or neural fields, have emerged as useful representations of shape and appearance in 3D computer vision. Despite advances, however, it remains challenging to build neural fields for categories of objects without datasets like ShapeNet that provide "canonicalized" object instances that are consistently aligned for their 3D position and orientation (pose). W… ▽ More

    Submitted 17 May, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

  31. arXiv:2211.01427  [pdf, other

    cs.CV cs.AI

    CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Natural Language

    Authors: Aditya Sanghi, Rao Fu, Vivian Liu, Karl Willis, Hooman Shayani, Amir Hosein Khasahmadi, Srinath Sridhar, Daniel Ritchie

    Abstract: Recent works have demonstrated that natural language can be used to generate and edit 3D shapes. However, these methods generate shapes with limited fidelity and diversity. We introduce CLIP-Sculptor, a method to address these constraints by producing high-fidelity and diverse 3D shapes without the need for (text, shape) pairs during training. CLIP-Sculptor achieves this in a multi-resolution appr… ▽ More

    Submitted 24 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted at Conference on Computer Vision and Pattern Recognition 2023(CVPR2023)

  32. arXiv:2207.09446  [pdf, other

    cs.CV cs.AI

    ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model

    Authors: Rao Fu, Xiao Zhan, Yiwen Chen, Daniel Ritchie, Srinath Sridhar

    Abstract: We present ShapeCrafter, a neural network for recursive text-conditioned 3D shape generation. Existing methods to generate text-conditioned 3D shapes consume an entire text prompt to generate a 3D shape in a single step. However, humans tend to describe shapes recursively-we may start with an initial description and progressively add details based on intermediate results. To capture this recursive… ▽ More

    Submitted 8 April, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Presented at the Advances in Neural Information Processing Systems (NeurIPS) 2022

  33. arXiv:2206.08497  [pdf, other

    cs.GR cs.CV

    Unsupervised Kinematic Motion Detection for Part-segmented 3D Shape Collections

    Authors: Xianghao Xu, Yifan Ruan, Srinath Sridhar, Daniel Ritchie

    Abstract: 3D models of manufactured objects are important for populating virtual worlds and for synthetic data generation for vision and robotics. To be most useful, such objects should be articulated: their parts should move when interacted with. While articulated object datasets exist, creating them is labor-intensive. Learning-based prediction of part motions can help, but all existing methods require an… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: SIGGRAPH 2022

  34. arXiv:2206.05837  [pdf, other

    cs.CV

    NeuralODF: Learning Omnidirectional Distance Fields for 3D Shape Representation

    Authors: Trevor Houchens, Cheng-You Lu, Shivam Duggal, Rao Fu, Srinath Sridhar

    Abstract: In visual computing, 3D geometry is represented in many different forms including meshes, point clouds, voxel grids, level sets, and depth images. Each representation is suited for different tasks thus making the transformation of one representation into another (forward map) an important and common problem. We propose Omnidirectional Distance Fields (ODFs), a new 3D shape representation that enco… ▽ More

    Submitted 31 August, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

  35. arXiv:2205.10358  [pdf, other

    cs.LG cs.NE

    A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities

    Authors: Daniel Cummings, Anthony Sarah, Sharath Nittur Sridhar, Maciej Szankin, Juan Pablo Munoz, Sairam Sundaresan

    Abstract: Recent advances in Neural Architecture Search (NAS) such as one-shot NAS offer the ability to extract specialized hardware-aware sub-network configurations from a task-specific super-network. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still under-explored. Popula… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  36. arXiv:2202.12954  [pdf, other

    cs.AI

    A Hardware-Aware System for Accelerating Deep Neural Network Optimization

    Authors: Anthony Sarah, Daniel Cummings, Sharath Nittur Sridhar, Sairam Sundaresan, Maciej Szankin, Tristan Webb, J. Pablo Munoz

    Abstract: Recent advances in Neural Architecture Search (NAS) which extract specialized hardware-aware configurations (a.k.a. "sub-networks") from a hardware-agnostic "super-network" have become increasingly popular. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still largely… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  37. arXiv:2202.12934  [pdf, other

    cs.NE

    Accelerating Neural Architecture Exploration Across Modalities Using Genetic Algorithms

    Authors: Daniel Cummings, Sharath Nittur Sridhar, Anthony Sarah, Maciej Szankin

    Abstract: Neural architecture search (NAS), the study of automating the discovery of optimal deep neural network architectures for tasks in domains such as computer vision and natural language processing, has seen rapid growth in the machine learning research community. While there have been many recent advancements in NAS, there is still a significant focus on reducing the computational cost incurred when… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  38. arXiv:2202.12411  [pdf, other

    cs.CL

    TrimBERT: Tailoring BERT for Trade-offs

    Authors: Sharath Nittur Sridhar, Anthony Sarah, Sairam Sundaresan

    Abstract: Models based on BERT have been extremely successful in solving a variety of natural language processing (NLP) tasks. Unfortunately, many of these large models require a great deal of computational resources and/or time for pre-training and fine-tuning which limits wider adoptability. While self-attention layers have been well-studied, a strong justification for inclusion of the intermediate layers… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2012.11881

  39. arXiv:2201.07788  [pdf, other

    cs.CV cs.AI cs.LG

    ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes

    Authors: Rahul Sajnani, Adrien Poulenard, Jivitesh Jain, Radhika Dua, Leonidas J. Guibas, Srinath Sridhar

    Abstract: Progress in 3D object understanding has relied on manually canonicalized shape datasets that contain instances with consistent position and orientation (3D pose). This has made it hard to generalize these methods to in-the-wild shapes, eg., from internet model collections or depth sensors. ConDor is a self-supervised method that learns to Canonicalize the 3D orientation and position for full and p… ▽ More

    Submitted 14 April, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: Accepted to CVPR 2022, New Orleans, Louisiana. For project page and code, see https://ivl.cs.brown.edu/ConDor/

  40. arXiv:2112.07022  [pdf, other

    cs.GR cs.CV cs.LG

    Learning Body-Aware 3D Shape Generative Models

    Authors: Bryce Blinn, Alexander Ding, R. Kenny Jones, Manolis Savva, Srinath Sridhar, Daniel Ritchie

    Abstract: The shape of many objects in the built environment is dictated by their relationships to the human body: how will a person interact with this object? Existing data-driven generative models of 3D shapes produce plausible objects but do not reason about the relationship of those objects to the human body. In this paper, we learn body-aware generative models of 3D shapes. Specifically, we train gener… ▽ More

    Submitted 20 January, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: 11 pages, 8 figures

  41. arXiv:2111.12332  [pdf, other

    cs.CR cs.DC

    Longest Chain Consensus Under Bandwidth Constraint

    Authors: Joachim Neu, Srivatsan Sridhar, Lei Yang, David Tse, Mohammad Alizadeh

    Abstract: Spamming attacks are a serious concern for consensus protocols, as witnessed by recent outages of a major blockchain, Solana. They cause congestion and excessive message delays in a real network due to its bandwidth constraints. In contrast, longest chain (LC), an important family of consensus protocols, has previously only been proven secure assuming an idealized network model in which all messag… ▽ More

    Submitted 17 May, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

  42. arXiv:2111.11426  [pdf, other

    cs.CV cs.GR cs.LG

    Neural Fields in Visual Computing and Beyond

    Authors: Yiheng Xie, Towaki Takikawa, Shunsuke Saito, Or Litany, Shiqin Yan, Numair Khan, Federico Tombari, James Tompkin, Vincent Sitzmann, Srinath Sridhar

    Abstract: Recent advances in machine learning have created increasing interest in solving visual computing problems using a class of coordinate-based neural networks that parametrize physical properties of scenes or objects across space and time. These methods, which we call neural fields, have seen successful application in the synthesis of 3D shapes and image, animation of human bodies, 3D reconstruction,… ▽ More

    Submitted 5 April, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Equal advising: Vincent Sitzmann and Srinath Sridhar

  43. arXiv:2110.04753  [pdf, other

    cs.GT cs.MA cs.SI math.DS

    Transaction Fees on a Honeymoon: Ethereum's EIP-1559 One Month Later

    Authors: Daniël Reijsbergen, Shyam Sridhar, Barnabé Monnot, Stefanos Leonardos, Stratis Skoulakis, Georgios Piliouras

    Abstract: Ethereum Improvement Proposal (EIP) 1559 was recently implemented to transform Ethereum's transaction fee market. EIP-1559 utilizes an algorithmic update rule with a constant learning rate to estimate a base fee. The base fee reflects prevailing network conditions and hence provides a more reliable oracle for current gas prices. Using on-chain data from the period after its launch, we evaluate t… ▽ More

    Submitted 18 April, 2022; v1 submitted 10 October, 2021; originally announced October 2021.

    Comments: IEEE Blockchain-2021, The 4th IEEE International Conference on Blockchain, Melbourne, Australia | 06-08 December 2021

    MSC Class: 91A80; 91-10; 91B26

  44. arXiv:2106.12332  [pdf, other

    cs.GT cs.DC cs.MA econ.TH math.DS

    From Griefing to Stability in Blockchain Mining Economies

    Authors: Yun Kuen Cheung, Stefanos Leonardos, Georgios Piliouras, Shyam Sridhar

    Abstract: We study a game-theoretic model of blockchain mining economies and show that griefing, a practice according to which participants harm other participants at some lesser cost to themselves, is a prevalent threat at its Nash equilibria. The proof relies on a generalization of evolutionary stability to non-homogeneous populations via griefing factors (ratios that measure network losses relative to de… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    MSC Class: 91B54; 91B55; 91A22; 91A26; 91-10;

  45. arXiv:2105.08016  [pdf, other

    cs.CV cs.CG

    StrobeNet: Category-Level Multiview Reconstruction of Articulated Objects

    Authors: Ge Zhang, Or Litany, Srinath Sridhar, Leonidas Guibas

    Abstract: We present StrobeNet, a method for category-level 3D reconstruction of articulating objects from one or more unposed RGB images. Reconstructing general articulating object categories % has important applications, but is challenging since objects can have wide variation in shape, articulation, appearance and topology. We address this by building on the idea of category-level articulation canonicali… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: preprint

  46. arXiv:2105.04668  [pdf, other

    cs.CV cs.LG

    HuMoR: 3D Human Motion Model for Robust Pose Estimation

    Authors: Davis Rempe, Tolga Birdal, Aaron Hertzmann, Jimei Yang, Srinath Sridhar, Leonidas J. Guibas

    Abstract: We introduce HuMoR: a 3D Human Motion Model for Robust Estimation of temporal pose and shape. Though substantial progress has been made in estimating 3D human motion and shape from dynamic observations, recovering plausible pose sequences in the presence of noise and occlusions remains a challenge. For this purpose, we propose an expressive generative model in the form of a conditional variational… ▽ More

    Submitted 18 August, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: ICCV 2021 camera ready

  47. arXiv:2012.11881  [pdf, other

    cs.CL cs.AI

    Undivided Attention: Are Intermediate Layers Necessary for BERT?

    Authors: Sharath Nittur Sridhar, Anthony Sarah

    Abstract: In recent times, BERT-based models have been extremely successful in solving a variety of natural language processing (NLP) tasks such as reading comprehension, natural language inference, sentiment analysis, etc. All BERT-based architectures have a self-attention block followed by a block of intermediate layers as the basic building component. However, a strong justification for the inclusion of… ▽ More

    Submitted 4 April, 2023; v1 submitted 22 December, 2020; originally announced December 2020.

  48. arXiv:2012.09904  [pdf, other

    cs.CV cs.LG

    Attention-based Image Upsampling

    Authors: Souvik Kundu, Hesham Mostafa, Sharath Nittur Sridhar, Sairam Sundaresan

    Abstract: Convolutional layers are an integral part of many deep neural network solutions in computer vision. Recent work shows that replacing the standard convolution operation with mechanisms based on self-attention leads to improved performance on image classification and object detection tasks. In this work, we show how attention mechanisms can be used to replace another canonical operation: strided tra… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  49. arXiv:2011.12912  [pdf, other

    cs.CV cs.AI cs.RO

    DRACO: Weakly Supervised Dense Reconstruction And Canonicalization of Objects

    Authors: Rahul Sajnani, AadilMehdi Sanchawala, Krishna Murthy Jatavallabhula, Srinath Sridhar, K. Madhava Krishna

    Abstract: We present DRACO, a method for Dense Reconstruction And Canonicalization of Object shape from one or more RGB images. Canonical shape reconstruction, estimating 3D object shape in a coordinate space canonicalized for scale, rotation, and translation parameters, is an emerging paradigm that holds promise for a multitude of robotic applications. Prior approaches either rely on painstakingly gathered… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: Preprint. For project page and code, see https://aadilmehdis.github.io/DRACO-Project-Page/

  50. arXiv:2010.00673  [pdf, other

    eess.AS cs.SD

    Helicality: An Isomap-based Measure of Octave Equivalence in Audio Data

    Authors: Sripathi Sridhar, Vincent Lostanlen

    Abstract: Octave equivalence serves as domain-knowledge in MIR systems, including chromagram, spiral convolutional networks, and harmonic CQT. Prior work has applied the Isomap manifold learning algorithm to unlabeled audio data to embed frequency sub-bands in 3-D space where the Euclidean distances are inversely proportional to the strength of their Pearson correlations. However, discovering octave equival… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: 3 pages, 3 figures. To be presented at the 21st International Society for Music Information Retrieval (ISMIR) Conference. Montreal, Canada, October 2020