Zum Hauptinhalt springen

Showing 1–50 of 85 results for author: Singh, S P

.
  1. arXiv:2408.15921  [pdf, other

    cond-mat.soft

    Structural transitions of a Semi-Flexible Polyampholyte

    Authors: Rakesh Palariya, Sunil P. Singh

    Abstract: Polyampholytes (PA) are charged polymers composed of positively and negatively charged monomers along their backbone. The sequence of the charged monomers and the bending of the chain significantly influence the conformation and dynamical behavior of the PA. Using coarse-grained molecular dynamics simulations, we comprehensively study the structural and dynamical properties of flexible and semi-fl… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  2. arXiv:2407.16611  [pdf, other

    cs.LG cs.AI

    Local vs Global continual learning

    Authors: Giulia Lanzillotta, Sidak Pal Singh, Benjamin F. Grewe, Thomas Hofmann

    Abstract: Continual learning is the problem of integrating new information in a model while retaining the knowledge acquired in the past. Despite the tangible improvements achieved in recent years, the problem of continual learning is still an open one. A better understanding of the mechanisms behind the successes and failures of existing continual learning algorithms can unlock the development of new succe… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: (10 pages, Will appear in the proceedings of CoLLAs 2024)

  3. arXiv:2407.02860  [pdf, other

    cond-mat.soft cond-mat.stat-mech physics.bio-ph

    Active Polar Ring Polymer in Shear Flow -- An Analytical Study

    Authors: Roland G. Winkler, Sunil P. Singh

    Abstract: We theoretically study the conformational and dynamical properties of semiflexible active polar ring polymers under linear shear flow. A ring is described as a continuous Gaussian polymer with a tangential active force of a constant density along its contour. The linear but non-Hermitian equation of motion is solved using an eigenfunction expansion, which yields activity-independent, but shear-rat… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 7 figures

  4. arXiv:2406.16300  [pdf, other

    cs.LG

    Landscaping Linear Mode Connectivity

    Authors: Sidak Pal Singh, Linara Adilova, Michael Kamp, Asja Fischer, Bernhard Schölkopf, Thomas Hofmann

    Abstract: The presence of linear paths in parameter space between two different network solutions in certain cases, i.e., linear mode connectivity (LMC), has garnered interest from both theoretical and practical fronts. There has been significant research that either practically designs algorithms catered for connecting networks by adjusting for the permutation symmetries as well as some others that more th… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: ICML 2024 HiLD workshop paper

  5. arXiv:2406.07164  [pdf, other

    cond-mat.soft cond-mat.stat-mech

    Collective dynamics of active dumbbells near a circular obstacle

    Authors: Chandranshu Tiwari, Sunil P. Singh

    Abstract: In this article, we present the collective dynamics of active dumbbells in the presence of a static circular obstacle using Brownian dynamics simulation. The active dumbbells aggregate on the surface of a circular obstacle beyond a critical radius. The aggregation is non-uniform along the circumference, and the aggregate size increases with the activity and the curvature radius. The dense aggregat… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 14 pages, 23 figures

    Journal ref: Soft Matter, 10/06/2024

  6. arXiv:2405.10880  [pdf

    cs.CR

    The MESA Security Model 2.0: A Dynamic Framework for Mitigating Stealth Data Exfiltration

    Authors: Sanjeev Pratap Singh, Naveed Afzal

    Abstract: The rising complexity of cyber threats calls for a comprehensive reassessment of current security frameworks in business environments. This research focuses on Stealth Data Exfiltration, a significant cyber threat characterized by covert infiltration, extended undetectability, and unauthorized dissemination of confidential data. Our findings reveal that conventional defense-in-depth strategies oft… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Journal ref: International Journal of Network Security & Its Applications (IJNSA) 2024

  7. arXiv:2403.19299  [pdf, other

    cs.CR quant-ph

    Post Quantum Cryptography and its Comparison with Classical Cryptography

    Authors: Tanmay Tripathi, Abhinav Awasthi, Shaurya Pratap Singh, Atul Chaturvedi

    Abstract: Cryptography plays a pivotal role in safeguarding sensitive information and facilitating secure communication. Classical cryptography relies on mathematical computations, whereas quantum cryptography operates on the principles of quantum mechanics, offering a new frontier in secure communication. Quantum cryptographic systems introduce novel dimensions to security, capable of detecting and thwarti… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  8. arXiv:2403.11685  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall cond-mat.stat-mech

    Berezinskii-Kosterlitz-Thouless to BCS-like superconducting transition crossover driven by weak magnetic fields in ultra-thin NbN films

    Authors: Meenakshi Sharma, Sergio Caprara, Andrea Perali, Surinder P. Singh, Sandeep Singh, Matteo Fretto, Natascia De Leo, Nicola Pinto

    Abstract: The Berezinskii-Kosterlitz-Thouless (BKT) transition in ultra-thin NbN films is investigated in the presence of weak perpendicular magnetic fields. A jump in the phase stiffness at the BKT transition is detected up to 5 G, while the BKT features are smeared between 5 G and 50 G, disappearing altogether at 100 G, where conventional current-voltage behaviour is observed. Our findings demonstrate tha… ▽ More

    Submitted 16 July, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 10 pages, 13 figures

  9. arXiv:2403.10901  [pdf, other

    astro-ph.GA

    $Herschel$ investigation of cores and filamentary structures in L1251 located in the Cepheus flare

    Authors: Divyansh Dewan, Archana Soam, Guo-Yin Zhang, Akhil Lasrado, Saikhom Pravash Singh, Chang Won Lee

    Abstract: Context: Molecular clouds are the prime locations of star formation. These clouds contain filamentary structures and cores which are crucial in the formation of young stars. Aims: In this work, we aim to quantify the physical properties of structural characteristics within the molecular cloud L1251 to better understand the initial conditions for star formation. Methods: We applied the getsf algori… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 15 pages, 20 figures, 2 tables, accepted for publication in JAA

  10. arXiv:2403.07379  [pdf, other

    cs.LG cs.CL stat.ML

    Hallmarks of Optimization Trajectories in Neural Networks: Directional Exploration and Redundancy

    Authors: Sidak Pal Singh, Bobby He, Thomas Hofmann, Bernhard Schölkopf

    Abstract: We propose a fresh take on understanding the mechanisms of neural networks by analyzing the rich directional structure of optimization trajectories, represented by their pointwise parameters. Towards this end, we introduce some natural notions of the complexity of optimization trajectories, both qualitative and quantitative, which hallmark the directional nature of optimization in neural networks:… ▽ More

    Submitted 24 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Preprint, 57 pages

  11. arXiv:2403.06656  [pdf, other

    physics.ao-ph

    Investigation of the Thermal Structure in the Atmospheric Boundary Layer During Evening Transition and the Impact of Aerosols on Radiative Cooling

    Authors: Suryadev Pratap Singh, Mohammad Rafiuddin, Subham Banerjee, Sreenivas K R

    Abstract: We have explored the evening transition using data from eighty days of observations across two fog seasons at the Kempegowda International Airport, Bengaluru (KIAB). Through field experiments and simulations integrating aerosol interaction in a radiation-conduction model, we elucidate the impact of aerosols on longwave cooling of the Atmospheric Boundary Layer (ABL). Field observations indicate th… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  12. arXiv:2402.07839  [pdf, other

    cs.CV cs.LG

    Towards Meta-Pruning via Optimal Transport

    Authors: Alexander Theus, Olin Geimer, Friedrich Wicke, Thomas Hofmann, Sotiris Anagnostidis, Sidak Pal Singh

    Abstract: Structural pruning of neural networks conventionally relies on identifying and discarding less important neurons, a practice often resulting in significant accuracy loss that necessitates subsequent fine-tuning efforts. This paper introduces a novel approach named Intra-Fusion, challenging this prevailing pruning paradigm. Unlike existing methods that focus on designing meaningful neuron importanc… ▽ More

    Submitted 13 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted as a Spotlight (top 5% of submissions) at the International Conference on Learning Representations (ICLR) 2024

  13. arXiv:2401.09335  [pdf, other

    physics.app-ph cond-mat.mtrl-sci

    Formation of nano and micro scale hierarchical structures in MgO and ZnO quantum dots doped LC media: The role of competitive forces

    Authors: A. K. Singh, S. P. Singh

    Abstract: In this paper, we have studied the effect of doping of ZnO and MgO nanoparticles (NPs) in 4-(trans-4-n-hexylcyclo-hexyl) isothiocyanatobenzoate. A thorough comparison of dielectric properties, optoelectronic properties, and calorimetric phase transition properties has been done for MgO and ZnO NP doped LC. We prepare their homogenous mixture of MgO and ZnO NPs in toluene and transfer into cells ma… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 22 pages, 20 figures, 5 tables

    Journal ref: Condensed Matter Physics, 2023, vol. 26, No. 4, 43602

  14. arXiv:2311.11885  [pdf, other

    cond-mat.soft

    Characteristic features of self-avoiding active Brownian polymers under linear shear flow

    Authors: Arindam Panda, Roland G. Winkler, Sunil P. Singh

    Abstract: We present Brownian dynamics simulation results of a flexible linear polymer with excluded-volume interactions under shear flow in the presence of active noise. The active noise strongly affects the polymer's conformational and dynamical properties, such as the stretching in the flow direction and compression in the gradient direction, shear-induced alignment, and shear viscosity. In the asymptoti… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 13 pages, 13 figures

    Journal ref: Soft Matter, 2023, 19, 8577-8586

  15. arXiv:2311.10642  [pdf, other

    cs.CL cs.LG

    Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers

    Authors: Vukasin Bozic, Danilo Dordevic, Daniele Coppola, Joseph Thommes, Sidak Pal Singh

    Abstract: This work presents an analysis of the effectiveness of using standard shallow feed-forward networks to mimic the behavior of the attention mechanism in the original Transformer model, a state-of-the-art architecture for sequence-to-sequence tasks. We substitute key elements of the attention mechanism in the Transformer with simple feed-forward networks, trained using the original components via kn… ▽ More

    Submitted 4 February, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: Accepted at AAAI24(https://aaai.org/aaai-conference/)

  16. arXiv:2311.03988  [pdf

    hep-ph

    P wave mesons emitting weak decays of bottom mesons

    Authors: Maninder Kaur, Supreet Pal Singh, R C Verma

    Abstract: This paper is the extension of our previous work entitled Searching a systematics for nonfactorizable contributions to and hadronic decays. Obtaining the factorizable contributions from the spectator quark model for a systematics has been identified among the isospin reduced amplitudes for the nonfactorizable terms among decay modes. This systematics helps us to derive a generic formula which assi… ▽ More

    Submitted 5 December, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 23 pages, 8 Figures

  17. arXiv:2310.20687  [pdf, other

    physics.atom-ph quant-ph

    Controlled dissipation for Rydberg atom experiments

    Authors: Bleuenn Bégoc, Giovanni Cichelli, Sukhjit P. Singh, Francesco Perciavalle, Davide Rossini, Luigi Amico, Oliver Morsch

    Abstract: We demonstrate a simple technique for adding controlled dissipation to Rydberg atom experiments. In our experiments we excite cold rubidium atoms in a magneto-optical trap to $70$-S Rydberg states whilst simultaneously inducing forced dissipation by resonantly coupling the Rydberg state to a hyperfine level of the short-lived $6$-P state. The resulting effective dissipation can be varied in streng… ▽ More

    Submitted 1 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 3 pages, 3 figures

  18. arXiv:2310.15096  [pdf, other

    physics.soc-ph

    Strategy Revision Phase with Payoff Threshold in the Public Goods Game

    Authors: Marco Alberto Javarone, Shaurya Pratap Singh

    Abstract: Commonly, the strategy revision phase in evolutionary games relies on payoff comparison. Namely, agents compare their payoff with the opponent, assessing whether changing strategy can be potentially convenient. Even tiny payoff differences can be crucial in this decision process. In this work, we study the dynamics of cooperation in the Public Goods Game, introducing a threshold $ε$ in the strat… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 7 pages, 4 figures, 1 table

  19. arXiv:2310.05719  [pdf, other

    cs.LG stat.ML

    Transformer Fusion with Optimal Transport

    Authors: Moritz Imfeld, Jacopo Graldi, Marco Giordano, Thomas Hofmann, Sotiris Anagnostidis, Sidak Pal Singh

    Abstract: Fusion is a technique for merging multiple independently-trained neural networks in order to combine their capabilities. Past attempts have been restricted to the case of fully-connected, convolutional, and residual networks. This paper presents a systematic approach for fusing two or more transformer-based networks exploiting Optimal Transport to (soft-)align the various architectural components.… ▽ More

    Submitted 22 April, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Appears at International Conference on Learning Representations (ICLR), 2024. M. Imfeld, J. Graldi, and M. Giordano are the first authors and contributed equally to this work

  20. arXiv:2310.01165  [pdf, other

    cs.LG cs.AI

    Towards guarantees for parameter isolation in continual learning

    Authors: Giulia Lanzillotta, Sidak Pal Singh, Benjamin F. Grewe, Thomas Hofmann

    Abstract: Deep learning has proved to be a successful paradigm for solving many challenges in machine learning. However, deep neural networks fail when trained sequentially on multiple tasks, a shortcoming known as catastrophic forgetting in the continual learning literature. Despite a recent flourish of learning algorithms successfully addressing this problem, we find that provable guarantees against catas… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 10 pages, 3 figures

  21. arXiv:2307.04719  [pdf, other

    cs.LG

    On the curvature of the loss landscape

    Authors: Alison Pouplin, Hrittik Roy, Sidak Pal Singh, Georgios Arvanitidis

    Abstract: One of the main challenges in modern deep learning is to understand why such over-parameterized models perform so well when trained on finite data. A way to analyze this generalization concept is through the properties of the associated loss landscape. In this work, we consider the loss landscape as an embedded Riemannian manifold and show that the differential geometric properties of the manifold… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 12 pages, 5 figures, preliminary work

  22. arXiv:2305.09088  [pdf, other

    cs.LG stat.ML

    The Hessian perspective into the Nature of Convolutional Neural Networks

    Authors: Sidak Pal Singh, Thomas Hofmann, Bernhard Schölkopf

    Abstract: While Convolutional Neural Networks (CNNs) have long been investigated and applied, as well as theorized, we aim to provide a slightly different perspective into their nature -- through the perspective of their Hessian maps. The reason is that the loss Hessian captures the pairwise interaction of parameters and therefore forms a natural ground to probe how the architectural aspects of CNN get mani… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: ICML 2023 conference proceedings

  23. arXiv:2304.14484  [pdf, other

    cs.CV

    OriCon3D: Effective 3D Object Detection using Orientation and Confidence

    Authors: Dhyey Manish Rajani, Surya Pratap Singh, Rahul Kashyap Swayampakula

    Abstract: In this paper, we propose an advanced methodology for the detection of 3D objects and precise estimation of their spatial positions from a single image. Unlike conventional frameworks that rely solely on center-point and dimension predictions, our research leverages a deep convolutional neural network-based 3D object weighted orientation regression paradigm. These estimates are then seamlessly int… ▽ More

    Submitted 3 January, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

  24. arXiv:2304.11310  [pdf, other

    cs.RO

    Twilight SLAM: Navigating Low-Light Environments

    Authors: Surya Pratap Singh, Billy Mazotti, Dhyey Manish Rajani, Sarvesh Mayilvahanan, Guoyuan Li, Maani Ghaffari

    Abstract: This paper presents a detailed examination of low-light visual Simultaneous Localization and Mapping (SLAM) pipelines, focusing on the integration of state-of-the-art (SOTA) low-light image enhancement algorithms with standard and contemporary SLAM frameworks. The primary objective of our work is to address a pivotal question: Does illuminating visual input significantly improve localization accur… ▽ More

    Submitted 24 December, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

  25. arXiv:2304.00192   

    cs.AI cs.LG

    Leveraging Neo4j and deep learning for traffic congestion simulation & optimization

    Authors: Shyam Pratap Singh, Arshad Ali Khan, Riad Souissi, Syed Adnan Yusuf

    Abstract: Traffic congestion has been a major challenge in many urban road networks. Extensive research studies have been conducted to highlight traffic-related congestion and address the issue using data-driven approaches. Currently, most traffic congestion analyses are done using simulation software that offers limited insight due to the limitations in the tools and utilities being used to render various… ▽ More

    Submitted 9 December, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: The paper was rejected by a journal publisher and we have advanced the research so need to re-write and re-publish in light of reviewers' comments and revised scope of research

  26. Accelerating cosmological models in $f(Q)$ gravity and the phase space analysis

    Authors: S. A. Narawade, Shashank P. Singh, B. Mishra

    Abstract: The dynamical aspect of accelerating cosmological model has been studied in this paper in the context of modified symmetric teleparallel gravity, the $f(Q)$ gravity. Initially, we have derived the dynamical parameters for two well known forms of $f(Q)$ such as: (i) log-square-root form and (ii) exponential form. The equation of state (EoS) parameter for the dark energy in the $f(Q)$ gravity in bot… ▽ More

    Submitted 17 July, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: 13 pages, 9 figures

    Journal ref: Physics of the Dark Universe 42 (2023) 101282

  27. arXiv:2302.10886  [pdf, other

    cs.LG stat.ML

    Some Fundamental Aspects about Lipschitz Continuity of Neural Networks

    Authors: Grigory Khromov, Sidak Pal Singh

    Abstract: Lipschitz continuity is a crucial functional property of any predictive model, that naturally governs its robustness, generalisation, as well as adversarial vulnerability. Contrary to other works that focus on obtaining tighter bounds and developing different practical strategies to enforce certain Lipschitz properties, we aim to thoroughly examine and characterise the Lipschitz behaviour of Neura… ▽ More

    Submitted 14 May, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

  28. arXiv:2211.09836  [pdf, ps, other

    cond-mat.supr-con cond-mat.mes-hall

    Complex phase-fluctuation effects correlated with granularity in superconducting NbN nanofilms

    Authors: Meenakshi Sharma, Manju Singh, Rajib K. Rakshit, Surinder P. Singh, Matteo Fretto, Natascia De Leo, Andrea Perali, Nicola Pinto

    Abstract: Superconducting nanofilms are tunable systems that can host a 3D-2D dimensional crossover, leading to the Berezinskii-Kosterlitz-Thouless (BKT) superconducting transition approaching the 2D regime. Reducing further the dimensionality, from 2D to quasi-1D, superconducting nanostructures with disorder can generate quantum and thermal phase slips (PS) of the order parameter. Both BKT and PS are compl… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 22 pages, 14 figures

  29. arXiv:2208.11580  [pdf, other

    cs.LG

    Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning

    Authors: Elias Frantar, Sidak Pal Singh, Dan Alistarh

    Abstract: We consider the problem of model compression for deep neural networks (DNNs) in the challenging one-shot/post-training setting, in which we are given an accurate trained model, and must compress it without any retraining, based only on a small amount of calibration input data. This problem has become popular in view of the emerging software and hardware support for executing models compressed via… ▽ More

    Submitted 8 January, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: Published at NeurIPS 2022

  30. arXiv:2207.05016  [pdf, other

    cs.GT math.OC

    Capacity Management in a Pandemic with Endogenous Patient Choices and Flows

    Authors: Sanyukta Deshpande, Lavanya Marla, Alan Scheller-Wolf, Siddharth Prakash Singh

    Abstract: Motivated by the experiences of a healthcare service provider during the Covid-19 pandemic, we aim to study the decisions of a provider that operates both an Emergency Department (ED) and a medical Clinic. Patients contact the provider through a phone call or may present directly at the ED: patients can be COVID (suspected/confirmed) or non-COVID, and have different severities. Depending on the se… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  31. arXiv:2206.03126  [pdf, other

    cs.LG

    Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse

    Authors: Lorenzo Noci, Sotiris Anagnostidis, Luca Biggio, Antonio Orvieto, Sidak Pal Singh, Aurelien Lucchi

    Abstract: Transformers have achieved remarkable success in several domains, ranging from natural language processing to computer vision. Nevertheless, it has been recently shown that stacking self-attention layers - the distinctive architectural component of Transformers - can result in rank collapse of the tokens' representations at initialization. The question of if and how rank collapse affects training… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  32. Compression of a confined semiflexible polymer under direct and oscillating fields

    Authors: Keerthi Radhakrishnan, Sunil P. Singh

    Abstract: The folding transition of biopolymers from the coil to compact structures has attracted wide research interest in the past and is well studied in polymer physics. Recent seminal works on DNA in confined devices have shown that these long biopolymers tend to collapse under an external field, contrary to the previously reported stretching. These long folded structures have a tendency to form knots t… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

  33. arXiv:2203.07337  [pdf, other

    stat.ML cs.LG

    Phenomenology of Double Descent in Finite-Width Neural Networks

    Authors: Sidak Pal Singh, Aurelien Lucchi, Thomas Hofmann, Bernhard Schölkopf

    Abstract: `Double descent' delineates the generalization behaviour of models depending on the regime they belong to: under- or over-parameterized. The current theoretical understanding behind the occurrence of this phenomenon is primarily based on linear and kernel regression models -- with informal parallels to neural networks via the Neural Tangent Kernel. Therefore such analyses do not adequately capture… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: Published at ICLR 2022

  34. arXiv:2201.09952  [pdf

    eess.IV cs.CV cs.LG

    A Deep Learning Approach for the Detection of COVID-19 from Chest X-Ray Images using Convolutional Neural Networks

    Authors: Aditya Saxena, Shamsheer Pal Singh

    Abstract: The COVID-19 (coronavirus) is an ongoing pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The virus was first identified in mid-December 2019 in the Hubei province of Wuhan, China and by now has spread throughout the planet with more than 75.5 million confirmed cases and more than 1.67 million deaths. With limited number of COVID-19 test kits available in medical fa… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

  35. arXiv:2112.11115  [pdf, other

    cs.LG

    Soft Actor-Critic with Cross-Entropy Policy Optimization

    Authors: Zhenyang Shi, Surya P. N. Singh

    Abstract: Soft Actor-Critic (SAC) is one of the state-of-the-art off-policy reinforcement learning (RL) algorithms that is within the maximum entropy based RL framework. SAC is demonstrated to perform very well in a list of continous control tasks with good stability and robustness. SAC learns a stochastic Gaussian policy that can maximize a trade-off between total expected reward and the policy entropy. To… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

  36. The explicit characterization of counterion dynamics around a flexible polyelectrolyte

    Authors: Keerthi Radhakrishnan, Sunil P. Singh

    Abstract: The article presents a comprehensive study of counterion dynamics around a generic linear polyelectrolyte (PE) chain with the help of coarse-grained computer simulations. The ion-chain coupling is discussed in the form of binding time, mean-square-displacement (MSD) relative to the chain, local ion transport coefficient, and spatio-temporal correlations in the effective charge. We have shown that… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 14 pages, 12 figures, SI material

  37. arXiv:2111.00243  [pdf, other

    cs.LG cs.SI

    The CAT SET on the MAT: Cross Attention for Set Matching in Bipartite Hypergraphs

    Authors: Govind Sharma, Swyam Prakash Singh, V. Susheela Devi, M. Narasimha Murty

    Abstract: Usual relations between entities could be captured using graphs; but those of a higher-order -- more so between two different types of entities (which we term "left" and "right") -- calls for a "bipartite hypergraph". For example, given a left set of symptoms and right set of diseases, the relation between a set subset of symptoms (that a patient experiences at a given point of time) and a subset… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: 18 pages, 9 figures, under review

  38. Searching a systematics for nonfactorizable contribution to B-and B0 mesons

    Authors: Maninder Kaur, Supreet Pal Singh, R. C. Verma

    Abstract: Two-body weak decays / and are examined under isospin analysis to study nonfactorizable contributions. After extracting the strong phases and obtaining the factorizable contributions from spectator-quark diagrams for Nc=3, we determine nonfactorizable isospin amplitudes from the experimental data for these modes. Our results support the universality of ratio of nonfactorizable isospin reduced ampl… ▽ More

    Submitted 9 October, 2021; v1 submitted 6 August, 2021; originally announced August 2021.

    Journal ref: Chinese Physics C, Vol. 46, No. 7 (2022) 073105

  39. arXiv:2106.16225  [pdf, other

    cs.LG cs.NE math.ST stat.ML

    Analytic Insights into Structure and Rank of Neural Network Hessian Maps

    Authors: Sidak Pal Singh, Gregor Bachmann, Thomas Hofmann

    Abstract: The Hessian of a neural network captures parameter interactions through second-order derivatives of the loss. It is a fundamental object of study, closely tied to various problems in deep learning, including model design, optimization, and generalization. Most prior work has been empirical, typically focusing on low-rank approximations and heuristics that are blind to the network structure. In con… ▽ More

    Submitted 1 July, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

  40. arXiv:2106.01777  [pdf, other

    cs.LG cs.AI cs.RO

    LiMIIRL: Lightweight Multiple-Intent Inverse Reinforcement Learning

    Authors: Aaron J. Snoswell, Surya P. N. Singh, Nan Ye

    Abstract: Multiple-Intent Inverse Reinforcement Learning (MI-IRL) seeks to find a reward function ensemble to rationalize demonstrations of different but unlabelled intents. Within the popular expectation maximization (EM) framework for learning probabilistic MI-IRL models, we present a warm-start strategy based on up-front clustering of the demonstrations in feature space. Our theoretical analysis shows th… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: Under review for NeurIPS 2021

  41. arXiv:2102.02420  [pdf, other

    cond-mat.soft

    Role of viscoelasticity on the dynamics and aggregation of chemically active sphere-dimers

    Authors: Soudamini Sahoo, Sunil Pratap Singh, Snigdha Thakur

    Abstract: The impact of complex media on the dynamics of active swimmers has gained a thriving interest in the research community for their prominent applications in various fields. This paper investigates the effect of viscoelasticity on the dynamics and aggregation of chemically powered sphere-dimers by using a coarse-grained hybrid mesoscopic simulation technique. The sphere-dimers perform active motion… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: 11 pages, 14 figures

    Journal ref: Physics of Fluids, 33, 017120 (2021)

  42. arXiv:2101.02029  [pdf

    cs.CY cs.NI

    Detection and Prediction of Infectious Diseases Using IoT Sensors: A Review

    Authors: Mohammad Meraj, Surendra Pal Singh, Prashant Johri, Mohammad Tabrez Quasim

    Abstract: An infectious kind of disease affects a huge number of human beings. A lot of investigation being conducted throughout the world. There are many interactive hardware platform packages like IoT in healthcare including smart tracking, smart sensors, and clinical device integration available in the market. Emerging technology like IoT has a notable ability to hold patients secure and healthful and al… ▽ More

    Submitted 2 January, 2021; originally announced January 2021.

    Comments: 7 pages, 2figures

  43. Revisiting Maximum Entropy Inverse Reinforcement Learning: New Perspectives and Algorithms

    Authors: Aaron J. Snoswell, Surya P. N. Singh, Nan Ye

    Abstract: We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcement Learning (IRL), which provides a principled method to find a most non-committal reward function consistent with given expert demonstrations, among many consistent reward functions. We first present a generalized MaxEnt formulation based on minimizing a KL-divergence instead of maximizing an en… ▽ More

    Submitted 4 June, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: Published as a conference paper at the 2020 IEEE Symposium Series on Computational Intelligence (SSCI)

  44. AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types

    Authors: Xin Luna Dong, Xiang He, Andrey Kan, Xian Li, Yan Liang, Jun Ma, Yifan Ethan Xu, Chenwei Zhang, Tong Zhao, Gabriel Blanco Saldana, Saurabh Deshpande, Alexandre Michetti Manduca, Jay Ren, Surender Pal Singh, Fan Xiao, Haw-Shiuan Chang, Giannis Karamanolakis, Yuning Mao, Yaqing Wang, Christos Faloutsos, Andrew McCallum, Jiawei Han

    Abstract: Can one build a knowledge graph (KG) for all products in the world? Knowledge graphs have firmly established themselves as valuable sources of information for search and question answering, and it is natural to wonder if a KG can contain information about products offered at online retail sites. There have been several successful examples of generic KGs, but organizing information about products p… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: KDD 2020

  45. arXiv:2005.00698  [pdf

    cs.HC cs.LG eess.SP

    Deep ConvLSTM with self-attention for human activity decoding using wearables

    Authors: Satya P. Singh, Aimé Lay-Ekuakille, Deepak Gangwar, Madan Kumar Sharma, Sukrit Gupta

    Abstract: Decoding human activity accurately from wearable sensors can aid in applications related to healthcare and context awareness. The present approaches in this domain use recurrent and/or convolutional models to capture the spatio-temporal features from time-series data from multiple sensors. We propose a deep neural network architecture that not only captures the spatio-temporal features of multiple… ▽ More

    Submitted 17 December, 2020; v1 submitted 2 May, 2020; originally announced May 2020.

    Comments: 8 pages, 2 figures, 3 tables. IEEE Sensors Journal, 2020

  46. arXiv:2004.14340  [pdf, other

    cs.LG stat.ML

    WoodFisher: Efficient Second-Order Approximation for Neural Network Compression

    Authors: Sidak Pal Singh, Dan Alistarh

    Abstract: Second-order information, in the form of Hessian- or Inverse-Hessian-vector products, is a fundamental tool for solving optimization problems. Recently, there has been significant interest in utilizing this information in the context of deep neural networks; however, relatively little is known about the quality of existing approximations in this context. Our work examines this question, identifies… ▽ More

    Submitted 25 November, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: NeurIPS 2020

  47. Conformation and dynamics of a self-avoiding active flexible polymer

    Authors: Shalabh K. Anand, Sunil P. Singh

    Abstract: We investigate conformations and dynamics of a polymer considering its monomers to be active Brownian particles. This active polymer shows very intriguing physical behavior which is absent in an active Rouse chain. The chain initially shrinks with active force, which starts swelling on further increase in force. The shrinkage followed by swelling is attributed purely to excluded-volume interaction… ▽ More

    Submitted 10 April, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

    Journal ref: Physical Review E 101, 030501(R) (2020)

  48. arXiv:2004.01996  [pdf, other

    cond-mat.soft

    A phase separation of active colloidal suspension via Quorum-sensing

    Authors: Francis Jose, Shalabh K. Anand, Sunil P. Singh

    Abstract: We present the Brownian dynamics simulation of active colloidal suspension in two dimensions, where the self-propulsion speed of a colloid is regulated according to the local density sensed by it. The role of concentration-dependent motility on the phase-separation of colloids and their dynamics is investigated in detail. Interestingly, the system phase separates at a very low packing fraction (… ▽ More

    Submitted 28 January, 2021; v1 submitted 4 April, 2020; originally announced April 2020.

  49. arXiv:2004.00218  [pdf

    q-bio.QM cs.CV cs.LG eess.IV

    3D Deep Learning on Medical Images: A Review

    Authors: Satya P. Singh, Lipo Wang, Sukrit Gupta, Haveesh Goli, Parasuraman Padmanabhan, Balázs Gulyás

    Abstract: The rapid advancements in machine learning, graphics processing technologies and the availability of medical imaging data have led to a rapid increase in the use of deep learning models in the medical domain. This was exacerbated by the rapid advancements in convolutional neural network (CNN) based architectures, which were adopted by the medical imaging community to assist clinicians in disease d… ▽ More

    Submitted 13 October, 2020; v1 submitted 31 March, 2020; originally announced April 2020.

    Comments: Published in Sensors Journal (https://www.mdpi.com/1424-8220/20/18/5097)

    Journal ref: Sensors 2020, 20, 5097

  50. arXiv:1910.05653  [pdf, other

    cs.LG stat.ML

    Model Fusion via Optimal Transport

    Authors: Sidak Pal Singh, Martin Jaggi

    Abstract: Combining different models is a widely used paradigm in machine learning applications. While the most common approach is to form an ensemble of models and average their individual predictions, this approach is often rendered infeasible by given resource constraints in terms of memory and computation, which grow linearly with the number of models. We present a layer-wise model fusion algorithm for… ▽ More

    Submitted 16 May, 2023; v1 submitted 12 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2020 conference proceedings (early version featured in the Optimal Transport & Machine Learning workshop, NeurIPS 2019)