Zum Hauptinhalt springen

Showing 1–50 of 63 results for author: Shaikh, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.03450  [pdf, other

    cs.LG cs.CE

    Probabilistic Surrogate Model for Accelerating the Design of Electric Vehicle Battery Enclosures for Crash Performance

    Authors: Shadab Anwar Shaikh, Harish Cherukuri, Kranthi Balusu, Ram Devanathan, Ayoub Soulami

    Abstract: This paper presents a probabilistic surrogate model for the accelerated design of electric vehicle battery enclosures with a focus on crash performance. The study integrates high-throughput finite element simulations and Gaussian Process Regression to develop a surrogate model that predicts crash parameters with high accuracy while providing uncertainty estimates. The model was trained using data… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  2. arXiv:2403.09040  [pdf, other

    cs.CL

    RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems

    Authors: Jennifer Hsia, Afreen Shaikh, Zhiruo Wang, Graham Neubig

    Abstract: Retrieval-augmented generation (RAG) can significantly improve the performance of language models (LMs) by providing additional context for tasks such as document-based question answering (DBQA). However, the effectiveness of RAG is highly dependent on its configuration. To systematically find the optimal configuration, we introduce RAGGED, a framework for analyzing RAG configurations across vario… ▽ More

    Submitted 12 August, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  3. arXiv:2401.13979  [pdf, other

    cs.CL cs.AI cs.LG

    Routoo: Learning to Route to Large Language Models Effectively

    Authors: Alireza Mohammadshahi, Arshad Rafiq Shaikh, Majid Yazdani

    Abstract: Developing foundational large language models (LLMs) is becoming increasingly costly and inefficient. Also, closed-source and larger open-source models generally offer better response quality but come with higher inference costs than smaller models. In this paper, we introduce Routoo, an architecture designed to optimize the selection of LLMs for specific prompts based on performance, cost, and ef… ▽ More

    Submitted 2 August, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  4. arXiv:2401.08585  [pdf, other

    q-bio.NC cs.AI quant-ph

    From Conceptual Spaces to Quantum Concepts: Formalising and Learning Structured Conceptual Models

    Authors: Sean Tull, Razin A. Shaikh, Sara Sabrina Zemljic, Stephen Clark

    Abstract: In this article we present a new modelling framework for structured concepts using a category-theoretic generalisation of conceptual spaces, and show how the conceptual representations can be learned automatically from data, using two very different instantiations: one classical and one quantum. A contribution of the work is a thorough category-theoretic formalisation of our framework. We claim th… ▽ More

    Submitted 6 November, 2023; originally announced January 2024.

    Comments: This article consolidates our previous reports on concept formalisation and learning: arXiv:2302.14822 and arXiv:2203.11216

  5. arXiv:2401.08081  [pdf, other

    cs.LG cs.SI

    Predicting Next Useful Location With Context-Awareness: The State-Of-The-Art

    Authors: Alireza Nezhadettehad, Arkady Zaslavsky, Rakib Abdur, Siraj Ahmed Shaikh, Seng W. Loke, Guang-Li Huang, Alireza Hassani

    Abstract: Predicting the future location of mobile objects reinforces location-aware services with proactive intelligence and helps businesses and decision-makers with better planning and near real-time scheduling in different applications such as traffic congestion control, location-aware advertisements, and monitoring public health and well-being. The recent developments in the smartphone and location sen… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  6. arXiv:2311.17892  [pdf, other

    cs.CL cs.AI

    A Pipeline For Discourse Circuits From CCG

    Authors: Jonathon Liu, Razin A. Shaikh, Benjamin Rodatz, Richie Yeung, Bob Coecke

    Abstract: There is a significant disconnect between linguistic theory and modern NLP practice, which relies heavily on inscrutable black-box architectures. DisCoCirc is a newly proposed model for meaning that aims to bridge this divide, by providing neuro-symbolic models that incorporate linguistic structure. DisCoCirc represents natural language text as a `circuit' that captures the core semantic informati… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 39 pages, many figures

  7. arXiv:2311.05778  [pdf, other

    cs.CV cs.AI

    DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency

    Authors: Azhar Shaikh, Michael Cochez, Denis Diachkov, Michiel de Rijcke, Sahar Yousefi

    Abstract: This paper introduces DONUT-hole, a sparse OCR-free visual document understanding (VDU) model that addresses the limitations of its predecessor model, dubbed DONUT. The DONUT model, leveraging a transformer architecture, overcoming the challenges of separate optical character recognition (OCR) and visual semantic understanding (VSU) components. However, its deployment in production environments an… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  8. arXiv:2310.19287  [pdf

    cs.LG

    Enhancing Scalability and Reliability in Semi-Decentralized Federated Learning With Blockchain: Trust Penalization and Asynchronous Functionality

    Authors: Ajay Kumar Shrestha, Faijan Ahamad Khan, Mohammed Afaan Shaikh, Amir Jaberzadeh, Jason Geng

    Abstract: The paper presents an innovative approach to address the challenges of scalability and reliability in Distributed Federated Learning by leveraging the integration of blockchain technology. The paper focuses on enhancing the trustworthiness of participating nodes through a trust penalization mechanism while also enabling asynchronous functionality for efficient and robust model updates. By combinin… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: To appear in 2023 IEEE Ubiquitous Computing, Electronics & Mobile Communication Conference (IEEE UEMCON)

  9. arXiv:2309.00637  [pdf

    cs.LG eess.SP

    Finite Element Analysis and Machine Learning Guided Design of Carbon Fiber Organosheet-based Battery Enclosures for Crashworthiness

    Authors: Shadab Anwar Shaikh, M. F. N. Taufique, Kranthi, Balusu, Shank S. Kulkarni, Forrest Hale, Jonathan Oleson, Ram Devanathan, Ayoub Soulami

    Abstract: Carbon fiber composite can be a potential candidate for replacing metal-based battery enclosures of current electric vehicles (E.V.s) owing to its better strength-to-weight ratio and corrosion resistance. However, the strength of carbon fiber-based structures depends on several parameters that should be carefully chosen. In this work, we implemented high throughput finite element analysis (FEA) ba… ▽ More

    Submitted 22 August, 2023; originally announced September 2023.

  10. arXiv:2307.10492  [pdf

    cs.LG

    Blockchain-Based Federated Learning: Incentivizing Data Sharing and Penalizing Dishonest Behavior

    Authors: Amir Jaberzadeh, Ajay Kumar Shrestha, Faijan Ahamad Khan, Mohammed Afaan Shaikh, Bhargav Dave, Jason Geng

    Abstract: With the increasing importance of data sharing for collaboration and innovation, it is becoming more important to ensure that data is managed and shared in a secure and trustworthy manner. Data governance is a common approach to managing data, but it faces many challenges such as data silos, data consistency, privacy, security, and access control. To address these challenges, this paper proposes a… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: To appear in the 5th International Congress on Blockchain and Applications (BLOCKCHAIN'23). Publish by the Lecture Notes in Networks and Systems series of Springer Verlag

  11. arXiv:2306.09812  [pdf, other

    cs.HC

    Boundary Blending: Reconsidering the Design of Multi-View Visualizations

    Authors: Maoyuan Sun, Abdul Rahman Shaikh, Yue Ma, David Koop, Hamed Alhoori

    Abstract: Multiple-view visualizations (MVs) have been widely used for visual analysis. Each view shows some part of the data in a usable way, and together multiple views enable a holistic understanding of the data under investigation. For example, an analyst may check a social network graph, a map of sensitive locations, a table of transaction records, and a collection of reports to identify suspicious act… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    ACM Class: H.5.0

  12. arXiv:2303.04190  [pdf, other

    math.GR cs.FL

    Multivariate growth and cogrowth

    Authors: Rostislav Grigorchuk, Jean-Francois Quint, Asif Shaikh

    Abstract: We investigate a multivariate growth series $Γ_L({\bf z}), {\bf z} \in \mathbb{C}^d$ associated with a regular language $L$ over an alphabet of cardinality $d.$ Our focus is on languages coming from subgroups of the free group and from subshifts of finite type. We develop a mechanism for computing the rate of growth $\varphi_L({\bf r})$ of $L$ in the direction ${\bf r} \in \mathbb{R}^d$. Using the… ▽ More

    Submitted 27 November, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: 40 pages, 10 figures, the revised version include the correction of definition 4.1 and the replacement of an incorrect figure 6a

    MSC Class: 20E05; 20F69; 05A05; 05A15; 05A16; 60F10; 68Q45

  13. arXiv:2302.14822  [pdf, other

    q-bio.NC cs.AI quant-ph

    Formalising and Learning a Quantum Model of Concepts

    Authors: Sean Tull, Razin A. Shaikh, Sara Sabrina Zemljic, Stephen Clark

    Abstract: In this report we present a new modelling framework for concepts based on quantum theory, and demonstrate how the conceptual representations can be learned automatically from data. A contribution of the work is a thorough category-theoretic formalisation of our framework. We claim that the use of category theory, and in particular the use of string diagrams to describe quantum processes, helps elu… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  14. arXiv:2302.00995  [pdf, other

    cs.CV

    Open-Set Multi-Source Multi-Target Domain Adaptation

    Authors: Rohit Lal, Arihant Gaur, Aadhithya Iyer, Muhammed Abdullah Shaikh, Ritik Agrawal

    Abstract: Single-Source Single-Target Domain Adaptation (1S1T) aims to bridge the gap between a labelled source domain and an unlabelled target domain. Despite 1S1T being a well-researched topic, they are typically not deployed to the real world. Methods like Multi-Source Domain Adaptation and Multi-Target Domain Adaptation have evolved to model real-world problems but still do not generalise well. The fact… ▽ More

    Submitted 3 February, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Accepted in NeurIPS 2021 Workshop on Pre-registration in Machine Learning

  15. arXiv:2210.12467  [pdf, other

    cs.CL

    ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long Earnings Call Transcripts

    Authors: Rajdeep Mukherjee, Abhinav Bohra, Akash Banerjee, Soumya Sharma, Manjunath Hegde, Afreen Shaikh, Shivani Shrivastava, Koustuv Dasgupta, Niloy Ganguly, Saptarshi Ghosh, Pawan Goyal

    Abstract: Despite tremendous progress in automatic summarization, state-of-the-art methods are predominantly trained to excel in summarizing short newswire articles, or documents with strong layout biases such as scientific articles or government reports. Efficient techniques to summarize financial documents, including facts and figures, have largely been unexplored, majorly due to the unavailability of sui… ▽ More

    Submitted 26 October, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: 14 pages; Accepted as a Long Paper in EMNLP 2022 (Main Conference); Codes: https://github.com/rajdeep345/ECTSum

    ACM Class: I.2.7

  16. arXiv:2210.10005  [pdf, other

    cs.CV

    Otsu based Differential Evolution Method for Image Segmentation

    Authors: Afreen Shaikh, Sharmila Botcha, Murali Krishna

    Abstract: This paper proposes an OTSU based differential evolution method for satellite image segmentation and compares it with four other methods such as Modified Artificial Bee Colony Optimizer (MABC), Artificial Bee Colony (ABC), Genetic Algorithm (GA), and Particle Swarm Optimization (PSO) using the objective function proposed by Otsu for optimal multilevel thresholding. The experiments conducted and th… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    ACM Class: I.2.10; I.4.6

  17. arXiv:2210.08961  [pdf

    cs.CY

    Determinants Influencing Intention to Use Social Commerce for Shopping in developing countries: A Case Study of Oman

    Authors: Shamma Al Harizi, Maryam Al Areimi, Abdul. Khalique Shaikh

    Abstract: Social media has had a significant impact on our individual lives, including our behavior regarding the purchasing of daily products. This study investigates the factors influencing Omani nationals' intentions to obtain products via social commerce. The researcher surveyed 202 participants and utilized the Technology Acceptance Model to develop the theoretical framework. The data collection was an… ▽ More

    Submitted 22 September, 2022; originally announced October 2022.

    Comments: 17 Pages

  18. arXiv:2209.11284  [pdf

    cs.DL cs.CY cs.SI

    The Impact of Social Media in Learning and Teaching: A Bibliometric-based Citation Analysis

    Authors: Abdul Shaikh, Saqib Ali, Ramla Al-Maamari

    Abstract: This paper presents the results of a systematic review of the literature on the impact of social media in learning and teaching through bibliometric based Citation analysis. The objective of the review was to map the evolution of the current literature and identify the leading sources of knowledge in terms of the most influential journals, authors, and articles. From a total of 50 top most relevan… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: 14 Pages

    Journal ref: 2021

  19. arXiv:2209.02380  [pdf, other

    cs.DL cs.LG

    YouTube and Science: Models for Research Impact

    Authors: Abdul Rahman Shaikh, Hamed Alhoori, Maoyuan Sun

    Abstract: Video communication has been rapidly increasing over the past decade, with YouTube providing a medium where users can post, discover, share, and react to videos. There has also been an increase in the number of videos citing research articles, especially since it has become relatively commonplace for academic conferences to require video submissions. However, the relationship between research arti… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: 21 pages, 12 figures, Scientometrics Journal

  20. arXiv:2207.07558  [pdf, other

    cs.HC

    Toward Systematic Design Considerations of Organizing Multiple Views

    Authors: Abdul Rahman Shaikh, David Koop, Hamed Alhoori, Maoyuan Sun

    Abstract: Multiple-view visualization (MV) has been used for visual analytics in various fields (e.g., bioinformatics, cybersecurity, and intelligence analysis). Because each view encodes data from a particular perspective, analysts often use a set of views laid out in 2D space to link and synthesize information. The difficulty of this process is impacted by the spatial organization of these views. For inst… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: Short paper with 4 pages + 1 reference page, 2 figures, 1 table, accepted at IEEE VIS 2022 conference

  21. arXiv:2205.10962  [pdf, other

    cs.CR

    Digital Twin for Secure Semiconductor Lifecycle Management: Prospects and Applications

    Authors: Hasan Al Shaikh, Mohammad Bin Monjil, Shigang Chen, Navid Asadizanjani, Farimah Farahmandi, Mark Tehranipoor, Fahim Rahman

    Abstract: The expansive globalization of the semiconductor supply chain has introduced numerous untrusted entities into different stages of a device's lifecycle. To make matters worse, the increase complexity in the design as well as aggressive time to market requirements of the newer generation of integrated circuits can lead either designers to unintentionally introduce security vulnerabilities or verific… ▽ More

    Submitted 24 May, 2022; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: 37 pages including citations, 14 figures, first edit contained minor repositioning of some of the images

  22. arXiv:2205.00466  [pdf, other

    quant-ph cs.LO math.CT

    Categorical Semantics for Feynman Diagrams

    Authors: Razin A. Shaikh, Stefano Gogioso

    Abstract: We introduce a novel compositional description of Feynman diagrams, with well-defined categorical semantics as morphisms in a dagger-compact category. Our chosen setting is suitable for infinite-dimensional diagrammatic reasoning, generalising the ZX calculus and other algebraic gadgets familiar to the categorical quantum theory community. The Feynman diagrams we define look very similar to thei… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

    Comments: Submitted to QPL 2022

  23. arXiv:2204.02179  [pdf, other

    cs.NE cs.AI

    Towards Robust and Accurate Myoelectric Controller Design based on Multi-objective Optimization using Evolutionary Computation

    Authors: Ahmed Aqeel Shaikh, Anand Kumar Mukhopadhyay, Soumyajit Poddar, Suman Samui

    Abstract: Myoelectric pattern recognition is one of the important aspects in the design of the control strategy for various applications including upper-limb prostheses and bio-robotic hand movement systems. The current work has proposed an approach to design an energy-efficient EMG-based controller by considering a kernelized SVM classifier for decoding the information of surface electromyography (sEMG) si… ▽ More

    Submitted 22 May, 2023; v1 submitted 2 April, 2022; originally announced April 2022.

    Comments: This is the updated paper

  24. arXiv:2203.11216  [pdf, other

    cs.LG cs.AI

    The Conceptual VAE

    Authors: Razin A. Shaikh, Sara Sabrina Zemljic, Sean Tull, Stephen Clark

    Abstract: In this report we present a new model of concepts, based on the framework of variational autoencoders, which is designed to have attractive properties such as factored conceptual domains, and at the same time be learnable from data. The model is inspired by, and closely related to, the Beta-VAE model of concepts, but is designed to be more closely connected with language, so that the names of conc… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  25. arXiv:2203.00295  [pdf, other

    cs.LG

    A Domain-Theoretic Framework for Robustness Analysis of Neural Networks

    Authors: Can Zhou, Razin A. Shaikh, Yiran Li, Amin Farjudian

    Abstract: A domain-theoretic framework is presented for validated robustness analysis of neural networks. First, global robustness of a general class of networks is analyzed. Then, using the fact that Edalat's domain-theoretic L-derivative coincides with Clarke's generalized gradient, the framework is extended for attack-agnostic local robustness analysis. The proposed framework is ideal for designing algor… ▽ More

    Submitted 9 January, 2023; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: 35 pages, 10 figures, 3 tables

    MSC Class: 06B35; 68Q55; 49J52; 68T37

  26. arXiv:2202.04650  [pdf

    eess.IV cs.AI cs.CV cs.LG

    Semantic Segmentation of Anaemic RBCs Using Multilevel Deep Convolutional Encoder-Decoder Network

    Authors: Muhammad Shahzad, Arif Iqbal Umar, Syed Hamad Shirazi, Israr Ahmed Shaikh

    Abstract: Pixel-level analysis of blood images plays a pivotal role in diagnosing blood-related diseases, especially Anaemia. These analyses mainly rely on an accurate diagnosis of morphological deformities like shape, size, and precise pixel counting. In traditional segmentation approaches, instance or object-based approaches have been adopted that are not feasible for pixel-level analysis. The convolution… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  27. arXiv:2112.09569  [pdf, other

    cs.CV cs.AI cs.LG

    CPPE-5: Medical Personal Protective Equipment Dataset

    Authors: Rishit Dagli, Ali Mustufa Shaikh

    Abstract: We present a new challenging dataset, CPPE - 5 (Medical Personal Protective Equipment), with the goal to allow the study of subordinate categorization of medical personal protective equipments, which is not possible with other popular data sets that focus on broad-level categories (such as PASCAL VOC, ImageNet, Microsoft COCO, OpenImages, etc). To make it easy for models trained on this dataset to… ▽ More

    Submitted 18 February, 2023; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: 18 pages, 6 tables, 6 figures. Code and models are available at https://git.io/cppe5-dataset

  28. arXiv:2111.12317  [pdf, other

    cs.CL

    Handling tree-structured text: parsing directory pages

    Authors: Sarang Shrivastava, Afreen Shaikh, Shivani Shrivastava, Chung Ming Ho, Pradeep Reddy, Vijay Saraswat

    Abstract: The determination of the reading sequence of text is fundamental to document understanding. This problem is easily solved in pages where the text is organized into a sequence of lines and vertical alignment runs the height of the page (producing multiple columns which can be read from left to right). We present a situation -- the directory page parsing problem -- where information is presented on… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

  29. arXiv:2111.11554  [pdf, other

    cs.OS cs.LG

    KML: Using Machine Learning to Improve Storage Systems

    Authors: Ibrahim Umit Akgun, Ali Selman Aydin, Andrew Burford, Michael McNeill, Michael Arkhangelskiy, Aadil Shaikh, Lukas Velikov, Erez Zadok

    Abstract: Operating systems include many heuristic algorithms designed to improve overall storage performance and throughput. Because such heuristics cannot work well for all conditions and workloads, system designers resorted to exposing numerous tunable parameters to users -- thus burdening users with continually optimizing their own storage systems and applications. Storage systems are usually responsibl… ▽ More

    Submitted 25 January, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: 17 pages, 13 figures

    Report number: Stony Brook U. CS TechReport FSL-21-02

  30. arXiv:2109.04993  [pdf, other

    cs.CV cs.AI cs.CL

    LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation

    Authors: Mohammad Abuzar Shaikh, Zhanghexuan Ji, Dana Moukheiber, Yan Shen, Sargur Srihari, Mingchen Gao

    Abstract: Pre-training visual and textual representations from large-scale image-text pairs is becoming a standard approach for many downstream vision-language tasks. The transformer-based models learn inter and intra-modal attention through a list of self-supervised learning tasks. This paper proposes LAViTeR, a novel architecture for visual and textual representation learning. The main module, Visual Text… ▽ More

    Submitted 19 October, 2021; v1 submitted 4 September, 2021; originally announced September 2021.

    Comments: 14 pages, 10 Figures, 5 Tables

  31. arXiv:2109.01949  [pdf, other

    cs.LG cs.AI cs.CL cs.CV eess.IV

    Improving Joint Learning of Chest X-Ray and Radiology Report by Word Region Alignment

    Authors: Zhanghexuan Ji, Mohammad Abuzar Shaikh, Dana Moukheiber, Sargur Srihari, Yifan Peng, Mingchen Gao

    Abstract: Self-supervised learning provides an opportunity to explore unlabeled chest X-rays and their associated free-text reports accumulated in clinical routine without manual supervision. This paper proposes a Joint Image Text Representation Learning Network (JoImTeRNet) for pre-training on chest X-ray images and their radiology reports. The model was pre-trained on both the global image-sentence level… ▽ More

    Submitted 4 September, 2021; originally announced September 2021.

    Comments: 10 Pages, 1 Figure, 3 Tables, Accepted in 12th Machine Learning in Medical Imaging (MLMI 2021) workshop

  32. arXiv:2108.12835  [pdf

    cs.NI

    Performance Evaluation of Ad Hoc Multicast Routing Protocols to Facilitate Video Streaming in VANETS

    Authors: Muhammad Danish Khan, Arshad Shaikh

    Abstract: Vehicular Ad Hoc Network (VANET) is a type of mobile ad hoc network (MANET) that facilitates communication among vehicles. VANET provides inter-vehicular communications to serve for the application like road traffic safety and traffic efficiency. Infotainment service has been an anticipating trend in VANETs, and video streaming has a high potential in VANET. Although, this emerging technology is t… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

  33. SightBi: Exploring Cross-View Data Relationships with Biclusters

    Authors: Maoyuan Sun, Abdul Rahman Shaikh, Hamed Alhoori, Jian Zhao

    Abstract: Multiple-view visualization (MV) has been heavily used in visual analysis tools for sensemaking of data in various domains (e.g., bioinformatics, cybersecurity and text analytics). One common task of visual analysis with multiple views is to relate data across different views. For example, to identify threats, an intelligence analyst needs to link people from a social network graph with locations… ▽ More

    Submitted 27 September, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: IEEE VIS 2021, ACM 2012 CCS - Human-centered computing, Visualization, Visualization design and evaluation methods

    ACM Class: H.5.2

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, 2021

  34. Composing Conversational Negation

    Authors: Razin A. Shaikh, Lia Yeh, Benjamin Rodatz, Bob Coecke

    Abstract: Negation in natural language does not follow Boolean logic and is therefore inherently difficult to model. In particular, it takes into account the broader understanding of what is being negated. In previous work, we proposed a framework for the negation of words that accounts for 'worldly context'. This paper extends that proposal now accounting for the compositional structure inherent in languag… ▽ More

    Submitted 3 November, 2022; v1 submitted 14 July, 2021; originally announced July 2021.

    Comments: In Proceedings ACT 2021, arXiv:2211.01102

    Journal ref: EPTCS 372, 2022, pp. 352-367

  35. arXiv:2107.01516  [pdf, other

    cs.IR cs.LG

    Introducing Self-Attention to Target Attentive Graph Neural Networks

    Authors: Sai Mitheran, Abhinav Java, Surya Kant Sahu, Arshad Shaikh

    Abstract: Session-based recommendation systems suggest relevant items to users by modeling user behavior and preferences using short-term anonymous sessions. Existing methods leverage Graph Neural Networks (GNNs) that propagate and aggregate information from neighboring nodes i.e., local message passing. Such graph-based architectures have representational limits, as a single sub-graph is susceptible to ove… ▽ More

    Submitted 7 January, 2022; v1 submitted 3 July, 2021; originally announced July 2021.

    Comments: Accepted at AISP 2022

    ACM Class: H.3.3; I.2.1

  36. arXiv:2105.05748  [pdf, other

    cs.CL math.CT quant-ph

    Conversational Negation using Worldly Context in Compositional Distributional Semantics

    Authors: Benjamin Rodatz, Razin A. Shaikh, Lia Yeh

    Abstract: We propose a framework to model an operational conversational negation by applying worldly context (prior knowledge) to logical negation in compositional distributional semantics. Given a word, our framework can create its negation that is similar to how humans perceive negation. The framework corrects logical negation to weight meanings closer in the entailment hierarchy more than meanings furthe… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 13 pages, 5 figures, To be published in Proceedings of SEMSPACE 2021 and to appear in the ACL anthology

  37. arXiv:2105.03358  [pdf, other

    eess.IV cs.CV cs.LG

    Soft-Attention Improves Skin Cancer Classification Performance

    Authors: Soumyya Kanti Datta, Mohammad Abuzar Shaikh, Sargur N. Srihari, Mingchen Gao

    Abstract: In clinical applications, neural networks must focus on and highlight the most important parts of an input image. Soft-Attention mechanism enables a neural network toachieve this goal. This paper investigates the effectiveness of Soft-Attention in deep neural architectures. The central aim of Soft-Attention is to boost the value of important features and suppress the noise-inducing features. We co… ▽ More

    Submitted 4 June, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: 8 pages, 9 figures, 4 tables

  38. arXiv:2102.03313  [pdf, other

    cs.LG

    Rethinking Neural Networks With Benford's Law

    Authors: Surya Kant Sahu, Abhinav Java, Arshad Shaikh, Yannic Kilcher

    Abstract: Benford's Law (BL) or the Significant Digit Law defines the probability distribution of the first digit of numerical values in a data sample. This Law is observed in many naturally occurring datasets. It can be seen as a measure of naturalness of a given distribution and finds its application in areas like anomaly and fraud detection. In this work, we address the following question: Is the distrib… ▽ More

    Submitted 22 October, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Short version accepted to NeurIPS 2021 ML4PS Workshop

  39. arXiv:2102.02335  [pdf, other

    cs.CL cs.AI cs.IR

    Self-Supervised Claim Identification for Automated Fact Checking

    Authors: Archita Pathak, Mohammad Abuzar Shaikh, Rohini Srihari

    Abstract: We propose a novel, attention-based self-supervised approach to identify "claim-worthy" sentences in a fake news article, an important first step in automated fact-checking. We leverage "aboutness" of headline and content using attention mechanism for this task. The identified claims can be used for downstream task of claim verification for which we are releasing a benchmark dataset of manually se… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 15 pages, 4 figures, Accepted at ICON 2020

  40. arXiv:2011.13638  [pdf

    cs.HC cs.CY cs.SE

    Human Computations in Citizen Crowds: A Knowledge Management Solution Framework

    Authors: Nadeem Kafi, Zubair Ahmed Shaikh, Muhammad Shahid Shaikh

    Abstract: KG (Knowledge Generation) and understanding have traditionally been a Human-centric activity. KE (Knowledge Engineering) and KM (Knowledge Management) have tried to augment human knowledge on two separate planes: the first deals with machine interpretation of knowledge while the later explore interactions in human networks for KG and understanding. However, both remain computer-centric. Crowdsourc… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

    Journal ref: Mehran University Research Journal of Engineering & Technology, Vol. 37, No. 3, 513-528 July 2018, p-ISSN: 0254-7821, e-ISSN: 2413-7219

  41. arXiv:2011.10568  [pdf, other

    cs.LG cs.AI cs.NE

    Learn to Bind and Grow Neural Structures

    Authors: Azhar Shaikh, Nishant Sinha

    Abstract: Task-incremental learning involves the challenging problem of learning new tasks continually, without forgetting past knowledge. Many approaches address the problem by expanding the structure of a shared neural network as tasks arrive, but struggle to grow optimally, without losing past knowledge. We present a new framework, Learn to Bind and Grow, which learns a neural architecture for a new task… ▽ More

    Submitted 21 November, 2020; originally announced November 2020.

    Comments: Accepted to 8th ACM IKDD CODS and 26th COMAD (CODS-COMAD '21) conference

  42. arXiv:2009.04532  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Attention based Writer Independent Handwriting Verification

    Authors: Mohammad Abuzar Shaikh, Tiehang Duan, Mihir Chauhan, Sargur Srihari

    Abstract: The task of writer verification is to provide a likelihood score for whether the queried and known handwritten image samples belong to the same writer or not. Such a task calls for the neural network to make it's outcome interpretable, i.e. provide a view into the network's decision making process. We implement and integrate cross-attention and soft-attention mechanisms to capture the highly corre… ▽ More

    Submitted 30 September, 2020; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: 7 pages, 6 figures, Published in 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR)

  43. arXiv:2005.08442  [pdf, other

    cs.LG q-bio.QM stat.ML

    Parsimonious Computing: A Minority Training Regime for Effective Prediction in Large Microarray Expression Data Sets

    Authors: Shailesh Sridhar, Snehanshu Saha, Azhar Shaikh, Rahul Yedida, Sriparna Saha

    Abstract: Rigorous mathematical investigation of learning rates used in back-propagation in shallow neural networks has become a necessity. This is because experimental evidence needs to be endorsed by a theoretical background. Such theory may be helpful in reducing the volume of experimental effort to accomplish desired results. We leveraged the functional property of Mean Square Error, which is Lipschitz… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

  44. arXiv:2004.03352  [pdf, other

    cs.DB cs.DC

    GeoFlink: A Distributed and Scalable Framework for the Real-time Processing of Spatial Streams

    Authors: Salman Ahmed Shaikh, Komal Mariam, Hiroyuki Kitagawa, Kyoung-Sook Kim

    Abstract: Apache Flink is an open-source system for scalable processing of batch and streaming data. Flink does not natively support efficient processing of spatial data streams, which is a requirement of many applications dealing with spatial data. Besides Flink, other scalable spatial data processing platforms including GeoSpark, Spatial Hadoop, etc. do not support streaming workloads and can only handle… ▽ More

    Submitted 2 August, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: CIKM 2020 Preprint

  45. arXiv:2003.06113  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Ultra Efficient Transfer Learning with Meta Update for Cross Subject EEG Classification

    Authors: Tiehang Duan, Mihir Chauhan, Mohammad Abuzar Shaikh, Jun Chu, Sargur Srihari

    Abstract: The pattern of Electroencephalogram (EEG) signal differs significantly across different subjects, and poses challenge for EEG classifiers in terms of 1) effectively adapting a learned classifier onto a new subject, 2) retaining knowledge of known subjects after the adaptation. We propose an efficient transfer learning method, named Meta UPdate Strategy (MUPS-EEG), for continuous EEG classification… ▽ More

    Submitted 1 March, 2021; v1 submitted 13 March, 2020; originally announced March 2020.

  46. arXiv:1909.02548  [pdf, other

    cs.CV cs.LG

    Explanation based Handwriting Verification

    Authors: Mihir Chauhan, Mohammad Abuzar Shaikh, Sargur N. Srihari

    Abstract: Deep learning system have drawback that their output is not accompanied with ex-planation. In a domain such as forensic handwriting verification it is essential to provideexplanation to jurors. The goal of handwriting verification is to find a measure of confi-dence whether the given handwritten samples are written by the same or different writer.We propose a method to generate explanations for th… ▽ More

    Submitted 14 August, 2019; originally announced September 2019.

    Comments: Presented at BMVC 2019: Workshop on Interpretable and Explainable Machine Vision, Cardiff, UK

  47. arXiv:1906.08244  [pdf, other

    cs.DL cs.LG econ.GN

    Predicting Patent Citations to measure Economic Impact of Scholarly Research

    Authors: Abdul Rahman Shaikh, Hamed Alhoori

    Abstract: A crucial goal of funding research and development has always been to advance economic development. On this basis, a consider-able body of research undertaken with the purpose of determining what exactly constitutes economic impact and how to accurately measure that impact has been published. Numerous indicators have been used to measure economic impact, although no single indicator has been widel… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

    Comments: 2 Pages, 1 figure, JCDL conference

  48. Hybrid Feature Learning for Handwriting Verification

    Authors: Mohammad Abuzar Shaikh, Mihir Chauhan, Jun Chu, Sargur Srihari

    Abstract: We propose an effective Hybrid Deep Learning (HDL) architecture for the task of determining the probability that a questioned handwritten word has been written by a known writer. HDL is an amalgamation of Auto-Learned Features (ALF) and Human-Engineered Features (HEF). To extract auto-learned features we use two methods: First, Two Channel Convolutional Neural Network (TC-CNN); Second, Two Channel… ▽ More

    Submitted 18 November, 2018; originally announced December 2018.

    Comments: Accepted and presented in International Conference on Frontiers in Handwriting Recognition (ICFHR) 2018

  49. arXiv:1802.04845  [pdf

    cs.CY

    Using Naive Bayes Algorithm to Students' bachelor Academic Performances Analysis

    Authors: Fahad Razaque, Nareena Soomro, Shoaib Ahmed Shaikh, Safeeullah Soomro, Javed Ahmed Samo, Natesh Kumar, Huma Dharejo

    Abstract: Academic Data Mining was one of emerging field which comprise procedure of examined students details by different elements such as earlier semester marks, attendance, assignment, discussion, lab work were of used to improved bachelor academic performance of students, and overcome difficulties of low ranks of bachelor students. It was extracted useful knowledge from bachelor academic students data… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    Comments: 2017

    Journal ref: IEEE Proceedings ICETAS 2017

  50. arXiv:1801.02430  [pdf

    cs.CR cs.CY q-bio.QM

    A Novel Hybrid Biometric Electronic Voting System: Integrating Finger Print and Face Recognition

    Authors: Shahram Najam Syed, Aamir Zeb Shaikh, Shabbar Naqvi

    Abstract: A novel hybrid design based electronic voting system is proposed, implemented and analyzed. The proposed system uses two voter verification techniques to give better results in comparison to single identification based systems. Finger print and facial recognition based methods are used for voter identification. Cross verification of a voter during an election process provides better accuracy than… ▽ More

    Submitted 5 January, 2018; originally announced January 2018.

    Journal ref: Mehran University Research Journal of Engineering and Technology, Mehran University Research Journal of Engineering and Technology, 2018, 37 (1), pp.59-68. http://publications.muet.edu.pk/index.php/muetrj/article/view/100/50