Zum Hauptinhalt springen

Showing 1–50 of 62 results for author: Zaki, M

Searching in archive cs. Search in all archives.
.
  1. LLaVA-Chef: A Multi-modal Generative Model for Food Recipes

    Authors: Fnu Mohbat, Mohammed J. Zaki

    Abstract: In the rapidly evolving landscape of online recipe sharing within a globalized context, there has been a notable surge in research towards comprehending and generating food recipes. Recent advancements in large language models (LLMs) like GPT-2 and LLaVA have paved the way for Natural Language Processing (NLP) approaches to delve deeper into various facets of food-related tasks, encompassing ingre… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  2. arXiv:2406.08530  [pdf, other

    cs.DB

    Validating Temporal Compliance Patterns: A Unified Approach with $MTL_f$ over various Data Models

    Authors: Nesma M. Zaki, Iman M. A. Helal, Ehab E. Hassanein, Ahmed Awad

    Abstract: Process mining extracts valuable insights from event data to help organizations improve their business processes, which is essential for their growth and success. By leveraging process mining techniques, organizations gain a comprehensive understanding of their processes' execution, enabling the discovery of process models, detection of deviations, identification of bottlenecks, and assessment of… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2405.20587  [pdf, ps, other

    cs.NI eess.SP

    Quality-Aware Task Offloading for Cooperative Perception in Vehicular Edge Computing

    Authors: Amr M. Zaki, Sara A. Elsayed, Khalid Elgazzar, Hossam S. Hassanein

    Abstract: Task offloading in Vehicular Edge Computing (VEC) can advance cooperative perception (CP) to improve traffic awareness in Autonomous Vehicles. In this paper, we propose the Quality-aware Cooperative Perception Task Offloading (QCPTO) scheme. Q-CPTO is the first task offloading scheme that enhances traffic awareness by prioritizing the quality rather than the quantity of cooperative perception. Q-C… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  4. arXiv:2403.15469  [pdf, other

    cs.CL cs.LG eess.AS

    Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning

    Authors: Shivam Ratnakant Mhaskar, Nirmesh J. Shah, Mohammadi Zaki, Ashishkumar P. Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah

    Abstract: Traditional Automatic Video Dubbing (AVD) pipeline consists of three key modules, namely, Automatic Speech Recognition (ASR), Neural Machine Translation (NMT), and Text-to-Speech (TTS). Within AVD pipelines, isometric-NMT algorithms are employed to regulate the length of the synthesized output text. This is done to guarantee synchronization with respect to the alignment of video and audio subseque… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted in NAACL2024 Findings

  5. arXiv:2402.06185  [pdf, other

    cs.CV cs.AI cs.LG

    Development and validation of an artificial intelligence model to accurately predict spinopelvic parameters

    Authors: Edward S. Harake, Joseph R. Linzey, Cheng Jiang, Rushikesh S. Joshi, Mark M. Zaki, Jaes C. Jones, Siri S. Khalsa, John H. Lee, Zachary Wilseck, Jacob R. Joseph, Todd C. Hollon, Paul Park

    Abstract: Objective. Achieving appropriate spinopelvic alignment has been shown to be associated with improved clinical symptoms. However, measurement of spinopelvic radiographic parameters is time-intensive and interobserver reliability is a concern. Automated measurement tools have the promise of rapid and consistent measurements, but existing tools are still limited by some degree of manual user-entry re… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures, to appear in Journal of Neurosurgery: Spine

  6. arXiv:2402.04538  [pdf, other

    cs.LG

    Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers

    Authors: Md Shamim Hussain, Mohammed J. Zaki, Dharmashankar Subramanian

    Abstract: Graph transformers typically lack third-order interactions, limiting their geometric understanding which is crucial for tasks like molecular geometry prediction. We propose the Triplet Graph Transformer (TGT) that enables direct communication between pairs within a 3-tuple of nodes via novel triplet attention and aggregation mechanisms. TGT is applied to molecular property prediction by first pred… ▽ More

    Submitted 9 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICML'24 Accepted Version, 25 pages, 10 figures, 18 tables

  7. arXiv:2310.08383  [pdf, other

    cs.CL cond-mat.mtrl-sci

    Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction

    Authors: Kausik Hira, Mohd Zaki, Dhruvil Sheth, Mausam, N M Anoop Krishnan

    Abstract: The discovery of new materials has a documented history of propelling human progress for centuries and more. The behaviour of a material is a function of its composition, structure, and properties, which further depend on its processing and testing conditions. Recent developments in deep learning and natural language processing have enabled information extraction at scale from published literature… ▽ More

    Submitted 26 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Journal ref: Digital Discovery, 2024, Advance Article

  8. arXiv:2308.09115  [pdf

    cs.CL cond-mat.mtrl-sci

    MaScQA: A Question Answering Dataset for Investigating Materials Science Knowledge of Large Language Models

    Authors: Mohd Zaki, Jayadeva, Mausam, N. M. Anoop Krishnan

    Abstract: Information extraction and textual comprehension from materials literature are vital for developing an exhaustive knowledge base that enables accelerated materials discovery. Language models have demonstrated their capability to answer domain-specific questions and retrieve information from knowledge bases. However, there are no benchmark datasets in the materials domain that can evaluate the unde… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  9. arXiv:2306.03209  [pdf, other

    cs.LG

    End-to-end Differentiable Clustering with Associative Memories

    Authors: Bishwajit Saha, Dmitry Krotov, Mohammed J. Zaki, Parikshit Ram

    Abstract: Clustering is a widely used unsupervised learning technique involving an intensive discrete optimization problem. Associative Memory models or AMs are differentiable neural networks defining a recursive dynamical system, which have been integrated with various deep learning architectures. We uncover a novel connection between the AM dynamics and the inherent discrete assignment necessary in cluste… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2023

  10. The Information Pathways Hypothesis: Transformers are Dynamic Self-Ensembles

    Authors: Md Shamim Hussain, Mohammed J. Zaki, Dharmashankar Subramanian

    Abstract: Transformers use the dense self-attention mechanism which gives a lot of flexibility for long-range connectivity. Over multiple layers of a deep transformer, the number of possible connectivity patterns increases exponentially. However, very few of these contribute to the performance of the network, and even fewer are essential. We hypothesize that there are sparsely connected sub-networks within… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: KDD23 preprint, 12 pages, 7 figures, 10 tables

  11. arXiv:2305.17219  [pdf

    cs.CV cs.CL cs.LG

    GVdoc: Graph-based Visual Document Classification

    Authors: Fnu Mohbat, Mohammed J. Zaki, Catherine Finegan-Dollak, Ashish Verma

    Abstract: The robustness of a model for real-world deployment is decided by how well it performs on unseen data and distinguishes between in-domain and out-of-domain samples. Visual document classifiers have shown impressive performance on in-distribution test sets. However, they tend to have a hard time correctly classifying and differentiating out-of-distribution examples. Image-based classifiers lack the… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  12. arXiv:2302.07253  [pdf, other

    cs.LG cond-mat.dis-nn cs.CV q-bio.NC stat.ML

    Energy Transformer

    Authors: Benjamin Hoover, Yuchen Liang, Bao Pham, Rameswar Panda, Hendrik Strobelt, Duen Horng Chau, Mohammed J. Zaki, Dmitry Krotov

    Abstract: Our work combines aspects of three promising paradigms in machine learning, namely, attention mechanism, energy-based models, and associative memory. Attention is the power-house driving modern deep learning successes, but it lacks clear theoretical foundations. Energy-based models allow a principled approach to discriminative and generative tasks, but the design of the energy functional is not st… ▽ More

    Submitted 31 October, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Journal ref: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  13. arXiv:2211.03223  [pdf

    cs.CV cond-mat.mtrl-sci eess.IV

    Cementron: Machine Learning the Constituent Phases in Cement Clinker from Optical Images

    Authors: Mohd Zaki, Siddhant Sharma, Sunil Kumar Gurjar, Raju Goyal, Jayadeva, N. M. Anoop Krishnan

    Abstract: Cement is the most used construction material. The performance of cement hydrate depends on the constituent phases, viz. alite, belite, aluminate, and ferrites present in the cement clinker, both qualitatively and quantitatively. Traditionally, clinker phases are analyzed from optical images relying on a domain expert and simple image processing techniques. However, the non-uniformity of the image… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  14. arXiv:2208.14376  [pdf, other

    cs.LG cs.NE cs.SI q-bio.NC stat.ML

    Associative Learning for Network Embedding

    Authors: Yuchen Liang, Dmitry Krotov, Mohammed J. Zaki

    Abstract: The network embedding task is to represent the node in the network as a low-dimensional vector while incorporating the topological and structural information. Most existing approaches solve this problem by factorizing a proximity matrix, either directly or implicitly. In this work, we introduce a network embedding method from a new perspective, which leverages Modern Hopfield Networks (MHN) for as… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted at the Eighth International Workshop on Deep Learning on Graphs: Methods and Applications (DLG-KDD 2022), Washington DC

  15. arXiv:2207.09090  [pdf, other

    cs.LG cs.AI eess.SY

    Actor-Critic based Improper Reinforcement Learning

    Authors: Mohammadi Zaki, Avinash Mohan, Aditya Gopalan, Shie Mannor

    Abstract: We consider an improper reinforcement learning setting where a learner is given $M$ base controllers for an unknown Markov decision process, and wishes to combine them optimally to produce a potentially new controller that can outperform each of the base ones. This can be useful in tuning across controllers, learnt possibly in mismatched or simulated environments, to obtain a good controller for a… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2102.08201

  16. arXiv:2207.05194  [pdf, other

    cs.CL

    Towards Neural Numeric-To-Text Generation From Temporal Personal Health Data

    Authors: Jonathan Harris, Mohammed J. Zaki

    Abstract: With an increased interest in the production of personal health technologies designed to track user data (e.g., nutrient intake, step counts), there is now more opportunity than ever to surface meaningful behavioral insights to everyday users in the form of natural language. This knowledge can increase their behavioral awareness and allow them to take action to meet their health goals. It can also… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: 5 pages, 2 figures, 1 table

  17. arXiv:2207.01079  [pdf, other

    cs.CL cond-mat.mtrl-sci cs.IR

    DiSCoMaT: Distantly Supervised Composition Extraction from Tables in Materials Science Articles

    Authors: Tanishq Gupta, Mohd Zaki, Devanshi Khatsuriya, Kausik Hira, N. M. Anoop Krishnan, Mausam

    Abstract: A crucial component in the curation of KB for a scientific domain (e.g., materials science, foods & nutrition, fuels) is information extraction from tables in the domain's published research articles. To facilitate research in this direction, we define a novel NLP task of extracting compositions of materials (e.g., glasses) from tables in materials science papers. The task involves solving several… ▽ More

    Submitted 28 January, 2024; v1 submitted 3 July, 2022; originally announced July 2022.

    Comments: Accepted long paper at ACL 2023 (https://2023.aclweb.org/program/accepted_main_conference/)

  18. arXiv:2206.09336  [pdf, other

    cs.DB

    Efficient Checking of Timed Order Compliance Rules over Graph-encoded Event Logs

    Authors: Nesma M. Zaki, Iman M. A. Helal, Ahmed Awad, Ehab E. Hassanein

    Abstract: Validation of compliance rules against process data is a fundamental functionality for business process management. Over the years, the problem has been addressed for different types of process data, i.e., process models, process event data at runtime, and event logs representing historical execution. Several approaches have been proposed to tackle compliance checking over process logs. These appr… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 18 pages, 5 figures, 6 tables

    MSC Class: 68

  19. arXiv:2206.06952  [pdf, other

    cs.CL cs.AI cs.LG

    FETILDA: An Effective Framework For Fin-tuned Embeddings For Long Financial Text Documents

    Authors: Bolun "Namir" Xia, Vipula D. Rawte, Mohammed J. Zaki, Aparna Gupta

    Abstract: Unstructured data, especially text, continues to grow rapidly in various domains. In particular, in the financial sphere, there is a wealth of accumulated unstructured financial data, such as the textual disclosure documents that companies submit on a regular basis to regulatory agencies, such as the Securities and Exchange Commission (SEC). These documents are typically very long and tend to cont… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: 10 pages, 9 figures, 7 tables

    ACM Class: I.2.7

  20. arXiv:2111.07198  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Keyphrase Extraction Using Neighborhood Knowledge Based on Word Embeddings

    Authors: Yuchen Liang, Mohammed J. Zaki

    Abstract: Keyphrase extraction is the task of finding several interesting phrases in a text document, which provide a list of the main topics within the document. Most existing graph-based models use co-occurrence links as cohesion indicators to model the relationship of syntactic elements. However, a word may have different forms of expression within the document, and may have several synonyms as well. Sim… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  21. arXiv:2110.06208  [pdf, other

    cs.CY eess.SY

    Towards formalization and monitoring of microscopic traffic parameters using temporal logic

    Authors: Mariam Nour, Mohamed H. Zaki

    Abstract: Smart cities are revolutionizing the transportation infrastructure by the integration of technology. However, ensuring that various transportation system components are operating as expected and in a safe manner is a great challenge. In this work, we propose the use of formal methods as a means to specify and reason about the traffic network's complex properties. Formal methods provide a flexible… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

  22. arXiv:2109.15290  [pdf

    cs.CL cond-mat.mtrl-sci

    MatSciBERT: A Materials Domain Language Model for Text Mining and Information Extraction

    Authors: Tanishq Gupta, Mohd Zaki, N. M. Anoop Krishnan, Mausam

    Abstract: An overwhelmingly large amount of knowledge in the materials domain is generated and stored as text published in peer-reviewed scientific literature. Recent developments in natural language processing, such as bidirectional encoder representations from transformers (BERT) models, provide promising tools to extract information from these texts. However, direct application of these models in the mat… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

  23. Global Self-Attention as a Replacement for Graph Convolution

    Authors: Md Shamim Hussain, Mohammed J. Zaki, Dharmashankar Subramanian

    Abstract: We propose an extension to the transformer neural network architecture for general-purpose graph learning by adding a dedicated pathway for pairwise structural information, called edge channels. The resultant framework - which we call Edge-augmented Graph Transformer (EGT) - can directly accept, process and output structural information of arbitrary form, which is important for effective learning… ▽ More

    Submitted 3 June, 2022; v1 submitted 6 August, 2021; originally announced August 2021.

    Comments: The accepted version in KDD '22

  24. arXiv:2105.00210  [pdf, other

    cs.LG

    Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling

    Authors: Mohammani Zaki, Avi Mohan, Aditya Gopalan, Shie Mannor

    Abstract: We consider the problem of scheduling in constrained queueing networks with a view to minimizing packet delay. Modern communication systems are becoming increasingly complex, and are required to handle multiple types of traffic with widely varying characteristics such as arrival rates and service times. This, coupled with the need for rapid network deployment, render a bottom up approach of first… ▽ More

    Submitted 1 May, 2021; originally announced May 2021.

    Comments: 4 pages, 5 figures, RLNQ workshop at the SIGMETRICS 2021

  25. arXiv:2102.08201  [pdf, other

    cs.LG eess.SY

    Improper Reinforcement Learning with Gradient-based Policy Optimization

    Authors: Mohammadi Zaki, Avinash Mohan, Aditya Gopalan, Shie Mannor

    Abstract: We consider an improper reinforcement learning setting where a learner is given $M$ base controllers for an unknown Markov decision process, and wishes to combine them optimally to produce a potentially new controller that can outperform each of the base ones. This can be useful in tuning across controllers, learnt possibly in mismatched or simulated environments, to obtain a good controller for a… ▽ More

    Submitted 3 July, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  26. arXiv:2102.05571  [pdf, other

    cs.CR cs.AI cs.IR cs.LG

    TINKER: A framework for Open source Cyberthreat Intelligence

    Authors: Nidhi Rastogi, Sharmishtha Dutta, Mohammed J. Zaki, Alex Gittens, Charu Aggarwal

    Abstract: Threat intelligence on malware attacks and campaigns is increasingly being shared with other security experts for a cost or for free. Other security analysts use this intelligence to inform them of indicators of compromise, attack techniques, and preventative actions. Security analysts prepare threat analysis reports after investigating an attack, an emerging cyber threat, or a recently discovered… ▽ More

    Submitted 19 January, 2023; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: 9 pages

  27. arXiv:2101.06887  [pdf, other

    cs.CL cs.LG cs.NE q-bio.NC stat.ML

    Can a Fruit Fly Learn Word Embeddings?

    Authors: Yuchen Liang, Chaitanya K. Ryali, Benjamin Hoover, Leopold Grinberg, Saket Navlakha, Mohammed J. Zaki, Dmitry Krotov

    Abstract: The mushroom body of the fruit fly brain is one of the best studied systems in neuroscience. At its core it consists of a population of Kenyon cells, which receive inputs from multiple sensory modalities. These cells are inhibited by the anterior paired lateral neuron, thus creating a sparse high dimensional representation of the inputs. In this work we study a mathematical formalization of this n… ▽ More

    Submitted 14 March, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: Accepted for publication at ICLR 2021

  28. Personalized Food Recommendation as Constrained Question Answering over a Large-scale Food Knowledge Graph

    Authors: Yu Chen, Ananya Subburathinam, Ching-Hua Chen, Mohammed J. Zaki

    Abstract: Food recommendation has become an important means to help guide users to adopt healthy dietary habits. Previous works on food recommendation either i) fail to consider users' explicit requirements, ii) ignore crucial health factors (e.g., allergies and nutrition needs), or iii) do not utilize the rich food knowledge for recommending healthy recipes. To address these limitations, we propose a novel… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Comments: 9 pages. Accepted by WSDM 2021. Final version

  29. arXiv:2101.01508  [pdf

    cs.DL physics.comp-ph physics.data-an

    Looking Through Glass: Knowledge Discovery from Materials Science Literature using Natural Language Processing

    Authors: Vineeth Venugopal, Sourav Sahoo, Mohd Zaki, Manish Agarwal, Nitya Nand Gosvami, N. M. Anoop Krishnan

    Abstract: Most of the knowledge in materials science literature is in the form of unstructured data such as text and images. Here, we present a framework employing natural language processing, which automates text and image comprehension and precision knowledge extraction from inorganic glasses' literature. The abstracts are automatically categorized using latent Dirichlet allocation (LDA), providing a way… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Comments: 17 pages, 5 figures

  30. arXiv:2006.13009  [pdf, other

    cs.LG stat.ML

    Iterative Deep Graph Learning for Graph Neural Networks: Better and Robust Node Embeddings

    Authors: Yu Chen, Lingfei Wu, Mohammed J. Zaki

    Abstract: In this paper, we propose an end-to-end graph learning framework, namely Iterative Deep Graph Learning (IDGL), for jointly and iteratively learning graph structure and graph embedding. The key rationale of IDGL is to learn a better graph structure based on better node embeddings, and vice versa (i.e., better node embeddings based on a better graph structure). Our iterative method dynamically stops… ▽ More

    Submitted 22 October, 2020; v1 submitted 21 June, 2020; originally announced June 2020.

    Comments: 19 pages. Accepted by NeurIPS 2020. Final version

  31. MALOnt: An Ontology for Malware Threat Intelligence

    Authors: Nidhi Rastogi, Sharmishtha Dutta, Mohammed J. Zaki, Alex Gittens, Charu Aggarwal

    Abstract: Malware threat intelligence uncovers deep information about malware, threat actors, and their tactics, Indicators of Compromise(IoC), and vulnerabilities in different platforms from scattered threat sources. This collective information can guide decision making in cyber defense applications utilized by security operation centers(SoCs). In this paper, we introduce an open-source malware ontology -… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

  32. arXiv:2006.07562  [pdf, other

    cs.LG stat.ML

    Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners

    Authors: Mohammadi Zaki, Avi Mohan, Aditya Gopalan

    Abstract: We study the problem of best arm identification in linearly parameterised multi-armed bandits. Given a set of feature vectors $\mathcal{X}\subset\mathbb{R}^d,$ a confidence parameter $δ$ and an unknown vector $θ^*,$ the goal is to identify $\arg\max_{x\in\mathcal{X}}x^Tθ^*$, with probability at least $1-δ,$ using noisy measurements of the form $x^Tθ^*.$ For this fixed confidence ($δ$-PAC) setting,… ▽ More

    Submitted 13 June, 2020; originally announced June 2020.

  33. Toward Subgraph-Guided Knowledge Graph Question Generation with Graph Neural Networks

    Authors: Yu Chen, Lingfei Wu, Mohammed J. Zaki

    Abstract: Knowledge graph (KG) question generation (QG) aims to generate natural language questions from KGs and target answers. Previous works mostly focus on a simple setting which is to generate questions from a single KG triple. In this work, we focus on a more realistic setting where we aim to generate questions from a KG subgraph and target answers. In addition, most of previous works built on either… ▽ More

    Submitted 30 April, 2023; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: Accepted by TNNLS 2023

  34. arXiv:2004.00071  [pdf, ps, other

    cs.AI cs.IR

    Personal Health Knowledge Graphs for Patients

    Authors: Nidhi Rastogi, Mohammed J. Zaki

    Abstract: Existing patient data analytics platforms fail to incorporate information that has context, is personal, and topical to patients. For a recommendation system to give a suitable response to a query or to derive meaningful insights from patient data, it should consider personal information about the patient's health history, including but not limited to their preferences, locations, and life choices… ▽ More

    Submitted 7 May, 2020; v1 submitted 31 March, 2020; originally announced April 2020.

    Comments: 3 pages, workshop paper

    ACM Class: I.2.4

  35. arXiv:2003.13721  [pdf, other

    cs.CL cs.LG

    Amharic Abstractive Text Summarization

    Authors: Amr M. Zaki, Mahmoud I. Khalil, Hazem M. Abbas

    Abstract: Text Summarization is the task of condensing long text into just a handful of sentences. Many approaches have been proposed for this task, some of the very first were building statistical models (Extractive Methods) capable of selecting important words and copying them to the output, however these models lacked the ability to paraphrase sentences, as they simply select important words without actu… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: content 3 pages, reference 2 pages, 2 figures, presented to AfricaNLP workshop ICLR 2020

  36. arXiv:2003.09530  [pdf, other

    cs.CL cs.DB

    A Framework for Generating Explanations from Temporal Personal Health Data

    Authors: Jonathan J. Harris, Ching-Hua Chen, Mohammed J. Zaki

    Abstract: Whereas it has become easier for individuals to track their personal health data (e.g., heart rate, step count, food log), there is still a wide chasm between the collection of data and the generation of meaningful explanations to help users better understand what their data means to them. With an increased comprehension of their data, users will be able to act upon the newfound information and wo… ▽ More

    Submitted 9 March, 2021; v1 submitted 20 March, 2020; originally announced March 2020.

    Comments: 41 pages, 24 figures. To appear in ACM Transactions on Computing for Healthcare

  37. arXiv:1912.07832  [pdf, other

    cs.LG stat.ML

    Deep Iterative and Adaptive Learning for Graph Neural Networks

    Authors: Yu Chen, Lingfei Wu, Mohammed J. Zaki

    Abstract: In this paper, we propose an end-to-end graph learning framework, namely Deep Iterative and Adaptive Learning for Graph Neural Networks (DIAL-GNN), for jointly learning the graph structure and graph embeddings simultaneously. We first cast the graph structure learning problem as a similarity metric learning problem and leverage an adapted graph regularization for controlling smoothness, connectivi… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

    Comments: 6 pages. Accepted at the AAAI 2020 Workshop on Deep Learning on Graphs: Methodologies and Applications (AAAI DLGMA 2020). Final Version

  38. arXiv:1911.01695  [pdf, other

    cs.LG math.OC stat.ML

    Towards Optimal and Efficient Best Arm Identification in Linear Bandits

    Authors: Mohammadi Zaki, Avinash Mohan, Aditya Gopalan

    Abstract: We give a new algorithm for best arm identification in linearly parameterised bandits in the fixed confidence setting. The algorithm generalises the well-known LUCB algorithm of Kalyanakrishnan et al. (2012) by playing an arm which minimises a suitable notion of geometric overlap of the statistical confidence set for the unknown parameter, and is fully adaptive and computationally efficient as com… ▽ More

    Submitted 7 November, 2019; v1 submitted 5 November, 2019; originally announced November 2019.

  39. arXiv:1910.08832  [pdf, other

    cs.CL

    Natural Question Generation with Reinforcement Learning Based Graph-to-Sequence Model

    Authors: Yu Chen, Lingfei Wu, Mohammed J. Zaki

    Abstract: Natural question generation (QG) aims to generate questions from a passage and an answer. In this paper, we propose a novel reinforcement learning (RL) based graph-to-sequence (Graph2Seq) model for QG. Our model consists of a Graph2Seq generator where a novel Bidirectional Gated Graph Neural Network is proposed to embed the passage, and a hybrid evaluator with a mixed objective combining both cros… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

    Comments: 4 pages. Accepted at the NeurIPS 2019 Workshop on Graph Representation Learning (NeurIPS GRL 2019). Final Version. arXiv admin note: substantial text overlap with arXiv:1908.04942

  40. arXiv:1908.04942  [pdf, other

    cs.CL

    Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation

    Authors: Yu Chen, Lingfei Wu, Mohammed J. Zaki

    Abstract: Natural question generation (QG) aims to generate questions from a passage and an answer. Previous works on QG either (i) ignore the rich structure information hidden in text, (ii) solely rely on cross-entropy loss that leads to issues like exposure bias and inconsistency between train/test measurement, or (iii) fail to fully exploit the answer information. To address these limitations, in this pa… ▽ More

    Submitted 27 August, 2020; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: 17 pages. Accepted by ICLR 2020. Final version (fix typo in figure)

  41. arXiv:1908.00059  [pdf, other

    cs.CL

    GraphFlow: Exploiting Conversation Flow with Graph Neural Networks for Conversational Machine Comprehension

    Authors: Yu Chen, Lingfei Wu, Mohammed J. Zaki

    Abstract: Conversational machine comprehension (MC) has proven significantly more challenging compared to traditional MC since it requires better utilization of conversation history. However, most existing approaches do not effectively capture conversation history and thus have trouble handling questions involving coreference or ellipsis. Moreover, when reasoning over passage text, most of them simply treat… ▽ More

    Submitted 15 July, 2020; v1 submitted 31 July, 2019; originally announced August 2019.

    Comments: 7 pages. Accepted by IJCAI 2020. Final Version. The SOLE copyright holder is IJCAI (https://www.ijcai.org), all rights reserved

  42. arXiv:1905.06076  [pdf, other

    stat.ML cs.LG

    Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic Functions

    Authors: Tim Pearce, Russell Tsuchida, Mohamed Zaki, Alexandra Brintrup, Andy Neely

    Abstract: A simple, flexible approach to creating expressive priors in Gaussian process (GP) models makes new kernels from a combination of basic kernels, e.g. summing a periodic and linear kernel can capture seasonal variation with a long term trend. Despite a well-studied link between GPs and Bayesian neural networks (BNNs), the BNN analogue of this has not yet been explored. This paper derives BNN archit… ▽ More

    Submitted 28 June, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Journal ref: The 35th Conference on Uncertainty in Artificial Intelligence (UAI 2019)

  43. arXiv:1903.02188  [pdf, other

    cs.CL

    Bidirectional Attentive Memory Networks for Question Answering over Knowledge Bases

    Authors: Yu Chen, Lingfei Wu, Mohammed J. Zaki

    Abstract: When answering natural language questions over knowledge bases (KBs), different question components and KB aspects play different roles. However, most existing embedding-based methods for knowledge base question answering (KBQA) ignore the subtle inter-relationships between the question and the KB (e.g., entity types, relation paths and context). In this work, we propose to directly model the two-… ▽ More

    Submitted 28 May, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: 11 pages. Accepted as NAACL 2019 Long Paper. Final Version

  44. arXiv:1901.02620  [pdf

    eess.IV cs.CV

    Fast CNN-Based Object Tracking Using Localization Layers and Deep Features Interpolation

    Authors: Al-Hussein A. El-Shafie, Mohamed Zaki, Serag El-Din Habib

    Abstract: Object trackers based on Convolution Neural Network (CNN) have achieved state-of-the-art performance on recent tracking benchmarks, while they suffer from slow computational speed. The high computational load arises from the extraction of the feature maps of the candidate and training patches in every video frame. The candidate and training patches are typically placed randomly around the previous… ▽ More

    Submitted 9 January, 2019; originally announced January 2019.

  45. arXiv:1811.12188  [pdf, other

    cs.LG stat.ML

    Bayesian Neural Network Ensembles

    Authors: Tim Pearce, Mohamed Zaki, Andy Neely

    Abstract: Ensembles of neural networks (NNs) have long been used to estimate predictive uncertainty; a small number of NNs are trained from different initialisations and sometimes on differing versions of the dataset. The variance of the ensemble's predictions is interpreted as its epistemic uncertainty. The appeal of ensembling stems from being a collection of regular NNs - this makes them both scalable an… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1810.05546

  46. arXiv:1810.05546  [pdf, other

    stat.ML cs.LG

    Uncertainty in Neural Networks: Approximately Bayesian Ensembling

    Authors: Tim Pearce, Felix Leibfried, Alexandra Brintrup, Mohamed Zaki, Andy Neely

    Abstract: Understanding the uncertainty of a neural network's (NN) predictions is essential for many purposes. The Bayesian framework provides a principled approach to this, however applying it to NNs is challenging due to large numbers of parameters and data. Ensembling NNs provides an easily implementable, scalable method for uncertainty quantification, however, it has been criticised for not being Bayesi… ▽ More

    Submitted 26 February, 2020; v1 submitted 12 October, 2018; originally announced October 2018.

    Comments: Please cite as published in AISTATS 2020

    Journal ref: The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020

  47. arXiv:1805.11324  [pdf, other

    stat.ML cs.AI cs.LG

    Bayesian Inference with Anchored Ensembles of Neural Networks, and Application to Exploration in Reinforcement Learning

    Authors: Tim Pearce, Nicolas Anastassacos, Mohamed Zaki, Andy Neely

    Abstract: The use of ensembles of neural networks (NNs) for the quantification of predictive uncertainty is widespread. However, the current justification is intuitive rather than analytical. This work proposes one minor modification to the normal ensembling methodology, which we prove allows the ensemble to perform Bayesian inference, hence converging to the corresponding Gaussian Process as both the total… ▽ More

    Submitted 2 July, 2018; v1 submitted 29 May, 2018; originally announced May 2018.

  48. arXiv:1706.06322  [pdf

    cs.NI

    Evaluation of energy consumption of reactive and proactive routing protocols in MANET

    Authors: Mohamad T. Sultan, Salim M. Zaki

    Abstract: Mobile Ad hoc Network (MANET) is a distributed, infrastructure-less and decentralized network. A routing protocol in MANET is used to find routes between mobile nodes to facilitate communication within the network. Numerous routing protocols have been proposed for MANET. Those routing protocols are designed to adaptively accommodate for dynamic unpredictable changes in network's topology. The mobi… ▽ More

    Submitted 20 June, 2017; originally announced June 2017.

  49. arXiv:1705.02033  [pdf, other

    stat.ML cs.LG

    KATE: K-Competitive Autoencoder for Text

    Authors: Yu Chen, Mohammed J. Zaki

    Abstract: Autoencoders have been successful in learning meaningful representations from image datasets. However, their performance on text datasets has not been widely studied. Traditional autoencoders tend to learn possibly trivial representations of text documents due to their confounding properties such as high-dimensionality, sparsity and power-law word distributions. In this paper, we propose a novel k… ▽ More

    Submitted 4 June, 2017; v1 submitted 4 May, 2017; originally announced May 2017.

    Comments: 10 pages, KDD'17

  50. arXiv:1609.01508  [pdf, ps, other

    cs.LG

    Low-rank Bandits with Latent Mixtures

    Authors: Aditya Gopalan, Odalric-Ambrym Maillard, Mohammadi Zaki

    Abstract: We study the task of maximizing rewards from recommending items (actions) to users sequentially interacting with a recommender system. Users are modeled as latent mixtures of C many representative user classes, where each class specifies a mean reward profile across actions. Both the user features (mixture distribution over classes) and the item features (mean reward vector per class) are unknown… ▽ More

    Submitted 6 September, 2016; originally announced September 2016.