Zum Hauptinhalt springen

Showing 1–49 of 49 results for author: Jannesari, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11798  [pdf, other

    cs.CL cs.DC cs.LG

    PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation

    Authors: Branden Butler, Sixing Yu, Arya Mazaheri, Ali Jannesari

    Abstract: Inference of Large Language Models (LLMs) across computer clusters has become a focal point of research in recent times, with many acceleration techniques taking inspiration from CPU speculative execution. These techniques reduce bottlenecks associated with memory bandwidth, but also increase end-to-end latency per inference run, requiring high speculation acceptance rates to improve performance.… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 11 pages, submitted to SC24 conference

  2. arXiv:2407.07321  [pdf, other

    cs.CL

    RAG vs. Long Context: Examining Frontier Large Language Models for Environmental Review Document Comprehension

    Authors: Hung Phan, Anurag Acharya, Sarthak Chaturvedi, Shivam Sharma, Mike Parker, Dan Nally, Ali Jannesari, Karl Pazdernik, Mahantesh Halappanavar, Sai Munikoti, Sameera Horawalavithana

    Abstract: Large Language Models (LLMs) have been applied to many research problems across various domains. One of the applications of LLMs is providing question-answering systems that cater to users from different fields. The effectiveness of LLM-based question-answering systems has already been established at an acceptable level for users posing questions in popular and public domains such as trivia and li… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 14 pages

  3. arXiv:2407.02238  [pdf, other

    cs.DC cs.LG cs.PF

    MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance Optimizations

    Authors: Akash Dutta, Ali Jannesari

    Abstract: One of the primary areas of interest in High Performance Computing is the improvement of performance of parallel workloads. Nowadays, compilable source code-based optimization tasks that employ deep learning often exploit LLVM Intermediate Representations (IRs) for extracting features from source code. Most such works target specific tasks, or are designed with a pre-defined set of heuristics. So… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 12 pages, 6 figures, 9 tables, PACT '24 conference

  4. arXiv:2406.13881  [pdf, other

    cs.DC

    Static Generation of Efficient OpenMP Offload Data Mappings

    Authors: Luke Marzen, Akash Dutta, Ali Jannesari

    Abstract: Increasing heterogeneity in HPC architectures and compiler advancements have led to OpenMP being frequently used to enable computations on heterogeneous devices. However, the efficient movement of data on heterogeneous computing platforms is crucial for achieving high utilization. Programmers must explicitly map data between the host and connected accelerator devices to achieve efficient data move… ▽ More

    Submitted 26 August, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted to the 2024 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC24)

  5. arXiv:2404.15182  [pdf, other

    cs.LG cs.AI

    FLoRA: Enhancing Vision-Language Models with Parameter-Efficient Federated Learning

    Authors: Duy Phuong Nguyen, J. Pablo Munoz, Ali Jannesari

    Abstract: In the rapidly evolving field of artificial intelligence, multimodal models, e.g., integrating vision and language into visual-language models (VLMs), have become pivotal for many applications, ranging from image captioning to multimodal search engines. Among these models, the Contrastive Language-Image Pre-training (CLIP) model has demonstrated remarkable performance in understanding and generati… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 10 pages, 11 figures

  6. arXiv:2404.06638  [pdf, other

    cond-mat.mtrl-sci cs.CV

    SAM-I-Am: Semantic Boosting for Zero-shot Atomic-Scale Electron Micrograph Segmentation

    Authors: Waqwoya Abebe, Jan Strube, Luanzheng Guo, Nathan R. Tallent, Oceane Bel, Steven Spurgeon, Christina Doty, Ali Jannesari

    Abstract: Image segmentation is a critical enabler for tasks ranging from medical diagnostics to autonomous driving. However, the correct segmentation semantics - where are boundaries located? what segments are logically similar? - change depending on the domain, such that state-of-the-art foundation models can generate meaningless and incorrect results. Moreover, in certain domains, fine-tuning and retrain… ▽ More

    Submitted 10 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  7. arXiv:2403.07231  [pdf, other

    cs.CV

    Learn and Search: An Elegant Technique for Object Lookup using Contrastive Learning

    Authors: Chandan Kumar, Jansel Herrera-Gerena, John Just, Matthew Darr, Ali Jannesari

    Abstract: The rapid proliferation of digital content and the ever-growing need for precise object recognition and segmentation have driven the advancement of cutting-edge techniques in the field of object classification and segmentation. This paper introduces "Learn and Search", a novel approach for object lookup that leverages the power of contrastive learning to enhance the efficiency and effectiveness of… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures

  8. arXiv:2403.02518  [pdf, other

    cs.DC cs.SE

    MPI Errors Detection using GNN Embedding and Vector Embedding over LLVM IR

    Authors: Jad El Karchi, Hanze Chen, Ali TehraniJamsaz, Ali Jannesari, Mihail Popov, Emmanuelle Saillard

    Abstract: Identifying errors in parallel MPI programs is a challenging task. Despite the growing number of verification tools, debugging parallel programs remains a significant challenge. This paper is the first to utilize embedding and deep learning graph neural networks (GNNs) to tackle the issue of identifying bugs in MPI programs. Specifically, we have designed and developed two models that can determin… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  9. arXiv:2402.13465  [pdf, other

    cs.CV

    Unsupervised learning based object detection using Contrastive Learning

    Authors: Chandan Kumar, Jansel Herrera-Gerena, John Just, Matthew Darr, Ali Jannesari

    Abstract: Training image-based object detectors presents formidable challenges, as it entails not only the complexities of object detection but also the added intricacies of precisely localizing objects within potentially diverse and noisy environments. However, the collection of imagery itself can often be straightforward; for instance, cameras mounted in vehicles can effortlessly capture vast amounts of d… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures

  10. arXiv:2402.02018  [pdf, other

    cs.LG

    The Landscape and Challenges of HPC Research and LLMs

    Authors: Le Chen, Nesreen K. Ahmed, Akash Dutta, Arijit Bhattacharjee, Sixing Yu, Quazi Ishtiaque Mahmud, Waqwoya Abebe, Hung Phan, Aishwarya Sarkar, Branden Butler, Niranjan Hasabnis, Gal Oren, Vy A. Vo, Juan Pablo Munoz, Theodore L. Willke, Tim Mattson, Ali Jannesari

    Abstract: Recently, language models (LMs), especially large language models (LLMs), have revolutionized the field of deep learning. Both encoder-decoder models and prompt-based techniques have shown immense potential for natural language processing and code-based tasks. Over the past several years, many research labs and institutions have invested heavily in high-performance computing, approaching or breach… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  11. arXiv:2401.16445  [pdf, other

    cs.SE cs.DC cs.LG

    OMPGPT: A Generative Pre-trained Transformer Model for OpenMP

    Authors: Le Chen, Arijit Bhattacharjee, Nesreen Ahmed, Niranjan Hasabnis, Gal Oren, Vy Vo, Ali Jannesari

    Abstract: Large language models (LLMs)such as ChatGPT have significantly advanced the field of Natural Language Processing (NLP). This trend led to the development of code-based large language models such as StarCoder, WizardCoder, and CodeLlama, which are trained extensively on vast repositories of code and programming languages. While the generic abilities of these code LLMs are useful for many programmer… ▽ More

    Submitted 21 June, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  12. arXiv:2312.17430  [pdf, other

    cs.LG

    LEFL: Low Entropy Client Sampling in Federated Learning

    Authors: Waqwoya Abebe, Pablo Munoz, Ali Jannesari

    Abstract: Federated learning (FL) is a machine learning paradigm where multiple clients collaborate to optimize a single global model using their private data. The global model is maintained by a central server that orchestrates the FL training process through a series of training rounds. In each round, the server samples clients from a client pool before sending them its latest global model parameters for… ▽ More

    Submitted 13 February, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

  13. arXiv:2311.06505  [pdf, other

    cs.LG

    CompCodeVet: A Compiler-guided Validation and Enhancement Approach for Code Dataset

    Authors: Le Chen, Arijit Bhattacharjee, Nesreen K. Ahmed, Niranjan Hasabnis, Gal Oren, Bin Lei, Ali Jannesari

    Abstract: Large language models (LLMs) have become increasingly prominent in academia and industry due to their remarkable performance in diverse applications. As these models evolve with increasing parameters, they excel in tasks like sentiment analysis and machine translation. However, even models with billions of parameters face challenges in tasks demanding multi-step reasoning. Code generation and comp… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  14. arXiv:2310.04047  [pdf, other

    cs.LG

    AUTOPARLLM: GNN-Guided Automatic Code Parallelization using Large Language Models

    Authors: Quazi Ishtiaque Mahmud, Ali TehraniJamsaz, Hung D Phan, Nesreen K. Ahmed, Ali Jannesari

    Abstract: Parallelizing sequentially written programs is a challenging task. Even experienced developers need to spend considerable time finding parallelism opportunities and then actually writing parallel versions of sequentially written programs. To address this issue, we present AUTOPARLLM, a framework for automatically discovering parallelism and generating the parallel version of the sequentially writt… ▽ More

    Submitted 8 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: 10 pages

  15. arXiv:2310.00247  [pdf, other

    cs.LG cs.DC

    Bridging the Gap Between Foundation Models and Heterogeneous Federated Learning

    Authors: Sixing Yu, J. Pablo Muñoz, Ali Jannesari

    Abstract: Federated learning (FL) offers privacy-preserving decentralized machine learning, optimizing models at edge clients without sharing private data. Simultaneously, foundation models (FMs) have gained traction in the artificial intelligence (AI) community due to their exceptional performance across various tasks. However, integrating FMs into FL presents challenges, primarily due to their substantial… ▽ More

    Submitted 4 October, 2023; v1 submitted 30 September, 2023; originally announced October 2023.

  16. Evaluating and Optimizing the Effectiveness of Neural Machine Translation in Supporting Code Retrieval Models: A Study on the CAT Benchmark

    Authors: Hung Phan, Ali Jannesari

    Abstract: Neural Machine Translation (NMT) is widely applied in software engineering tasks. The effectiveness of NMT for code retrieval relies on the ability to learn from the sequence of tokens in the source language to the sequence of tokens in the target language. While NMT performs well in pseudocode-to-code translation, it might have challenges in learning to translate from natural language query to so… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted as Full Paper in Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (CIKM), Birmingham, UK, October 2023

  17. arXiv:2306.00210  [pdf, other

    cs.PL cs.DC cs.LG

    PERFOGRAPH: A Numerical Aware Program Graph Representation for Performance Optimization and Program Analysis

    Authors: Ali TehraniJamsaz, Quazi Ishtiaque Mahmud, Le Chen, Nesreen K. Ahmed, Ali Jannesari

    Abstract: The remarkable growth and significant success of machine learning have expanded its applications into programming languages and program analysis. However, a key challenge in adopting the latest machine learning methods is the representation of programming languages, which directly impacts the ability of machine learning methods to reason about programs. The absence of numerical awareness, aggregat… ▽ More

    Submitted 29 November, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

  18. arXiv:2305.11414  [pdf, other

    cs.LG cs.AI cs.CR

    Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models

    Authors: Sixing Yu, J. Pablo Muñoz, Ali Jannesari

    Abstract: Foundation Models (FMs), such as LLaMA, BERT, GPT, ViT, and CLIP, have demonstrated remarkable success in a wide range of applications, driven by their ability to leverage vast amounts of data for pre-training. However, optimizing FMs often requires access to sensitive data, raising privacy concerns and limiting their applicability in many domains. In this paper, we propose the Federated Foundatio… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted at the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

  19. arXiv:2305.05779  [pdf, other

    cs.LG cs.SE

    Learning to Parallelize with OpenMP by Augmented Heterogeneous AST Representation

    Authors: Le Chen, Quazi Ishtiaque Mahmud, Hung Phan, Nesreen K. Ahmed, Ali Jannesari

    Abstract: Detecting parallelizable code regions is a challenging task, even for experienced developers. Numerous recent studies have explored the use of machine learning for code analysis and program synthesis, including parallelization, in light of the success of machine learning in natural language processing. However, applying machine learning techniques to parallelism detection presents several challeng… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  20. arXiv:2305.00875  [pdf, other

    cs.SE cs.AI cs.LG

    Redundancy and Concept Analysis for Code-trained Language Models

    Authors: Arushi Sharma, Zefu Hu, Christopher Quinn, Ali Jannesari

    Abstract: Code-trained language models have proven to be highly effective for various code intelligence tasks. However, they can be challenging to train and deploy for many software engineering applications due to computational bottlenecks and memory constraints. Implementing effective strategies to address these issues requires a better understanding of these 'black box' models. In this paper, we perform t… ▽ More

    Submitted 15 February, 2024; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: 4 figures, 6 tables

  21. arXiv:2304.12568  [pdf, other

    cs.DC cs.LG cs.PF

    Performance Optimization using Multimodal Modeling and Heterogeneous GNN

    Authors: Akash Dutta, Jordi Alcaraz, Ali TehraniJamsaz, Eduardo Cesar, Anna Sikora, Ali Jannesari

    Abstract: Growing heterogeneity and configurability in HPC architectures has made auto-tuning applications and runtime parameters on these systems very complex. Users are presented with a multitude of options to configure parameters. In addition to application specific solutions, a common approach is to use general purpose search strategies, which often might not identify the best configurations or their ti… ▽ More

    Submitted 27 April, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 14 pages, 9 figures, 3 tables

  22. arXiv:2304.04658  [pdf, other

    cs.SE

    GraphBinMatch: Graph-based Similarity Learning for Cross-Language Binary and Source Code Matching

    Authors: Ali TehraniJamsaz, Hanze Chen, Ali Jannesari

    Abstract: Matching binary to source code and vice versa has various applications in different fields, such as computer security, software engineering, and reverse engineering. Even though there exist methods that try to match source code with binary code to accelerate the reverse engineering process, most of them are designed to focus on one programming language. However, in real life, programs are develope… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  23. arXiv:2304.03487  [pdf, other

    cs.DC cs.LG cs.PF

    ParaGraph: Weighted Graph Representation for Performance Optimization of HPC Kernels

    Authors: Ali TehraniJamsaz, Alok Mishra, Akash Dutta, Abid M. Malik, Barbara Chapman, Ali Jannesari

    Abstract: GPU-based HPC clusters are attracting more scientific application developers due to their extensive parallelism and energy efficiency. In order to achieve portability among a variety of multi/many core architectures, a popular choice for an application developer is to utilize directive-based parallel programming models, such as OpenMP. However, even with OpenMP, the developer must choose from amon… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  24. arXiv:2302.11467  [pdf, other

    cs.DC cs.LG cs.PF

    Power Constrained Autotuning using Graph Neural Networks

    Authors: Akash Dutta, Jee Choi, Ali Jannesari

    Abstract: Recent advances in multi and many-core processors have led to significant improvements in the performance of scientific computing applications. However, the addition of a large number of complex cores have also increased the overall power consumption, and power has become a first-order design constraint in modern processors. While we can limit power consumption by simply applying software-based po… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: 11 pages, 7 figures, 2 tables, IPDPS '23

  25. arXiv:2301.11787  [pdf, other

    cs.LG cs.DC cs.PF

    Accelerating Domain-aware Deep Learning Models with Distributed Training

    Authors: Aishwarya Sarkar, Chaoqun Lu, Ali Jannesari

    Abstract: Recent advances in data-generating techniques led to an explosive growth of geo-spatiotemporal data. In domains such as hydrology, ecology, and transportation, interpreting the complex underlying patterns of spatiotemporal interactions with the help of deep learning techniques hence becomes the need of the hour. However, applying deep learning techniques without domain-specific knowledge tends to… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: Accepted for Workshop on Multi-scale, Multi-physic and Coupled Problems on Highly Parallel Systems, HPC Asia 2023, 27 February - 2 March 2023, Singapore

  26. arXiv:2212.08743  [pdf, other

    cs.LG cs.DC

    Addressing Data Heterogeneity in Decentralized Learning via Topological Pre-processing

    Authors: Waqwoya Abebe, Ali Jannesari

    Abstract: Recently, local peer topology has been shown to influence the overall convergence of decentralized learning (DL) graphs in the presence of data heterogeneity. In this paper, we demonstrate the advantages of constructing a proxy-based locally heterogeneous DL topology to enhance convergence and maintain data privacy. In particular, we propose a novel peer clumping strategy to efficiently cluster pe… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  27. arXiv:2212.06352  [pdf, other

    cs.DC

    Towards Seamless Management of AI Models in High-Performance Computing

    Authors: Sixing Yu, Murali Emani, Chunhua Liao, Pei-Hung Lin, Tristan Vanderbruggen, Xipeng Shen, Ali Jannesari

    Abstract: With the increasing prevalence of artificial intelligence (AI) in diverse science/engineering communities, AI models emerge on an unprecedented scale among various domains. However, given the complexity and diversity of the software and hardware environments, reusing AI artifacts (models and datasets) is extremely challenging, especially with AI-driven science applications. Building an ecosystem t… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: Accepted at the 2nd Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE)

  28. arXiv:2211.05716  [pdf, other

    cs.LG cs.CV

    Resource-Aware Heterogeneous Federated Learning using Neural Architecture Search

    Authors: Sixing Yu, J. Pablo Muñoz, Ali Jannesari

    Abstract: Federated Learning (FL) is extensively used to train AI/ML models in distributed and privacy-preserving settings. Participant edge devices in FL systems typically contain non-independent and identically distributed (Non-IID) private data and unevenly distributed computational resources. Preserving user data privacy while optimizing AI/ML models in a heterogeneous federated network requires us to a… ▽ More

    Submitted 30 April, 2024; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: Accepted at the 30th International European Conference on Parallel and Distributed Computing (Euro-Par 2024)

  29. arXiv:2208.07978  [pdf, other

    cs.DC cs.CR cs.LG

    Enhancing Heterogeneous Federated Learning with Knowledge Extraction and Multi-Model Fusion

    Authors: Duy Phuong Nguyen, Sixing Yu, J. Pablo Muñoz, Ali Jannesari

    Abstract: Concerned with user data privacy, this paper presents a new federated learning (FL) method that trains machine learning models on edge devices without accessing sensitive data. Traditional FL methods, although privacy-protective, fail to manage model heterogeneity and incur high communication costs due to their reliance on aggregation methods. To address this limitation, we propose a resource-awar… ▽ More

    Submitted 30 September, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: Accept at the 4th workshop on Artificial Intelligence and Machine Learning for Scientific Applications (AI4S), SC 23

  30. Heterogeneous Graph Neural Networks for Software Effort Estimation

    Authors: Hung Phan, Ali Jannesari

    Abstract: Software effort can be measured by story point [35]. Current approaches for automatically estimating story points focus on applying pre-trained embedding models and deep learning for text regression to solve this problem which required expensive embedding models. We propose HeteroSP, a tool for estimating story points from textual input of Agile software project issues. We select GPT2SP [12] and D… ▽ More

    Submitted 30 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted in the Technical Papers Track of the 16th International Symposium on Empirical Software Engineering and Measurement, 2022 (ESEM 2022)

  31. arXiv:2203.03062  [pdf, other

    cs.SE cs.AI

    Story Point Effort Estimation by Text Level Graph Neural Network

    Authors: Hung Phan, Ali Jannesari

    Abstract: Estimating the software projects' efforts developed by agile methods is important for project managers or technical leads. It provides a summary as a first view of how many hours and developers are required to complete the tasks. There are research works on automatic predicting the software efforts, including Term Frequency Inverse Document Frequency (TFIDF) as the traditional approach for this pr… ▽ More

    Submitted 14 March, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: accepted at The 1st International Workshop on Natural Language-based Software Engineering (to appear)

  32. arXiv:2203.00611  [pdf, other

    cs.DC cs.AI cs.LG

    Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization

    Authors: Ali TehraniJamsaz, Mihail Popov, Akash Dutta, Emmanuelle Saillard, Ali Jannesari

    Abstract: There is a large space of NUMA and hardware prefetcher configurations that can significantly impact the performance of an application. Previous studies have demonstrated how a model can automatically select configurations based on the dynamic properties of the code to achieve speedups. This paper demonstrates how the static Intermediate Representation (IR) of the code can guide NUMA/prefetcher opt… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  33. arXiv:2112.00847  [pdf, other

    cs.CV cs.LG

    CLAWS: Contrastive Learning with hard Attention and Weak Supervision

    Authors: Jansel Herrera-Gerena, Ramakrishnan Sundareswaran, John Just, Matthew Darr, Ali Jannesari

    Abstract: Learning effective visual representations without human supervision is a long-standing problem in computer vision. Recent advances in self-supervised learning algorithms have utilized contrastive learning, with methods such as SimCLR, which applies a composition of augmentations to an image, and minimizes a contrastive loss between the two augmented images. In this paper, we present CLAWS, an anno… ▽ More

    Submitted 31 January, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

  34. arXiv:2111.14345  [pdf, other

    cs.LG

    SPATL: Salient Parameter Aggregation and Transfer Learning for Heterogeneous Clients in Federated Learning

    Authors: Sixing Yu, Phuong Nguyen, Waqwoya Abebe, Wei Qian, Ali Anwar, Ali Jannesari

    Abstract: Federated learning~(FL) facilitates the training and deploying AI models on edge devices. Preserving user data privacy in FL introduces several challenges, including expensive communication costs, limited resources, and data heterogeneity. In this paper, we propose SPATL, an FL method that addresses these issues by: (a) introducing a salient parameter selection agent and communicating selected par… ▽ More

    Submitted 26 August, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: Accepted at The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC 22)

  35. arXiv:2110.00841  [pdf, other

    cs.LG

    Transfer Learning Approaches for Knowledge Discovery in Grid-based Geo-Spatiotemporal Data

    Authors: Aishwarya Sarkar, Jien Zhang, Chaoqun Lu, Ali Jannesari

    Abstract: Extracting and meticulously analyzing geo-spatiotemporal features is crucial to recognize intricate underlying causes of natural events, such as floods. Limited evidence about hidden factors leading to climate change makes it challenging to predict regional water discharge accurately. In addition, the explosive growth in complex geo-spatiotemporal environment data that requires repeated learning b… ▽ More

    Submitted 1 November, 2021; v1 submitted 2 October, 2021; originally announced October 2021.

  36. arXiv:2109.12714  [pdf, other

    cs.LG cs.CV

    Cluster Analysis with Deep Embeddings and Contrastive Learning

    Authors: Ramakrishnan Sundareswaran, Jansel Herrera-Gerena, John Just, Ali Jannesari

    Abstract: Unsupervised disentangled representation learning is a long-standing problem in computer vision. This work proposes a novel framework for performing image clustering from deep embeddings by combining instance-level contrastive learning with a deep embedding based cluster center predictor. Our approach jointly learns representations and predicts cluster centers in an end-to-end manner. This is acco… ▽ More

    Submitted 2 October, 2021; v1 submitted 26 September, 2021; originally announced September 2021.

  37. arXiv:2109.02145  [pdf, other

    cs.LG

    Temporal Shift Reinforcement Learning

    Authors: Deepak George Thomas, Tichakorn Wongpiromsarn, Ali Jannesari

    Abstract: The function approximators employed by traditional image-based Deep Reinforcement Learning (DRL) algorithms usually lack a temporal learning component and instead focus on learning the spatial component. We propose a technique, Temporal Shift Reinforcement Learning (TSRL), wherein both temporal, as well as spatial components are jointly learned. Moreover, TSRL does not require additional parameter… ▽ More

    Submitted 26 October, 2021; v1 submitted 5 September, 2021; originally announced September 2021.

  38. arXiv:2106.06921  [pdf, other

    cs.LG cs.CV

    Heterogeneous Federated Learning using Dynamic Model Pruning and Adaptive Gradient

    Authors: Sixing Yu, Phuong Nguyen, Ali Anwar, Ali Jannesari

    Abstract: Federated Learning (FL) has emerged as a new paradigm for training machine learning models distributively without sacrificing data security and privacy. Learning models on edge devices such as mobile phones is one of the most common use cases for FL. However, Non-identical independent distributed~(non-IID) data in edge devices easily leads to training failures. Especially, over-parameterized machi… ▽ More

    Submitted 9 February, 2023; v1 submitted 13 June, 2021; originally announced June 2021.

    Comments: Preprint of the CCGrid 2023 Submission

  39. arXiv:2105.12254  [pdf, other

    cs.RO cs.LG

    Interpretable UAV Collision Avoidance using Deep Reinforcement Learning

    Authors: Deepak-George Thomas, Daniil Olshanskyi, Karter Krueger, Tichakorn Wongpiromsarn, Ali Jannesari

    Abstract: The significant components of any successful autonomous flight system are task completion and collision avoidance. Most deep learning algorithms successfully execute these aspects under the environment and conditions they are trained. However, they fail when subjected to novel environments. This paper presents an autonomous multi-rotor flight algorithm, using Deep Reinforcement Learning augmented… ▽ More

    Submitted 4 June, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

  40. arXiv:2103.06403  [pdf, other

    cs.AI cs.CV cs.RO

    A Vision Based Deep Reinforcement Learning Algorithm for UAV Obstacle Avoidance

    Authors: Jeremy Roghair, Kyungtae Ko, Amir Ehsan Niaraki Asli, Ali Jannesari

    Abstract: Integration of reinforcement learning with unmanned aerial vehicles (UAVs) to achieve autonomous flight has been an active research area in recent years. An important part focuses on obstacle detection and avoidance for UAVs navigating through an environment. Exploration in an unseen environment can be tackled with Deep Q-Network (DQN). However, value exploration with uniform sampling of actions m… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: 12 pages, 6 figures

  41. arXiv:2102.03214  [pdf, other

    cs.CV cs.LG

    Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning

    Authors: Sixing Yu, Arya Mazaheri, Ali Jannesari

    Abstract: Model compression is an essential technique for deploying deep neural networks (DNNs) on power and memory-constrained resources. However, existing model-compression methods often rely on human expertise and focus on parameters' local importance, ignoring the rich topology information within DNNs. In this paper, we propose a novel multi-stage graph embedding technique based on graph neural networks… ▽ More

    Submitted 1 July, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Accepted at ICML 2022 Long presentation

  42. arXiv:2011.12641  [pdf, other

    cs.CV

    Auto Graph Encoder-Decoder for Neural Network Pruning

    Authors: Sixing Yu, Arya Mazaheri, Ali Jannesari

    Abstract: Model compression aims to deploy deep neural networks (DNN) on mobile devices with limited computing and storage resources. However, most of the existing model compression methods rely on manually defined rules, which require domain expertise. DNNs are essentially computational graphs, which contain rich structural information. In this paper, we aim to find a suitable compression policy from DNNs'… ▽ More

    Submitted 9 November, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: In proc. of ICCV 2021

  43. arXiv:2010.04328  [pdf, other

    cs.LG cs.AI eess.SP

    HydroDeep -- A Knowledge Guided Deep Neural Network for Geo-Spatiotemporal Data Analysis

    Authors: Aishwarya Sarkar, Jien Zhang, Chaoqun Lu, Ali Jannesari

    Abstract: Due to limited evidence and complex causes of regional climate change, the confidence in predicting fluvial floods remains low. Understanding the fundamental mechanisms intrinsic to geo-spatiotemporal information is crucial to improve the prediction accuracy. This paper demonstrates a hybrid neural network architecture - HydroDeep, that couples a process-based hydro-ecological model with a combina… ▽ More

    Submitted 8 February, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

  44. arXiv:2008.08951  [pdf, other

    cs.LG stat.ML

    Static Neural Compiler Optimization via Deep Reinforcement Learning

    Authors: Rahim Mammadli, Ali Jannesari, Felix Wolf

    Abstract: The phase-ordering problem of modern compilers has received a lot of attention from the research community over the years, yet remains largely unsolved. Various optimization sequences exposed to the user are manually designed by compiler developers. In designing such a sequence developers have to choose the set of optimization passes, their parameters and ordering within a sequence. Resulting sequ… ▽ More

    Submitted 16 October, 2020; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: 11 pages, 5 figures

  45. arXiv:1909.12217  [pdf, other

    eess.SP cs.AI cs.LG

    Visual Exploration and Energy-aware Path Planning via Reinforcement Learning

    Authors: Amir Niaraki, Jeremy Roghair, Ali Jannesari

    Abstract: Visual exploration and smart data collection via autonomous vehicles is an attractive topic in various disciplines. Disturbances like wind significantly influence both the power consumption of the flying robots and the performance of the camera. We propose a reinforcement learning approach which combines the effects of the power consumption and the object detection modules to develop a policy for… ▽ More

    Submitted 25 January, 2021; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: 20 Pages, 14 figures

  46. arXiv:1907.07110  [pdf

    cs.DC

    DeepRace: Finding Data Race Bugs via Deep Learning

    Authors: Ali Tehrani, Mohammed Khaleel, Reza Akbari, Ali Jannesari

    Abstract: With the proliferation of multi-core hardware, parallel programs have become ubiquitous. These programs have their own type of bugs known as concurrency bugs and among them, data race bugs have been mostly in the focus of researchers over the past decades. In fact, detecting data races is a very challenging and important task. There have been several research paths in this area with many sophistic… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: 9 pages

  47. arXiv:1907.06205  [pdf, other

    cs.SE cs.CL cs.FL cs.LG cs.PL stat.ML

    Automatic Repair and Type Binding of Undeclared Variables using Neural Networks

    Authors: Venkatesh Theru Mohan, Ali Jannesari

    Abstract: Deep learning had been used in program analysis for the prediction of hidden software defects using software defect datasets, security vulnerabilities using generative adversarial networks as well as identifying syntax errors by learning a trained neural machine translation on program codes. However, all these approaches either require defect datasets or bug-free source codes that are executable f… ▽ More

    Submitted 14 July, 2019; originally announced July 2019.

    Comments: 16 pages, 16 figures

  48. arXiv:1906.00786  [pdf, other

    cs.CV

    Efficient Object Detection Model for Real-Time UAV Applications

    Authors: Subrahmanyam Vaddi, Chandan Kumar, Ali Jannesari

    Abstract: Unmanned Aerial Vehicles (UAVs) especially drones, equipped with vision techniques have become very popular in recent years, with their extensive use in wide range of applications. Many of these applications require use of computer vision techniques, particularly object detection from the information captured by on-board camera. In this paper, we propose an end to end object detection model runnin… ▽ More

    Submitted 30 May, 2019; originally announced June 2019.

    Comments: 10 pages, 4 figures, Under Review. arXiv admin note: substantial text overlap with arXiv:1808.07256 by other authors without attribution; substantial text overlap with arXiv:1807.06789, arXiv:1612.03144, arXiv:1809.03193 by other authors

  49. arXiv:1611.06945  [pdf, other

    cs.NE cs.DC cs.MS

    A Metaprogramming and Autotuning Framework for Deploying Deep Learning Applications

    Authors: Matthew W. Moskewicz, Ali Jannesari, Kurt Keutzer

    Abstract: In recent years, deep neural networks (DNNs), have yielded strong results on a wide range of applications. Graphics Processing Units (GPUs) have been one key enabling factor leading to the current popularity of DNNs. However, despite increasing hardware flexibility and software programming toolchain maturity, high efficiency GPU programming remains difficult: it suffers from high complexity, low p… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.