Skip to main content

Showing 1–31 of 31 results for author: Tan, C W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04069  [pdf, other

    cs.CL cs.AI cs.LG

    A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations

    Authors: Md Tahmid Rahman Laskar, Sawsan Alqahtani, M Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee Wei Tan, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang

    Abstract: Large Language Models (LLMs) have recently gained significant attention due to their remarkable capabilities in performing diverse tasks across various domains. However, a thorough evaluation of these models is crucial before deploying them in real-world applications to ensure they produce reliable performance. Despite the well-established importance of evaluating LLMs in the community, the comple… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2403.14890  [pdf

    cs.SI

    Unraveling Contagion Origins: Optimal Estimation through Maximum-Likelihood and Starlike Tree Approximation in Markovian Spreading Models

    Authors: Pei-Duo Yu, Chee Wei Tan, Liang Zheng, Chao Zhao

    Abstract: Identifying the source of epidemic-like spread in networks is crucial for tasks like removing internet viruses or finding the rumor source in online social networks. The challenge lies in tracing the source from a snapshot observation of infected nodes. How do we accurately pinpoint the source? Utilizing snapshot data, we apply a probabilistic approach, focusing on the graph boundary and the obser… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  3. arXiv:2401.08216  [pdf, other

    cs.CR cs.LG

    Towards Efficient and Certified Recovery from Poisoning Attacks in Federated Learning

    Authors: Yu Jiang, Jiyuan Shen, Ziyao Liu, Chee Wei Tan, Kwok-Yan Lam

    Abstract: Federated learning (FL) is vulnerable to poisoning attacks, where malicious clients manipulate their updates to affect the global model. Although various methods exist for detecting those clients in FL, identifying malicious clients requires sufficient model updates, and hence by the time malicious clients are detected, FL models have been already poisoned. Thus, a method is needed to recover an a… ▽ More

    Submitted 19 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  4. arXiv:2312.03998  [pdf, other

    cs.LG

    Series2Vec: Similarity-based Self-supervised Representation Learning for Time Series Classification

    Authors: Navid Mohammadi Foumani, Chang Wei Tan, Geoffrey I. Webb, Hamid Rezatofighi, Mahsa Salehi

    Abstract: We argue that time series analysis is fundamentally different in nature to either vision or natural language processing with respect to the forms of meaningful self-supervised learning tasks that can be defined. Motivated by this insight, we introduce a novel approach called \textit{Series2Vec} for self-supervised representation learning. Unlike other self-supervised methods in time series, which… ▽ More

    Submitted 12 December, 2023; v1 submitted 6 December, 2023; originally announced December 2023.

  5. arXiv:2311.14708  [pdf, other

    cs.CY cs.AI cs.CL cs.HC

    Large Language Model-Driven Classroom Flipping: Empowering Student-Centric Peer Questioning with Flipped Interaction

    Authors: Chee Wei Tan

    Abstract: Reciprocal questioning is essential for effective teaching and learning, fostering active engagement and deeper understanding through collaborative interactions, especially in large classrooms. Can large language model (LLM), such as OpenAI's GPT (Generative Pre-trained Transformer) series, assist in this? This paper investigates a pedagogical approach of classroom flipping based on flipped intera… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Submitted

  6. arXiv:2307.14349  [pdf, other

    cs.SE cs.AI

    Copilot for Xcode: Exploring AI-Assisted Programming by Prompting Cloud-based Large Language Models

    Authors: Chee Wei Tan, Shangxin Guo, Man Fai Wong, Ching Nam Hang

    Abstract: This paper presents an AI-assisted programming tool called Copilot for Xcode for program composition and design to support human software developers. By seamlessly integrating cloud-based Large Language Models (LLM) with Apple's local development environment, Xcode, this tool enhances productivity and unleashes creativity for software development in Apple software ecosystem (e.g., iOS apps, macOS)… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

  7. arXiv:2307.13716  [pdf, other

    cs.LG cs.AI

    FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on Staged Reinforcement Learning

    Authors: Leiming Chen, Weishan Zhang, Cihao Dong, Sibo Qiao, Ziling Huang, Yuming Nie, Zhaoxiang Hou, Chee Wei Tan

    Abstract: Traditional federated learning uses the number of samples to calculate the weights of each client model and uses this fixed weight value to fusion the global model. However, in practical scenarios, each client's device and data heterogeneity leads to differences in the quality of each client's model. Thus the contribution to the global model is not wholly determined by the sample size. In addition… ▽ More

    Submitted 19 March, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

  8. Contagion Source Detection in Epidemic and Infodemic Outbreaks: Mathematical Analysis and Network Algorithms

    Authors: Chee Wei Tan, Pei-Duo Yu

    Abstract: This monograph provides an overview of the mathematical theories and computational algorithm design for contagion source detection in large networks. By leveraging network centrality as a tool for statistical inference, we can accurately identify the source of contagions, trace their spread, and predict future trajectories. This approach provides fundamental insights into surveillance capability a… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: Suggested Citation: Chee Wei Tan and Pei-Duo Yu (2023), "Contagion Source Detection in Epidemic and Infodemic Outbreaks: Mathematical Analysis and Network Algorithms", Foundations and Trends in Networking: Vol. 13: No. 2-3, pp 107-251. http://dx.doi.org/10.1561/1300000068

  9. arXiv:2307.02503  [pdf, other

    cs.SE cs.AI cs.CL

    Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review

    Authors: Man Fai Wong, Shangxin Guo, Ching Nam Hang, Siu Wai Ho, Chee Wei Tan

    Abstract: This paper provides a comprehensive review of the literature concerning the utilization of Natural Language Processing (NLP) techniques, with a particular focus on transformer-based large language models (LLMs) trained using Big Code, within the domain of AI-assisted programming tasks. LLMs, augmented with software naturalness, have played a crucial role in facilitating AI-assisted programming app… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Journal ref: Entropy(2023), 25(6), 888

  10. Improving Position Encoding of Transformers for Multivariate Time Series Classification

    Authors: Navid Mohammadi Foumani, Chang Wei Tan, Geoffrey I. Webb, Mahsa Salehi

    Abstract: Transformers have demonstrated outstanding performance in many applications of deep learning. When applied to time series data, transformers require effective position encoding to capture the ordering of the time series data. The efficacy of position encoding in time series analysis is not well-studied and remains controversial, e.g., whether it is better to inject absolute position encoding or re… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  11. arXiv:2305.11921  [pdf, other

    stat.ME cs.AI cs.LG cs.PF

    An Approach to Multiple Comparison Benchmark Evaluations that is Stable Under Manipulation of the Comparate Set

    Authors: Ali Ismail-Fawaz, Angus Dempster, Chang Wei Tan, Matthieu Herrmann, Lynn Miller, Daniel F. Schmidt, Stefano Berretti, Jonathan Weber, Maxime Devanne, Germain Forestier, Geoffrey I. Webb

    Abstract: The measurement of progress using benchmarks evaluations is ubiquitous in computer science and machine learning. However, common approaches to analyzing and presenting the results of benchmark comparisons of multiple algorithms over multiple datasets, such as the critical difference diagram introduced by Demšar (2006), have important shortcomings and, we show, are open to both inadvertent and inte… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  12. arXiv:2304.05800  [pdf, other

    cs.LG cs.AI

    Proximity Forest 2.0: A new effective and scalable similarity-based classifier for time series

    Authors: Matthieu Herrmann, Chang Wei Tan, Mahsa Salehi, Geoffrey I. Webb

    Abstract: Time series classification (TSC) is a challenging task due to the diversity of types of feature that may be relevant for different classification tasks, including trends, variance, frequency, magnitude, and various patterns. To address this challenge, several alternative classes of approach have been developed, including similarity-based, features and intervals, shapelets, dictionary, kernel, neur… ▽ More

    Submitted 13 April, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  13. arXiv:2302.02515  [pdf, other

    cs.LG cs.AI cs.CV

    Deep Learning for Time Series Classification and Extrinsic Regression: A Current Survey

    Authors: Navid Mohammadi Foumani, Lynn Miller, Chang Wei Tan, Geoffrey I. Webb, Germain Forestier, Mahsa Salehi

    Abstract: Time Series Classification and Extrinsic Regression are important and challenging machine learning tasks. Deep learning has revolutionized natural language processing and computer vision and holds great promise in other fields such as time series analysis where the relevant features must often be abstracted from the raw data but are not known a priori. This paper surveys the current state of the a… ▽ More

    Submitted 19 December, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

  14. arXiv:2301.13007  [pdf, other

    cs.CV cs.AI cs.LG

    EuclidNet: Deep Visual Reasoning for Constructible Problems in Geometry

    Authors: Man Fai Wong, Xintong Qi, Chee Wei Tan

    Abstract: In this paper, we present a deep learning-based framework for solving geometric construction problems through visual reasoning, which is useful for automated geometry theorem proving. Constructible problems in geometry often ask for the sequence of straightedge-and-compass constructions to construct a given goal given some initial setup. Our EuclidNet framework leverages the neural network archite… ▽ More

    Submitted 27 December, 2022; originally announced January 2023.

    Comments: Accepted by 2nd MATH-AI Workshop at NeurIPS'22

    Journal ref: Adv. Artif. Intell. Mach. Learn.(2023), 3(1):839-852

  15. arXiv:2301.10350  [pdf, other

    cs.LG

    Parameterizing the cost function of Dynamic Time Warping with application to time series classification

    Authors: Matthieu Herrmann, Chang Wei Tan, Geoffrey I. Webb

    Abstract: Dynamic Time Warping (DTW) is a popular time series distance measure that aligns the points in two series with one another. These alignments support warping of the time dimension to allow for processes that unfold at differing rates. The distance is the minimum sum of costs of the resulting alignments over any allowable warping of the time dimension. The cost of an alignment of two points is a fun… ▽ More

    Submitted 28 March, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

  16. arXiv:2211.00880  [pdf, other

    cs.SI physics.soc-ph

    DeepTrace: Learning to Optimize Contact Tracing in Epidemic Networks with Graph Neural Networks

    Authors: Chee Wei Tan, Pei-Duo Yu, Siya Chen, H. Vincent Poor

    Abstract: Digital contact tracing aims to curb epidemics by identifying and mitigating public health emergencies through technology. Backward contact tracing, which tracks the sources of infection, proved crucial in places like Japan for identifying COVID-19 infections from superspreading events. This paper presents a novel perspective of digital contact tracing as online graph exploration and addresses the… ▽ More

    Submitted 24 June, 2024; v1 submitted 2 November, 2022; originally announced November 2022.

  17. arXiv:2209.07018  [pdf, other

    cs.LG

    FRANS: Automatic Feature Extraction for Time Series Forecasting

    Authors: Alexey Chernikov, Chang Wei Tan, Pablo Montero-Manso, Christoph Bergmeir

    Abstract: Feature extraction methods help in dimensionality reduction and capture relevant information. In time series forecasting (TSF), features can be used as auxiliary information to achieve better accuracy. Traditionally, features used in TSF are handcrafted, which requires domain knowledge and significant data-engineering work. In this research, we first introduce a notion of static and dynamic featur… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  18. Epidemic Source Detection in Contact Tracing Networks: Epidemic Centrality in Graphs and Message-Passing Algorithms

    Authors: Pei-Duo Yu, Chee Wei Tan, Hung-Lin Fu

    Abstract: We study the epidemic source detection problem in contact tracing networks modeled as a graph-constrained maximum likelihood estimation problem using the susceptible-infected model in epidemiology. Based on a snapshot observation of the infection subgraph, we first study finite degree regular graphs and regular graphs with cycles separately, thereby establishing a mathematical equivalence in maxim… ▽ More

    Submitted 25 February, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

  19. arXiv:2111.14623  [pdf, other

    cs.LG cs.CY stat.AP

    An Overview of Healthcare Data Analytics With Applications to the COVID-19 Pandemic

    Authors: Zhe Fei, Yevgen Ryeznik, Oleksandr Sverdlov, Chee Wei Tan, Weng Kee Wong

    Abstract: In the era of big data, standard analysis tools may be inadequate for making inference and there is a growing need for more efficient and innovative ways to collect, process, analyze and interpret the massive and complex data. We provide an overview of challenges in big data problems and describe how innovative analytical methods, machine learning tools and metaheuristics can tackle general health… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

    Journal ref: IEEE TRANSACTIONS ON BIG DATA, 12 August 2021

  20. arXiv:2102.08245  [pdf, other

    cs.LG

    Classification of multivariate weakly-labelled time-series with attention

    Authors: Surayez Rahman, Chang Wei Tan

    Abstract: This research identifies a gap in weakly-labelled multivariate time-series classification (TSC), where state-of-the-art TSC models do not per-form well. Weakly labelled time-series are time-series containing noise and significant redundancies. In response to this gap, this paper proposes an approach of exploiting context relevance of subsequences from previous subsequences to improve classificatio… ▽ More

    Submitted 17 September, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  21. arXiv:2102.00457  [pdf, other

    cs.LG stat.ML

    MultiRocket: Multiple pooling operators and transformations for fast and effective time series classification

    Authors: Chang Wei Tan, Angus Dempster, Christoph Bergmeir, Geoffrey I. Webb

    Abstract: We propose MultiRocket, a fast time series classification (TSC) algorithm that achieves state-of-the-art performance with a tiny fraction of the time and without the complex ensembling structure of many state-of-the-art methods. MultiRocket improves on MiniRocket, one of the fastest TSC algorithms to date, by adding multiple pooling operators and transformations to improve the diversity of the fea… ▽ More

    Submitted 21 February, 2022; v1 submitted 31 January, 2021; originally announced February 2021.

  22. arXiv:2006.12672  [pdf, other

    cs.LG stat.ML

    Time Series Extrinsic Regression

    Authors: Chang Wei Tan, Christoph Bergmeir, Francois Petitjean, Geoffrey I. Webb

    Abstract: This paper studies Time Series Extrinsic Regression (TSER): a regression task of which the aim is to learn the relationship between a time series and a continuous scalar variable; a task closely related to time series classification (TSC), which aims to learn the relationship between a time series and a categorical class label. This task generalizes time series forecasting (TSF), relaxing the requ… ▽ More

    Submitted 3 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

  23. arXiv:2006.12246  [pdf, other

    cs.CV cs.HC cs.LG stat.ML

    Pain Intensity Estimation from Mobile Video Using 2D and 3D Facial Keypoints

    Authors: Matthew Lee, Lyndon Kennedy, Andreas Girgensohn, Lynn Wilcox, John Song En Lee, Chin Wen Tan, Ban Leong Sng

    Abstract: Managing post-surgical pain is critical for successful surgical outcomes. One of the challenges of pain management is accurately assessing the pain level of patients. Self-reported numeric pain ratings are limited because they are subjective, can be affected by mood, and can influence the patient's perception of pain when making comparisons. In this paper, we introduce an approach that analyzes 2D… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  24. arXiv:2006.10996  [pdf, other

    cs.LG stat.ML

    Monash University, UEA, UCR Time Series Extrinsic Regression Archive

    Authors: Chang Wei Tan, Christoph Bergmeir, Francois Petitjean, Geoffrey I. Webb

    Abstract: Time series research has gathered lots of interests in the last decade, especially for Time Series Classification (TSC) and Time Series Forecasting (TSF). Research in TSC has greatly benefited from the University of California Riverside and University of East Anglia (UCR/UEA) Time Series Archives. On the other hand, the advancement in Time Series Forecasting relies on time series forecasting compe… ▽ More

    Submitted 19 October, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

  25. arXiv:2004.11839  [pdf, other

    eess.SP cs.LG stat.ML

    Detecting Driver's Distraction using Long-term Recurrent Convolutional Network

    Authors: Chang Wei Tan, Mahsa Salehi, Geoffrey Mackellar

    Abstract: In this study we demonstrate a novel Brain Computer Interface (BCI) approach to detect driver distraction events to improve road safety. We use a commercial wireless headset that generates EEG signals from the brain. We collected real EEG signals from participants who undertook a 40-minute driving simulation and were required to perform different tasks while driving. These signals are segmented in… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

    Comments: 3 pages 2 figures

  26. arXiv:1910.04341  [pdf, other

    cs.LG stat.ML

    Time series classification for varying length series

    Authors: Chang Wei Tan, Francois Petitjean, Eamonn Keogh, Geoffrey I. Webb

    Abstract: Research into time series classification has tended to focus on the case of series of uniform length. However, it is common for real-world time series data to have unequal lengths. Differing time series lengths may arise from a number of fundamentally different mechanisms. In this work, we identify and evaluate two classes of such mechanisms -- variations in sampling rate relative to the relevant… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: 23 pages

  27. arXiv:1808.09617  [pdf, other

    cs.LG stat.ML

    Elastic bands across the path: A new framework and methods to lower bound DTW

    Authors: Chang Wei Tan, Francois Petitjean, Geoffrey I. Webb

    Abstract: There has been renewed recent interest in developing effective lower bounds for Dynamic Time Warping (DTW) distance between time series. These have many applications in time series indexing, clustering, forecasting, regression and classification. One of the key time series classification algorithms, the nearest neighbor algorithm with DTW distance (NN-DTW) is very expensive to compute, due to the… ▽ More

    Submitted 14 February, 2019; v1 submitted 28 August, 2018; originally announced August 2018.

  28. Joint Beamforming and Power Control in Coordinated Multicell: Max-Min Duality, Effective Network and Large System Transition

    Authors: Yichao Huang, Chee Wei Tan, Bhaskar D. Rao

    Abstract: This paper studies joint beamforming and power control in a coordinated multicell downlink system that serves multiple users per cell to maximize the minimum weighted signal-to-interference-plus-noise ratio. The optimal solution and distributed algorithm with geometrically fast convergence rate are derived by employing the nonlinear Perron-Frobenius theory and the multicell network duality. The it… ▽ More

    Submitted 27 June, 2013; v1 submitted 11 March, 2013; originally announced March 2013.

    Comments: Some typos in the version publised in the IEEE Transactions on Wireless Communications are corrected

  29. Rooting out the Rumor Culprit from Suspects

    Authors: Wenxiang Dong, Wenyi Zhang, Chee Wei Tan

    Abstract: Suppose that a rumor originating from a single source among a set of suspects spreads in a network, how to root out this rumor source? With the a priori knowledge of suspect nodes and an observation of infected nodes, we construct a maximum a posteriori (MAP) estimator to identify the rumor source using the susceptible-infected (SI) model. The a priori suspect set and its associated connectivity b… ▽ More

    Submitted 9 May, 2013; v1 submitted 26 January, 2013; originally announced January 2013.

    Comments: Submitted to IEEE Transactions on Information Theory

  30. arXiv:1208.4743   

    cs.NI

    Optimal Charging of Electric Vehicles in Smart Grid: Characterization and Valley-Filling Algorithms

    Authors: Niangjun Chen, Chee Wei Tan, Tony Q. S. Quek

    Abstract: Electric vehicles (EVs) offer an attractive long-term solution to reduce the dependence on fossil fuel and greenhouse gas emission. However, a fleet of EVs with different EV battery charging rate constraints, that is distributed across a smart power grid network requires a coordinated charging schedule to minimize the power generation and EV charging costs. In this paper, we study a joint optimal… ▽ More

    Submitted 7 April, 2013; v1 submitted 23 August, 2012; originally announced August 2012.

    Comments: This paper is temporarily withdrawn in preparation for journal submission

  31. arXiv:1103.0038  [pdf, ps, other

    cs.IT

    On the Sum-Capacity with Successive Decoding in Interference Channels

    Authors: Yue Zhao, Chee Wei Tan, A. Salman Avestimehr, Suhas N. Diggavi, Gregory J. Pottie

    Abstract: In this paper, we investigate the sum-capacity of the two-user Gaussian interference channel with Gaussian superposition coding and successive decoding. We first examine an approximate deterministic formulation of the problem, and introduce the complementarity conditions that capture the use of Gaussian coding and successive decoding. In the deterministic channel problem, we find the constrained s… ▽ More

    Submitted 28 March, 2011; v1 submitted 28 February, 2011; originally announced March 2011.

    Comments: 32 pages, 21 figures