-
Exploring the potential of AI in nurturing learner empathy, prosocial values and environmental stewardship
Authors:
Kenneth Y T Lim,
Minh Anh Nguyen Duc,
Minh Tuan Nguyen Thien
Abstract:
With Artificial Intelligence (AI) becoming a powerful tool for education (Zawacki-Richter et al., 2019), this chapter describes the concept of combining generative and traditional AI, citizen-science physiological, neuroergonomic wearables and environmental sensors into activities for learners to understand their own well-being and emotional states better with a view to developing empathy and envi…
▽ More
With Artificial Intelligence (AI) becoming a powerful tool for education (Zawacki-Richter et al., 2019), this chapter describes the concept of combining generative and traditional AI, citizen-science physiological, neuroergonomic wearables and environmental sensors into activities for learners to understand their own well-being and emotional states better with a view to developing empathy and environmental stewardship. Alongside bespoke and affordable wearables (DIY EEG headsets and biometric wristbands), interpretable AI and data science are used for learners to explore how the environment affects them physiologically and mentally in authentic environments. For example, relationships between environmental changes (e.g. poorer air quality) and their well-being (e.g. cognitive functioning) can be discovered. This is particularly crucial, as relevant knowledge can influence the way people treat the environment, as suggested by the disciplines of environmental neuroscience and environmental psychology (Doell et al., 2023). Yet, according to Palme and Salvati, there have been relatively few studies on the relationships between microclimates and human health and emotions (Palme and Salvati, 2021). As anthropogenic environmental pollution is becoming a prevalent problem, our research also aims to leverage on generative AI to introduce hypothetical scenarios of the environment as emotionally strong stimuli of relevance to the learners. This would provoke an emotional response for them to learn about their own physiological and neurological responses (using neuro-physiological data). Ultimately, we hope to establish a bidirectional understanding of how the environment affects humans physiologically and mentally; after which, to gain insights as to how AI can be used to effectively foster empathy, pro-environmental attitudes and stewardship.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
Accelerator-as-a-Service in Public Clouds: An Intra-Host Traffic Management View for Performance Isolation in the Wild
Authors:
Jiechen Zhao,
Ran Shu,
Katie Lim,
Zewen Fan,
Thomas Anderson,
Mingyu Gao,
Natalie Enright Jerger
Abstract:
I/O devices in public clouds have integrated increasing numbers of hardware accelerators, e.g., AWS Nitro, Azure FPGA and Nvidia BlueField. However, such specialized compute (1) is not explicitly accessible to cloud users with performance guarantee, (2) cannot be leveraged simultaneously by both providers and users, unlike general-purpose compute (e.g., CPUs). Through ten observations, we present…
▽ More
I/O devices in public clouds have integrated increasing numbers of hardware accelerators, e.g., AWS Nitro, Azure FPGA and Nvidia BlueField. However, such specialized compute (1) is not explicitly accessible to cloud users with performance guarantee, (2) cannot be leveraged simultaneously by both providers and users, unlike general-purpose compute (e.g., CPUs). Through ten observations, we present that the fundamental difficulty of democratizing accelerators is insufficient performance isolation support. The key obstacles to enforcing accelerator isolation are (1) too many unknown traffic patterns in public clouds and (2) too many possible contention sources in the datapath. In this work, instead of scheduling such complex traffic on-the-fly and augmenting isolation support on each system component, we propose to model traffic as network flows and proactively re-shape the traffic to avoid unpredictable contention. We discuss the implications of our findings on the design of future I/O management stacks and device interfaces.
△ Less
Submitted 14 July, 2024;
originally announced July 2024.
-
Representation Type of the Descent Algebras of Type $\mathbb{A}$
Authors:
Karin Erdmann,
Kay Jin Lim
Abstract:
We classify the representation type of the descent algebras of type $\mathbb{A}$ in the positive characteristic case.
We classify the representation type of the descent algebras of type $\mathbb{A}$ in the positive characteristic case.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Projective Modules and Cohomology for Integral Basic Algebras
Authors:
David J. Benson,
Kay Jin Lim
Abstract:
Algebras defined over fields of characteristic zero and positive characteristic usually do not behave the same way. However, for certain algebras, for example the group algebras, they behave the same way as the characteristic zero case at good enough prime. In this paper, we initiate the study of this topic by imposing increasingly strong hypotheses on basic algebras. When the algebras satisfy the…
▽ More
Algebras defined over fields of characteristic zero and positive characteristic usually do not behave the same way. However, for certain algebras, for example the group algebras, they behave the same way as the characteristic zero case at good enough prime. In this paper, we initiate the study of this topic by imposing increasingly strong hypotheses on basic algebras. When the algebras satisfy the right hypotheses, we have equalities of the dimensions of their cohomology groups between simple modules and equalities of graded Cartan numbers. The examples include the Solomon descent algebras of finite Coxeter groups at large enough primes, nil-Coxeter algebra, and certain finite semigroup algebras at an arbitrary prime.
△ Less
Submitted 7 July, 2024;
originally announced July 2024.
-
Predictive power-sharing scaling law in double-null L-mode plasmas
Authors:
K. Lim,
P. Ricci,
L. Stenger,
B. De Lucca,
G. Durr-Legoupil-Nicoud,
O. Février,
C. Theiler,
K. Verhaegh
Abstract:
The physical mechanisms regulating the power sharing at the outer targets of L-mode double-null (DN) configurations are investigated using nonlinear, flux-driven, three-dimensional two-fluid simulations. Scans of parameters that regulate the turbulent level, such as the plasma resistivity and the magnetic imbalance, reveal that the power asymmetry in DN configurations is determined by the combined…
▽ More
The physical mechanisms regulating the power sharing at the outer targets of L-mode double-null (DN) configurations are investigated using nonlinear, flux-driven, three-dimensional two-fluid simulations. Scans of parameters that regulate the turbulent level, such as the plasma resistivity and the magnetic imbalance, reveal that the power asymmetry in DN configurations is determined by the combined effects of diamagnetic drift, turbulence, and geometrical factor. Leveraging these observations, an analytical theory-based scaling law for the power-sharing asymmetry is derived and compared with nonlinear simulations. These comparisons indicate that the scaling law effectively captures the trends observed in simulations. Validation with experimental data from TCV DN discharges demonstrates agreement of the scaling law with the experimental results.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Smoothed NPMLEs in nonparametric Poisson mixtures and beyond
Authors:
Keunwoo Lim,
Fang Han
Abstract:
We discuss nonparametric mixing distribution estimation under the Gaussian-smoothed optimal transport (GOT) distance. It is shown that a recently formulated conjecture -- that the Poisson nonparametric maximum likelihood estimator can achieve root-$n$ rate of convergence under the GOT distance -- holds up to some logarithmic terms. We also establish the same conclusion for other minimum-distance e…
▽ More
We discuss nonparametric mixing distribution estimation under the Gaussian-smoothed optimal transport (GOT) distance. It is shown that a recently formulated conjecture -- that the Poisson nonparametric maximum likelihood estimator can achieve root-$n$ rate of convergence under the GOT distance -- holds up to some logarithmic terms. We also establish the same conclusion for other minimum-distance estimators, and discuss mixture models beyond the Poisson.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Enhancing Language Models for Financial Relation Extraction with Named Entities and Part-of-Speech
Authors:
Menglin Li,
Kwan Hui Lim
Abstract:
The Financial Relation Extraction (FinRE) task involves identifying the entities and their relation, given a piece of financial statement/text. To solve this FinRE problem, we propose a simple but effective strategy that improves the performance of pre-trained language models by augmenting them with Named Entity Recognition (NER) and Part-Of-Speech (POS), as well as different approaches to combine…
▽ More
The Financial Relation Extraction (FinRE) task involves identifying the entities and their relation, given a piece of financial statement/text. To solve this FinRE problem, we propose a simple but effective strategy that improves the performance of pre-trained language models by augmenting them with Named Entity Recognition (NER) and Part-Of-Speech (POS), as well as different approaches to combine these information. Experiments on a financial relations dataset show promising results and highlights the benefits of incorporating NER and POS in existing models. Our dataset and codes are available at https://github.com/kwanhui/FinRelExtract.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Gravitating Scalarons with Inverted Higgs Potential
Authors:
Xiao Yan Chew,
Kok-Geng Lim
Abstract:
Previously, a class of regular and asymptotically flat gravitating scalar solitons (scalarons) has been constructed in the Einstein--Klein--Gordon (EKG) theory by adopting a phantom field with Higgs-like potential where the kinetic term has the wrong sign and the scalaron possesses the negative Arnowitt--Deser--Misner (ADM) mass as a consequence. In this paper, we demonstrate that the use of the p…
▽ More
Previously, a class of regular and asymptotically flat gravitating scalar solitons (scalarons) has been constructed in the Einstein--Klein--Gordon (EKG) theory by adopting a phantom field with Higgs-like potential where the kinetic term has the wrong sign and the scalaron possesses the negative Arnowitt--Deser--Misner (ADM) mass as a consequence. In this paper, we demonstrate that the use of the phantom field can be avoided by inverting the Higgs-like potential in the EKG system when the kinetic term has a proper sign, such that the corresponding gravitating scalaron can possess the positive ADM mass. We systematically study the basic properties of the gravitating scalaron, such as the ADM mass, the energy conditions, the geodesics of test particles, etc. Moreover, we find that it can be smoothly connected to the counterpart hairy black hole solutions from our recent work in the small horizon limit.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Towards Precise Observations of Neural Model Robustness in Classification
Authors:
Wenchuan Mu,
Kwan Hui Lim
Abstract:
In deep learning applications, robustness measures the ability of neural models that handle slight changes in input data, which could lead to potential safety hazards, especially in safety-critical applications. Pre-deployment assessment of model robustness is essential, but existing methods often suffer from either high costs or imprecise results. To enhance safety in real-world scenarios, metric…
▽ More
In deep learning applications, robustness measures the ability of neural models that handle slight changes in input data, which could lead to potential safety hazards, especially in safety-critical applications. Pre-deployment assessment of model robustness is essential, but existing methods often suffer from either high costs or imprecise results. To enhance safety in real-world scenarios, metrics that effectively capture the model's robustness are needed. To address this issue, we compare the rigour and usage conditions of various assessment methods based on different definitions. Then, we propose a straightforward and practical metric utilizing hypothesis testing for probabilistic robustness and have integrated it into the TorchAttacks library. Through a comparative analysis of diverse robustness assessment methods, our approach contributes to a deeper understanding of model robustness in safety-critical applications.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Label-Free Topic-Focused Summarization Using Query Augmentation
Authors:
Wenchuan Mu,
Kwan Hui Lim
Abstract:
In today's data and information-rich world, summarization techniques are essential in harnessing vast text to extract key information and enhance decision-making and efficiency. In particular, topic-focused summarization is important due to its ability to tailor content to specific aspects of an extended text. However, this usually requires extensive labelled datasets and considerable computationa…
▽ More
In today's data and information-rich world, summarization techniques are essential in harnessing vast text to extract key information and enhance decision-making and efficiency. In particular, topic-focused summarization is important due to its ability to tailor content to specific aspects of an extended text. However, this usually requires extensive labelled datasets and considerable computational power. This study introduces a novel method, Augmented-Query Summarization (AQS), for topic-focused summarization without the need for extensive labelled datasets, leveraging query augmentation and hierarchical clustering. This approach facilitates the transferability of machine learning models to the task of summarization, circumventing the need for topic-specific training. Through real-world tests, our method demonstrates the ability to generate relevant and accurate summaries, showing its potential as a cost-effective solution in data-rich environments. This innovation paves the way for broader application and accessibility in the field of topic-focused summarization technology, offering a scalable, efficient method for personalized content extraction.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
FewUser: Few-Shot Social User Geolocation via Contrastive Learning
Authors:
Menglin Li,
Kwan Hui Lim
Abstract:
To address the challenges of scarcity in geotagged data for social user geolocation, we propose FewUser, a novel framework for Few-shot social User geolocation. We incorporate a contrastive learning strategy between users and locations to improve geolocation performance with no or limited training data. FewUser features a user representation module that harnesses a pre-trained language model (PLM)…
▽ More
To address the challenges of scarcity in geotagged data for social user geolocation, we propose FewUser, a novel framework for Few-shot social User geolocation. We incorporate a contrastive learning strategy between users and locations to improve geolocation performance with no or limited training data. FewUser features a user representation module that harnesses a pre-trained language model (PLM) and a user encoder to process and fuse diverse social media inputs effectively. To bridge the gap between PLM's knowledge and geographical data, we introduce a geographical prompting module with hard, soft, and semi-soft prompts, to enhance the encoding of location information. Contrastive learning is implemented through a contrastive loss and a matching loss, complemented by a hard negative mining strategy to refine the learning process. We construct two datasets TwiU and FliU, containing richer metadata than existing benchmarks, to evaluate FewUser and the extensive experiments demonstrate that FewUser significantly outperforms state-of-the-art methods in both zero-shot and various few-shot settings, achieving absolute improvements of 26.95\% and \textbf{41.62\%} on TwiU and FliU, respectively, with only one training sample per class. We further conduct a comprehensive analysis to investigate the impact of user representation on geolocation performance and the effectiveness of FewUser's components, offering valuable insights for future research in this area.
△ Less
Submitted 28 March, 2024;
originally announced April 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seongjin Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Beehive: A Flexible Network Stack for Direct-Attached Accelerators
Authors:
Katie Lim,
Matthew Giordano,
Theano Stavrinos,
Pratyush Patel,
Jacob Nelson,
Irene Zhang,
Baris Kasikci,
Tom Anderson
Abstract:
Direct-attached accelerators, where application accelerators are directly connected to the datacenter network via a hardware network stack, offer substantial benefits in terms of reduced latency, CPU overhead, and energy use. However, a key challenge is that modern datacenter network stacks are complex, with interleaved protocol layers, network management functions, and virtualization support. To…
▽ More
Direct-attached accelerators, where application accelerators are directly connected to the datacenter network via a hardware network stack, offer substantial benefits in terms of reduced latency, CPU overhead, and energy use. However, a key challenge is that modern datacenter network stacks are complex, with interleaved protocol layers, network management functions, and virtualization support. To operators, network feature agility, diagnostics, and manageability are often considered just as important as raw performance. By contrast, existing hardware network stacks only support basic protocols and are often difficult to extend since they use fixed processing pipelines.
We propose Beehive, a new, open-source FPGA network stack for direct-attached accelerators designed to enable flexible and adaptive construction of complex network functionality in hardware. Application and network protocol elements are modularized as tiles over a network-on-chip substrate. Elements can be added or scaled up/down to match workload characteristics with minimal effort or changes to other elements. Flexible diagnostics and control are integral, with tooling to ensure deadlock safety. Our implementation interoperates with standard Linux TCP and UDP clients, with a 4x improvement in end-to-end remote procedure call tail latency for Linux UDP clients versus a CPU-attached accelerator
△ Less
Submitted 30 May, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Spatial-Temporal Graph Representation Learning for Tactical Networks Future State Prediction
Authors:
Junhua Liu,
Justin Albrethsen,
Lincoln Goh,
David Yau,
Kwan Hui Lim
Abstract:
Resource allocation in tactical ad-hoc networks presents unique challenges due to their dynamic and multi-hop nature. Accurate prediction of future network connectivity is essential for effective resource allocation in such environments. In this paper, we introduce the Spatial-Temporal Graph Encoder-Decoder (STGED) framework for Tactical Communication Networks that leverages both spatial and tempo…
▽ More
Resource allocation in tactical ad-hoc networks presents unique challenges due to their dynamic and multi-hop nature. Accurate prediction of future network connectivity is essential for effective resource allocation in such environments. In this paper, we introduce the Spatial-Temporal Graph Encoder-Decoder (STGED) framework for Tactical Communication Networks that leverages both spatial and temporal features of network states to learn latent tactical behaviors effectively. STGED hierarchically utilizes graph-based attention mechanism to spatially encode a series of communication network states, leverages a recurrent neural network to temporally encode the evolution of states, and a fully-connected feed-forward network to decode the connectivity in the future state. Through extensive experiments, we demonstrate that STGED consistently outperforms baseline models by large margins across different time-steps input, achieving an accuracy of up to 99.2\% for the future state prediction task of tactical communication networks.
△ Less
Submitted 14 July, 2024; v1 submitted 20 March, 2024;
originally announced March 2024.
-
X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment
Authors:
Dongjae Shin,
Hyeonseok Lim,
Inho Won,
Changsu Choi,
Minjun Kim,
Seungwoo Song,
Hangyeol Yoo,
Sangmin Kim,
Kyungtae Lim
Abstract:
The impressive development of large language models (LLMs) is expanding into the realm of large multimodal models (LMMs), which incorporate multiple types of data beyond text. However, the nature of multimodal models leads to significant expenses in the creation of training data. Furthermore, constructing multilingual data for LMMs presents its own set of challenges due to language diversity and c…
▽ More
The impressive development of large language models (LLMs) is expanding into the realm of large multimodal models (LMMs), which incorporate multiple types of data beyond text. However, the nature of multimodal models leads to significant expenses in the creation of training data. Furthermore, constructing multilingual data for LMMs presents its own set of challenges due to language diversity and complexity. Therefore, in this study, we propose two cost-effective methods to solve this problem: (1) vocabulary expansion and pretraining of multilingual LLM for specific languages, and (2) automatic and elaborate construction of multimodal datasets using GPT4-V. Based on015 these methods, we constructed a 91K English-Korean-Chinese multilingual, multimodal training dataset. Additionally, we developed a bilingual multimodal model that exhibits excellent performance in both Korean and English, surpassing existing approaches.
△ Less
Submitted 1 April, 2024; v1 submitted 17 March, 2024;
originally announced March 2024.
-
Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean
Authors:
ChangSu Choi,
Yongbin Jeong,
Seoyoon Park,
InHo Won,
HyeonSeok Lim,
SangMin Kim,
Yejee Kang,
Chanhyuk Yoon,
Jaewan Park,
Yiseul Lee,
HyeJin Lee,
Younggyun Hahm,
Hansaem Kim,
KyungTae Lim
Abstract:
Large language models (LLMs) use pretraining to predict the subsequent word; however, their expansion requires significant computing resources. Numerous big tech companies and research institutes have developed multilingual LLMs (MLLMs) to meet current demands, overlooking less-resourced languages (LRLs). This study proposed three strategies to enhance the performance of LRLs based on the publicly…
▽ More
Large language models (LLMs) use pretraining to predict the subsequent word; however, their expansion requires significant computing resources. Numerous big tech companies and research institutes have developed multilingual LLMs (MLLMs) to meet current demands, overlooking less-resourced languages (LRLs). This study proposed three strategies to enhance the performance of LRLs based on the publicly available MLLMs. First, the MLLM vocabularies of LRLs were expanded to enhance expressiveness. Second, bilingual data were used for pretraining to align the high- and less-resourced languages. Third, a high-quality small-scale instruction dataset was constructed and instruction-tuning was performed to augment the LRL. The experiments employed the Llama2 model and Korean was used as the LRL, which was quantitatively evaluated against other developed LLMs across eight tasks. Furthermore, a qualitative assessment was performed based on human evaluation and GPT4. Experimental results showed that our proposed Bllossom model exhibited superior performance in qualitative analyses compared to previously proposed Korean monolingual models.
△ Less
Submitted 21 March, 2024; v1 submitted 16 March, 2024;
originally announced March 2024.
-
Leveraging Contrastive Learning for Few-shot Geolocation of Social Posts
Authors:
Menglin Li,
Kwan Hui Lim
Abstract:
Social geolocation is an important problem of predicting the originating locations of social media posts. However, this task is challenging due to the need for a substantial volume of training data, alongside well-annotated labels. These issues are further exacerbated by new or less popular locations with insufficient labels, further leading to an imbalanced dataset. In this paper, we propose \tex…
▽ More
Social geolocation is an important problem of predicting the originating locations of social media posts. However, this task is challenging due to the need for a substantial volume of training data, alongside well-annotated labels. These issues are further exacerbated by new or less popular locations with insufficient labels, further leading to an imbalanced dataset. In this paper, we propose \textbf{ContrastGeo}, a \textbf{Contrast}ive learning enhanced framework for few-shot social \textbf{Geo}location. Specifically, a Tweet-Location Contrastive learning objective is introduced to align representations of tweets and locations within tweet-location pairs. To capture the correlations between tweets and locations, a Tweet-Location Matching objective is further adopted into the framework and refined via an online hard negative mining approach. We also develop three fusion strategies with various fusion encoders to better generate joint representations of tweets and locations. Comprehensive experiments on three social media datasets highlight ContrastGeo's superior performance over several state-of-the-art baselines in few-shot social geolocation.
△ Less
Submitted 19 February, 2024;
originally announced March 2024.
-
Generation and optimization of entanglement between giant atoms chirally coupled to spin cavities
Authors:
Jia-Bin You,
Jian Feng Kong,
Davit Aghamalyan,
Wai-Keong Mok,
Kian Hwee Lim,
Jun Ye,
Ching Eng Png,
Francisco J. García-Vidal
Abstract:
We explore a scheme for entanglement generation and optimization in giant atoms by coupling them to finite one-dimensional arrays of spins that behave as cavities. We find that high values for the concurrence can be achieved in small-sized cavities, being the generation time very short. When exciting the system by external means, optimal concurrence is obtained for very weak drivings. We also anal…
▽ More
We explore a scheme for entanglement generation and optimization in giant atoms by coupling them to finite one-dimensional arrays of spins that behave as cavities. We find that high values for the concurrence can be achieved in small-sized cavities, being the generation time very short. When exciting the system by external means, optimal concurrence is obtained for very weak drivings. We also analyze the effect of disorder in these systems, showing that although the average concurrence decreases with disorder, high concurrences can still be obtained even in scenarios presenting strong disorder. This result leads us to propose an optimization procedure in which by engineering the on-site energies or hoppings in the cavity, concurrences close to 1 can be reached within an extremely short period of time.
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
Ocassionally Secure: A Comparative Analysis of Code Generation Assistants
Authors:
Ran Elgedawy,
John Sadik,
Senjuti Dutta,
Anuj Gautam,
Konstantinos Georgiou,
Farzin Gholamrezae,
Fujiao Ji,
Kyungchan Lim,
Qian Liu,
Scott Ruoti
Abstract:
$ $Large Language Models (LLMs) are being increasingly utilized in various applications, with code generations being a notable example. While previous research has shown that LLMs have the capability to generate both secure and insecure code, the literature does not take into account what factors help generate secure and effective code. Therefore in this paper we focus on identifying and understan…
▽ More
$ $Large Language Models (LLMs) are being increasingly utilized in various applications, with code generations being a notable example. While previous research has shown that LLMs have the capability to generate both secure and insecure code, the literature does not take into account what factors help generate secure and effective code. Therefore in this paper we focus on identifying and understanding the conditions and contexts in which LLMs can be effectively and safely deployed in real-world scenarios to generate quality code. We conducted a comparative analysis of four advanced LLMs--GPT-3.5 and GPT-4 using ChatGPT and Bard and Gemini from Google--using 9 separate tasks to assess each model's code generation capabilities. We contextualized our study to represent the typical use cases of a real-life developer employing LLMs for everyday tasks as work. Additionally, we place an emphasis on security awareness which is represented through the use of two distinct versions of our developer persona. In total, we collected 61 code outputs and analyzed them across several aspects: functionality, security, performance, complexity, and reliability. These insights are crucial for understanding the models' capabilities and limitations, guiding future development and practical applications in the field of automated code generation.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Contrastive Learning in Distilled Models
Authors:
Valerie Lim,
Kai Wen Ng,
Kenneth Lim
Abstract:
Natural Language Processing models like BERT can provide state-of-the-art word embeddings for downstream NLP tasks. However, these models yet to perform well on Semantic Textual Similarity, and may be too large to be deployed as lightweight edge applications. We seek to apply a suitable contrastive learning method based on the SimCSE paper, to a model architecture adapted from a knowledge distilla…
▽ More
Natural Language Processing models like BERT can provide state-of-the-art word embeddings for downstream NLP tasks. However, these models yet to perform well on Semantic Textual Similarity, and may be too large to be deployed as lightweight edge applications. We seek to apply a suitable contrastive learning method based on the SimCSE paper, to a model architecture adapted from a knowledge distillation based model, DistilBERT, to address these two issues. Our final lightweight model DistilFace achieves an average of 72.1 in Spearman's correlation on STS tasks, a 34.2 percent improvement over BERT base.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining
Authors:
Minjun Kim,
Seungwoo Song,
Youhan Lee,
Haneol Jang,
Kyungtae Lim
Abstract:
The current research direction in generative models, such as the recently developed GPT4, aims to find relevant knowledge information for multimodal and multilingual inputs to provide answers. Under these research circumstances, the demand for multilingual evaluation of visual question answering (VQA) tasks, a representative task of multimodal systems, has increased. Accordingly, we propose a bili…
▽ More
The current research direction in generative models, such as the recently developed GPT4, aims to find relevant knowledge information for multimodal and multilingual inputs to provide answers. Under these research circumstances, the demand for multilingual evaluation of visual question answering (VQA) tasks, a representative task of multimodal systems, has increased. Accordingly, we propose a bilingual outside-knowledge VQA (BOK-VQA) dataset in this study that can be extended to multilingualism. The proposed data include 17K images, 17K question-answer pairs for both Korean and English and 280K instances of knowledge information related to question-answer content. We also present a framework that can effectively inject knowledge information into a VQA system by pretraining the knowledge information of BOK-VQA data in the form of graph embeddings. Finally, through in-depth analysis, we demonstrated the actual effect of the knowledge information contained in the constructed training data on VQA.
△ Less
Submitted 15 March, 2024; v1 submitted 12 January, 2024;
originally announced January 2024.
-
Assessing AI Detectors in Identifying AI-Generated Code: Implications for Education
Authors:
Wei Hung Pan,
Ming Jie Chok,
Jonathan Leong Shan Wong,
Yung Xin Shin,
Yeong Shian Poon,
Zhou Yang,
Chun Yong Chong,
David Lo,
Mei Kuan Lim
Abstract:
Educators are increasingly concerned about the usage of Large Language Models (LLMs) such as ChatGPT in programming education, particularly regarding the potential exploitation of imperfections in Artificial Intelligence Generated Content (AIGC) Detectors for academic misconduct. In this paper, we present an empirical study where the LLM is examined for its attempts to bypass detection by AIGC Det…
▽ More
Educators are increasingly concerned about the usage of Large Language Models (LLMs) such as ChatGPT in programming education, particularly regarding the potential exploitation of imperfections in Artificial Intelligence Generated Content (AIGC) Detectors for academic misconduct. In this paper, we present an empirical study where the LLM is examined for its attempts to bypass detection by AIGC Detectors. This is achieved by generating code in response to a given question using different variants. We collected a dataset comprising 5,069 samples, with each sample consisting of a textual description of a coding problem and its corresponding human-written Python solution codes. These samples were obtained from various sources, including 80 from Quescol, 3,264 from Kaggle, and 1,725 from LeetCode. From the dataset, we created 13 sets of code problem variant prompts, which were used to instruct ChatGPT to generate the outputs. Subsequently, we assessed the performance of five AIGC detectors. Our results demonstrate that existing AIGC Detectors perform poorly in distinguishing between human-written code and AI-generated code.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
Causal Discovery for fMRI data: Challenges, Solutions, and a Case Study
Authors:
Eric Rawls,
Bryan Andrews,
Kelvin Lim,
Erich Kummerfeld
Abstract:
Designing studies that apply causal discovery requires navigating many researcher degrees of freedom. This complexity is exacerbated when the study involves fMRI data. In this paper we (i) describe nine challenges that occur when applying causal discovery to fMRI data, (ii) discuss the space of decisions that need to be made, (iii) review how a recent case study made those decisions, (iv) and iden…
▽ More
Designing studies that apply causal discovery requires navigating many researcher degrees of freedom. This complexity is exacerbated when the study involves fMRI data. In this paper we (i) describe nine challenges that occur when applying causal discovery to fMRI data, (ii) discuss the space of decisions that need to be made, (iii) review how a recent case study made those decisions, (iv) and identify existing gaps that could potentially be solved by the development of new methods. Overall, causal discovery is a promising approach for analyzing fMRI data, and multiple successful applications have indicated that it is superior to traditional fMRI functional connectivity methods, but current causal discovery methods for fMRI leave room for improvement.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Appropriate State-Dependent Friction Coefficient Accelerates Kinetic Langevin Dynamics
Authors:
Keunwoo Lim,
Molei Tao
Abstract:
We consider the convergence of kinetic Langevin dynamics to its ergodic invariant measure, which is Gibbs distribution. Instead of the standard setup where the friction coefficient is a constant scalar, we investigate position-dependent friction coefficient and the possible accelerated convergence it enables. We show that by choosing this coefficient matrix to be $2\sqrt{\text{Hess}V}$, convergenc…
▽ More
We consider the convergence of kinetic Langevin dynamics to its ergodic invariant measure, which is Gibbs distribution. Instead of the standard setup where the friction coefficient is a constant scalar, we investigate position-dependent friction coefficient and the possible accelerated convergence it enables. We show that by choosing this coefficient matrix to be $2\sqrt{\text{Hess}V}$, convergence is accelerated in the sense that no constant scalar friction coefficient can lead to faster convergence for a large subset of (nonlinear) strongly-convex potential $V$'s. The speed of convergence is quantified in terms of chi-square divergence from the target distribution, and proved using a Lyapunov approach, based on viewing sampling as optimization in the infinite dimensional space of probability distributions.
△ Less
Submitted 30 June, 2024; v1 submitted 12 December, 2023;
originally announced December 2023.
-
Utilizing Language Models for Tour Itinerary Recommendation
Authors:
Ngai Lam Ho,
Kwan Hui Lim
Abstract:
Tour itinerary recommendation involves planning a sequence of relevant Point-of-Interest (POIs), which combines challenges from the fields of both Operations Research (OR) and Recommendation Systems (RS). As an OR problem, there is the need to maximize a certain utility (e.g., popularity of POIs in the tour) while adhering to some constraints (e.g., maximum time for the tour). As a RS problem, it…
▽ More
Tour itinerary recommendation involves planning a sequence of relevant Point-of-Interest (POIs), which combines challenges from the fields of both Operations Research (OR) and Recommendation Systems (RS). As an OR problem, there is the need to maximize a certain utility (e.g., popularity of POIs in the tour) while adhering to some constraints (e.g., maximum time for the tour). As a RS problem, it is heavily related to problem or filtering or ranking a subset of POIs that are relevant to a user and recommending it as part of an itinerary. In this paper, we explore the use of language models for the task of tour itinerary recommendation and planning. This task has the unique requirement of recommending personalized POIs relevant to users and planning these POIs as an itinerary that satisfies various constraints. We discuss some approaches in this area, such as using word embedding techniques like Word2Vec and GloVe for learning POI embeddings and transformer-based techniques like BERT for generating
itineraries.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
SBTRec- A Transformer Framework for Personalized Tour Recommendation Problem with Sentiment Analysis
Authors:
Ngai Lam Ho,
Roy Ka-Wei Lee,
Kwan Hui Lim
Abstract:
When traveling to an unfamiliar city for holidays, tourists often rely on guidebooks, travel websites, or recommendation systems to plan their daily itineraries and explore popular points of interest (POIs). However, these approaches may lack optimization in terms of time feasibility, localities, and user preferences. In this paper, we propose the SBTRec algorithm: a BERT-based Trajectory Recommen…
▽ More
When traveling to an unfamiliar city for holidays, tourists often rely on guidebooks, travel websites, or recommendation systems to plan their daily itineraries and explore popular points of interest (POIs). However, these approaches may lack optimization in terms of time feasibility, localities, and user preferences. In this paper, we propose the SBTRec algorithm: a BERT-based Trajectory Recommendation with sentiment analysis, for recommending personalized sequences of POIs as itineraries. The key contributions of this work include analyzing users' check-ins and uploaded photos to understand the relationship between POI visits and distance. We introduce SBTRec, which encompasses sentiment analysis to improve recommendation accuracy by understanding users' preferences and satisfaction levels from reviews and comments about different POIs. Our proposed algorithms are evaluated against other sequence prediction methods using datasets from 8 cities. The results demonstrate that SBTRec achieves an average F1 score of 61.45%, outperforming baseline algorithms.
The paper further discusses the flexibility of the SBTRec algorithm, its ability to adapt to different scenarios and cities without modification, and its potential for extension by incorporating additional information for more reliable predictions. Overall, SBTRec provides personalized and relevant POI recommendations, enhancing tourists' overall trip experiences. Future work includes fine-tuning personalized embeddings for users, with evaluation of users' comments on POIs,~to further enhance prediction accuracy.
△ Less
Submitted 18 November, 2023;
originally announced November 2023.
-
BTRec: BERT-Based Trajectory Recommendation for Personalized Tours
Authors:
Ngai Lam Ho,
Roy Ka-Wei Lee,
Kwan Hui Lim
Abstract:
An essential task for tourists having a pleasant holiday is to have a well-planned itinerary with relevant recommendations, especially when visiting unfamiliar cities. Many tour recommendation tools only take into account a limited number of factors, such as popular Points of Interest (POIs) and routing constraints. Consequently, the solutions they provide may not always align with the individual…
▽ More
An essential task for tourists having a pleasant holiday is to have a well-planned itinerary with relevant recommendations, especially when visiting unfamiliar cities. Many tour recommendation tools only take into account a limited number of factors, such as popular Points of Interest (POIs) and routing constraints. Consequently, the solutions they provide may not always align with the individual users of the system. We propose an iterative algorithm in this paper, namely: BTREC (BERT-based Trajectory Recommendation), that extends from the POIBERT embedding algorithm to recommend personalized itineraries on POIs using the BERT framework. Our BTREC algorithm incorporates users' demographic information alongside past POI visits into a modified BERT language model to recommend a personalized POI itinerary prediction given a pair of source and destination POIs. Our recommendation system can create a travel itinerary that maximizes POIs visited, while also taking into account user preferences for categories of POIs and time availability. Our recommendation algorithm is largely inspired by the problem of sentence completion in natural language processing (NLP). Using a dataset of eight cities of different sizes, our experimental results demonstrate that our proposed algorithm is stable and outperforms many other sequence prediction algorithms, measured by recall, precision, and F1-scores.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Exponentially faster preparation of quantum dimers via driven-dissipative stabilization
Authors:
Kian Hwee Lim,
Wai-Keong Mok,
Jia-Bin You,
Jian Feng Kong,
Davit Aghamalyan
Abstract:
We propose a novel rapid, high-fidelity, and noise-resistant scheme to generate many-body entanglement between multiple qubits stabilized by dissipation into a 1D bath. Using a carefully designed time-dependent drive, our scheme achieves a provably exponential speedup over state-of-the-art dissipative stabilization schemes in 1D baths, which require a timescale that diverges as the target fidelity…
▽ More
We propose a novel rapid, high-fidelity, and noise-resistant scheme to generate many-body entanglement between multiple qubits stabilized by dissipation into a 1D bath. Using a carefully designed time-dependent drive, our scheme achieves a provably exponential speedup over state-of-the-art dissipative stabilization schemes in 1D baths, which require a timescale that diverges as the target fidelity approaches unity and scales exponentially with the number of qubits. To prepare quantum dimer pairs, our scheme only requires local 2-qubit control Hamiltonians, with a protocol time that is independent of system size. This provides a scalable and robust protocol for generating a large number of entangled dimer pairs on-demand, serving as a fundamental resource for many quantum metrology and quantum information processing tasks.
△ Less
Submitted 29 July, 2024; v1 submitted 22 September, 2023;
originally announced September 2023.
-
Scalar Hairy Black Holes with Inverted Mexican Hat Potential
Authors:
Xiao Yan Chew,
Kok-Geng Lim
Abstract:
We numerically construct the asymptotically flat solutions of hairy black holes supported by a symmetric inverted Mexican hat potential with a local minimum and two degenerate global maxima of a real scalar field that contains a quartic self-interaction term. The solutions of hairy black holes emerge from the Schwarzschild black hole when the non-trivial scalar field exists outside the event horiz…
▽ More
We numerically construct the asymptotically flat solutions of hairy black holes supported by a symmetric inverted Mexican hat potential with a local minimum and two degenerate global maxima of a real scalar field that contains a quartic self-interaction term. The solutions of hairy black holes emerge from the Schwarzschild black hole when the non-trivial scalar field exists outside the event horizon. Therefore, we perform a comprehensive study on the properties of the hairy black holes such as the area of horizon, the Hawking temperature, the innermost stable circular orbit, the photon sphere, etc. We also numerically study their linear stability in the mode analysis, hence finding that they are unstable against the linear perturbation.
△ Less
Submitted 26 July, 2023;
originally announced July 2023.
-
Anti-noise window: Subjective perception of active noise reduction and effect of informational masking
Authors:
Bhan Lam,
Kelvin Chee Quan Lim,
Kenneth Ooi,
Zhen-Ting Ong,
Dongyuan Shi,
Woon-Seng Gan
Abstract:
Reviving natural ventilation (NV) for urban sustainability presents challenges for indoor acoustic comfort. Active control and interference-based noise mitigation strategies, such as the use of loudspeakers, offer potential solutions to achieve acoustic comfort while maintaining NV. However, these approaches are not commonly integrated or evaluated from a perceptual standpoint. This study examines…
▽ More
Reviving natural ventilation (NV) for urban sustainability presents challenges for indoor acoustic comfort. Active control and interference-based noise mitigation strategies, such as the use of loudspeakers, offer potential solutions to achieve acoustic comfort while maintaining NV. However, these approaches are not commonly integrated or evaluated from a perceptual standpoint. This study examines the perceptual and objective aspects of an active-noise-control (ANC)-based "anti-noise" window (ANW) and its integration with informational masking (IM) in a model bedroom. Forty participants assessed the ANW in a three-way interaction involving noise types (traffic, train, and aircraft), maskers (bird, water), and ANC (on, off). The evaluation focused on perceived annoyance (PAY; ISO/TS 15666), perceived affective quality (ISO/TS 12913-2), loudness (PLN), and included an open-ended qualitative assessment. Despite minimal objective reduction in decibel-based indicators and a slight increase in psychoacoustic sharpness, the ANW alone demonstrated significant reductions in PAY and PLN, as well as an improvement in ISO pleasantness across all noise types. The addition of maskers generally enhanced overall acoustic comfort, although water masking led to increased PLN. Furthermore, the combination of ANC with maskers showed interaction effects, with both maskers significantly reducing PAY compared to ANC alone.
△ Less
Submitted 8 July, 2023;
originally announced July 2023.
-
PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition
Authors:
Jia Le Ngwe,
Kian Ming Lim,
Chin Poo Lee,
Thian Song Ong
Abstract:
Facial Expression Recognition (FER) is a machine learning problem that deals with recognizing human facial expressions. While existing work has achieved performance improvements in recent years, FER in the wild and under challenging conditions remains a challenge. In this paper, a lightweight patch and attention network based on MobileNetV1, referred to as PAtt-Lite, is proposed to improve FER per…
▽ More
Facial Expression Recognition (FER) is a machine learning problem that deals with recognizing human facial expressions. While existing work has achieved performance improvements in recent years, FER in the wild and under challenging conditions remains a challenge. In this paper, a lightweight patch and attention network based on MobileNetV1, referred to as PAtt-Lite, is proposed to improve FER performance under challenging conditions. A truncated ImageNet-pre-trained MobileNetV1 is utilized as the backbone feature extractor of the proposed method. In place of the truncated layers is a patch extraction block that is proposed for extracting significant local facial features to enhance the representation from MobileNetV1, especially under challenging conditions. An attention classifier is also proposed to improve the learning of these patched feature maps from the extremely lightweight feature extractor. The experimental results on public benchmark databases proved the effectiveness of the proposed method. PAtt-Lite achieved state-of-the-art results on CK+, RAF-DB, FER2013, FERPlus, and the challenging conditions subsets for RAF-DB and FERPlus.
△ Less
Submitted 13 August, 2024; v1 submitted 16 June, 2023;
originally announced June 2023.
-
Synthesizing Speech Test Cases with Text-to-Speech? An Empirical Study on the False Alarms in Automated Speech Recognition Testing
Authors:
Julia Kaiwen Lau,
Kelvin Kai Wen Kong,
Julian Hao Yong,
Per Hoong Tan,
Zhou Yang,
Zi Qian Yong,
Joshua Chern Wey Low,
Chun Yong Chong,
Mei Kuan Lim,
David Lo
Abstract:
Recent studies have proposed the use of Text-To-Speech (TTS) systems to automatically synthesise speech test cases on a scale and uncover a large number of failures in ASR systems. However, the failures uncovered by synthetic test cases may not reflect the actual performance of an ASR system when it transcribes human audio, which we refer to as false alarms. Given a failed test case synthesised fr…
▽ More
Recent studies have proposed the use of Text-To-Speech (TTS) systems to automatically synthesise speech test cases on a scale and uncover a large number of failures in ASR systems. However, the failures uncovered by synthetic test cases may not reflect the actual performance of an ASR system when it transcribes human audio, which we refer to as false alarms. Given a failed test case synthesised from TTS systems, which consists of TTS-generated audio and the corresponding ground truth text, we feed the human audio stating the same text to an ASR system. If human audio can be correctly transcribed, an instance of a false alarm is detected. In this study, we investigate false alarm occurrences in five popular ASR systems using synthetic audio generated from four TTS systems and human audio obtained from two commonly used datasets. Our results show that the least number of false alarms is identified when testing Deepspeech, and the number of false alarms is the highest when testing Wav2vec2. On average, false alarm rates range from 21% to 34% in all five ASR systems. Among the TTS systems used, Google TTS produces the least number of false alarms (17%), and Espeak TTS produces the highest number of false alarms (32%) among the four TTS systems. Additionally, we build a false alarm estimator that flags potential false alarms, which achieves promising results: a precision of 98.3%, a recall of 96.4%, an accuracy of 98.5%, and an F1 score of 97.3%. Our study provides insight into the appropriate selection of TTS systems to generate high-quality speech to test ASR systems. Additionally, a false alarm estimator can be a way to minimise the impact of false alarms and help developers choose suitable test inputs when evaluating ASR systems. The source code used in this paper is publicly available on GitHub at https://github.com/julianyonghao/FAinASRtest.
△ Less
Submitted 18 July, 2023; v1 submitted 27 May, 2023;
originally announced May 2023.
-
K-UniMorph: Korean Universal Morphology and its Feature Schema
Authors:
Eunkyul Leah Jo,
Kyuwon Kim,
Xihan Wu,
KyungTae Lim,
Jungyeul Park,
Chulwoo Park
Abstract:
We present in this work a new Universal Morphology dataset for Korean. Previously, the Korean language has been underrepresented in the field of morphological paradigms amongst hundreds of diverse world languages. Hence, we propose this Universal Morphological paradigms for the Korean language that preserve its distinct characteristics. For our K-UniMorph dataset, we outline each grammatical crite…
▽ More
We present in this work a new Universal Morphology dataset for Korean. Previously, the Korean language has been underrepresented in the field of morphological paradigms amongst hundreds of diverse world languages. Hence, we propose this Universal Morphological paradigms for the Korean language that preserve its distinct characteristics. For our K-UniMorph dataset, we outline each grammatical criterion in detail for the verbal endings, clarify how to extract inflected forms, and demonstrate how we generate the morphological schemata. This dataset adopts morphological feature schema from Sylak-Glassman et al. (2015) and Sylak-Glassman (2016) for the Korean language as we extract inflected verb forms from the Sejong morphologically analyzed corpus that is one of the largest annotated corpora for Korean. During the data creation, our methodology also includes investigating the correctness of the conversion from the Sejong corpus. Furthermore, we carry out the inflection task using three different Korean word forms: letters, syllables and morphemes. Finally, we discuss and describe future perspectives on Korean morphological paradigms and the dataset.
△ Less
Submitted 17 May, 2023; v1 submitted 10 May, 2023;
originally announced May 2023.
-
Korean Named Entity Recognition Based on Language-Specific Features
Authors:
Yige Chen,
KyungTae Lim,
Jungyeul Park
Abstract:
In the paper, we propose a novel way of improving named entity recognition in the Korean language using its language-specific features. While the field of named entity recognition has been studied extensively in recent years, the mechanism of efficiently recognizing named entities in Korean has hardly been explored. This is because the Korean language has distinct linguistic properties that preven…
▽ More
In the paper, we propose a novel way of improving named entity recognition in the Korean language using its language-specific features. While the field of named entity recognition has been studied extensively in recent years, the mechanism of efficiently recognizing named entities in Korean has hardly been explored. This is because the Korean language has distinct linguistic properties that prevent models from achieving their best performances. Therefore, an annotation scheme for {Korean corpora} by adopting the CoNLL-U format, which decomposes Korean words into morphemes and reduces the ambiguity of named entities in the original segmentation that may contain functional morphemes such as postpositions and particles, is proposed herein. We investigate how the named entity tags are best represented in this morpheme-based scheme and implement an algorithm to convert word-based {and syllable-based Korean corpora} with named entities into the proposed morpheme-based format. Analyses of the results of {statistical and neural} models reveal that the proposed morpheme-based format is feasible, and the {varied} performances of the models under the influence of various additional language-specific features are demonstrated. Extrinsic conditions were also considered to observe the variance of the performances of the proposed models, given different types of data, including the original segmentation and different types of tagging formats.
△ Less
Submitted 10 May, 2023;
originally announced May 2023.
-
Effect of triangularity on plasma turbulence and the SOL-width scaling in L-mode diverted tokamak configurations
Authors:
Kyungtak Lim,
Maurizio Giacomin,
Paolo Ricci,
António Coelho,
Olivier Février,
Davide Mancini,
Davide Silvagni,
Louis Stenger
Abstract:
The effect of triangularity on tokamak boundary plasma turbulence is investigated by using global, flux-driven, three-dimensional, two-fluid simulations. The simulations show that negative triangularity stabilizes boundary plasma turbulence, and linear investigations reveal that this is due to a reduction of the magnetic curvature drive of interchange instabilities, such as the resistive balloonin…
▽ More
The effect of triangularity on tokamak boundary plasma turbulence is investigated by using global, flux-driven, three-dimensional, two-fluid simulations. The simulations show that negative triangularity stabilizes boundary plasma turbulence, and linear investigations reveal that this is due to a reduction of the magnetic curvature drive of interchange instabilities, such as the resistive ballooning mode. As a consequence, the pressure decay length $L_p$, related to the SOL power fall-off length $λ_q$, is found to be affected by triangularity. Leveraging considerations on the effect of triangularity on the linear growth rate and nonlinear evolution of the resistive ballooning mode, the analytical theory-based scaling law for $L_p$ in L-mode plasmas, derived by Giacomin \textit{et al.} [{Nucl. Fusion}, \href{https://doi.org/10.1088/1741-4326/abf8f6}{\textbf{61} 076002} (2021)], is extended to include the effect of triangularity. The scaling is in agreement with nonlinear simulations and a multi-machine experimental database, which include recent TCV discharges dedicated to the study of the effect of triangularity in L-mode diverted discharges. Overall, the present results highlight that negative triangularity narrows the $L_p$ and considering the effect of triangularity is important for a reliable extrapolation of $λ_q$ from present experiments to larger devices.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
Self-consistent gyrokinetic modelling of turbulent and neoclassical tungsten transport in toroidally rotating plasmas
Authors:
Kyungtak Lim,
Xavier Garbet,
Yanick Sarazin,
Etienne Gravier,
Maxime Lesur,
Guillaume Lo-Cascio,
Timothe Rouyer
Abstract:
The effect of toroidal rotation on both turbulent and neoclassical transport of tungsten (W) in tokamaks is investigated using the flux-driven, global, nonlinear 5D gyrokinetic code GYSELA. Nonlinear simulations are carried out with different levels of momentum injection that drive W to the supersonic regime, while the toroidal velocity of the main ions remains in the subsonic regime. The numerica…
▽ More
The effect of toroidal rotation on both turbulent and neoclassical transport of tungsten (W) in tokamaks is investigated using the flux-driven, global, nonlinear 5D gyrokinetic code GYSELA. Nonlinear simulations are carried out with different levels of momentum injection that drive W to the supersonic regime, while the toroidal velocity of the main ions remains in the subsonic regime. The numerical simulations demonstrate that toroidal rotation induces centrifugal forces that cause W to accumulate in the outboard region, generating an in-out poloidal asymmetry. This asymmetry enhances neoclassical inward convection, which can lead to central accumulation of W in cases of strong plasma rotation. The core accumulation of W is mainly driven by inward neoclassical convection. However, as momentum injection continues, roto-diffusion, proportional to the radial gradient of the toroidal velocity, becomes significant and generate outward turbulent flux in the case of ion temperature gradient (ITG) turbulence. Overall, the numerical results from nonlinear GYSELA simulations are in qualitative agreement with the theoretical predictions for impurity transport, as well as experimental observations.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
Optimizing Group Utility in Itinerary Planning: A Strategic and Crowd-Aware Approach
Authors:
Junhua Liu,
Kwan Hui Lim,
Kristin L. Wood,
Menglin Li
Abstract:
Itinerary recommendation is a complex sequence prediction problem with numerous real-world applications. This task becomes even more challenging when considering the optimization of multiple user queuing times and crowd levels, as well as numerous involved parameters, such as attraction popularity, queuing time, walking time, and operating hours. Existing solutions typically focus on single-person…
▽ More
Itinerary recommendation is a complex sequence prediction problem with numerous real-world applications. This task becomes even more challenging when considering the optimization of multiple user queuing times and crowd levels, as well as numerous involved parameters, such as attraction popularity, queuing time, walking time, and operating hours. Existing solutions typically focus on single-person perspectives and fail to address real-world issues resulting from natural crowd behavior, like the Selfish Routing problem. In this paper, we introduce the Strategic and Crowd-Aware Itinerary Recommendation (SCAIR) algorithm, which optimizes group utility in real-world settings. We model the route recommendation strategy as a Markov Decision Process and propose a State Encoding mechanism that enables real-time planning and allocation in linear time. We evaluate our algorithm against various competitive and realistic baselines using a theme park dataset, demonstrating that SCAIR outperforms these baselines in addressing the Selfish Routing problem across four theme parks.
△ Less
Submitted 10 September, 2023; v1 submitted 4 April, 2023;
originally announced April 2023.
-
Hybrid Computing for Interactive Datacenter Applications
Authors:
Pratyush Patel,
Katie Lim,
Kushal Jhunjhunwalla,
Ashlie Martinez,
Max Demoulin,
Jacob Nelson,
Irene Zhang,
Thomas Anderson
Abstract:
Field-Programmable Gate Arrays (FPGAs) are more energy efficient and cost effective than CPUs for a wide variety of datacenter applications. Yet, for latency-sensitive and bursty workloads, this advantage can be difficult to harness due to high FPGA spin-up costs. We propose that a hybrid FPGA and CPU computing framework can harness the energy efficiency benefits of FPGAs for such workloads at rea…
▽ More
Field-Programmable Gate Arrays (FPGAs) are more energy efficient and cost effective than CPUs for a wide variety of datacenter applications. Yet, for latency-sensitive and bursty workloads, this advantage can be difficult to harness due to high FPGA spin-up costs. We propose that a hybrid FPGA and CPU computing framework can harness the energy efficiency benefits of FPGAs for such workloads at reasonable cost. Our key insight is to use FPGAs for stable-state workload and CPUs for short-term workload bursts. Using this insight, we design Spork, a lightweight hybrid scheduler that can realize these energy efficiency and cost benefits in practice. Depending on the desired objective, Spork can trade off energy efficiency for cost reduction and vice versa. It is parameterized with key differences between FPGAs and CPUs in terms of power draw, performance, cost, and spin-up latency. We vary this parameter space and analyze various application and worker configurations on production and synthetic traces. Our evaluation of cloud workloads shows that energy-optimized Spork is not only more energy efficient but it is also cheaper than homogeneous platforms--for short application requests with tight deadlines, it is 1.53x more energy efficient and 2.14x cheaper than using only FPGAs. Relative to an idealized version of an existing cost-optimized hybrid scheduler, energy-optimized Spork provides 1.2-2.4x higher energy efficiency at comparable cost, while cost-optimized Spork provides 1.1-2x higher energy efficiency at 1.06-1.2x lower cost.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
Robustness Evaluation in Hand Pose Estimation Models using Metamorphic Testing
Authors:
Muxin Pu,
Chun Yong Chong,
Mei Kuan Lim
Abstract:
Hand pose estimation (HPE) is a task that predicts and describes the hand poses from images or video frames. When HPE models estimate hand poses captured in a laboratory or under controlled environments, they normally deliver good performance. However, the real-world environment is complex, and various uncertainties may happen, which could degrade the performance of HPE models. For example, the ha…
▽ More
Hand pose estimation (HPE) is a task that predicts and describes the hand poses from images or video frames. When HPE models estimate hand poses captured in a laboratory or under controlled environments, they normally deliver good performance. However, the real-world environment is complex, and various uncertainties may happen, which could degrade the performance of HPE models. For example, the hands could be occluded, the visibility of hands could be reduced by imperfect exposure rate, and the contour of hands prone to be blurred during fast hand movements. In this work, we adopt metamorphic testing to evaluate the robustness of HPE models and provide suggestions on the choice of HPE models for different applications. The robustness evaluation was conducted on four state-of-the-art models, namely MediaPipe hands, OpenPose, BodyHands, and NSRM hand. We found that on average more than 80\% of the hands could not be identified by BodyHands, and at least 50\% of hands could not be identified by MediaPipe hands when diagonal motion blur is introduced, while an average of more than 50\% of strongly underexposed hands could not be correctly estimated by NSRM hand. Similarly, applying occlusions on only four hand joints will also largely degrade the performance of these models. The experimental results show that occlusions, illumination variations, and motion blur are the main obstacles to the performance of existing HPE models. These findings may pave the way for researchers to improve the performance and robustness of hand pose estimation models and their applications.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
SkillRec: A Data-Driven Approach to Job Skill Recommendation for Career Insights
Authors:
Xiang Qian Ong,
Kwan Hui Lim
Abstract:
Understanding the skill sets and knowledge required for any career is of utmost importance, but it is increasingly challenging in today's dynamic world with rapid changes in terms of the tools and techniques used. Thus, it is especially important to be able to accurately identify the required skill sets for any job for better career insights and development. In this paper, we propose and develop t…
▽ More
Understanding the skill sets and knowledge required for any career is of utmost importance, but it is increasingly challenging in today's dynamic world with rapid changes in terms of the tools and techniques used. Thus, it is especially important to be able to accurately identify the required skill sets for any job for better career insights and development. In this paper, we propose and develop the Skill Recommendation (SkillRec) system for recommending the relevant job skills required for a given job based on the job title. SkillRec collects and identify the skill set required for a job based on the job descriptions published by companies hiring for these roles. In addition to the data collection and pre-processing capabilities, SkillRec also utilises word/sentence embedding techniques for job title representation, alongside a feed-forward neural network for job skill recommendation based on the job title representation. Based on our preliminary experiments on a dataset of 6,000 job titles and descriptions, SkillRec shows a promising performance in terms of accuracy and F1-score.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems
Authors:
Daniel Hao Xian Yuen,
Andrew Yong Chen Pang,
Zhou Yang,
Chun Yong Chong,
Mei Kuan Lim,
David Lo
Abstract:
Recent years have witnessed wider adoption of Automated Speech Recognition (ASR) techniques in various domains. Consequently, evaluating and enhancing the quality of ASR systems is of great importance. This paper proposes ASDF, an Automated Speech Recognition Differential Testing Framework for testing ASR systems. ASDF extends an existing ASR testing tool, the CrossASR++, which synthesizes test ca…
▽ More
Recent years have witnessed wider adoption of Automated Speech Recognition (ASR) techniques in various domains. Consequently, evaluating and enhancing the quality of ASR systems is of great importance. This paper proposes ASDF, an Automated Speech Recognition Differential Testing Framework for testing ASR systems. ASDF extends an existing ASR testing tool, the CrossASR++, which synthesizes test cases from a text corpus. However, CrossASR++ fails to make use of the text corpus efficiently and provides limited information on how the failed test cases can improve ASR systems. To address these limitations, our tool incorporates two novel features: (1) a text transformation module to boost the number of generated test cases and uncover more errors in ASR systems and (2) a phonetic analysis module to identify on which phonemes the ASR system tend to produce errors. ASDF generates more high-quality test cases by applying various text transformation methods (e.g., change tense) to the texts in failed test cases. By doing so, ASDF can utilize a small text corpus to generate a large number of audio test cases, something which CrossASR++ is not capable of. In addition, ASDF implements more metrics to evaluate the performance of ASR systems from multiple perspectives. ASDF performs phonetic analysis on the identified failed test cases to identify the phonemes that ASR systems tend to transcribe incorrectly, providing useful information for developers to improve ASR systems. The demonstration video of our tool is made online at https://www.youtube.com/watch?v=DzVwfc3h9As. The implementation is available at https://github.com/danielyuenhx/asdf-differential-testing.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
Partial fillings of the bosonic $E_8$ quantum Hall state
Authors:
Pak Kau Lim,
Michael Mulligan,
Jeffrey C. Y. Teo
Abstract:
We study bosonic topological phases constructed from electrons. In addition to a bulk excitation energy gap, these bosonic phases also have a fermion energy gap, below which all local excitations in the bulk and on the edge are even combinations of electrons. We focus on chiral phases, in which all low-energy edge excitations move in the same direction, that arise from the short-range entangled…
▽ More
We study bosonic topological phases constructed from electrons. In addition to a bulk excitation energy gap, these bosonic phases also have a fermion energy gap, below which all local excitations in the bulk and on the edge are even combinations of electrons. We focus on chiral phases, in which all low-energy edge excitations move in the same direction, that arise from the short-range entangled $E_8$ quantum Hall state, the bosonic analog of the filled lowest Landau level of electrons. The $E_8$ edge-state theory features an $E_8$ Kac-Moody symmetry that can be decomposed into ${\cal G}_A \times {\cal G}_B$ subalgebras, such as $SU(3) \times E_6$, $SO(M) \times SO(16-M)$, and $G_2 \times F_4$. (Here, $\{SO(M) \}$, $\{SU(N)\}$, and $\{E_8, G_2, F_4 \}$ denote orthogonal, unitary, and exceptional Lie algebras.) Using these symmetry decompositions, we construct exactly solvable coupled-wire model Hamiltonians for families of long-range entangled ${\cal G}_A$ or ${\cal G}_B$ bosonic fractional quantum Hall states that ``partially fill" the $E_8$ state and are pairwise related by a generalized particle-hole symmetry. These long-range entangled states feature either Abelian or non-Abelian topological order. Some support the emergence of non-local Dirac and Majorana fermions, Ising anyons, metaplectic anyons, Fibonacci anyons, as well as deconfined $\mathbb{Z}_2$ gauge fluxes and charges.
△ Less
Submitted 24 July, 2023; v1 submitted 30 December, 2022;
originally announced December 2022.
-
POIBERT: A Transformer-based Model for the Tour Recommendation Problem
Authors:
Ngai Lam Ho,
Kwan Hui Lim
Abstract:
Tour itinerary planning and recommendation are challenging problems for tourists visiting unfamiliar cities. Many tour recommendation algorithms only consider factors such as the location and popularity of Points of Interest (POIs) but their solutions may not align well with the user's own preferences and other location constraints. Additionally, these solutions do not take into consideration of t…
▽ More
Tour itinerary planning and recommendation are challenging problems for tourists visiting unfamiliar cities. Many tour recommendation algorithms only consider factors such as the location and popularity of Points of Interest (POIs) but their solutions may not align well with the user's own preferences and other location constraints. Additionally, these solutions do not take into consideration of the users' preference based on their past POIs selection. In this paper, we propose POIBERT, an algorithm for recommending personalized itineraries using the BERT language model on POIs. POIBERT builds upon the highly successful BERT language model with the novel adaptation of a language model to our itinerary recommendation task, alongside an iterative approach to generate consecutive POIs.
Our recommendation algorithm is able to generate a sequence of POIs that optimizes time and users' preference in POI categories based on past trajectories from similar tourists. Our tour recommendation algorithm is modeled by adapting the itinerary recommendation problem to the sentence completion problem in natural language processing (NLP). We also innovate an iterative algorithm to generate travel itineraries that satisfies the time constraints which is most likely from past trajectories. Using a Flickr dataset of seven cities, experimental results show that our algorithm out-performs many sequence prediction algorithms based on measures in recall, precision and F1-scores.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Software Architecture and System Design of Rubin Observatory
Authors:
William O'Mullane,
Frossie Economou,
Kian-Tat Lim,
Fritz Mueller,
Tim Jenness,
Gregory P. Dubois-Felsmann,
Leanne P. Guy,
Ian S. Sullivan,
Yusra AlSayyad,
John D. Swinbank,
K. Simon Krughoff
Abstract:
Starting from a description of the Rubin Observatory Data Management System Architecture, and drawing on our experience with and involvement in a range of other projects including Gaia, SDSS, UKIRT, and JCMT, we derive a series of generic design patterns and lessons learned.
Starting from a description of the Rubin Observatory Data Management System Architecture, and drawing on our experience with and involvement in a range of other projects including Gaia, SDSS, UKIRT, and JCMT, we derive a series of generic design patterns and lessons learned.
△ Less
Submitted 24 November, 2022;
originally announced November 2022.
-
A Transformer-based Framework for POI-level Social Post Geolocation
Authors:
Menglin Li,
Kwan Hui Lim,
Teng Guo,
Junhua Liu
Abstract:
POI-level geo-information of social posts is critical to many location-based applications and services. However, the multi-modality, complexity and diverse nature of social media data and their platforms limit the performance of inferring such fine-grained locations and their subsequent applications. To address this issue, we present a transformer-based general framework, which builds upon pre-tra…
▽ More
POI-level geo-information of social posts is critical to many location-based applications and services. However, the multi-modality, complexity and diverse nature of social media data and their platforms limit the performance of inferring such fine-grained locations and their subsequent applications. To address this issue, we present a transformer-based general framework, which builds upon pre-trained language models and considers non-textual data, for social post geolocation at the POI level. To this end, inputs are categorized to handle different social data, and an optimal combination strategy is provided for feature representations. Moreover, a uniform representation of hierarchy is proposed to learn temporal information, and a concatenated version of encodings is employed to capture feature-wise positions better. Experimental results on various social datasets demonstrate that three variants of our proposed framework outperform multiple state-of-art baselines by a large margin in terms of accuracy and distance error metrics.
△ Less
Submitted 26 October, 2022;
originally announced November 2022.
-
Universal Evasion Attacks on Summarization Scoring
Authors:
Wenchuan Mu,
Kwan Hui Lim
Abstract:
The automatic scoring of summaries is important as it guides the development of summarizers. Scoring is also complex, as it involves multiple aspects such as fluency, grammar, and even textual entailment with the source text. However, summary scoring has not been considered a machine learning task to study its accuracy and robustness. In this study, we place automatic scoring in the context of reg…
▽ More
The automatic scoring of summaries is important as it guides the development of summarizers. Scoring is also complex, as it involves multiple aspects such as fluency, grammar, and even textual entailment with the source text. However, summary scoring has not been considered a machine learning task to study its accuracy and robustness. In this study, we place automatic scoring in the context of regression machine learning tasks and perform evasion attacks to explore its robustness. Attack systems predict a non-summary string from each input, and these non-summary strings achieve competitive scores with good summarizers on the most popular metrics: ROUGE, METEOR, and BERTScore. Attack systems also "outperform" state-of-the-art summarization methods on ROUGE-1 and ROUGE-L, and score the second-highest on METEOR. Furthermore, a BERTScore backdoor is observed: a simple trigger can score higher than any automatic summarization method. The evasion attacks in this work indicate the low robustness of current scoring systems at the system level. We hope that our highlighting of these proposed attacks will facilitate the development of summary scores.
△ Less
Submitted 25 October, 2022;
originally announced October 2022.
-
Revision for Concision: A Constrained Paraphrase Generation Task
Authors:
Wenchuan Mu,
Kwan Hui Lim
Abstract:
Academic writing should be concise as concise sentences better keep the readers' attention and convey meaning clearly. Writing concisely is challenging, for writers often struggle to revise their drafts. We introduce and formulate revising for concision as a natural language processing task at the sentence level. Revising for concision requires algorithms to use only necessary words to rewrite a s…
▽ More
Academic writing should be concise as concise sentences better keep the readers' attention and convey meaning clearly. Writing concisely is challenging, for writers often struggle to revise their drafts. We introduce and formulate revising for concision as a natural language processing task at the sentence level. Revising for concision requires algorithms to use only necessary words to rewrite a sentence while preserving its meaning. The revised sentence should be evaluated according to its word choice, sentence structure, and organization. The revised sentence also needs to fulfil semantic retention and syntactic soundness. To aide these efforts, we curate and make available a benchmark parallel dataset that can depict revising for concision. The dataset contains 536 pairs of sentences before and after revising, and all pairs are collected from college writing centres. We also present and evaluate the approaches to this problem, which may assist researchers in this area.
△ Less
Submitted 25 October, 2022;
originally announced October 2022.
-
Yet Another Format of Universal Dependencies for Korean
Authors:
Yige Chen,
Eunkyul Leah Jo,
Yundong Yao,
KyungTae Lim,
Miikka Silfverberg,
Francis M. Tyers,
Jungyeul Park
Abstract:
In this study, we propose a morpheme-based scheme for Korean dependency parsing and adopt the proposed scheme to Universal Dependencies. We present the linguistic rationale that illustrates the motivation and the necessity of adopting the morpheme-based format, and develop scripts that convert between the original format used by Universal Dependencies and the proposed morpheme-based format automat…
▽ More
In this study, we propose a morpheme-based scheme for Korean dependency parsing and adopt the proposed scheme to Universal Dependencies. We present the linguistic rationale that illustrates the motivation and the necessity of adopting the morpheme-based format, and develop scripts that convert between the original format used by Universal Dependencies and the proposed morpheme-based format automatically. The effectiveness of the proposed format for Korean dependency parsing is then testified by both statistical and neural models, including UDPipe and Stanza, with our carefully constructed morpheme-based word embedding for Korean. morphUD outperforms parsing results for all Korean UD treebanks, and we also present detailed error analyses.
△ Less
Submitted 20 September, 2022;
originally announced September 2022.
-
Bi-color atomic beam slower and magnetic field compensation for ultracold gases
Authors:
Jianing Li,
Kelvin Lim,
Swarup Das,
Thomas Zanon-Willette,
Chen-Hao Feng,
Paul Robert,
Andrea Bertoldi,
Philippe Bouyer,
Chang Chi Kwong,
Shau-Yu Lan,
David Wilkowski
Abstract:
Transversely loaded bidimensional-magneto-optical-traps (2D-MOT) have been recently developed as high flux sources for cold strontium atoms to realize a new generation of compact experimental setups. Here, we discuss on the implementation of a cross-polarized bi-color slower for a strontium atomic beam improving the 2D-MOT loading, and increasing the number of atoms in a final MOT by eleven times.…
▽ More
Transversely loaded bidimensional-magneto-optical-traps (2D-MOT) have been recently developed as high flux sources for cold strontium atoms to realize a new generation of compact experimental setups. Here, we discuss on the implementation of a cross-polarized bi-color slower for a strontium atomic beam improving the 2D-MOT loading, and increasing the number of atoms in a final MOT by eleven times. Our slowing scheme addresses simultaneously two excited Zeeman substates of the 88Sr 1S0->1P1 transition at 461 nm. We also realized a 3-axis active feedback control of the magnetic field down to the microgauss regime. Such a compensation is performed thanks to a network of eight magnetic field probes arranged in a cuboid configuration around the atomic cold sample, and a pair of coils in Helmholtz configuration along each of three Cartesian directions. Our active feedback is capable of efficiently suppressing most of the magnetically-induced position fluctuations of the 689~nm intercombination-line MOT.
△ Less
Submitted 5 January, 2023; v1 submitted 18 September, 2022;
originally announced September 2022.
-
Oscillating bound states in non-Markovian photonic lattices
Authors:
Kian Hwee Lim,
Wai-Keong Mok,
Leong-Chuan Kwek
Abstract:
It is known that the superposition of two bound states in the continuum (BIC) leads to the phenomenon of an oscillating bound state, where excitations mediated by the continuum modes oscillate persistently. We perform exact calculations for the oscillating BICs in a 1D photonic lattice coupled to a "giant atom" at multiple points. Our work is significantly distinct from previous proposals of oscil…
▽ More
It is known that the superposition of two bound states in the continuum (BIC) leads to the phenomenon of an oscillating bound state, where excitations mediated by the continuum modes oscillate persistently. We perform exact calculations for the oscillating BICs in a 1D photonic lattice coupled to a "giant atom" at multiple points. Our work is significantly distinct from previous proposals of oscillating BICs in continuous waveguide systems due to the presence of a finite energy band contributing band-edge effects. In particular, we show that the bound states outside the energy band are detrimental to the oscillating BIC phenomenon, and can be suppressed by increasing either the number of coupling points or the separation between each coupling point. Crucially, non-Markovianity is necessary for the existence of oscillating BIC, and the oscillation amplitude increases with the characteristic delay time of the giant atom interactions. We also propose a novel initialization scheme in the BIC subspace. Our work be experimentally implemented on current photonic waveguide array platforms and opens up new prospects in utilizing reservoir engineering for the storage of quantum information in photonic lattices.
△ Less
Submitted 17 February, 2023; v1 submitted 23 August, 2022;
originally announced August 2022.