Search | arXiv e-print repository

arXiv:2408.10538 [pdf, other]

Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resections with Pringle Maneuver

Authors: Diandian Guo, Weixin Si, Zhixi Li, Jialun Pei, Pheng-Ann Heng

Abstract: Pringle maneuver (PM) in laparoscopic liver resection aims to reduce blood loss and provide a clear surgical view by intermittently blocking blood inflow of the liver, whereas prolonged PM may cause ischemic injury. To comprehensively monitor this surgical procedure and provide timely warnings of ineffective and prolonged blocking, we suggest two complementary AI-assisted surgical monitoring tasks… ▽ More Pringle maneuver (PM) in laparoscopic liver resection aims to reduce blood loss and provide a clear surgical view by intermittently blocking blood inflow of the liver, whereas prolonged PM may cause ischemic injury. To comprehensively monitor this surgical procedure and provide timely warnings of ineffective and prolonged blocking, we suggest two complementary AI-assisted surgical monitoring tasks: workflow recognition and blocking effectiveness detection in liver resections. The former presents challenges in real-time capturing of short-term PM, while the latter involves the intraoperative discrimination of long-term liver ischemia states. To address these challenges, we meticulously collect a novel dataset, called PmLR50, consisting of 25,037 video frames covering various surgical phases from 50 laparoscopic liver resection procedures. Additionally, we develop an online baseline for PmLR50, termed PmNet. This model embraces Masked Temporal Encoding (MTE) and Compressed Sequence Modeling (CSM) for efficient short-term and long-term temporal information modeling, and embeds Contrastive Prototype Separation (CPS) to enhance action discrimination between similar intraoperative operations. Experimental results demonstrate that PmNet outperforms existing state-of-the-art surgical workflow recognition methods on the PmLR50 benchmark. Our research offers potential clinical applications for the laparoscopic liver surgery community. Source code and data will be publicly available. △ Less

Submitted 21 August, 2024; v1 submitted 20 August, 2024; originally announced August 2024.

arXiv:2407.06955 [pdf, other]

ICLGuard: Controlling In-Context Learning Behavior for Applicability Authorization

Authors: Wai Man Si, Michael Backes, Yang Zhang

Abstract: In-context learning (ICL) is a recent advancement in the capabilities of large language models (LLMs). This feature allows users to perform a new task without updating the model. Concretely, users can address tasks during the inference time by conditioning on a few input-label pair demonstrations along with the test input. It is different than the conventional fine-tuning paradigm and offers more… ▽ More In-context learning (ICL) is a recent advancement in the capabilities of large language models (LLMs). This feature allows users to perform a new task without updating the model. Concretely, users can address tasks during the inference time by conditioning on a few input-label pair demonstrations along with the test input. It is different than the conventional fine-tuning paradigm and offers more flexibility. However, this capability also introduces potential issues. For example, users may use the model on any data without restriction, such as performing tasks with improper or sensitive content, which might violate the model policy or conflict with the model owner's interests. As a model owner, it is crucial to establish a mechanism to control the model's behavior under ICL, depending on the model owner's requirements for various content. To this end, we introduce the concept of "applicability authorization" tailored for LLMs, particularly for ICL behavior, and propose a simple approach, ICLGuard. It is a fine-tuning framework designed to allow the model owner to regulate ICL behavior on different data. ICLGuard preserves the original LLM and fine-tunes only a minimal set of additional trainable parameters to "guard" the LLM. Empirical results show that the guarded LLM can deactivate its ICL ability on target data without affecting its ICL ability on other data and its general functionality across all data. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2406.17858 [pdf, other]

Depth-Driven Geometric Prompt Learning for Laparoscopic Liver Landmark Detection

Authors: Jialun Pei, Ruize Cui, Yaoqian Li, Weixin Si, Jing Qin, Pheng-Ann Heng

Abstract: Laparoscopic liver surgery poses a complex intraoperative dynamic environment for surgeons, where remains a significant challenge to distinguish critical or even hidden structures inside the liver. Liver anatomical landmarks, e.g., ridge and ligament, serve as important markers for 2D-3D alignment, which can significantly enhance the spatial perception of surgeons for precise surgery. To facilitat… ▽ More Laparoscopic liver surgery poses a complex intraoperative dynamic environment for surgeons, where remains a significant challenge to distinguish critical or even hidden structures inside the liver. Liver anatomical landmarks, e.g., ridge and ligament, serve as important markers for 2D-3D alignment, which can significantly enhance the spatial perception of surgeons for precise surgery. To facilitate the detection of laparoscopic liver landmarks, we collect a novel dataset called L3D, which comprises 1,152 frames with elaborated landmark annotations from surgical videos of 39 patients across two medical sites. For benchmarking purposes, 12 mainstream detection methods are selected and comprehensively evaluated on L3D. Further, we propose a depth-driven geometric prompt learning network, namely D2GPLand. Specifically, we design a Depth-aware Prompt Embedding (DPE) module that is guided by self-supervised prompts and generates semantically relevant geometric information with the benefit of global depth cues extracted from SAM-based features. Additionally, a Semantic-specific Geometric Augmentation (SGA) scheme is introduced to efficiently merge RGB-D spatial and geometric information through reverse anatomic perception. The experimental results indicate that D2GPLand obtains state-of-the-art performance on L3D, with 63.52% DICE and 48.68% IoU scores. Together with 2D-3D fusion technology, our method can directly provide the surgeon with intuitive guidance information in laparoscopic scenarios. △ Less

Submitted 27 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

Comments: This paper has been accepted by MICCAI 2024

arXiv:2406.00485 [pdf]

TacShade A New 3D-printed Soft Optical Tactile Sensor Based on Light, Shadow and Greyscale for Shape Reconstruction

Authors: Zhenyu Lu, Jialong Yang, Haoran Li, Yifan Li, Weiyong Si, Nathan Lepora, Chenguang Yang

Abstract: In this paper, we present the TacShade a newly designed 3D-printed soft optical tactile sensor. The sensor is developed for shape reconstruction under the inspiration of sketch drawing that uses the density of sketch lines to draw light and shadow, resulting in the creation of a 3D-view effect. TacShade, building upon the strengths of the TacTip, a single-camera tactile sensor of large in-depth de… ▽ More In this paper, we present the TacShade a newly designed 3D-printed soft optical tactile sensor. The sensor is developed for shape reconstruction under the inspiration of sketch drawing that uses the density of sketch lines to draw light and shadow, resulting in the creation of a 3D-view effect. TacShade, building upon the strengths of the TacTip, a single-camera tactile sensor of large in-depth deformation and being sensitive to edge and surface following, improves the structure in that the markers are distributed within the gap of papillae pins. Variations in light, dark, and grey effects can be generated inside the sensor through external contact interactions. The contours of the contacting objects are outlined by white markers, while the contact depth characteristics can be indirectly obtained from the distribution of black pins and white markers, creating a 2.5D visualization. Based on the imaging effect, we improve the Shape from Shading (SFS) algorithm to process tactile images, enabling a coarse but fast reconstruction for the contact objects. Two experiments are performed. The first verifies TacShade s ability to reconstruct the shape of the contact objects through one image for object distinction. The second experiment shows the shape reconstruction capability of TacShade for a large panel with ridged patterns based on the location of robots and image splicing technology. △ Less

Submitted 1 June, 2024; originally announced June 2024.

Comments: This paper has been accepted by ICRA 2024

arXiv:2405.08365 [pdf, ps, other]

A Riemannian Proximal Newton-CG Method

Authors: Wen Huang, Wutao Si

Abstract: Recently, a Riemannian proximal Newton method has been developed for optimizing problems in the form of $\min_{x\in\mathcal{M}} f(x) + μ\|x\|_1$, where $\mathcal{M}$ is a compact embedded submanifold and $f(x)$ is smooth. Although this method converges superlinearly locally, global convergence is not guaranteed. The existing remedy relies on a hybrid approach: running a Riemannian proximal gradien… ▽ More Recently, a Riemannian proximal Newton method has been developed for optimizing problems in the form of $\min_{x\in\mathcal{M}} f(x) + μ\|x\|_1$, where $\mathcal{M}$ is a compact embedded submanifold and $f(x)$ is smooth. Although this method converges superlinearly locally, global convergence is not guaranteed. The existing remedy relies on a hybrid approach: running a Riemannian proximal gradient method until the iterate is sufficiently accurate and switching to the Riemannian proximal Newton method. This existing approach is sensitive to the switching parameter. This paper proposes a Riemannian proximal Newton-CG method that merges the truncated conjugate gradient method with the Riemannian proximal Newton method. The global convergence and local superlinear convergence are proven. Numerical experiments show that the proposed method outperforms other state-of-the-art methods. △ Less

Submitted 14 May, 2024; originally announced May 2024.

arXiv:2404.18934 [pdf]

The Visual Experience Dataset: Over 200 Recorded Hours of Integrated Eye Movement, Odometry, and Egocentric Video

Authors: Michelle R. Greene, Benjamin J. Balas, Mark D. Lescroart, Paul R. MacNeilage, Jennifer A. Hart, Kamran Binaee, Peter A. Hausamann, Ronald Mezile, Bharath Shankar, Christian B. Sinnott, Kaylie Capurro, Savannah Halow, Hunter Howe, Mariam Josyula, Annie Li, Abraham Mieses, Amina Mohamed, Ilya Nudnou, Ezra Parkhill, Peter Riley, Brett Schmidt, Matthew W. Shinkle, Wentao Si, Brian Szekely, Joaquin M. Torres , et al. (1 additional authors not shown)

Abstract: We introduce the Visual Experience Dataset (VEDB), a compilation of over 240 hours of egocentric video combined with gaze- and head-tracking data that offers an unprecedented view of the visual world as experienced by human observers. The dataset consists of 717 sessions, recorded by 58 observers ranging from 6-49 years old. This paper outlines the data collection, processing, and labeling protoco… ▽ More We introduce the Visual Experience Dataset (VEDB), a compilation of over 240 hours of egocentric video combined with gaze- and head-tracking data that offers an unprecedented view of the visual world as experienced by human observers. The dataset consists of 717 sessions, recorded by 58 observers ranging from 6-49 years old. This paper outlines the data collection, processing, and labeling protocols undertaken to ensure a representative sample and discusses the potential sources of error or bias within the dataset. The VEDB's potential applications are vast, including improving gaze tracking methodologies, assessing spatiotemporal image statistics, and refining deep neural networks for scene and activity recognition. The VEDB is accessible through established open science platforms and is intended to be a living dataset with plans for expansion and community contributions. It is released with an emphasis on ethical considerations, such as participant privacy and the mitigation of potential biases. By providing a dataset grounded in real-world experiences and accompanied by extensive metadata and supporting code, the authors invite the research community to utilize and contribute to the VEDB, facilitating a richer understanding of visual perception and behavior in naturalistic settings. △ Less

Submitted 13 August, 2024; v1 submitted 15 February, 2024; originally announced April 2024.

Comments: 40 pages, 1 table, 9 figures

arXiv:2404.11304 [pdf]

Dynamic Phasor Modeling of Single-Phase Grid-Forming Converters

Authors: Wenjia Si, Chenming Liu, Steven Liu, Hongchang Li, Chenghui Zhang, Jingyang Fang

Abstract: In modern power systems, grid-forming power converters (GFMCs) have emerged as an enabling technology. However, the modeling of single-phase GFMCs faces new challenges. In particular, the nonlinear orthogonal signal generation unit, crucial for power measurement, still lacks an accurate model. To overcome the challenges, this letter proposes a dynamic phasor model of single-phase GFMCs. Moreover,… ▽ More In modern power systems, grid-forming power converters (GFMCs) have emerged as an enabling technology. However, the modeling of single-phase GFMCs faces new challenges. In particular, the nonlinear orthogonal signal generation unit, crucial for power measurement, still lacks an accurate model. To overcome the challenges, this letter proposes a dynamic phasor model of single-phase GFMCs. Moreover, we linearize the proposed model and perform stability analysis, which confirm that the proposed model is more accurate than existing models. Experimental results validate the improved accuracy of the proposed dynamic phasor model. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2402.00199 [pdf, other]

ViTacTip: Design and Verification of a Novel Biomimetic Physical Vision-Tactile Fusion Sensor

Authors: Wen Fan, Haoran Li, Weiyong Si, Shan Luo, Nathan Lepora, Dandan Zhang

Abstract: Tactile sensing is significant for robotics since it can obtain physical contact information during manipulation. To capture multimodal contact information within a compact framework, we designed a novel sensor called ViTacTip, which seamlessly integrates both tactile and visual perception capabilities into a single, integrated sensor unit. ViTacTip features a transparent skin to capture fine feat… ▽ More Tactile sensing is significant for robotics since it can obtain physical contact information during manipulation. To capture multimodal contact information within a compact framework, we designed a novel sensor called ViTacTip, which seamlessly integrates both tactile and visual perception capabilities into a single, integrated sensor unit. ViTacTip features a transparent skin to capture fine features of objects during contact, which can be known as the see-through-skin mechanism. In the meantime, the biomimetic tips embedded in ViTacTip can amplify touch motions during tactile perception. For comparative analysis, we also fabricated a ViTac sensor devoid of biomimetic tips, as well as a TacTip sensor with opaque skin. Furthermore, we develop a Generative Adversarial Network (GAN)-based approach for modality switching between different perception modes, effectively alternating the emphasis between vision and tactile perception modes. We conducted a performance evaluation of the proposed sensor across three distinct tasks: i) grating identification, ii) pose regression, and iii) contact localization and force estimation. In the grating identification task, ViTacTip demonstrated an accuracy of 99.72%, surpassing TacTip, which achieved 94.60%. It also exhibited superior performance in both pose and force estimation tasks with the minimum error of 0.08mm and 0.03N, respectively, in contrast to ViTac's 0.12mm and 0.15N. Results indicate that ViTacTip outperforms single-modality sensors. △ Less

Submitted 31 January, 2024; originally announced February 2024.

Comments: 7 pages, 5 figures, 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

arXiv:2401.13097 [pdf]

Digital Divides in Scene Recognition: Uncovering Socioeconomic Biases in Deep Learning Systems

Authors: Michelle R. Greene, Mariam Josyula, Wentao Si, Jennifer A. Hart

Abstract: Computer-based scene understanding has influenced fields ranging from urban planning to autonomous vehicle performance, yet little is known about how well these technologies work across social differences. We investigate the biases of deep convolutional neural networks (dCNNs) in scene classification, using nearly one million images from global and US sources, including user-submitted home photogr… ▽ More Computer-based scene understanding has influenced fields ranging from urban planning to autonomous vehicle performance, yet little is known about how well these technologies work across social differences. We investigate the biases of deep convolutional neural networks (dCNNs) in scene classification, using nearly one million images from global and US sources, including user-submitted home photographs and Airbnb listings. We applied statistical models to quantify the impact of socioeconomic indicators such as family income, Human Development Index (HDI), and demographic factors from public data sources (CIA and US Census) on dCNN performance. Our analyses revealed significant socioeconomic bias, where pretrained dCNNs demonstrated lower classification accuracy, lower classification confidence, and a higher tendency to assign labels that could be offensive when applied to homes (e.g., "ruin", "slum"), especially in images from homes with lower socioeconomic status (SES). This trend is consistent across two datasets of international images and within the diverse economic and racial landscapes of the United States. This research contributes to understanding biases in computer vision, emphasizing the need for more inclusive and representative training datasets. By mitigating the bias in the computer vision pipelines, we can ensure fairer and more equitable outcomes for applied computer vision, including home valuation and smart home security systems. There is urgency in addressing these biases, which can significantly impact critical decisions in urban development and resource allocation. Our findings also motivate the development of AI systems that better understand and serve diverse communities, moving towards technology that equitably benefits all sectors of society. △ Less

Submitted 23 January, 2024; originally announced January 2024.

Comments: 20 pages, 3 figures, 3 tables

MSC Class: 68-02 ACM Class: I.2.m

arXiv:2312.17458 [pdf, ps, other]

The conditional Lyapunov exponents and synchronisation of rotating turbulent flows

Authors: Jian Li, Mengdan Tian, Yi Li, Wenwen Si, Huda Khaleel Mohammed

Abstract: The synchronisation between rotating turbulent flows in periodic boxes is investigated numerically. The flows are coupled via a master-slave coupling, taking the Fourier modes with wavenumber below a given value $k_m$ as the master modes. It is found that synchronisation happens when $k_m$ exceeds a threshold value $k_c$, and $k_c$ depends strongly on the forcing scheme. In rotating Kolmogorov flo… ▽ More The synchronisation between rotating turbulent flows in periodic boxes is investigated numerically. The flows are coupled via a master-slave coupling, taking the Fourier modes with wavenumber below a given value $k_m$ as the master modes. It is found that synchronisation happens when $k_m$ exceeds a threshold value $k_c$, and $k_c$ depends strongly on the forcing scheme. In rotating Kolmogorov flows, $k_cη$ does not change with rotation in the range of rotation rates considered, $η$ being the Kolmogorov length scale. Even though the energy spectrum has a steeper slope, the value of $k_cη$ is the same as that found in isotropic turbulence. In flows driven by a forcing term maintaining constant energy injection rate, synchronisation becomes easier when rotation is stronger. $k_cη$ decreases with rotation, and it is reduced significantly for strong rotations when the slope of the energy spectrum approaches $-3$. It is shown that the conditional Lyapunov exponent for a given $k_m$ is reduced by rotation in the flows driven by the second type of forcing, but it increases mildly with rotation for the Kolmogorov flows. The local conditional Lyapunov exponents fluctuate more strongly as rotation is increased, although synchronisation occurs as long as the average conditional Lyapunov exponents are negative. We also look for the relationship between $k_c$ and the energy spectra of the Lyapunov vectors. We find that the spectra always seem to peak around $k_c$, and synchronisation fails when the energy spectra of the conditional Lyapunov vectors have a local maximum in the slaved modes. △ Less

Submitted 28 December, 2023; originally announced December 2023.

arXiv:2311.14685 [pdf, other]

Comprehensive Assessment of Toxicity in ChatGPT

Authors: Boyang Zhang, Xinyue Shen, Wai Man Si, Zeyang Sha, Zeyuan Chen, Ahmed Salem, Yun Shen, Michael Backes, Yang Zhang

Abstract: Moderating offensive, hateful, and toxic language has always been an important but challenging topic in the domain of safe use in NLP. The emerging large language models (LLMs), such as ChatGPT, can potentially further accentuate this threat. Previous works have discovered that ChatGPT can generate toxic responses using carefully crafted inputs. However, limited research has been done to systemati… ▽ More Moderating offensive, hateful, and toxic language has always been an important but challenging topic in the domain of safe use in NLP. The emerging large language models (LLMs), such as ChatGPT, can potentially further accentuate this threat. Previous works have discovered that ChatGPT can generate toxic responses using carefully crafted inputs. However, limited research has been done to systematically examine when ChatGPT generates toxic responses. In this paper, we comprehensively evaluate the toxicity in ChatGPT by utilizing instruction-tuning datasets that closely align with real-world scenarios. Our results show that ChatGPT's toxicity varies based on different properties and settings of the prompts, including tasks, domains, length, and languages. Notably, prompts in creative writing tasks can be 2x more likely than others to elicit toxic responses. Prompting in German and Portuguese can also double the response toxicity. Additionally, we discover that certain deliberately toxic prompts, designed in earlier studies, no longer yield harmful responses. We hope our discoveries can guide model developers to better regulate these AI systems and the users to avoid undesirable outputs. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2310.12964 [pdf, other]

PAC Prediction Sets Under Label Shift

Authors: Wenwen Si, Sangdon Park, Insup Lee, Edgar Dobriban, Osbert Bastani

Abstract: Prediction sets capture uncertainty by predicting sets of labels rather than individual labels, enabling downstream decisions to conservatively account for all plausible outcomes. Conformal inference algorithms construct prediction sets guaranteed to contain the true label with high probability. These guarantees fail to hold in the face of distribution shift, which is precisely when reliable uncer… ▽ More Prediction sets capture uncertainty by predicting sets of labels rather than individual labels, enabling downstream decisions to conservatively account for all plausible outcomes. Conformal inference algorithms construct prediction sets guaranteed to contain the true label with high probability. These guarantees fail to hold in the face of distribution shift, which is precisely when reliable uncertainty quantification can be most useful. We propose a novel algorithm for constructing prediction sets with PAC guarantees in the label shift setting. This method estimates the predicted probabilities of the classes in a target domain, as well as the confusion matrix, then propagates uncertainty in these estimates through a Gaussian elimination algorithm to compute confidence intervals for importance weights. Finally, it uses these intervals to construct prediction sets. We evaluate our approach on five datasets: the CIFAR-10, ChestX-Ray and Entity-13 image datasets, the tabular CDC Heart dataset, and the AGNews text dataset. Our algorithm satisfies the PAC guarantee while producing smaller, more informative, prediction sets compared to several baselines. △ Less

Submitted 19 October, 2023; originally announced October 2023.

arXiv:2308.03558 [pdf, other]

Mondrian: Prompt Abstraction Attack Against Large Language Models for Cheaper API Pricing

Authors: Wai Man Si, Michael Backes, Yang Zhang

Abstract: The Machine Learning as a Service (MLaaS) market is rapidly expanding and becoming more mature. For example, OpenAI's ChatGPT is an advanced large language model (LLM) that generates responses for various queries with associated fees. Although these models can deliver satisfactory performance, they are far from perfect. Researchers have long studied the vulnerabilities and limitations of LLMs, suc… ▽ More The Machine Learning as a Service (MLaaS) market is rapidly expanding and becoming more mature. For example, OpenAI's ChatGPT is an advanced large language model (LLM) that generates responses for various queries with associated fees. Although these models can deliver satisfactory performance, they are far from perfect. Researchers have long studied the vulnerabilities and limitations of LLMs, such as adversarial attacks and model toxicity. Inevitably, commercial ML models are also not exempt from such issues, which can be problematic as MLaaS continues to grow. In this paper, we discover a new attack strategy against LLM APIs, namely the prompt abstraction attack. Specifically, we propose Mondrian, a simple and straightforward method that abstracts sentences, which can lower the cost of using LLM APIs. In this approach, the adversary first creates a pseudo API (with a lower established price) to serve as the proxy of the target API (with a higher established price). Next, the pseudo API leverages Mondrian to modify the user query, obtain the abstracted response from the target API, and forward it back to the end user. Our results show that Mondrian successfully reduces user queries' token length ranging from 13% to 23% across various tasks, including text classification, generation, and question answering. Meanwhile, these abstracted queries do not significantly affect the utility of task-specific and general language models like ChatGPT. Mondrian also reduces instruction prompts' token length by at least 11% without compromising output quality. As a result, the prompt abstraction attack enables the adversary to profit without bearing the cost of API development and deployment. △ Less

Submitted 7 August, 2023; originally announced August 2023.

arXiv:2305.07406 [pdf, other]

Two-in-One: A Model Hijacking Attack Against Text Generation Models

Authors: Wai Man Si, Michael Backes, Yang Zhang, Ahmed Salem

Abstract: Machine learning has progressed significantly in various applications ranging from face recognition to text generation. However, its success has been accompanied by different attacks. Recently a new attack has been proposed which raises both accountability and parasitic computing risks, namely the model hijacking attack. Nevertheless, this attack has only focused on image classification tasks. In… ▽ More Machine learning has progressed significantly in various applications ranging from face recognition to text generation. However, its success has been accompanied by different attacks. Recently a new attack has been proposed which raises both accountability and parasitic computing risks, namely the model hijacking attack. Nevertheless, this attack has only focused on image classification tasks. In this work, we broaden the scope of this attack to include text generation and classification models, hence showing its broader applicability. More concretely, we propose a new model hijacking attack, Ditto, that can hijack different text classification tasks into multiple generation ones, e.g., language translation, text summarization, and language modeling. We use a range of text benchmark datasets such as SST-2, TweetEval, AGnews, QNLI, and IMDB to evaluate the performance of our attacks. Our results show that by using Ditto, an adversary can successfully hijack text generation models without jeopardizing their utility. △ Less

Submitted 12 May, 2023; originally announced May 2023.

Comments: To appear in the 32nd USENIX Security Symposium, August 2023, Anaheim, CA, USA

arXiv:2304.13919 [pdf, other]

Detection of Adversarial Physical Attacks in Time-Series Image Data

Authors: Ramneet Kaur, Yiannis Kantaros, Wenwen Si, James Weimer, Insup Lee

Abstract: Deep neural networks (DNN) have become a common sensing modality in autonomous systems as they allow for semantically perceiving the ambient environment given input images. Nevertheless, DNN models have proven to be vulnerable to adversarial digital and physical attacks. To mitigate this issue, several detection frameworks have been proposed to detect whether a single input image has been manipula… ▽ More Deep neural networks (DNN) have become a common sensing modality in autonomous systems as they allow for semantically perceiving the ambient environment given input images. Nevertheless, DNN models have proven to be vulnerable to adversarial digital and physical attacks. To mitigate this issue, several detection frameworks have been proposed to detect whether a single input image has been manipulated by adversarial digital noise or not. In our prior work, we proposed a real-time detector, called VisionGuard (VG), for adversarial physical attacks against single input images to DNN models. Building upon that work, we propose VisionGuard* (VG), which couples VG with majority-vote methods, to detect adversarial physical attacks in time-series image data, e.g., videos. This is motivated by autonomous systems applications where images are collected over time using onboard sensors for decision-making purposes. We emphasize that majority-vote mechanisms are quite common in autonomous system applications (among many other applications), as e.g., in autonomous driving stacks for object detection. In this paper, we investigate, both theoretically and experimentally, how this widely used mechanism can be leveraged to enhance the performance of adversarial detectors. We have evaluated VG* on videos of both clean and physically attacked traffic signs generated by a state-of-the-art robust physical attack. We provide extensive comparative experiments against detectors that have been designed originally for out-of-distribution data and digitally attacked images. △ Less

Submitted 26 April, 2023; originally announced April 2023.

arXiv:2304.04032 [pdf, ps, other]

A Riemannian Proximal Newton Method

Authors: Wutao Si, P. -A. Absil, Wen Huang, Rujun Jiang, Simon Vary

Abstract: In recent years, the proximal gradient method and its variants have been generalized to Riemannian manifolds for solving optimization problems with an additively separable structure, i.e., $f + h$, where $f$ is continuously differentiable, and $h$ may be nonsmooth but convex with computationally reasonable proximal mapping. In this paper, we generalize the proximal Newton method to embedded subman… ▽ More In recent years, the proximal gradient method and its variants have been generalized to Riemannian manifolds for solving optimization problems with an additively separable structure, i.e., $f + h$, where $f$ is continuously differentiable, and $h$ may be nonsmooth but convex with computationally reasonable proximal mapping. In this paper, we generalize the proximal Newton method to embedded submanifolds for solving the type of problem with $h(x) = μ\|x\|_1$. The generalization relies on the Weingarten and semismooth analysis. It is shown that the Riemannian proximal Newton method has a local quadratic convergence rate under certain reasonable assumptions. Moreover, a hybrid version is given by concatenating a Riemannian proximal gradient method and the Riemannian proximal Newton method. It is shown that if the switch parameter is chosen appropriately, then the hybrid method converges globally and also has a local quadratic convergence rate. Numerical experiments on random and synthetic data are used to demonstrate the performance of the proposed methods. △ Less

Submitted 3 April, 2024; v1 submitted 8 April, 2023; originally announced April 2023.

Comments: Updates compared to the previous version: *) the updates for the published version. Additionally updates: *) local quadratic convergence rate *) update the proofs of Lemma 3.3. *) update the proofs of Proposition 3.13

arXiv:2211.09345 [pdf]

More Effective Centrality-Based Attacks on Weighted Networks

Authors: Balume Mburano, Weisheng Si, Qing Cao, Wei Xing Zheng

Abstract: Only when understanding hackers' tactics, can we thwart their attacks. With this spirit, this paper studies how hackers can effectively launch the so-called 'targeted node attacks', in which iterative attacks are staged on a network, and in each iteration the most important node is removed. In the existing attacks for weighted networks, the node importance is typically measured by the centralities… ▽ More Only when understanding hackers' tactics, can we thwart their attacks. With this spirit, this paper studies how hackers can effectively launch the so-called 'targeted node attacks', in which iterative attacks are staged on a network, and in each iteration the most important node is removed. In the existing attacks for weighted networks, the node importance is typically measured by the centralities related to shortest-path lengths, and the attack effectiveness is also measured mostly by length-related metrics. However, this paper argues that flows can better reflect network functioning than shortest-path lengths for those networks with carrying traffic as the main functionality. Thus, this paper proposes metrics based on flows for measuring the node importance and the attack effectiveness, respectively. Our node importance metrics include three flow-based centralities (flow betweenness, current-flow betweenness and current-flow closeness), which have not been proposed for use in the attacks on weighted networks yet. Our attack effectiveness metric is a new one proposed by us based on average network flow. Extensive experiments on both artificial and real-world networks show that the attack methods with our three suggested centralities are more effective than the existing attack methods when evaluated under our proposed attack effectiveness metric. △ Less

Submitted 17 November, 2022; originally announced November 2022.

arXiv:2211.00099 [pdf, other]

doi 10.1145/3550469.3555378

UmeTrack: Unified multi-view end-to-end hand tracking for VR

Authors: Shangchen Han, Po-chen Wu, Yubo Zhang, Beibei Liu, Linguang Zhang, Zheng Wang, Weiguang Si, Peizhao Zhang, Yujun Cai, Tomas Hodan, Randi Cabezas, Luan Tran, Muzaffer Akbay, Tsz-Ho Yu, Cem Keskin, Robert Wang

Abstract: Real-time tracking of 3D hand pose in world space is a challenging problem and plays an important role in VR interaction. Existing work in this space are limited to either producing root-relative (versus world space) 3D pose or rely on multiple stages such as generating heatmaps and kinematic optimization to obtain 3D pose. Moreover, the typical VR scenario, which involves multi-view tracking from… ▽ More Real-time tracking of 3D hand pose in world space is a challenging problem and plays an important role in VR interaction. Existing work in this space are limited to either producing root-relative (versus world space) 3D pose or rely on multiple stages such as generating heatmaps and kinematic optimization to obtain 3D pose. Moreover, the typical VR scenario, which involves multi-view tracking from wide \ac{fov} cameras is seldom addressed by these methods. In this paper, we present a unified end-to-end differentiable framework for multi-view, multi-frame hand tracking that directly predicts 3D hand pose in world space. We demonstrate the benefits of end-to-end differentiabilty by extending our framework with downstream tasks such as jitter reduction and pinch prediction. To demonstrate the efficacy of our model, we further present a new large-scale egocentric hand pose dataset that consists of both real and synthetic data. Experiments show that our system trained on this dataset handles various challenging interactive motions, and has been successfully applied to real-time VR applications. △ Less

Submitted 31 October, 2022; originally announced November 2022.

Comments: SIGGRAPH Asia 2022 Conference Papers, 8 pages

arXiv:2209.03463 [pdf, other]

Why So Toxic? Measuring and Triggering Toxic Behavior in Open-Domain Chatbots

Authors: Wai Man Si, Michael Backes, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini, Savvas Zannettou, Yang Zhang

Abstract: Chatbots are used in many applications, e.g., automated agents, smart home assistants, interactive characters in online games, etc. Therefore, it is crucial to ensure they do not behave in undesired manners, providing offensive or toxic responses to users. This is not a trivial task as state-of-the-art chatbot models are trained on large, public datasets openly collected from the Internet. This pa… ▽ More Chatbots are used in many applications, e.g., automated agents, smart home assistants, interactive characters in online games, etc. Therefore, it is crucial to ensure they do not behave in undesired manners, providing offensive or toxic responses to users. This is not a trivial task as state-of-the-art chatbot models are trained on large, public datasets openly collected from the Internet. This paper presents a first-of-its-kind, large-scale measurement of toxicity in chatbots. We show that publicly available chatbots are prone to providing toxic responses when fed toxic queries. Even more worryingly, some non-toxic queries can trigger toxic responses too. We then set out to design and experiment with an attack, ToxicBuddy, which relies on fine-tuning GPT-2 to generate non-toxic queries that make chatbots respond in a toxic manner. Our extensive experimental evaluation demonstrates that our attack is effective against public chatbot models and outperforms manually-crafted malicious queries proposed by previous work. We also evaluate three defense mechanisms against ToxicBuddy, showing that they either reduce the attack performance at the cost of affecting the chatbot's utility or are only effective at mitigating a portion of the attack. This highlights the need for more research from the computer security and online safety communities to ensure that chatbot models do not hurt their users. Overall, we are confident that ToxicBuddy can be used as an auditing tool and that our work will pave the way toward designing more effective defenses for chatbot safety. △ Less

Submitted 9 September, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

Journal ref: Published in ACM CCS 2022. Please cite the CCS version

arXiv:2202.05445 [pdf, other]

doi 10.1103/PhysRevX.12.011022

Neutron spectroscopy evidence on the dual nature of magnetic excitations in a van der Waals metallic ferromagnet Fe$_{2.72}$GeTe$_{2}$

Authors: Song Bao, Wei Wang, Yanyan Shangguan, Zhengwei Cai, Zhao-Yang Dong, Zhentao Huang, Wenda Si, Zhen Ma, Ryoichi Kajimoto, Kazuhiko Ikeuchi, Shin-ichiro Yano, Shun-Li Yu, Xiangang Wan, Jian-Xin Li, Jinsheng Wen

Abstract: In the local or itinerant extreme, magnetic excitations can be described by the Heisenberg model which treats electron spins as localized moments, or by the itinerant-electron model where the exchange interaction between electrons leads to unequal numbers of electrons with up and down spins. However, it has been elusive when both local moments and itinerant electrons are present in the intermediat… ▽ More In the local or itinerant extreme, magnetic excitations can be described by the Heisenberg model which treats electron spins as localized moments, or by the itinerant-electron model where the exchange interaction between electrons leads to unequal numbers of electrons with up and down spins. However, it has been elusive when both local moments and itinerant electrons are present in the intermediate range. Using inelastic neutron scattering, we provide direct spectroscopic evidence on the coexistence of and interplay between local moments and itinerant electrons in a van der Waals metallic ferromagnet Fe$_{2.72}$GeTe$_{2}$, which can sustain tunable room-temperature ferromagnetism down to the monolayer limit. We find that there exist ferromagnetic spin-wave excitations dispersing from the zone center at low energies resulting from local moments, and a column-like broad continuum at the zone boundary at high energies up to over 100 meV resulting from itinerant electrons. Unlike the two-dimensional crystal structure, the low-energy mode exhibits a three-dimensional nature, and the high-energy mode also has an out-of-plane dependence. Both modes persist well above the Curie temperature of 160 K. Our neutron spectroscopic data reveal that the low-energy spin waves at 100 K are more coherent than those at 4 K, which is evidence of the weakening of the Kondo screening at high temperatures. These results unambiguously demonstrate the coexistence of local moments and itinerant electrons, and the Kondo effect between these two components in Fe$_{2.72}$GeTe$_{2}$. Such behaviors are generally expected in heavy-fermion systems with heavy $f$ electrons but rarely clearly observed in materials with light $d$ electrons. These findings shed light on the understanding of magnetism in transition-metal compounds. △ Less

Submitted 10 February, 2022; originally announced February 2022.

Comments: Main text 15 pages, 5 figures. Supplementary Materials available on PRX

Journal ref: Phys. Rev. X 12, 011022 (2022)

arXiv:2111.05598 [pdf, other]

Tilt grain boundaries of hexagonal structures: a spectral viewpoint

Authors: Kai Jiang, Wei Si, Jie Xu

Abstract: We propose a spectral viewpoint for grain boundaries that are generally quasiperiodic. To accurately capture the spectra computationally, it is crucial to adopt the projection method for quasiperiodic functions. Armed with the Lifshitz-Petrich free energy, we take the spectral viewpoint to examine tilt grain boundaries of the hexagonal phase. Several ingredients of grain boundaries are extracted,… ▽ More We propose a spectral viewpoint for grain boundaries that are generally quasiperiodic. To accurately capture the spectra computationally, it is crucial to adopt the projection method for quasiperiodic functions. Armed with the Lifshitz-Petrich free energy, we take the spectral viewpoint to examine tilt grain boundaries of the hexagonal phase. Several ingredients of grain boundaries are extracted, which are not easy to obtain from real-space profiles. We find that only a few spectra substantially contribute to the formation of grain boundaries. Their linear relation to the intrinsic spectra of the bulk hexagonal phase is independent of the tilt angle. By examining the feature of the spectral intensities, we propose a definition of the interface width. The widths calculated from this definition are consistent with visual estimation. △ Less

Submitted 10 November, 2021; originally announced November 2021.

Comments: 19 pages, 10 figures, 3 tables

MSC Class: 65N35; 70H12; 82B24

arXiv:2107.02486 [pdf, other]

doi 10.1103/PhysRevB.104.L020402

Topological magnon insulator spin excitations in the two-dimensional ferromagnet CrBr$_3$

Authors: Zhengwei Cai, Song Bao, Zhao-Long Gu, Yi-Peng Gao, Zhen Ma, Yanyan Shangguan, Wenda Si, Zhao-Yang Dong, Wei Wang, Yizhang Wu, Dongjing Lin, Jinghui Wang, Kejing Ran, Shichao Li, Devashibhai Adroja, Xiaoxiang Xi, Shun-Li Yu, Xiaoshan Wu, Jian-Xin Li, Jinsheng Wen

Abstract: Topological magnons are bosonic analogues of topological fermions in electronic systems. They have been studied extensively by theory but rarely realized by experiment. Here, by performing inelastic neutron scattering measurements on single crystals of a two-dimensional ferromagnet CrBr$_3$, which was classified as Dirac magnon semimetal featured by the linear bands crossing at the Dirac points, w… ▽ More Topological magnons are bosonic analogues of topological fermions in electronic systems. They have been studied extensively by theory but rarely realized by experiment. Here, by performing inelastic neutron scattering measurements on single crystals of a two-dimensional ferromagnet CrBr$_3$, which was classified as Dirac magnon semimetal featured by the linear bands crossing at the Dirac points, we fully map out the magnetic excitation spectra, and reveal that there is an apparent gap of $\sim$3.5~meV between the acoustic and optical branches of the magnons at the K point. By collaborative efforts between experiment and theoretical calculations using a five-orbital Hubbard model obtained from first-principles calculations to derive the exchange parameters, we find that a Hamiltonian with Heisenberg exchange interactions, next-nearest-neighbor Dzyaloshinskii-Moriya (DM) interaction, and single-ion anisotropy is more appropriate to describe the system. Calculations using the model show that the lower and upper magnon bands separated by the gap exhibit Chern numbers of $\pm1$. These results indicate that CrBr$_3$ is a topological magnon insulator, where the nontrivial gap is a result of the DM interaction. △ Less

Submitted 6 July, 2021; originally announced July 2021.

Comments: Version as published in PRB Letter, main text 7 pages, supplementary materials 6 pages

Journal ref: Phys. Rev. B 104, L020402 (2021)

arXiv:2105.15054 [pdf, other]

Telling Stories through Multi-User Dialogue by Modeling Character Relations

Authors: Wai Man Si, Prithviraj Ammanabrolu, Mark O. Riedl

Abstract: This paper explores character-driven story continuation, in which the story emerges through characters' first- and second-person narration as well as dialogue -- requiring models to select language that is consistent with a character's persona and their relationships with other characters while following and advancing the story. We hypothesize that a multi-task model that trains on character dialo… ▽ More This paper explores character-driven story continuation, in which the story emerges through characters' first- and second-person narration as well as dialogue -- requiring models to select language that is consistent with a character's persona and their relationships with other characters while following and advancing the story. We hypothesize that a multi-task model that trains on character dialogue plus character relationship information improves transformer-based story continuation. To this end, we extend the Critical Role Dungeons and Dragons Dataset (Rameshkumar and Bailey, 2020) -- consisting of dialogue transcripts of people collaboratively telling a story while playing the role-playing game Dungeons and Dragons -- with automatically extracted relationships between each pair of interacting characters as well as their personas. A series of ablations lend evidence to our hypothesis, showing that our multi-task model using character relationships improves story continuation accuracy over strong baselines. △ Less

Submitted 31 May, 2021; originally announced May 2021.

Comments: In Proceedings of SIGDIAL 2021

arXiv:2102.05792 [pdf, other]

Rate-Splitting Multiple Access for Multigateway Multibeam Satellite Systems with Feeder Link Interference

Authors: Zhi Wen Si, Longfei Yin, Bruno Clerckx

Abstract: This paper studies the precoder design problem of achieving max-min fairness (MMF) amongst users in multigateway multibeam satellite communication systems with feeder link interference. We propose a beamforming strategy based on a newly introduced transmission scheme known as rate-splitting multiple access (RSMA). RSMA relies on multi-antenna rate-splitting at the transmitter and successive interf… ▽ More This paper studies the precoder design problem of achieving max-min fairness (MMF) amongst users in multigateway multibeam satellite communication systems with feeder link interference. We propose a beamforming strategy based on a newly introduced transmission scheme known as rate-splitting multiple access (RSMA). RSMA relies on multi-antenna rate-splitting at the transmitter and successive interference cancellation (SIC) at the receivers, such that the intended message for a user is split into a common part and a private part and the interference is partially decoded and partially treated as noise. In this paper, we formulate the MMF problem subject to per-antenna power constraints at the satellite for the system with imperfect channel state information at the transmitter (CSIT). We also consider the case of two-stage precoding which is assisted by on-board processing (OBP) at the satellite. Numerical results obtained through simulations for RSMA and the conventional linear precoding method are compared. When RSMA is used, MMF rate gain is promised and this gain increases when OBP is used. RSMA is proven to be promising for multigateway multibeam satellite systems whereby there are various practical challenges such as feeder link interference, CSIT uncertainty, per-antenna power constraints, uneven user distribution per beam and frame-based processing. △ Less

Submitted 10 February, 2021; originally announced February 2021.

Comments: Submitted for publication

arXiv:2012.00877 [pdf]

Measuring Network Robustness by Average Network Flow

Authors: Weisheng Si, Balume Mburano, Wei Xing Zheng, Tie Qiu

Abstract: Infrastructure networks such as the Internet backbone and power grids are essential for our everyday lives. With the prevalence of cyber-attacks on them, measuring their robustness has become an important issue. To date, many robustness metrics have been proposed. It is desirable for a robustness metric to possess the following three properties: considering global network topologies, strictly incr… ▽ More Infrastructure networks such as the Internet backbone and power grids are essential for our everyday lives. With the prevalence of cyber-attacks on them, measuring their robustness has become an important issue. To date, many robustness metrics have been proposed. It is desirable for a robustness metric to possess the following three properties: considering global network topologies, strictly increasing upon link additions, and having a quadratic complexity in terms of the number of nodes on sparse networks. This paper proposes to use Average Network Flow (ANF) as a robustness metric, and proves that it increases strictly, and gives an algorithm to compute ANF with a quadratic complexity by leveraging Gomory-Hu trees. Thus, with ANF intrinsically considering global network topologies, ANF is unveiled to be a new robustness metric satisfying those three properties. Moreover, this paper compares ANF with seven existing representative metrics, showing that each metric has its own characteristics, so there is no silver bullet in measuring network robustness and it is recommended to apply several metrics together to gain a comprehensive view. Finally, by experimenting on the scenarios in which network topologies preserve the same numbers of nodes and links, some interesting behaviors of robustness metrics are reported. △ Less

Submitted 2 July, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

arXiv:2008.12637 [pdf, other]

High-order energy stable schemes of incommensurate phase-field crystal model

Authors: Kai Jiang, Wei Si

Abstract: This article focuses on the development of high-order energy stable schemes for the multi-length-scale incommensurate phase-field crystal model which is able to study the phase behavior of aperiodic structures. These high-order schemes based on the scalar auxiliary variable (SAV) and spectral deferred correction (SDC) approaches are suitable for the L 2 gradient flow equation, i.e., the Allen-Cahn… ▽ More This article focuses on the development of high-order energy stable schemes for the multi-length-scale incommensurate phase-field crystal model which is able to study the phase behavior of aperiodic structures. These high-order schemes based on the scalar auxiliary variable (SAV) and spectral deferred correction (SDC) approaches are suitable for the L 2 gradient flow equation, i.e., the Allen-Cahn dynamic equation. Concretely, we propose a second-order Crank-Nicolson (CN) scheme of the SAV system, prove the energy dissipation law, and give the error estimate in the almost periodic function sense. Moreover, we use the SDC method to improve the computational accuracy of the SAV/CN scheme. Numerical results demonstrate the advantages of high-order numerical methods in numerical computations and show the influence of length-scales on the formation of ordered structures. △ Less

Submitted 28 August, 2020; originally announced August 2020.

Comments: 17 pages, 6 figures

arXiv:2007.15193 [pdf]

Fermionic Order by Disorder in a van der Waals Antiferromagnet

Authors: R. Okuma, D. Ueta, S. Kuniyoshi, Y. Fujisawa, B. Smith, C. H. Hsu, Y. Inagaki, W. Si, T. Kawae, H. Lin, F. C. Chuang, T. Masuda, R. Kobayashi, Y. Okada

Abstract: CeTe3 is a unique platform to investigate the itinerant magnetism in a van der Waals (vdW) coupled metal. Despite chemical pressure being a promising route to boost quantum fluctuation in this system, a systematic study on the chemical pressure effect on Ce3+(4f1) states is absent. Here, we report on the successful growth of a series of Se doped single crystals of CeTe3. We found a fluctuation dri… ▽ More CeTe3 is a unique platform to investigate the itinerant magnetism in a van der Waals (vdW) coupled metal. Despite chemical pressure being a promising route to boost quantum fluctuation in this system, a systematic study on the chemical pressure effect on Ce3+(4f1) states is absent. Here, we report on the successful growth of a series of Se doped single crystals of CeTe3. We found a fluctuation driven exotic magnetic rotation from the usual easy-axis ordering to an unusual hard-axis ordering. Unlike in localized magnetic systems, near-critical magnetism can increase itinerancy hand-in-hand with enhancing fluctuation of magnetism. Thus, seemingly unstable hard-axis ordering emerges through kinetic energy gain, with the self-consistent observation of enhanced magnetic fluctuation (disorder). As far as we recognize, this order-by-disorder process in fermionic system is observed for the first time within vdW materials. Our finding opens a unique experimental platform for direct visualization of the rich quasiparticle Fermi surface deformation associated with the Fermionic order-by-disorder process. Also, the search for emergent exotic phases by further tuning of quantum fluctuation is suggested as a promising future challenge. △ Less

Submitted 29 July, 2020; originally announced July 2020.

arXiv:2006.07012 [pdf, other]

doi 10.1103/PhysRevB.101.214419

Evidence for magnon-phonon coupling in the topological magnet Cu$_3$TeO$_6$

Authors: Song Bao, Zhengwei Cai, Wenda Si, Wei Wang, Xiaomeng Wang, Yanyan Shangguan, Zhen Ma, Zhao-Yang Dong, Ryoichi Kajimoto, Kazuhiko Ikeuchi, Shun-Li Yu, Jian Sun, Jian-Xin Li, Jinsheng Wen

Abstract: We perform thermodynamic and inelastic neutron scattering (INS) measurements to study the lattice dynamics (phonons) of a cubic collinear antiferromagnet Cu$_3$TeO$_6$ which hosts topological spin excitations (magnons). While the specific heat and thermal conductivity results show that the thermal transport is dominated by phonons, the deviation of the thermal conductivity from a pure phononic mod… ▽ More We perform thermodynamic and inelastic neutron scattering (INS) measurements to study the lattice dynamics (phonons) of a cubic collinear antiferromagnet Cu$_3$TeO$_6$ which hosts topological spin excitations (magnons). While the specific heat and thermal conductivity results show that the thermal transport is dominated by phonons, the deviation of the thermal conductivity from a pure phononic model indicates that there is a strong coupling between magnons and phonons. In the INS measurements, we find a mode in the excitation spectra at 4.5 K, which exhibits a slight downward dispersion around the Brillouin zone center. This mode disappears above the Néel temperature, and thus cannot be a phonon. Furthermore, the dispersion is distinct from that of a magnon. Instead, it can be explained by the magnon-polaron mode, which is new collective excitations resulting from the hybridization between magnons and phonons. We consider the suppression of the thermal conductivity and emergence of the magnon-polaron mode to be evidence for magnon-phonon coupling in Cu$_3$TeO$_6$. △ Less

Submitted 14 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

Comments: 9 pages, 4 figures, formula Cu3TeO6 updated in the title and abstract

Journal ref: Phys. Rev. B 101, 214419 (2020)

arXiv:2004.12314 [pdf]

A Global Benchmark of Algorithms for Segmenting Late Gadolinium-Enhanced Cardiac Magnetic Resonance Imaging

Authors: Zhaohan Xiong, Qing Xia, Zhiqiang Hu, Ning Huang, Cheng Bian, Yefeng Zheng, Sulaiman Vesal, Nishant Ravikumar, Andreas Maier, Xin Yang, Pheng-Ann Heng, Dong Ni, Caizi Li, Qianqian Tong, Weixin Si, Elodie Puybareau, Younes Khoudli, Thierry Geraud, Chen Chen, Wenjia Bai, Daniel Rueckert, Lingchao Xu, Xiahai Zhuang, Xinzhe Luo, Shuman Jia , et al. (19 additional authors not shown)

Abstract: Segmentation of cardiac images, particularly late gadolinium-enhanced magnetic resonance imaging (LGE-MRI) widely used for visualizing diseased cardiac structures, is a crucial first step for clinical diagnosis and treatment. However, direct segmentation of LGE-MRIs is challenging due to its attenuated contrast. Since most clinical studies have relied on manual and labor-intensive approaches, auto… ▽ More Segmentation of cardiac images, particularly late gadolinium-enhanced magnetic resonance imaging (LGE-MRI) widely used for visualizing diseased cardiac structures, is a crucial first step for clinical diagnosis and treatment. However, direct segmentation of LGE-MRIs is challenging due to its attenuated contrast. Since most clinical studies have relied on manual and labor-intensive approaches, automatic methods are of high interest, particularly optimized machine learning approaches. To address this, we organized the "2018 Left Atrium Segmentation Challenge" using 154 3D LGE-MRIs, currently the world's largest cardiac LGE-MRI dataset, and associated labels of the left atrium segmented by three medical experts, ultimately attracting the participation of 27 international teams. In this paper, extensive analysis of the submitted algorithms using technical and biological metrics was performed by undergoing subgroup analysis and conducting hyper-parameter analysis, offering an overall picture of the major design choices of convolutional neural networks (CNNs) and practical considerations for achieving state-of-the-art left atrium segmentation. Results show the top method achieved a dice score of 93.2% and a mean surface to a surface distance of 0.7 mm, significantly outperforming prior state-of-the-art. Particularly, our analysis demonstrated that double, sequentially used CNNs, in which a first CNN is used for automatic region-of-interest localization and a subsequent CNN is used for refined regional segmentation, achieved far superior results than traditional methods and pipelines containing single CNNs. This large-scale benchmarking study makes a significant step towards much-improved segmentation methods for cardiac LGE-MRIs, and will serve as an important benchmark for evaluating and comparing the future works in the field. △ Less

Submitted 7 May, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

arXiv:2002.09898 [pdf, other]

Efficient numerical methods for computing the stationary states of phase field crystal models

Authors: Kai Jiang, Wei Si, Chen Chang, Chenglong Bao

Abstract: Finding the stationary states of a free energy functional is an important problem in phase field crystal (PFC) models. Many efforts have been devoted for designing numerical schemes with energy dissipation and mass conservation properties. However, most existing approaches are time-consuming due to the requirement of small effective step sizes. In this paper, we discretize the energy functional an… ▽ More Finding the stationary states of a free energy functional is an important problem in phase field crystal (PFC) models. Many efforts have been devoted for designing numerical schemes with energy dissipation and mass conservation properties. However, most existing approaches are time-consuming due to the requirement of small effective step sizes. In this paper, we discretize the energy functional and propose efficient numerical algorithms for solving the constrained non-convex minimization problem. A class of gradient based approaches, which is the so-called adaptive accelerated Bregman proximal gradient (AA-BPG) methods, is proposed and the convergence property is established without the global Lipschitz constant requirements. A practical Newton method is also designed to further accelerate the local convergence with convergence guarantee. One key feature of our algorithms is that the energy dissipation and mass conservation properties hold during the iteration process. Moreover, we develop a hybrid acceleration framework to accelerate the AA-BPG methods and most of existing approaches through coupling with the practical Newton method. Extensive numerical experiments, including two three dimensional periodic crystals in Landau-Brazovskii (LB) model and a two dimensional quasicrystal in Lifshitz-Petrich (LP) model, demonstrate that our approaches have adaptive step sizes which lead to a significant acceleration over many existing methods when computing complex structures. △ Less

Submitted 10 November, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

Comments: 28 pages, 8 figures

arXiv:2001.04179 [pdf, other]

Randomized extended block Kaczmarz for solving least squares

Authors: Kui Du, Wutao Si, Xiaohui Sun

Abstract: Randomized iterative algorithms have recently been proposed to solve large-scale linear systems. In this paper, we present a simple randomized extended block Kaczmarz algorithm that exponentially converges in the mean square to the unique minimum $\ell_2$-norm least squares solution of a given linear system of equations. The proposed algorithm is pseudoinverse-free and therefore different from the… ▽ More Randomized iterative algorithms have recently been proposed to solve large-scale linear systems. In this paper, we present a simple randomized extended block Kaczmarz algorithm that exponentially converges in the mean square to the unique minimum $\ell_2$-norm least squares solution of a given linear system of equations. The proposed algorithm is pseudoinverse-free and therefore different from the projection-based randomized double block Kaczmarz algorithm of Needell, Zhao, and Zouzias. We emphasize that our method works for all types of linear systems (consistent or inconsistent, overdetermined or underdetermined, full-rank or rank-deficient). Moreover, our approach can utilize efficient implementations on distributed computing units, yielding remarkable improvements in computational time. Numerical examples are given to show the efficiency of the new algorithm. △ Less

Submitted 8 July, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

Comments: 20 pages, 3 figures, more general results are presented

MSC Class: 65F10; 65F20

arXiv:1911.06550 [pdf]

doi 10.1021/acs.jpcc.9b05590

Yttrium Tantalum Oxynitride Multiphases as Photoanodes for Water Oxidation

Authors: Wenping Si, Zahra Pourmand Tehrani, Fatima Haydous, Nicola Marzari, Ivano E. Castelli, Daniele Pergolesi, Thomas Lippert

Abstract: Perovskite yttrium tantalum oxynitride is theoretically proposed as a promising semiconductor for solar water splitting because of the predicted bandgap and energy positions of band edges. In experiment, however, we show here that depending on processing parameters, yttrium tantalum oxynitrides exist in multiphases, including the desired perovskite YTaON2, defect fluorite YTa(O,N,o)4, and N-doped… ▽ More Perovskite yttrium tantalum oxynitride is theoretically proposed as a promising semiconductor for solar water splitting because of the predicted bandgap and energy positions of band edges. In experiment, however, we show here that depending on processing parameters, yttrium tantalum oxynitrides exist in multiphases, including the desired perovskite YTaON2, defect fluorite YTa(O,N,o)4, and N-doped YTaO4. These multiphases have bandgaps ranging between 2.13 and 2.31 eV, all responsive to visible light. The N-doped YTaO4, perovskite main phase, and fluorite main phase derived from crystalline fergusonite oxide precursors exhibit interesting photoelectrochemical performances for water oxidation, while the defect fluorite derived from low crystallized scheelite-type oxide precursors show negligible activity. Preliminarily measurements show that loading IrOx cocatalyst on N-doped YTaO4 significantly improves its photoelectrochemical performance encouraging further studies to optimize this new material for solar fuel production. △ Less

Submitted 15 November, 2019; originally announced November 2019.

Journal ref: J. Phys. Chem. C, 2019, 123, 43, 26211

arXiv:1911.06549 [pdf]

doi 10.1021/acsaem.9b00420

Suppressed charge recombination in hematite photoanode via protonation and annealing

Authors: Wenping Si, Fatima Haydous, Ugljesa Babic, Daniele Pergolesi, Thomas Lippert

Abstract: Hematite as promising photoanode for solar water splitting suffers from severe bulk and surface charge recombination. This work describes that a protonation-annealing treatment can effectively suppress both bulk and surface charge recombination in hematite. Protons/electrons are electrochemically incorporated into hematite under 0.2 VRHE followed by annealing at 120 oC. The photocurrent density in… ▽ More Hematite as promising photoanode for solar water splitting suffers from severe bulk and surface charge recombination. This work describes that a protonation-annealing treatment can effectively suppress both bulk and surface charge recombination in hematite. Protons/electrons are electrochemically incorporated into hematite under 0.2 VRHE followed by annealing at 120 oC. The photocurrent density increases from ~0.9 mA cm-2 to 1.8 mA cm-2 at 1.23 VRHE under 1 sun, and further to 2.7 mA cm-2 after loading cobalt phosphate, stabilizing at round 2.4 mA cm-2. A cathodic shift of the onset potential of photocurrent is also observed. H2O2 oxidation, impedance spectroscopy and Mott-Schottky measurements show that the protonation suppresses bulk recombination and enhances donor density, but introducing more surface recombination. The annealing reduces surface recombination, while preserving relatively high bulk charge separation efficiency. Different from previous reports on the electrochemically reduced hematite, this work demonstrates that the performance improvement should be ascribed to the proton incorporation instead of the formation of Fe3O4 or metal Fe. This facile treatment by protonation and annealing could be applied in other semiconductors to promote the development of high performing photoelectrodes. △ Less

Submitted 15 November, 2019; originally announced November 2019.

Journal ref: ACS Applied Energy Materials 2019, 2, 5438

arXiv:1909.00305 [pdf, other]

An efficient method for computing stationary states of phase field crystal models

Authors: Kai Jiang, Wei Si, Chenglong Bao

Abstract: Computing stationary states is an important topic for phase field crystal (PFC) models. Great efforts have been made for energy dissipation of the numerical schemes when using gradient flows. However, it is always time-consuming due to the requirement of small effective time steps. In this paper, we propose an adaptive accelerated proximal gradient method for finding the stationary states of PFC m… ▽ More Computing stationary states is an important topic for phase field crystal (PFC) models. Great efforts have been made for energy dissipation of the numerical schemes when using gradient flows. However, it is always time-consuming due to the requirement of small effective time steps. In this paper, we propose an adaptive accelerated proximal gradient method for finding the stationary states of PFC models. The energy dissipation is guaranteed and the convergence property is established for the discretized energy functional. Moreover, the connections between generalized proximal operator with classical (semi-)implicit and explicit schemes for gradient flow are given. Extensive numerical experiments, including two three dimensional periodic crystals in Landau-Brazovskii (LB) model and a two dimensional quasicrystal in Lifshitz-Petrich (LP) model, demonstrate that our approach has adaptive time steps which lead to significant acceleration over semi-implicit methods for computing complex structures. Furthermore, our result reveals a deep physical mechanism of the simple LB model via which the sigma phase is first discovered. △ Less

Submitted 31 August, 2019; originally announced September 2019.

Comments: 18 pages, 9 figures

arXiv:1903.07859 [pdf, other]

doi 10.1080/14786435.2019.1671997

Stability of three-dimensional icosahedral quasicrystals in multi-component systems

Authors: Kai Jiang, Wei Si

Abstract: The relative stability of three-dimensional icosahedral quasicrystals in multi-component systems has been investigated based on a coupled-mode Swift-Hohenberg model with two-length-scales. A recently developed projection method, which provides a unified numerical framework to study periodic crystals and quasicrystals, is used to compute free energies to high accuracy. Compared with traditional app… ▽ More The relative stability of three-dimensional icosahedral quasicrystals in multi-component systems has been investigated based on a coupled-mode Swift-Hohenberg model with two-length-scales. A recently developed projection method, which provides a unified numerical framework to study periodic crystals and quasicrystals, is used to compute free energies to high accuracy. Compared with traditional approaches, the advantage of the projection method has been also discussed detailedly. A rigorous and systematical computation demonstrates that three-dimensional icosahedral quasicrystal, two-dimensional decagonal quasicrystal are stable phases in such a simple multi-component coupled-mode Swift-Hohenberg model. The result extends the multiple length-scales interaction mechanism which can stabilize quasicrystals from single-component to multi-component systems. △ Less

Submitted 5 September, 2019; v1 submitted 19 March, 2019; originally announced March 2019.

Comments: 17 pages, 11 figures

arXiv:1903.04497 [pdf]

doi 10.1088/1361-6471/ab4574

Searching for long-lived particles beyond the Standard Model at the Large Hadron Collider

Authors: Juliette Alimena, James Beacham, Martino Borsato, Yangyang Cheng, Xabier Cid Vidal, Giovanna Cottin, Albert De Roeck, Nishita Desai, David Curtin, Jared A. Evans, Simon Knapen, Sabine Kraml, Andre Lessa, Zhen Liu, Sascha Mehlhase, Michael J. Ramsey-Musolf, Heather Russell, Jessie Shelton, Brian Shuve, Monica Verducci, Jose Zurita, Todd Adams, Michael Adersberger, Cristiano Alpigiani, Artur Apresyan , et al. (176 additional authors not shown)

Abstract: Particles beyond the Standard Model (SM) can generically have lifetimes that are long compared to SM particles at the weak scale. When produced at experiments such as the Large Hadron Collider (LHC) at CERN, these long-lived particles (LLPs) can decay far from the interaction vertex of the primary proton-proton collision. Such LLP signatures are distinct from those of promptly decaying particles t… ▽ More Particles beyond the Standard Model (SM) can generically have lifetimes that are long compared to SM particles at the weak scale. When produced at experiments such as the Large Hadron Collider (LHC) at CERN, these long-lived particles (LLPs) can decay far from the interaction vertex of the primary proton-proton collision. Such LLP signatures are distinct from those of promptly decaying particles that are targeted by the majority of searches for new physics at the LHC, often requiring customized techniques to identify, for example, significantly displaced decay vertices, tracks with atypical properties, and short track segments. Given their non-standard nature, a comprehensive overview of LLP signatures at the LHC is beneficial to ensure that possible avenues of the discovery of new physics are not overlooked. Here we report on the joint work of a community of theorists and experimentalists with the ATLAS, CMS, and LHCb experiments --- as well as those working on dedicated experiments such as MoEDAL, milliQan, MATHUSLA, CODEX-b, and FASER --- to survey the current state of LLP searches at the LHC, and to chart a path for the development of LLP searches into the future, both in the upcoming Run 3 and at the High-Luminosity LHC. The work is organized around the current and future potential capabilities of LHC experiments to generally discover new LLPs, and takes a signature-based approach to surveying classes of models that give rise to LLPs rather than emphasizing any particular theory motivation. We develop a set of simplified models; assess the coverage of current searches; document known, often unexpected backgrounds; explore the capabilities of proposed detector upgrades; provide recommendations for the presentation of search results; and look towards the newest frontiers, namely high-multiplicity "dark showers", highlighting opportunities for expanding the LHC reach for these signals. △ Less

Submitted 11 March, 2019; originally announced March 2019.

Journal ref: J. Phys. G: Nucl. Part. Phys. 47 090501 (2020)

arXiv:1902.07880 [pdf, other]

Evaluation of Algorithms for Multi-Modality Whole Heart Segmentation: An Open-Access Grand Challenge

Authors: Xiahai Zhuang, Lei Li, Christian Payer, Darko Stern, Martin Urschler, Mattias P. Heinrich, Julien Oster, Chunliang Wang, Orjan Smedby, Cheng Bian, Xin Yang, Pheng-Ann Heng, Aliasghar Mortazi, Ulas Bagci, Guanyu Yang, Chenchen Sun, Gaetan Galisot, Jean-Yves Ramel, Thierry Brouard, Qianqian Tong, Weixin Si, Xiangyun Liao, Guodong Zeng, Zenglin Shi, Guoyan Zheng , et al. (9 additional authors not shown)

Abstract: Knowledge of whole heart anatomy is a prerequisite for many clinical applications. Whole heart segmentation (WHS), which delineates substructures of the heart, can be very valuable for modeling and analysis of the anatomy and functions of the heart. However, automating this segmentation can be arduous due to the large variation of the heart shape, and different image qualities of the clinical data… ▽ More Knowledge of whole heart anatomy is a prerequisite for many clinical applications. Whole heart segmentation (WHS), which delineates substructures of the heart, can be very valuable for modeling and analysis of the anatomy and functions of the heart. However, automating this segmentation can be arduous due to the large variation of the heart shape, and different image qualities of the clinical data. To achieve this goal, a set of training data is generally needed for constructing priors or for training. In addition, it is difficult to perform comparisons between different methods, largely due to differences in the datasets and evaluation metrics used. This manuscript presents the methodologies and evaluation results for the WHS algorithms selected from the submissions to the Multi-Modality Whole Heart Segmentation (MM-WHS) challenge, in conjunction with MICCAI 2017. The challenge provides 120 three-dimensional cardiac images covering the whole heart, including 60 CT and 60 MRI volumes, all acquired in clinical environments with manual delineation. Ten algorithms for CT data and eleven algorithms for MRI data, submitted from twelve groups, have been evaluated. The results show that many of the deep learning (DL) based methods achieved high accuracy, even though the number of training datasets was limited. A number of them also reported poor results in the blinded evaluation, probably due to overfitting in their training. The conventional algorithms, mainly based on multi-atlas segmentation, demonstrated robust and stable performance, even though the accuracy is not as good as the best DL method in CT segmentation. The challenge, including the provision of the annotated training data and the blinded evaluation for submitted algorithms on the test data, continues as an ongoing benchmarking resource via its homepage (\url{www.sdspeople.fudan.edu.cn/zhuangxiahai/0/mmwhs/}). △ Less

Submitted 21 February, 2019; originally announced February 2019.

Comments: 14 pages, 7 figures, sumitted to Medical Image Analysis

arXiv:1902.07482 [pdf]

doi 10.1021/acsaem.8b01811

Oxynitride Thin Films versus Particle-Based Photoanodes: a Comparative Study for Photoelectrochemical Solar Water Splitting

Authors: Fatima Haydous, Max Döbeli, Wenping Si, Friedrich Waag, Fei Li, Ekaterina Pomjakushina, Alexander Wokaun, Bilal Gökce, Daniele Pergolesi, Thomas Lippert

Abstract: The solar water splitting process assisted by semiconductor photocatalysts attracts growing research interests worldwide for the production of hydrogen as a clean and sustainable energy carrier. Due to their optical and electrical properties several oxynitride materials show great promise for the fabrication of efficient photocatalysts for solar water splitting. This study reports a comparative in… ▽ More The solar water splitting process assisted by semiconductor photocatalysts attracts growing research interests worldwide for the production of hydrogen as a clean and sustainable energy carrier. Due to their optical and electrical properties several oxynitride materials show great promise for the fabrication of efficient photocatalysts for solar water splitting. This study reports a comparative investigation of particle- and thin films-based photocatalysts using three different oxynitride materials. The absolute comparison of the photoelectrochemical activities favors the particle-based electrodes due to the better absorption properties and larger electrochemical surface area. However, thin films surpass the particle-based photoelectrodes due to their more suitable morphological features that improve the separation and mobility of the photo-generated charge carriers. Our analysis identifies what specific insights into the properties of materials can be achieved with the two complementary approaches. △ Less

Submitted 20 February, 2019; originally announced February 2019.

Journal ref: ACS Appl. Enrgy Mater. 2, 2019, 754-763

arXiv:1902.07470 [pdf]

doi 10.1021/acs.jpcc.8b09629

Improved Photoelectrochemical Water Splitting of CaNbO2N Photoanodes by Co-Pi Photodeposition and Surface Passivation

Authors: Fatima Haydous, Wenping Si, Vitaliy A. Guzenko, Friedrich Waag, Ekaterina Pomjakushina, Mario El Kazzi, Laurent Sévery, Alexander Wokaun, Daniele Pergolesi, Thomas Lippert

Abstract: Photoelectrochemical solar water splitting is a promising approach to convert solar energy into sustainable hydrogen fuel using semiconductor electrodes. Due to their visible light absorption properties, oxynitrides have shown to be attractive photocatalysts for this application. In this study, the influence of the preparation method of CaNbO2N particles on their morphological and optical properti… ▽ More Photoelectrochemical solar water splitting is a promising approach to convert solar energy into sustainable hydrogen fuel using semiconductor electrodes. Due to their visible light absorption properties, oxynitrides have shown to be attractive photocatalysts for this application. In this study, the influence of the preparation method of CaNbO2N particles on their morphological and optical properties, and thereby their photoelectrochemical performance, is investigated. The best performing CaNbO2N photoanode is produced by ammonolysis of Nb enriched calcium niobium oxide. The enhanced photoactivity arises from an enlarged surface area and superior visible light absorption properties. The photoactivity of this photoanode was further enhanced by photodeposition of Co-Pi co-catalyst and by atomic layer deposition of an Al2O3 overlayer. A photocurrent density of 70 microA.cm-2 at 1.23 V vs RHE was achieved. The observed enhancement of the photoelectrochemical performance after Co-Pi/Al2O3 deposition is the combined effect of the improved kinetics of oxygen evolution due to the Co-Pi co-catalyst and the reduced surface recombination of the photogenerated carriers at the Al2O3 surface layer. △ Less

Submitted 20 February, 2019; originally announced February 2019.

Journal ref: J. Phys. Chem. C, 2019, 123, 1059-1068

arXiv:1902.03832 [pdf]

doi 10.1002/adfm.201605690

LaTiOxNy thin film model systems for photocatalytic water splitting: physicochemical evolution of the solid-liquid interface and the role of the crystallographic orientation

Authors: Markus Pichler, Wenping Si, Fatima Haydous, Helena Téllez, John Druce, Emiliana Fabbri, Mario El Kazzi, Max Döbeli, Silviya Ninova, Ulrich Aschauer, Alexander Wokaun, Daniele Pergolesi, Thomas Lippert

Abstract: The size of the band gap and the energy position of the band edges make several oxynitride semiconductors promising candidates for efficient hydrogen and oxygen production under solar light illumination. The intense research efforts dedicated to oxynitride materials have unveiled the majority of their most important properties. However, two crucial aspects have received much less attention. One is… ▽ More The size of the band gap and the energy position of the band edges make several oxynitride semiconductors promising candidates for efficient hydrogen and oxygen production under solar light illumination. The intense research efforts dedicated to oxynitride materials have unveiled the majority of their most important properties. However, two crucial aspects have received much less attention. One is the critical issue of the compositional/structural surface modifications occurring during operation and how these affect the photoelectrochemical performance. The second concerns the relation between the electrochemical response and the crystallographic surface orientation of the oxynitride semiconductor. These are indeed topics of fundamental importance since it is exactly at the surface where the visible light-driven electrochemical reaction takes place. In contrast to conventional powder samples, thin films represent the best model system for these investigations. This study reviews current state-of-the-art of oxynitride thin film fabrication and characterization before focusing on LaTiO2N selected as representative photocatalyst. We report the investigation of the initial physicochemical evolution of the surface. Then we show that, after stabilization, the absorbed photon-to-current conversion efficiency of epitaxial thin films can differ by about 50% for different crystallographic surface orientations and be up to 5 times larger than for polycrystalline samples. △ Less

Submitted 11 February, 2019; originally announced February 2019.

Journal ref: M. Pichler, et al., Adv. Funct. Mat. 2017, 27, 1605690

arXiv:1812.09523 [pdf, other]

doi 10.1016/j.jcp.2019.02.047

A finite element method of the self-consistent field theory on general curved surfaces

Authors: Huayi Wei, Ming Xu, Wei Si, Kai Jiang

Abstract: Block copolymers provide a wonderful platform in studying the soft condensed matter systems. Many fascinating ordered structures have been discovered in bulk and confined systems. Among various theories, the self-consistent field theory (SCFT) has been proven to be a powerful tool for studying the equilibrium ordered structures. Many numerical methods have been developed to solve the SCFT model. H… ▽ More Block copolymers provide a wonderful platform in studying the soft condensed matter systems. Many fascinating ordered structures have been discovered in bulk and confined systems. Among various theories, the self-consistent field theory (SCFT) has been proven to be a powerful tool for studying the equilibrium ordered structures. Many numerical methods have been developed to solve the SCFT model. However, most of these focus on the bulk systems, and little work on the confined systems, especially on general curved surfaces. In this work, we developed a linear surface finite element method, which has a rigorous mathematical theory to guarantee numerical precsion, to study the self-assembled phases of block copolymers on general curved surfaces based on the SCFT. Furthermore, to capture the consistent surface for a given self-assembled pattern, an adaptive approach to optimize the size of the general curved surface has been proposed. To demonstrate the power of this approach, we investigate the self-assembled patterns of diblock copolymers on several distinct curved surfaces, including five closed surfaces and an unclosed surface. Numerical results illustrate the efficiency of the proposed method. The obtained ordered structures are consistent with the previous results on standard surfaces, such as sphere and torus. Certainly, the proposed numerical framework has the capability of studying the phase behaviors on general surfaces precisely. △ Less

Submitted 22 December, 2018; originally announced December 2018.

arXiv:1812.09486 [pdf, other]

High-order energy stable schemes of incommensurate phase-field crystal model

Authors: Kai Jiang, Wei Si

Abstract: This article focuses on the development of high-order energy stable schemes for the multi-length-scale incommensurate phase-field crystal model which is able to study the phase behavior of aperiodic structures. These high-order schemes based on the scalar auxiliary variable (SAV) and spectral deferred correction (SDC) approaches are suitable for the L 2 gradient flow equation, i.e., the Allen-Cahn… ▽ More This article focuses on the development of high-order energy stable schemes for the multi-length-scale incommensurate phase-field crystal model which is able to study the phase behavior of aperiodic structures. These high-order schemes based on the scalar auxiliary variable (SAV) and spectral deferred correction (SDC) approaches are suitable for the L 2 gradient flow equation, i.e., the Allen-Cahn dynamic equation. Concretely, we propose a second-order Crank-Nicolson (CN) scheme of the SAV system, prove the energy dissipation law, and give the error estimate in the almost periodic function sense. Moreover, we use the SDC method to improve the computational accuracy of the SAV/CN scheme. Numerical results demonstrate the advantages of high-order numerical methods in numerical computations and show the influence of length-scales on the formation of ordered structures. △ Less

Submitted 25 May, 2020; v1 submitted 22 December, 2018; originally announced December 2018.

Comments: 22 pages, 6 figures

arXiv:1804.07099 [pdf, ps, other]

Loop Restricted Existential Rules and First-order Rewritability for Query Answering

Authors: Vernon Asuncion, Yan Zhang, Heng Zhang, Yun Bai, Weisheng Si

Abstract: In ontology-based data access (OBDA), the classical database is enhanced with an ontology in the form of logical assertions generating new intensional knowledge. A powerful form of such logical assertions is the tuple-generating dependencies (TGDs), also called existential rules, where Horn rules are extended by allowing existential quantifiers to appear in the rule heads. In this paper we introdu… ▽ More In ontology-based data access (OBDA), the classical database is enhanced with an ontology in the form of logical assertions generating new intensional knowledge. A powerful form of such logical assertions is the tuple-generating dependencies (TGDs), also called existential rules, where Horn rules are extended by allowing existential quantifiers to appear in the rule heads. In this paper we introduce a new language called loop restricted (LR) TGDs (existential rules), which are TGDs with certain restrictions on the loops embedded in the underlying rule set. We study the complexity of this new language. We show that the conjunctive query answering (CQA) under the LR TGDs is decid- able. In particular, we prove that this language satisfies the so-called bounded derivation-depth prop- erty (BDDP), which implies that the CQA is first-order rewritable, and its data complexity is in AC0 . We also prove that the combined complexity of the CQA is EXPTIME complete, while the language membership is PSPACE complete. Then we extend the LR TGDs language to the generalised loop restricted (GLR) TGDs language, and prove that this class of TGDs still remains to be first-order rewritable and properly contains most of other first-order rewritable TGDs classes discovered in the literature so far. △ Less

Submitted 1 August, 2018; v1 submitted 19 April, 2018; originally announced April 2018.

Comments: Full paper version of extended abstract

arXiv:1709.10283 [pdf, other]

Commissioning and Operation of the New CMS Phase-1 Pixel Detector

Authors: Weinan Si

Abstract: The Phase-1 upgrade of the CMS pixel detector is built out of four barrel layers (BPix) and three forward disks in each endcap (FPix). It comprises a total of 124M pixel channels in 1,856 modules, and it is designed to withstand instantaneous luminosities of up to $2 \times 10^{34}\,$cm$^{-2}$s$^{-1}$. Different parts of the detector were assembled over the last year and later brought to CERN for… ▽ More The Phase-1 upgrade of the CMS pixel detector is built out of four barrel layers (BPix) and three forward disks in each endcap (FPix). It comprises a total of 124M pixel channels in 1,856 modules, and it is designed to withstand instantaneous luminosities of up to $2 \times 10^{34}\,$cm$^{-2}$s$^{-1}$. Different parts of the detector were assembled over the last year and later brought to CERN for installation inside the CMS tracker. At various stages during the assembly tests have been performed to ensure that the readout and power electronics and the cooling system meet the design specifications. After tests of the individual components, system tests were performed before the installation inside CMS. In addition to reviewing these tests, we also present results from the final commissioning of the detector in-situ using the central CMS DAQ system. Finally we review results from the initial operation of the detector first with cosmic rays and then with pp collisions. △ Less

Submitted 29 September, 2017; originally announced September 2017.

Comments: Talk presented at the APS Division of Particles and Fields Meeting (DPF 2017), July 31-August 4, 2017, Fermilab. C170731

Report number: CMS-CR-2017-254

arXiv:1706.00222 [pdf, other]

doi 10.1088/1748-0221/12/05/P05022

Test Beam Performance Measurements for the Phase I Upgrade of the CMS Pixel Detector

Authors: M. Dragicevic, M. Friedl, J. Hrubec, H. Steininger, A. Gädda, J. Härkönen, T. Lampén, P. Luukka, T. Peltola, E. Tuominen, E. Tuovinen, A. Winkler, P. Eerola, T. Tuuva, G. Baulieu, G. Boudoul, L. Caponetto, C. Combaret, D. Contardo, T. Dupasquier, G. Gallbit, N. Lumb, L. Mirabito, S. Perries, M. Vander Donckt , et al. (462 additional authors not shown)

Abstract: A new pixel detector for the CMS experiment was built in order to cope with the instantaneous luminosities anticipated for the Phase~I Upgrade of the LHC. The new CMS pixel detector provides four-hit tracking with a reduced material budget as well as new cooling and powering schemes. A new front-end readout chip mitigates buffering and bandwidth limitations, and allows operation at low comparator… ▽ More A new pixel detector for the CMS experiment was built in order to cope with the instantaneous luminosities anticipated for the Phase~I Upgrade of the LHC. The new CMS pixel detector provides four-hit tracking with a reduced material budget as well as new cooling and powering schemes. A new front-end readout chip mitigates buffering and bandwidth limitations, and allows operation at low comparator thresholds. In this paper, comprehensive test beam studies are presented, which have been conducted to verify the design and to quantify the performance of the new detector assemblies in terms of tracking efficiency and spatial resolution. Under optimal conditions, the tracking efficiency is $99.95\pm0.05\,\%$, while the intrinsic spatial resolutions are $4.80\pm0.25\,μ\mathrm{m}$ and $7.99\pm0.21\,μ\mathrm{m}$ along the $100\,μ\mathrm{m}$ and $150\,μ\mathrm{m}$ pixel pitch, respectively. The findings are compared to a detailed Monte Carlo simulation of the pixel detector and good agreement is found. △ Less

Submitted 1 June, 2017; originally announced June 2017.

Report number: CMS-NOTE-2017-002

arXiv:1705.06580 [pdf, ps, other]

The correlation of co-located hydrogen masers

Authors: Y. C. Guo, B. Wang, H. W. Si, Z. W. Cai, A. M. Zhang, X. Zhu, J. Yang, C. H. Han, T. C. Li, L. J. Wang

Abstract: The correlation of co-located hydrogen masers (H-masers) is difficult to measure because their common-mode noise induced by the environment will be cancelled out during the comparison measurement. With the development of fibre-based high-precision time and frequency transfer technique, the correlation of co-located hydrogen masers can be directly measured with the help of remote H-masers. Recently… ▽ More The correlation of co-located hydrogen masers (H-masers) is difficult to measure because their common-mode noise induced by the environment will be cancelled out during the comparison measurement. With the development of fibre-based high-precision time and frequency transfer technique, the correlation of co-located hydrogen masers can be directly measured with the help of remote H-masers. Recently, a fiber-based frequency synchronization network was constructed in the Beijing region by connecting 5 H-masers from 4 institutions. The correlation coefficient of atomic clocks is defined and the correlation between two co-located H-masers is measured using both experimental and simulative methods. The results show that the correlation is not prominent until the averaging time is larger than $\sim10^3$s; then, the coefficient grows rapidly for averaging times ranging from $\sim10^3$s to $\sim10^5$s and decreases beyond $\sim10^5$s up to 5 days. △ Less

Submitted 22 March, 2018; v1 submitted 15 May, 2017; originally announced May 2017.

arXiv:1608.00054 [pdf]

doi 10.1088/1674-1056/24/10/108201

Ion and water transport in charge-modified graphene nanopores

Authors: Yinghua Qiu, Kun Li, Weiyu Chen, Wei Si, Qiyan Tan, Yunfei Chen

Abstract: Porous graphene has high mechanical strength and atomic layer thickness, which make it a promising material for material separation and biomolecule sensing. Electrostatic interactions between charges in aqueous solution are a kind of strong long-range interaction which may have great influence on the fluid transport through nanopores. Here, molecular dynamics simulations were conducted to investig… ▽ More Porous graphene has high mechanical strength and atomic layer thickness, which make it a promising material for material separation and biomolecule sensing. Electrostatic interactions between charges in aqueous solution are a kind of strong long-range interaction which may have great influence on the fluid transport through nanopores. Here, molecular dynamics simulations were conducted to investigate ion and water transport through a 1.05-nm-in-diameter monolayer graphene nanopore with its edge charge-modified. From the results, it is found that the nanopores are selective to counterions when they are charged. As the charge amount increases, the total ionic currents show an increase-decrease profile while the co-ion currents monotonously decrease. The co-ions rejection can reach 75% and 90% when the nanopores are negatively and positively charged, respectively. Cl ions current increases and reaches a plateau, and Na+ current decreases with the charge amount in the systems where they act as counterions. Besides, the charge modification can enhance the water transport through nanopores obviously. This is mainly due to the ion selection of nanopores. Especially, positive charges on the pore edge facilitate the water transport much more than negative charges. △ Less

Submitted 29 July, 2016; originally announced August 2016.

Journal ref: Chinese Physics B 24(10) 108201, 2015

arXiv:1601.04474 [pdf, ps, other]

doi 10.1063/1.4945778

Drift of Charge Carriers in Crystalline Organic Semiconductors

Authors: Jingjuan Dong, Wei Si, Chang-Qin Wu

Abstract: We investigate the direct-current response of crystalline organic semiconductors in the presence of finite external electric fields by the quantum-classical Ehrenfest dynamics complemented with instantaneous decoherence corrections (IDC). The IDC is carried out in the real-space representation with the energy-dependent reweighing factors to account for both intermolecular decoherence and energy re… ▽ More We investigate the direct-current response of crystalline organic semiconductors in the presence of finite external electric fields by the quantum-classical Ehrenfest dynamics complemented with instantaneous decoherence corrections (IDC). The IDC is carried out in the real-space representation with the energy-dependent reweighing factors to account for both intermolecular decoherence and energy relaxation by which conduction occurs. In this way, both the diffusion and drift motion of charge carriers are described in a unified framework. Based on an off-diagonal electron-phonon coupling model for pentacene, we find that the drift velocity initially increases with the electric field and then decreases at higher fields due to the Wannier-Stark localization, and a negative electric-field dependence of mobility is observed. The Einstein relation, which is a manifestation of the fluctuation-dissipation theorem, is found to be restored in electric fields up to ~$10^5$ V/cm for a wide temperature region studied. Furthermore, we show that the incorporated decoherence and energy relaxation could explain the large discrepancy between the mobilities calculated by the Ehrenfest dynamics and the full quantum methods, which proves the effectiveness of our approach to take back these missing processes. △ Less

Submitted 18 January, 2016; originally announced January 2016.

Comments: 8 pages, 5 figures

arXiv:1509.02153 [pdf, ps, other]

Hybrid-BCP: A Robust Load Balancing and Routing Protocol for Intra-Car Wired/Wireless Networks

Authors: Wei Si, David Starobinski, Moshe Laifenfeld

Abstract: With the emergence of connected and autonomous vehicles, sensors are increasingly deployed within cars to support new functionalities. Traffic generated by these sensors congest traditional intra-car networks, such as CAN buses. Furthermore, the large amount of wires needed to connect sensors makes it harder to design cars in a modular way. To alleviate these limitations, we propose, simulate, and… ▽ More With the emergence of connected and autonomous vehicles, sensors are increasingly deployed within cars to support new functionalities. Traffic generated by these sensors congest traditional intra-car networks, such as CAN buses. Furthermore, the large amount of wires needed to connect sensors makes it harder to design cars in a modular way. To alleviate these limitations, we propose, simulate, and implement a hybrid wired/wireless architecture, in which each node is connected to either a wired interface or a wireless interface or both. Specifically, we propose a new protocol, called Hybrid-Backpressure Collection Protocol (Hybrid-BCP), to efficiently collect data from sensors in intra-car networks. Hybrid-BCP is backward-compatible with the CAN bus technology, and builds on the BCP protocol, designed for wireless sensor networks. Hybrid-BCP achieves high throughput and shows resilience to dynamic network conditions, including adversarial interferences. Our testbed implementation, based on CAN and ZigBee transceivers, demonstrates the load balancing and routing functionalities of Hybrid-BCP and its resilience to DoS attacks. We further provide simulation results, obtained with the ns-3 simulator and based on real intra-car RSSI traces, that compare between the performance of Hybrid-BCP and a tree-based collection protocol. Notably, the simulations show that Hybrid-BCP can achieve the same performance as the tree-based protocol while reducing the radio transmission power by a factor of 10. △ Less

Submitted 7 September, 2015; originally announced September 2015.

arXiv:1505.02234 [pdf, ps, other]

doi 10.1063/1.4926534

Decoherence and Energy Relaxation in the Quantum-Classical Dynamics for Charge Transport in Organic Semiconducting Crystals: an Instantaneous Decoherence Correction Approach

Authors: Wei Si, Chang-Qin Wu

Abstract: We explore an instantaneous decoherence correction (IDC) approach for the decoherence and energy relaxation in the quantum-classical dynamics of charge transport in organic semiconducting crystals. These effects, originating from environmental fluctuations, are essential ingredients of the carrier dynamics. The IDC is carried out by measurement-like operations in the adiabatic representation. Whil… ▽ More We explore an instantaneous decoherence correction (IDC) approach for the decoherence and energy relaxation in the quantum-classical dynamics of charge transport in organic semiconducting crystals. These effects, originating from environmental fluctuations, are essential ingredients of the carrier dynamics. The IDC is carried out by measurement-like operations in the adiabatic representation. While decoherence is inherent in the IDC, energy relaxation is taken into account by considering the detailed balance through the introduction of energy-dependent reweighing factors, which could be either Boltzmann (IDC-BM) or Miller-Abrahams (IDC-MA) type. For a non-diagonal electron-phonon coupling model, it is shown that the IDC tends to enhance diffusion while energy relaxation weakens this enhancement. As expected, both the IDC-BM and IDC-MA achieve a near-equilibrium distribution at finite temperatures in the diffusion process, while the Ehrenfest dynamics renders system tending to infinite temperature limit. The resulting energy relaxation times with the two kinds of factors lie in different regimes and exhibit different dependence on temperature, decoherence time and electron-phonon coupling strength, due to different dominant relaxation process. △ Less

Submitted 9 May, 2015; originally announced May 2015.

Comments: 8 pages, 4 figures

Journal ref: Journal of Chemical Physics 143, 024103 (2015)

Showing 1–50 of 67 results for author: Si, W