-
LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
Authors:
Amy Xin,
Yunjia Qi,
Zijun Yao,
Fangwei Zhu,
Kaisheng Zeng,
Xu Bin,
Lei Hou,
Juanzi Li
Abstract:
Entity Linking (EL) models are well-trained at mapping mentions to their corresponding entities according to a given context. However, EL models struggle to disambiguate long-tail entities due to their limited training data. Meanwhile, large language models (LLMs) are more robust at interpreting uncommon mentions. Yet, due to a lack of specialized training, LLMs suffer at generating correct entity…
▽ More
Entity Linking (EL) models are well-trained at mapping mentions to their corresponding entities according to a given context. However, EL models struggle to disambiguate long-tail entities due to their limited training data. Meanwhile, large language models (LLMs) are more robust at interpreting uncommon mentions. Yet, due to a lack of specialized training, LLMs suffer at generating correct entity IDs. Furthermore, training an LLM to perform EL is cost-intensive. Building upon these insights, we introduce LLM-Augmented Entity Linking LLMAEL, a plug-and-play approach to enhance entity linking through LLM data augmentation. We leverage LLMs as knowledgeable context augmenters, generating mention-centered descriptions as additional input, while preserving traditional EL models for task specific processing. Experiments on 6 standard datasets show that the vanilla LLMAEL outperforms baseline EL models in most cases, while the fine-tuned LLMAEL set the new state-of-the-art results across all 6 benchmarks.
△ Less
Submitted 15 July, 2024; v1 submitted 4 July, 2024;
originally announced July 2024.
-
Atomic-scale tunable phonon transport at tailored grain boundaries
Authors:
Xiaowang Wang,
Chaitanya A. Gadre,
Runqing Yang,
Wanjuan Zou,
Xing Bin,
Christopher Addiego,
Toshihiro Aoki,
Yujie Quan,
Wei-Tao Peng,
Yifeng Huang,
Chaojie Du,
Mingjie Xu,
Xingxu Yan,
Ruqian Wu,
Shyue Ping Ong,
Bolin Liao,
Penghui Cao,
Xiaoqing Pan
Abstract:
Manipulating thermal properties in materials has been of fundamental importance for advancing innovative technologies. Heat carriers such as phonons are impeded by breaking crystal symmetry or periodicity. Notable methods of impeding the phonon propagation include varying the density of defects, interfaces, and nanostructures, as well as changing composition. However, a robust link between the ind…
▽ More
Manipulating thermal properties in materials has been of fundamental importance for advancing innovative technologies. Heat carriers such as phonons are impeded by breaking crystal symmetry or periodicity. Notable methods of impeding the phonon propagation include varying the density of defects, interfaces, and nanostructures, as well as changing composition. However, a robust link between the individual nanoscale defect structures, phonon states, and macroscopic thermal conductivity is lacking. Here we reveal from nanoscale structure-phonon mechanisms on how the grain boundary (GB) tilt and twist angles fundamentally drive the changes in atom rearrangements, exotic vibrational states, and finally macroscopic heat transport at different bicrystal strontium titanate GBs using emerging atomic resolution vibrational spectroscopy. The 10 deg and 22 deg tilt GBs exhibit reduced phonon populations by 54% and 16% compared to the bulk value, respectively, consistent with measured thermal conductivities. A tiny twist angle further introduces a fine and local tunning of thermal conductivity by introducing twist induced defects periodically embedded with the tilt induced GB defects. Our results demonstrate that varying the tilt angle coarsely modifies the phonon population along entire GB while varying the twist angle incurs a finer adjustment at periodic locations on the GB. Our study offers a systematic approach to understanding and manipulating cross GB thermal transport of arbitrary GBs predictably and precisely.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Trinity: Syncretizing Multi-/Long-tail/Long-term Interests All in One
Authors:
Jing Yan,
Liu Jiang,
Jianfei Cui,
Zhichen Zhao,
Xingyan Bin,
Feng Zhang,
Zuotao Liu
Abstract:
Interest modeling in recommender system has been a constant topic for improving user experience, and typical interest modeling tasks (e.g. multi-interest, long-tail interest and long-term interest) have been investigated in many existing works. However, most of them only consider one interest in isolation, while neglecting their interrelationships. In this paper, we argue that these tasks suffer f…
▽ More
Interest modeling in recommender system has been a constant topic for improving user experience, and typical interest modeling tasks (e.g. multi-interest, long-tail interest and long-term interest) have been investigated in many existing works. However, most of them only consider one interest in isolation, while neglecting their interrelationships. In this paper, we argue that these tasks suffer from a common "interest amnesia" problem, and a solution exists to mitigate it simultaneously. We figure that long-term cues can be the cornerstone since they reveal multi-interest and clarify long-tail interest. Inspired by the observation, we propose a novel and unified framework in the retrieval stage, "Trinity", to solve interest amnesia problem and improve multiple interest modeling tasks. We construct a real-time clustering system that enables us to project items into enumerable clusters, and calculate statistical interest histograms over these clusters. Based on these histograms, Trinity recognizes underdelivered themes and remains stable when facing emerging hot topics. Trinity is more appropriate for large-scale industry scenarios because of its modest computational overheads. Its derived retrievers have been deployed on the recommender system of Douyin, significantly improving user experience and retention. We believe that such practical experience can be well generalized to other scenarios.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Deep Retrieval: Learning A Retrievable Structure for Large-Scale Recommendations
Authors:
Weihao Gao,
Xiangjun Fan,
Chong Wang,
Jiankai Sun,
Kai Jia,
Wenzhi Xiao,
Ruofan Ding,
Xingyan Bin,
Hui Yang,
Xiaobing Liu
Abstract:
One of the core problems in large-scale recommendations is to retrieve top relevant candidates accurately and efficiently, preferably in sub-linear time. Previous approaches are mostly based on a two-step procedure: first learn an inner-product model, and then use some approximate nearest neighbor (ANN) search algorithm to find top candidates. In this paper, we present Deep Retrieval (DR), to lear…
▽ More
One of the core problems in large-scale recommendations is to retrieve top relevant candidates accurately and efficiently, preferably in sub-linear time. Previous approaches are mostly based on a two-step procedure: first learn an inner-product model, and then use some approximate nearest neighbor (ANN) search algorithm to find top candidates. In this paper, we present Deep Retrieval (DR), to learn a retrievable structure directly with user-item interaction data (e.g. clicks) without resorting to the Euclidean space assumption in ANN algorithms. DR's structure encodes all candidate items into a discrete latent space. Those latent codes for the candidates are model parameters and learnt together with other neural network parameters to maximize the same objective function. With the model learnt, a beam search over the structure is performed to retrieve the top candidates for reranking. Empirically, we first demonstrate that DR, with sub-linear computational complexity, can achieve almost the same accuracy as the brute-force baseline on two public datasets. Moreover, we show that, in a live production recommendation system, a deployed DR approach significantly outperforms a well-tuned ANN baseline in terms of engagement metrics. To the best of our knowledge, DR is among the first non-ANN algorithms successfully deployed at the scale of hundreds of millions of items for industrial recommendation systems.
△ Less
Submitted 18 May, 2021; v1 submitted 12 July, 2020;
originally announced July 2020.
-
Modulation of interfacial thermal transport between fumed silica nanoparticles by surface chemical functionalization for advanced thermal insulation
Authors:
Takashi Kodama,
Nobuhiro Shinohara,
Shih-Wei Hung,
Xu Bin,
Masanao Obori,
Donguk Suh,
Junichiro Shiomi
Abstract:
Since solid-state heat transport in a highly porous nanocomposite strongly depends on the thermal boundary conductance (TBC) between constituent nanomaterials, further suppression of the TBC is important for improving performance of thermal insulators. Here, targeting a nanocomposite fabricated by stamping fumed silica nanoparticles, we perform a wide variety of surface functionalization on fumed…
▽ More
Since solid-state heat transport in a highly porous nanocomposite strongly depends on the thermal boundary conductance (TBC) between constituent nanomaterials, further suppression of the TBC is important for improving performance of thermal insulators. Here, targeting a nanocomposite fabricated by stamping fumed silica nanoparticles, we perform a wide variety of surface functionalization on fumed silica nanoparticles by silane coupling method and investigate the impact on the thermal conductivity (Km). The Km of the silica nanocomposite is approximately 20 and 9 mW/m/K under atmospheric and vacuum condition at the material density of 0.2 g/cm3 without surface functionalization, respectively, and the experimental results indicate that the Km can be modulated depending on the chemical structure of molecules. The surface modification with a linear alkyl chain of optimal length significantly suppresses Km by approximately 30%, and the suppression can be further enhanced to approximately 50% with the infrared opacifier. The magnitude of suppression was found to sensitively depend on the length of terminal chain. The magnitude is also related to the number of reactive silanol groups in the chemical structure, where the surface modification with fluorocarbon gives the largest suppression. The surface hydrophobization merits thermal insulation through significant suppression of the TBC, presumably by reducing the water molecules that otherwise would serve as heat conduction channels at the interface. On the other hand, when the chain length is long, the suppression is counteracted by the enhanced phonon transmission through the silane coupling molecules that grows with the chain length. This is supported by the analytical model and present simulation results, leading to predict the optimal chemical structure for better thermal insulation.
△ Less
Submitted 16 March, 2021; v1 submitted 23 June, 2020;
originally announced June 2020.
-
Abnormal Subspace Sparse PCA for Anomaly Detection and Interpretation
Authors:
Xingyan Bin,
Ying Zhao,
Bilong Shen
Abstract:
The main shortage of principle component analysis (PCA) based anomaly detection models is their interpretability. In this paper, our goal is to propose an interpretable PCA-based model for anomaly detection and interpretation. The propose ASPCA model constructs principal components with sparse and orthogonal loading vectors to represent the abnormal subspace, and uses them to interpret detected an…
▽ More
The main shortage of principle component analysis (PCA) based anomaly detection models is their interpretability. In this paper, our goal is to propose an interpretable PCA-based model for anomaly detection and interpretation. The propose ASPCA model constructs principal components with sparse and orthogonal loading vectors to represent the abnormal subspace, and uses them to interpret detected anomalies. Our experiments on a synthetic dataset and two real world datasets showed that the proposed ASPCA models achieved comparable detection accuracies as the PCA model, and can provide interpretations for individual anomalies.
△ Less
Submitted 15 May, 2016;
originally announced May 2016.
-
Numerical study of the depinning transition of a ferromagnetic magnetic domain wall in films
Authors:
Bin Xi,
Meng-Bo Luo,
Valerii M. Vinokur,
Xiao Hu
Abstract:
We report first principle numerical study of domain wall (DW) depinning in two-dimensional magnetic film, which is modeled by 2D random-field Ising system with the dipole-dipole interaction. We observe nonconventional activation-type motion of DW and reveal its fractal structure of DW near the depinning transition. We determine scaling functions describing critical dynamics near the transition and…
▽ More
We report first principle numerical study of domain wall (DW) depinning in two-dimensional magnetic film, which is modeled by 2D random-field Ising system with the dipole-dipole interaction. We observe nonconventional activation-type motion of DW and reveal its fractal structure of DW near the depinning transition. We determine scaling functions describing critical dynamics near the transition and obtain universal exponents establishing connection between thermal softening of pinning potential and critical dynamics. We observe that tuning the strength of the dipole-dipole interaction switches DW dynamics between two different universality classes corresponding to two distinct dynamic regimes, motion in the random potential and that in the random force.
△ Less
Submitted 19 January, 2015;
originally announced January 2015.
-
Objective Information Theory: A Sextuple Model and 9 Kinds of Metrics
Authors:
Xu Jianfeng,
Tang Jun,
Ma Xuefeng,
Xu Bin,
Shen Yanli,
Qiao Yongjie
Abstract:
In the contemporary era, the importance of information is undisputed, but there has never been a common understanding of information, nor a unanimous conclusion to the researches on information metrics. Based on the previous studies, this paper analyzes the important achievements in the researches of the properties and metrics of information as well as their main insufficiencies, and explores the…
▽ More
In the contemporary era, the importance of information is undisputed, but there has never been a common understanding of information, nor a unanimous conclusion to the researches on information metrics. Based on the previous studies, this paper analyzes the important achievements in the researches of the properties and metrics of information as well as their main insufficiencies, and explores the essence and connotation, the mathematical expressions and other basic problems related to information. On the basis of the understanding of the objectivity of information, it proposes the definitions and a Sextuple model of information; discusses the basic properties of information, and brings forward the definitions and mathematical expressions of nine kinds of metrics of information, i.e., extensity, detailedness, sustainability, containability, delay, richness, distribution, validity and matchability. Through these, this paper establishes a basic theory frame of Objective Information Theory to support the analysis and research on information and information system systematically and comprehensively.
△ Less
Submitted 3 April, 2014; v1 submitted 15 August, 2013;
originally announced August 2013.
-
HCMU metrics with cusp singularities and conical singularities
Authors:
Chen Qing,
Wu Yingyi,
Xu Bin
Abstract:
An HCMU metric is a conformal metric which has a finite number of singularities on a compact Riemann surface and satisfies the equation of the extremal Kähler metric. In this paper, we give a necessary and sufficient condition for the existence of a kind of HCMU metrics which has both cusp singularities and conical singularities.
An HCMU metric is a conformal metric which has a finite number of singularities on a compact Riemann surface and satisfies the equation of the extremal Kähler metric. In this paper, we give a necessary and sufficient condition for the existence of a kind of HCMU metrics which has both cusp singularities and conical singularities.
△ Less
Submitted 26 February, 2013;
originally announced February 2013.
-
Low-energy properties of anisotropic two-dimensional spin-1/2 Heisenberg models in staggered magnetic fields
Authors:
Bin Xi,
Shijie Hu,
Jize Zhao,
Gang Su,
B. Normand,
Xiaoqun Wang
Abstract:
We present a systematic study of the anisotropic spin-1/2 Heisenberg model in staggered magnetic fields in two dimensions (2D). To mimic real materials, we consider a system of coupled, antiferromagnetic chains, whose interchain interaction can be either ferro- or antiferromagnetic. When the staggered field is commensurate with the magnetic interactions, an energy gap opens immediately and follows…
▽ More
We present a systematic study of the anisotropic spin-1/2 Heisenberg model in staggered magnetic fields in two dimensions (2D). To mimic real materials, we consider a system of coupled, antiferromagnetic chains, whose interchain interaction can be either ferro- or antiferromagnetic. When the staggered field is commensurate with the magnetic interactions, an energy gap opens immediately and follows a power law as a function of the applied field, similar to the situation in 1D. When the field competes with the interactions, a quantum phase transition (QPT) occurs from a gapless, magnetically ordered phase at low fields to a gapped, disordered regime. We use a continuous-time Monte Carlo method to compute the staggered moment of the ordered phases and the spin gap of the disordered phases. We deduce the phase diagrams as functions of the anisotropy ratio and the applied field, and calculate the scaling behavior of the models in both quantities. We show that in the competitive case, the staggered field acts to maintain a regime of quasi-1D behavior around the QPT, and we discuss as a consequence the nature of the crossover from 1D to 2D physics.
△ Less
Submitted 22 June, 2011;
originally announced June 2011.