Zum Hauptinhalt springen

Showing 1–50 of 959 results for author: Lu, S

.
  1. arXiv:2408.14372  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cond-mat.str-el

    Exact Diagonalization Study on Avalanches in Many-Body Localized Constrained Spin Chains

    Authors: Shuangyuan Lu

    Abstract: Avalanche is believed to be the mechanism behind the transition from many-body localization to the thermal phase. We utilize spin chains with constraints to study the physics of quantum avalanches by exact diagonalization of disordered systems coupled to a thermal bath. Single-spin observables are used to characterize localization and quantify the influence of the thermal bath on long disordered s… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 17 pages, 46 figures

  2. arXiv:2408.14158  [pdf, other

    cs.DC cs.AI

    Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

    Authors: Wei An, Xiao Bi, Guanting Chen, Shanhuang Chen, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Wenjun Gao, Kang Guan, Jianzhong Guo, Yongqiang Guo, Zhe Fu, Ying He, Panpan Huang, Jiashi Li, Wenfeng Liang, Xiaodong Liu, Xin Liu, Yiyuan Liu, Yuxuan Liu, Shanghao Lu, Xuan Lu, Xiaotao Nie, Tian Pei , et al. (27 additional authors not shown)

    Abstract: The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: This is the preprint version of the paper accepted for presentation at the 2024 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'24). \c{opyright} 2024 IEEE. Personal use of this material is permitted. For other uses, permission from IEEE must be obtained. Please refer to IEEE Xplore for the final published version

  3. arXiv:2408.11801  [pdf, other

    cs.CV

    Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models

    Authors: Yuzhou Huang, Yiran Qin, Shunlin Lu, Xintao Wang, Rui Huang, Ying Shan, Ruimao Zhang

    Abstract: Traditional visual storytelling is complex, requiring specialized knowledge and substantial resources, yet often constrained by human creativity and creation precision. While Large Language Models (LLMs) enhance visual storytelling, current approaches often limit themselves to 2D visuals or oversimplify stories through motion synthesis and behavioral simulation, failing to create comprehensive, mu… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: Project page: https://yuzhou914.github.io/Story3D-Agent/

  4. arXiv:2408.09380   

    cs.AI cs.IR

    ELASTIC: Efficient Linear Attention for Sequential Interest Compression

    Authors: Jiaxin Deng, Shiyao Wang, Song Lu, Yinfeng Li, Xinchen Luo, Yuanjun Liu, Peixing Xu, Guorui Zhou

    Abstract: State-of-the-art sequential recommendation models heavily rely on transformer's attention mechanism. However, the quadratic computational and memory complexities of self attention have limited its scalability for modeling users' long range behaviour sequences. To address this problem, we propose ELASTIC, an Efficient Linear Attention for SequenTial Interest Compression, requiring only linear time… ▽ More

    Submitted 20 August, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

    Comments: We hereby withdraw this paper from arXiv due to incomplete experiments. Upon further review, we have determined that additional experimental work is necessary to fully validate our findings and conclusions

  5. arXiv:2408.09085  [pdf, other

    cs.CV

    Segment Anything with Multiple Modalities

    Authors: Aoran Xiao, Weihao Xuan, Heli Qi, Yun Xing, Naoto Yokoya, Shijian Lu

    Abstract: Robust and accurate segmentation of scenes has become one core functionality in various visual recognition and navigation tasks. This has inspired the recent development of Segment Anything Model (SAM), a foundation model for general mask segmentation. However, SAM is largely tailored for single-modal RGB images, limiting its applicability to multi-modal data captured with widely-adopted sensor su… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: Project page: https://xiaoaoran.github.io/projects/MM-SAM

  6. arXiv:2408.08143  [pdf, other

    cs.CR cs.CV

    Unlearnable Examples Detection via Iterative Filtering

    Authors: Yi Yu, Qichen Zheng, Siyuan Yang, Wenhan Yang, Jun Liu, Shijian Lu, Yap-Peng Tan, Kwok-Yan Lam, Alex Kot

    Abstract: Deep neural networks are proven to be vulnerable to data poisoning attacks. Recently, a specific type of data poisoning attack known as availability attacks has led to the failure of data utilization for model learning by adding imperceptible perturbations to images. Consequently, it is quite beneficial and challenging to detect poisoned samples, also known as Unlearnable Examples (UEs), from a mi… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: Accepted by ICANN 2024

  7. arXiv:2408.06381  [pdf, other

    eess.IV cs.AI cs.CV

    Assessment of Cell Nuclei AI Foundation Models in Kidney Pathology

    Authors: Junlin Guo, Siqi Lu, Can Cui, Ruining Deng, Tianyuan Yao, Zhewen Tao, Yizhe Lin, Marilyn Lionts, Quan Liu, Juming Xiong, Catie Chang, Mitchell Wilkes, Mengmeng Yin, Haichun Yang, Yuankai Huo

    Abstract: Cell nuclei instance segmentation is a crucial task in digital kidney pathology. Traditional automatic segmentation methods often lack generalizability when applied to unseen datasets. Recently, the success of foundation models (FMs) has provided a more generalizable solution, potentially enabling the segmentation of any cell type. In this study, we perform a large-scale evaluation of three widely… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  8. arXiv:2408.05196  [pdf, other

    cs.LG q-bio.BM

    Cell Morphology-Guided Small Molecule Generation with GFlowNets

    Authors: Stephen Zhewen Lu, Ziqing Lu, Ehsan Hajiramezanali, Tommaso Biancalani, Yoshua Bengio, Gabriele Scalia, Michał Koziarski

    Abstract: High-content phenotypic screening, including high-content imaging (HCI), has gained popularity in the last few years for its ability to characterize novel therapeutics without prior knowledge of the protein target. When combined with deep learning techniques to predict and represent molecular-phenotype interactions, these advancements hold the potential to significantly accelerate and enhance drug… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  9. arXiv:2408.03464  [pdf, other

    cs.CV cs.LG

    AI Foundation Models in Remote Sensing: A Survey

    Authors: Siqi Lu, Junlin Guo, James R Zimmer-Dauphinee, Jordan M Nieusma, Xiao Wang, Parker VanValkenburgh, Steven A Wernke, Yuankai Huo

    Abstract: Artificial Intelligence (AI) technologies have profoundly transformed the field of remote sensing, revolutionizing data collection, processing, and analysis. Traditionally reliant on manual interpretation and task-specific models, remote sensing has been significantly enhanced by the advent of foundation models--large-scale, pre-trained AI models capable of performing a wide array of tasks with un… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  10. arXiv:2408.02669  [pdf, other

    astro-ph.GA

    SDSS-IV MaNGA: Stellar rotational support in disk galaxies vs. central surface density and stellar population age

    Authors: Xiaohan Wang, Yifei Luo, S. M. Faber, David C. Koo, Shude Mao, Kyle B. Westfall, Shengdong Lu, Weichen Wang, Kevin Bundy, N. Boardman, Vladimir Avila-Reese, José G. Fernández-Trincado, Richard R. Lane

    Abstract: We investigate how the stellar rotational support changes as a function of spatially resolved stellar population age ($\rm D_n4000$) and relative central stellar surface density ($ΔΣ_1$) for MaNGA isolated/central disk galaxies. We find that the galaxy rotational support $λ_{R_\mathrm{e}}$ varies smoothly as a function of $ΔΣ_1$ and $\rm D_n4000$. $\rm D_n4000$ vs. $ΔΣ_1$ follows a "J-shape", with… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 24 pages, 22 figures (including Appendix), accepted for publication in MNRAS

  11. arXiv:2408.01897  [pdf, other

    cs.CV

    CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical Imagery

    Authors: Zilin Chen, Shengnan Lu

    Abstract: Object detection is of paramount importance in biomedical image analysis, particularly for lesion identification. While current methodologies are proficient in identifying and pinpointing lesions, they often lack the precision needed to detect minute biomedical entities (e.g., abnormal cells, lung nodules smaller than 3 mm), which are critical in blood and lung pathology. To address this challenge… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  12. arXiv:2408.00278  [pdf, other

    cs.LG cs.AI cs.NE

    High Performance Im2win and Direct Convolutions using Three Tensor Layouts on SIMD Architectures

    Authors: Xiang Fu, Xinpeng Zhang, Jixiang Ma, Peng Zhao, Shuai Lu, Xu T. Liu

    Abstract: Convolution is the core component within deep neural networks and it is computationally intensive and time consuming. Tensor data layouts significantly impact convolution operations in terms of memory access and computational efficiency. Yet, there is still a lack of comprehensive performance characterization on data layouts on SIMD architectures concerning convolution methods. This paper proposes… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  13. arXiv:2407.19521  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultra-light dark matter with non-canonical kinetics reopening the mass window

    Authors: Shiyun Lu, Amara Ilyas, Xiao-Han Ma, Bo Wang, Dongdong Zhang, Yi-Fu Cai

    Abstract: Fuzzy dark matter (FDM) with mass around $10^{-22}$ eV is viewed as a promising paradigm in understanding the structure formation of the local universe at small scales. Recent observations, however, begin to challenge FDM in return. We focus on the arguments between the solution to CDM small-scale curiosities and recent observations on matter power spectrum, and find its implication on an earlier… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 25 pages, 10 figures

  14. arXiv:2407.18581  [pdf, other

    cs.CL cs.AI

    Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing

    Authors: Hukai Huang, Shenghui Lu, Yahui Shan, He Qu, Wenhao Guan, Qingyang Hong, Lin Li

    Abstract: The Mixture of Experts (MoE) approach is well-suited for multilingual and code-switching (CS) tasks due to its multi-expert architecture. This work introduces the DLG-MoE, a Dynamic Language Group-based MoE optimized for bilingual and CS scenarios. DLG-MoE operates based on a hierarchical routing mechanism. First, the language router explicitly models the language and dispatches the representation… ▽ More

    Submitted 7 August, 2024; v1 submitted 26 July, 2024; originally announced July 2024.

  15. arXiv:2407.18365  [pdf, other

    cs.LG cs.AI cs.DC math.OC

    FADAS: Towards Federated Adaptive Asynchronous Optimization

    Authors: Yujia Wang, Shiqiang Wang, Songtao Lu, Jinghui Chen

    Abstract: Federated learning (FL) has emerged as a widely adopted training paradigm for privacy-preserving machine learning. While the SGD-based FL algorithms have demonstrated considerable success in the past, there is a growing trend towards adopting adaptive federated optimization methods, particularly for training large-scale models. However, the conventional synchronous aggregation design poses a signi… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted by ICML 2024

  16. arXiv:2407.09299  [pdf, other

    cs.CV

    PID: Physics-Informed Diffusion Model for Infrared Image Generation

    Authors: Fangyuan Mao, Jilin Mei, Shun Lu, Fuyang Liu, Liang Chen, Fangzhou Zhao, Yu Hu

    Abstract: Infrared imaging technology has gained significant attention for its reliable sensing ability in low visibility conditions, prompting many studies to convert the abundant RGB images to infrared images. However, most existing image translation methods treat infrared images as a stylistic variation, neglecting the underlying physical laws, which limits their practical application. To address these i… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  17. arXiv:2407.08496  [pdf, ps, other

    math.DG math.GT

    Convergences of Combinatorial Ricci Flows to Degenerated Circle Packings in Hyperbolic Background Geometry

    Authors: Guangming Hu, Sicheng Lu, Dong Tan, Youliang Zhong, Puchun Zhou

    Abstract: This paper investigates a kind of degenerated circle packings in hyperbolic background geometry. A main problem is whether a prescribed total geodesic curvature data can be realized by a degenerated circle packing or not. We fully characterize the sufficient and necessary conditions and show the uniqueness. Furthermore, we introduce the combinatoral Ricci flow to find the desired degenerated circl… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 36 pages, 9 figures

    MSC Class: 52C26; 57M50

  18. arXiv:2407.07016  [pdf

    cond-mat.mtrl-sci

    Is Large Language Model All You Need to Predict the Synthesizability and Precursors of Crystal Structures?

    Authors: Zhilong Song, Shuaihua Lu, Minggang Ju, Qionghua Zhou, Jinlan Wang

    Abstract: Accessing the synthesizability of crystal structures is pivotal for advancing the practical application of theoretical material structures designed by machine learning or high-throughput screening. However, a significant gap exists between the actual synthesizability and thermodynamic or kinetic stability, which is commonly used for screening theoretical structures for experiments. To address this… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  19. arXiv:2407.06915  [pdf, ps, other

    cs.RO

    FE-GUT: Factor Graph Optimization hybrid with Extended Kalman Filter for tightly coupled GNSS/UWB Integration

    Authors: Qijia Zhao, Shaolin Lü, Jianan Lou, Rong Zhang

    Abstract: Precise positioning and navigation information has been increasingly important with the development of the consumer electronics market. Due to some deficits of Global Navigation Satellite System (GNSS), such as susceptible to interferences, integrating of GNSS with additional alternative sensors is a promising approach to overcome the performance limitations of GNSS-based localization systems. Ult… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  20. arXiv:2407.06784  [pdf, other

    math.NA

    Preasymptotic error estimates of EEM and CIP-EEM for the time-harmonic Maxwell equations with large wave number

    Authors: Shuaishuai Lu, Haijun Wu

    Abstract: Preasymptotic error estimates are derived for the linear edge element method (EEM) and the linear $\boldsymbol{H}(\boldsymbol{\mathrm{curl}})$-conforming interior penalty edge element method (CIP-EEM) for the time-harmonic Maxwell equations with large wave number. It is shown that under the mesh condition that $κ^3 h^2$ is sufficiently small, the errors of the solutions to both methods are bounded… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  21. arXiv:2407.06489  [pdf

    cond-mat.mtrl-sci

    T2MAT (text-to-materials): A universal framework for generating material structures with goal properties from a single sentence

    Authors: Zhilong Song, Shuaihua Lu, Qionghua Zhou, Jinlan Wang

    Abstract: Artificial Intelligence-Generated Content (AIGC)-content autonomously produced by AI systems without human intervention-has significantly boosted efficiency across various fields. However, the AIGC in material science faces challenges in the ability to efficiently discover innovative materials that surpass existing databases, alongside the invariances and stability considerations of crystal struct… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  22. arXiv:2407.05078  [pdf, ps, other

    math.NA

    Function and derivative approximation by shallow neural networks

    Authors: Yuanyuan Li, Shuai Lu

    Abstract: We investigate a Tikhonov regularization scheme specifically tailored for shallow neural networks within the context of solving a classic inverse problem: approximating an unknown function and its derivatives within a unit cubic domain based on noisy measurements. The proposed Tikhonov regularization scheme incorporates a penalty term that takes three distinct yet intricately related network (semi… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    MSC Class: 65D15; 65F22; 65J20

  23. arXiv:2407.04995  [pdf

    physics.optics

    A Broadband Algorithm for Adiabatic Mode Evolution and its Application on Polarization Splitter-Rotator on LNOI Platform

    Authors: Geng Chen, Chijun Li, Xuanhao Wang, An Pan, Junjie Wei, Yuankang Huang, Siyu Lu, Yiqi Dai, Xiangyu Meng, Cheng Zeng, Jinsong Xia

    Abstract: Adiabatic mode evolution waveguides (AMEWs) are widely utilized in integrated photonics, including tapered waveguides, edge couplers, mode converters, splitters, etc. An analytical theory and a novel AMEW design algorithm are developed to create shortcuts to adiabaticity (STA). This new algorithm is effective in shortening the total length of the AMEW while maintaining the desired wavelength range… ▽ More

    Submitted 22 July, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: 16 pages, 6 figures, 2 tables

  24. arXiv:2407.04909  [pdf, ps, other

    math.PR

    The averaging principle of stochastic functional partial differential equations with Hölder coefficients and infinite delay

    Authors: Shuaishuai Lu, Xue Yang, Yong Li

    Abstract: In this paper, we establish the averaging principle for stochastic functional partial differential equations (SFPDEs) characterized by Hölder coefficients and infinite delay. Firstly, we rigorously establish the existence and uniqueness of strong solutions for a specific class of finite-dimensional systems characterized by Hölder continuous coefficients and infinite delay. We extend these results… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  25. arXiv:2407.02973  [pdf, other

    astro-ph.GA

    NOEMA formIng Cluster survEy (NICE): Characterizing eight massive galaxy groups at $1.5 < z < 4$ in the COSMOS field

    Authors: Nikolaj B. Sillassen, Shuowen Jin, Georgios E. Magdis, Emanuele Daddi, Tao Wang, Shiying Lu, Hanwen Sun, Vinod Arumugam, Daizhong Liu, Malte Brinch, Chiara D'Eugenio, Raphael Gobat, Carlos Gómez-Guijarro, Michael Rich, Eva Schinnerer, Veronica Strazzullo, Qinghua Tan, Francesco Valentino, Yijun Wang, Mengyuan Xiao, Luwenjia Zhou, David Blánquez-Sesé, Zheng Cai, Yanmei Chen, Laure Ciesla , et al. (19 additional authors not shown)

    Abstract: The NOEMA formIng Cluster survEy (NICE) is a large program targeting 69 massive galaxy group candidates at $z>2$ in six deep fields. We report spectroscopic confirmation of eight groups at $1.65\leq z\leq3.61$ in COSMOS. Homogeneously selected as significant overdensities of red IRAC sources with red Herschel colors, four groups are confirmed by CO and [CI] with NOEMA 3mm observations, three are c… ▽ More

    Submitted 5 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: 44 pages (27pp appendix), 32 figures, 18 tables, accepted for publication in A&A

  26. arXiv:2407.00588  [pdf, other

    math.AP math.NA

    Forward and backward problems for coupled subdiffusion systems

    Authors: Dian Feng, Yikan Liu, Shuai Lu

    Abstract: In this article, we investigate both forward and backward problems for coupled systems of time-fractional diffusion equations, encompassing scenarios of strong coupling. For the forward problem, we establish the well-posedness of the system, leveraging the eigensystem of the corresponding elliptic system as the foundation. When considering the backward problem, specifically the determination of in… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 26 pages, 7 figures

    MSC Class: 35R11; 35K58; 35B44

  27. arXiv:2407.00178  [pdf, other

    physics.ins-det

    Shower Separation in Five Dimensions for Highly Granular Calorimeters using Machine Learning

    Authors: S. Lai, J. Utehs, A. Wilhahn, M. C. Fouz, O. Bach, E. Brianne, A. Ebrahimi, K. Gadow, P. Göttlicher, O. Hartbrich, D. Heuchel, A. Irles, K. Krüger, J. Kvasnicka, S. Lu, C. Neubüser, A. Provenza, M. Reinecke, F. Sefkow, S. Schuwalow, M. De Silva, Y. Sudo, H. L. Tran, L. Liu, R. Masuda , et al. (26 additional authors not shown)

    Abstract: To achieve state-of-the-art jet energy resolution for Particle Flow, sophisticated energy clustering algorithms must be developed that can fully exploit available information to separate energy deposits from charged and neutral particles. Three published neural network-based shower separation models were applied to simulation and experimental data to measure the performance of the highly granular… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  28. arXiv:2406.16005  [pdf, other

    cs.DC

    A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications

    Authors: Lei Chen, Shi Liu, Chenxi Wang, Haoran Ma, Yifan Qiao, Zhe Wang, Chenggang Wu, Youyou Lu, Xiaobing Feng, Huimin Cui, Shan Lu, Harry Xu

    Abstract: With rapid advances in network hardware, far memory has gained a great deal of traction due to its ability to break the memory capacity wall. Existing far memory systems fall into one of two data paths: one that uses the kernel's paging system to transparently access far memory at the page granularity, and a second that bypasses the kernel, fetching data at the object granularity. While it is gene… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  29. arXiv:2406.15484  [pdf, other

    cs.CL cs.AI cs.CY

    JobFair: A Framework for Benchmarking Gender Hiring Bias in Large Language Models

    Authors: Ze Wang, Zekun Wu, Xin Guan, Michael Thaler, Adriano Koshiyama, Skylar Lu, Sachin Beepath, Ediz Ertekin Jr., Maria Perez-Ortiz

    Abstract: This paper presents a novel framework for benchmarking hierarchical gender hiring bias in Large Language Models (LLMs) for resume scoring, revealing significant issues of reverse bias and overdebiasing. Our contributions are fourfold: First, we introduce a framework using a real, anonymized resume dataset from the Healthcare, Finance, and Construction industries, meticulously used to avoid confoun… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Submitted to EMNLP 2024

  30. arXiv:2406.14523  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Optical and Raman selection rules for odd-parity clean superconductors

    Authors: Shuangyuan Lu, Xu Yang, Yuan-Ming Lu

    Abstract: We derive selection rules in optical absorption and Raman scattering spectra, that can determine the parity of pairing order parameters under inversion symmetry in two classes of \emph{clean} superconductors: (i) chiral superconductors with strong spin-orbit couplings, (ii) singlet superconductors with negligible spin-orbit couplings. Experimentally, the inversion parity of pair wave functions can… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 16 pages, 12 figures

    Journal ref: Phys. Rev. B 109, 245119 (2024)

  31. arXiv:2406.12718  [pdf, other

    cs.CV cs.AI cs.CL

    AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

    Authors: Wenbin An, Feng Tian, Sicong Leng, Jiahao Nie, Haonan Lin, QianYing Wang, Guang Dai, Ping Chen, Shijian Lu

    Abstract: Despite their great success across various multimodal tasks, Large Vision-Language Models (LVLMs) are facing a prevalent problem with object hallucinations, where the generated textual responses are inconsistent with ground-truth objects in the given image. This paper investigates various LVLMs and pinpoints attention deficiency toward discriminative local image features as one root cause of objec… ▽ More

    Submitted 21 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  32. arXiv:2406.12386  [pdf, other

    cs.CL

    IPEval: A Bilingual Intellectual Property Agency Consultation Evaluation Benchmark for Large Language Models

    Authors: Qiyao Wang, Jianguo Huang, Shule Lu, Yuan Lin, Kan Xu, Liang Yang, Hongfei Lin

    Abstract: The rapid development of Large Language Models (LLMs) in vertical domains, including intellectual property (IP), lacks a specific evaluation benchmark for assessing their understanding, application, and reasoning abilities. To fill this gap, we introduce IPEval, the first evaluation benchmark tailored for IP agency and consulting tasks. IPEval comprises 2657 multiple-choice questions across four m… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  33. arXiv:2406.11937  [pdf, other

    physics.ins-det hep-ex physics.data-an

    Using graph neural networks to reconstruct charged pion showers in the CMS High Granularity Calorimeter

    Authors: M. Aamir, B. Acar, G. Adamov, T. Adams, C. Adloff, S. Afanasiev, C. Agrawal, C. Agrawal, A. Ahmad, H. A. Ahmed, S. Akbar, N. Akchurin, B. Akgul, B. Akgun, R. O. Akpinar, E. Aktas, A. AlKadhim, V. Alexakhin, J. Alimena, J. Alison, A. Alpana, W. Alshehri, P. Alvarez Dominguez, M. Alyari, C. Amendola , et al. (550 additional authors not shown)

    Abstract: A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadr… ▽ More

    Submitted 30 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Prepared for submission to JINST

  34. arXiv:2406.11571  [pdf, other

    astro-ph.GA

    PRIMER: JWST/MIRI reveals the evolution of star-forming structures in galaxies at z<2.5

    Authors: Yipeng Lyu, Benjamin Magnelli, David Elbaz, Pablo G. Pérez-González, Camila Correa, Emanuele Daddi, Carlos Gómez-Guijarro, James S. Dunlop, Norman A. Grogin, Anton M. Koekemoer, Derek J. McLeod, Shiying Lu

    Abstract: The stellar structures of star-forming galaxies (SFGs) undergo significant size growth during their mass assembly and must pass through a compaction phase as they evolve into quiescent galaxies (QGs). To shed light on the mechanisms behind this structural evolution, we study the morphology of the star-forming components of 665 SFGs at 0<z<2.5 measured using JWST/MIRI observation and compare them w… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 24 pages, 17 figures, submitted to A&A, comments are welcome

  35. arXiv:2406.10724  [pdf, other

    eess.IV cs.CV cs.LG

    Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft

    Authors: Ian Vyse, Rishit Dagli, Dav Vrat Chadha, John P. Ma, Hector Chen, Isha Ruparelia, Prithvi Seran, Matthew Xie, Eesa Aamer, Aidan Armstrong, Naveen Black, Ben Borstein, Kevin Caldwell, Orrin Dahanaggamaarachchi, Joe Dai, Abeer Fatima, Stephanie Lu, Maxime Michet, Anoushka Paul, Carrie Ann Po, Shivesh Prakash, Noa Prosser, Riddhiman Roy, Mirai Shinjo, Iliya Shofman , et al. (4 additional authors not shown)

    Abstract: Satellite remote sensing missions have gained popularity over the past fifteen years due to their ability to cover large swaths of land at regular intervals, making them ideal for monitoring environmental trends. The FINCH mission, a 3U+ CubeSat equipped with a hyperspectral camera, aims to monitor crop residue cover in agricultural fields. Although hyperspectral imaging captures both spectral and… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: To appear in 38th Annual Small Satellite Conference

  36. arXiv:2406.10511  [pdf, other

    cs.DC cs.AR cs.PF math.NA

    Efficient Hardware Accelerator Based on Medium Granularity Dataflow for SpTRSV

    Authors: Qian Chen, Xiaofeng Yang, Shengli Lu

    Abstract: Sparse triangular solve (SpTRSV) is widely used in various domains. Numerous studies have been conducted using CPUs, GPUs, and specific hardware accelerators, where dataflow can be categorized into coarse and fine granularity. Coarse dataflow offers good spatial locality but suffers from low parallelism, while fine dataflow provides high parallelism but disrupts the spatial structure, leading to i… ▽ More

    Submitted 27 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  37. arXiv:2406.10416  [pdf, other

    cs.CR cs.DC cs.LG

    Byzantine-Robust Decentralized Federated Learning

    Authors: Minghong Fang, Zifan Zhang, Hairi, Prashant Khanduri, Jia Liu, Songtao Lu, Yuchen Liu, Neil Gong

    Abstract: Federated learning (FL) enables multiple clients to collaboratively train machine learning models without revealing their private training data. In conventional FL, the system follows the server-assisted architecture (server-assisted FL), where the training process is coordinated by a central server. However, the server-assisted FL framework suffers from poor scalability due to a communication bot… ▽ More

    Submitted 13 July, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: To appear in ACM Conference on Computer and Communications Security 2024 (CCS '24)

  38. arXiv:2406.09121  [pdf, other

    cs.CV

    MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era

    Authors: Jiahao Nie, Gongjie Zhang, Wenbin An, Yap-Peng Tan, Alex C. Kot, Shijian Lu

    Abstract: Despite the recent advancements in Multi-modal Large Language Models (MLLMs), understanding inter-object relations, i.e., interactions or associations between distinct objects, remains a major challenge for such models. This issue significantly hinders their advanced reasoning capabilities and is primarily due to the lack of large-scale, high-quality, and diverse multi-modal data essential for tra… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  39. arXiv:2406.04252  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Sub-nanometer depth resolution and single dopant visualization achieved by tilt-coupled multislice electron ptychography

    Authors: Zehao Dong, Yang Zhang, Chun-Chien Chiu, Sicheng Lu, Jianbing Zhang, Yu-Chen Liu, Suya Liu, Jan-Chi Yang, Pu Yu, Yayu Wang, Zhen Chen

    Abstract: Real-space imaging of three-dimensional atomic structures is a critical yet challenging task in materials science. Although scanning transmission electron microscopy has achieved sub-angstrom lateral resolution through techniques like electron ptychography1,2, depth resolution remains limited to only 2 to 3 nanometers with a single projection setup3,4. Attaining better depth resolution typically n… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 27 pages, 5 figures, 10 supplementary figures

  40. arXiv:2406.03496  [pdf, other

    cs.CL cs.AI cs.LG

    Wings: Learning Multimodal LLMs without Text-only Forgetting

    Authors: Yi-Kai Zhang, Shiyin Lu, Yang Li, Yanqing Ma, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, De-Chuan Zhan, Han-Jia Ye

    Abstract: Multimodal large language models (MLLMs), initiated with a trained LLM, first align images with text and then fine-tune on multimodal mixed inputs. However, the MLLM catastrophically forgets the text-only instructions, which do not include images and can be addressed within the initial LLM. In this paper, we present Wings, a novel MLLM that excels in both text-only dialogues and multimodal compreh… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  41. arXiv:2406.02672  [pdf, other

    astro-ph.GA astro-ph.CO

    A comparison of pre-existing $Λ$CDM predictions with the abundance of JWST galaxies at high redshift

    Authors: Shengdong Lu, Carlos S. Frenk, Sownak Bose, Cedric G. Lacey, Shaun Cole, Carlton M. Baugh, John C. Helly

    Abstract: Observations with the James Webb Space Telescope have revealed a high abundance of bright galaxies at redshift, $z\gtrsim 12$, which has been widely interpreted as conflicting with the $Λ$CDM model. In Cowley et al. (2018) predictions were made - prior to the JWST observations - for the expected abundance of these galaxies using the Durham semi-analytic galaxy formation model, GALFORM, which is kn… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 14 pages, 8 figures, submitted to MNRAS on 4 June, 2024

  42. arXiv:2406.02539  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Parrot: Multilingual Visual Instruction Tuning

    Authors: Hai-Long Sun, Da-Wei Zhou, Yang Li, Shiyin Lu, Chao Yi, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, De-Chuan Zhan, Han-Jia Ye

    Abstract: The rapid development of Multimodal Large Language Models (MLLMs) like GPT-4V has marked a significant step towards artificial general intelligence. Existing methods mainly focus on aligning vision encoders with LLMs through supervised fine-tuning (SFT) to endow LLMs with multimodal abilities, making MLLMs' inherent ability to react to multiple languages progressively deteriorate as the training p… ▽ More

    Submitted 11 August, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Code is available at: https://github.com/AIDC-AI/Parrot

  43. arXiv:2406.02260  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Near-Room-Temperature Field-Controllable Exchange Bias in 2D van der Waals Ferromagnet Fe3GaTe2

    Authors: Jifeng Shao, Xiaolong Yin, Chunhao Bao, Sirong Lu, Xiaoming Ma, Shu Guo, Le Wang, Xi Zhang, Zhiyue Li, Longxiang Li, Yue Zhao, Tingyong Chen

    Abstract: Exchange bias (EB) is a cornerstone of modern magnetic memory and sensing technologies. Its extension to the realm of two-dimensional (2D) van der Waals (vdW) magnets holds promise for revolutionary advancements in miniaturized and efficient atomic spintronic devices. However, the blocking temperature of EB in 2D vdW magnets is currently well below room temperature ~130 K. This study reports a rob… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 14 pages, 5 figures

  44. arXiv:2406.00734  [pdf, other

    cs.LG

    GLADformer: A Mixed Perspective for Graph-level Anomaly Detection

    Authors: Fan Xu, Nan Wang, Hao Wu, Xuezhi Wen, Dalin Zhang, Siyang Lu, Binyong Li, Wei Gong, Hai Wan, Xibin Zhao

    Abstract: Graph-Level Anomaly Detection (GLAD) aims to distinguish anomalous graphs within a graph dataset. However, current methods are constrained by their receptive fields, struggling to learn global features within the graphs. Moreover, most contemporary methods are based on spatial domain and lack exploration of spectral characteristics. In this paper, we propose a multi-perspective hybrid graph-level… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  45. arXiv:2405.20797  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Ovis: Structural Embedding Alignment for Multimodal Large Language Model

    Authors: Shiyin Lu, Yang Li, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, Han-Jia Ye

    Abstract: Current Multimodal Large Language Models (MLLMs) typically integrate a pre-trained LLM with another pre-trained vision transformer through a connector, such as an MLP, endowing the LLM with visual capabilities. However, the misalignment between two embedding strategies in MLLMs -- the structural textual embeddings based on an embedding look-up table and the continuous embeddings generated directly… ▽ More

    Submitted 17 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  46. arXiv:2405.20598  [pdf

    cond-mat.str-el cond-mat.mes-hall

    Mott insulating phase and coherent-incoherent crossover across magnetic phase transition in 2D antiferromagnetic CrSBr

    Authors: Fan Wu, Xuefeng Zhang, Yi Chen, Ding Pei, Mengwen Zhan, Zicheng Tao, Cheng Chen, Shipeng Lu, Jingzhi Chen, Shujie Tang, Xia Wang, Yanfeng Guo, Lexian Yang, Yan Zhang, Yulin Chen, Qixi Mi, Gang Li, Zhongkai Liu

    Abstract: In two-dimensional van der Waals magnetic materials, the interplay between magnetism and electron correlation can give rise to new ground states and lead to novel transport and optical properties. A fundamental question in these materials is how the electron correlation manifests and interacts with the magnetic orders. In this study, we demonstrate that the recently discovered 2D antiferromagnetic… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  47. arXiv:2405.20340  [pdf, other

    cs.CV

    MotionLLM: Understanding Human Behaviors from Human Motions and Videos

    Authors: Ling-Hao Chen, Shunlin Lu, Ailing Zeng, Hao Zhang, Benyou Wang, Ruimao Zhang, Lei Zhang

    Abstract: This study delves into the realm of multi-modality (i.e., video and motion modalities) human behavior understanding by leveraging the powerful capabilities of Large Language Models (LLMs). Diverging from recent LLMs designed for video-only or motion-only understanding, we argue that understanding human behavior necessitates joint modeling from both videos and motion sequences (e.g., SMPL sequences… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: MotionLLM version 1.0, project page see https://lhchen.top/MotionLLM

  48. arXiv:2405.19767  [pdf

    physics.geo-ph

    MAE-GAN: A Novel Strategy for Simultaneous Super-resolution Reconstruction and Denoising of Post-stack Seismic Profile

    Authors: Wenshuo Yu, Shiqi Dong, Shaoping Lu, Xintong Dong

    Abstract: Post-stack seismic profiles are images reflecting containing geological structures which provides a critical foundation for understanding the distribution of oil and gas resources. However, due to the limitations of seismic acquisition equipment and data collecting geometry, the post-stack profiles suffer from low resolution and strong noise issues, which severely affects subsequent seismic interp… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  49. arXiv:2405.19487  [pdf, other

    cs.CL

    A Full-duplex Speech Dialogue Scheme Based On Large Language Models

    Authors: Peng Wang, Songshuo Lu, Yaohua Tang, Sijie Yan, Yuanjun Xiong, Wei Xia

    Abstract: We present a generative dialogue system capable of operating in a full-duplex manner, allowing for seamless interaction. It is based on a large language model (LLM) carefully aligned to be aware of a perception module, a motor function module, and the concept of a simple finite state machine (called neural FSM) with two states. The perception and motor function modules operate simultaneously, allo… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  50. arXiv:2405.18891  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Inverse Design of Promising Alloys for Electrocatalytic CO$_2$ Reduction via Generative Graph Neural Networks Combined with Bird Swarm Algorithm

    Authors: Zhilong Song, Linfeng Fan, Shuaihua Lu, Qionghua Zhou, Chongyi Ling, Jinlan Wang

    Abstract: Directly generating material structures with optimal properties is a long-standing goal in material design. One of the fundamental challenges lies in how to overcome the limitation of traditional generative models to efficiently explore the global chemical space rather than a small localized space. Herein, we develop a framework named MAGECS to address this dilemma, by integrating the bird swarm a… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.