Zum Hauptinhalt springen

Showing 1–20 of 20 results for author: Xi, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13005  [pdf

    cs.CL cs.AI cs.SI

    Understanding the Rare Inflammatory Disease Using Large Language Models and Social Media Data

    Authors: Nan Miles Xi, Hong-Long Ji, Lin Wang

    Abstract: Sarcoidosis is a rare inflammatory disease characterized by the formation of granulomas in various organs. The disease presents diagnostic and treatment challenges due to its diverse manifestations and unpredictable nature. In this study, we employed a Large Language Model (LLM) to analyze sarcoidosis-related discussions on the social media platform Reddit. Our findings underscore the efficacy of… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  2. arXiv:2403.01969  [pdf, other

    cs.CL

    AS-ES Learning: Towards Efficient CoT Learning in Small Models

    Authors: Nuwa Xi, Yuhan Chen, Sendong Zhao, Haochun Wang, Bing Qin, Ting Liu

    Abstract: Chain-of-Thought (CoT) serves as a critical emerging ability in LLMs, especially when it comes to logical reasoning. Attempts have been made to induce such ability in small models as well by distilling from the data with CoT generated by Large Language Models (LLMs). However, existing methods often simply generate and incorporate more data from LLMs and fail to note the importance of efficiently u… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  3. arXiv:2402.01349  [pdf, other

    cs.CL cs.AI

    Beyond the Answers: Reviewing the Rationality of Multiple Choice Question Answering for the Evaluation of Large Language Models

    Authors: Haochun Wang, Sendong Zhao, Zewen Qiang, Nuwa Xi, Bing Qin, Ting Liu

    Abstract: In the field of natural language processing (NLP), Large Language Models (LLMs) have precipitated a paradigm shift, markedly enhancing performance in natural language generation tasks. Despite these advancements, the comprehensive evaluation of LLMs remains an inevitable challenge for the community. Recently, the utilization of Multiple Choice Question Answering (MCQA) as a benchmark for LLMs has… ▽ More

    Submitted 29 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 17 pages, 8 figures

  4. arXiv:2401.16107  [pdf, other

    cs.CL cs.AI

    Beyond Direct Diagnosis: LLM-based Multi-Specialist Agent Consultation for Automatic Diagnosis

    Authors: Haochun Wang, Sendong Zhao, Zewen Qiang, Nuwa Xi, Bing Qin, Ting Liu

    Abstract: Automatic diagnosis is a significant application of AI in healthcare, where diagnoses are generated based on the symptom description of patients. Previous works have approached this task directly by modeling the relationship between the normalized symptoms and all possible diseases. However, in the clinical diagnostic process, patients are initially consulted by a general practitioner and, if nece… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  5. CToMP: A Cycle-task-oriented Memory Protection Scheme for Unmanned Systems

    Authors: Chengyan Ma, Ning Xi, Di Lu, Yebo Feng, Jianfeng Ma

    Abstract: Memory corruption attacks (MCAs) refer to malicious behaviors of system intruders that modify the contents of a memory location to disrupt the normal operation of computing systems, causing leakage of sensitive data or perturbations to ongoing processes. Unlike general-purpose systems, unmanned systems cannot deploy complete security protection schemes, due to their limitations in size, cost and p… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted by SCIENCE CHINA Information Sciences

  6. arXiv:2309.05203  [pdf, other

    cs.CL

    From Artificially Real to Real: Leveraging Pseudo Data from Large Language Models for Low-Resource Molecule Discovery

    Authors: Yuhan Chen, Nuwa Xi, Yanrui Du, Haochun Wang, Jianyu Chen, Sendong Zhao, Bing Qin

    Abstract: Molecule discovery serves as a cornerstone in numerous scientific domains, fueling the development of new materials and innovative drug designs. Recent developments of in-silico molecule discovery have highlighted the promising results of cross-modal techniques, which bridge molecular structures with their descriptive annotations. However, these cross-modal methods frequently encounter the issue o… ▽ More

    Submitted 5 March, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: AAAI2024

  7. arXiv:2309.04175  [pdf, other

    cs.CL cs.AI

    Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese

    Authors: Haochun Wang, Sendong Zhao, Zewen Qiang, Zijian Li, Nuwa Xi, Yanrui Du, MuZhen Cai, Haoqiang Guo, Yuhan Chen, Haoming Xu, Bing Qin, Ting Liu

    Abstract: Large Language Models (LLMs) have demonstrated remarkable success in diverse natural language processing (NLP) tasks in general domains. However, LLMs sometimes generate responses with the hallucination about medical facts due to limited domain knowledge. Such shortcomings pose potential risks in the utilization of LLMs within medical contexts. To address this challenge, we propose knowledge-tunin… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 11 pages, 5 figures

  8. arXiv:2309.04174  [pdf, other

    cs.CL cs.AI

    Manifold-based Verbalizer Space Re-embedding for Tuning-free Prompt-based Classification

    Authors: Haochun Wang, Sendong Zhao, Chi Liu, Nuwa Xi, Muzhen Cai, Bing Qin, Ting Liu

    Abstract: Prompt-based classification adapts tasks to a cloze question format utilizing the [MASK] token and the filled tokens are then mapped to labels through pre-defined verbalizers. Recent studies have explored the use of verbalizer embeddings to reduce labor in this process. However, all existing studies require a tuning process for either the pre-trained models or additional trainable embeddings. Mean… ▽ More

    Submitted 29 January, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: Accepted by AAAI 2024, 11 pages, 3 figures

  9. arXiv:2307.09769  [pdf, other

    cs.CV

    Source-Free Domain Adaptation for Medical Image Segmentation via Prototype-Anchored Feature Alignment and Contrastive Learning

    Authors: Qinji Yu, Nan Xi, Junsong Yuan, Ziyu Zhou, Kang Dang, Xiaowei Ding

    Abstract: Unsupervised domain adaptation (UDA) has increasingly gained interests for its capacity to transfer the knowledge learned from a labeled source domain to an unlabeled target domain. However, typical UDA methods require concurrent access to both the source and target domain data, which largely limits its application in medical scenarios where source data is often unavailable due to privacy concern.… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI23

  10. arXiv:2307.05355  [pdf, other

    eess.SP cs.CL

    UniCoRN: Unified Cognitive Signal ReconstructioN bridging cognitive signals and human language

    Authors: Nuwa Xi, Sendong Zhao, Haochun Wang, Chi Liu, Bing Qin, Ting Liu

    Abstract: Decoding text stimuli from cognitive signals (e.g. fMRI) enhances our understanding of the human language system, paving the way for building versatile Brain-Computer Interface. However, existing studies largely focus on decoding individual word-level fMRI volumes from a restricted vocabulary, which is far too idealized for real-world application. In this paper, we propose fMRI2text, the first ope… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: the 61st Annual Meeting of the Association for Computational Linguistics

  11. arXiv:2304.06975  [pdf, other

    cs.CL

    HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge

    Authors: Haochun Wang, Chi Liu, Nuwa Xi, Zewen Qiang, Sendong Zhao, Bing Qin, Ting Liu

    Abstract: Large Language Models (LLMs), such as the LLaMA model, have demonstrated their effectiveness in various general-domain natural language processing (NLP) tasks. Nevertheless, LLMs have not yet performed optimally in biomedical domain tasks due to the need for medical expertise in the responses. In response to this challenge, we propose HuaTuo, a LLaMA-based model that has been supervised-fine-tuned… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: LLaMA-based Chinese Medical model - HuaTuo. Model, code and training data are available at https://github.com/SCIR-HI/Huatuo-Llama-Med-Chinese

  12. arXiv:2304.05642  [pdf, other

    cs.CL

    Global Prompt Cell: A Portable Control Module for Effective Prompt Tuning

    Authors: Chi Liu, Haochun Wang, Nuwa Xi, Sendong Zhao, Bing Qin

    Abstract: As a novel approach to tuning pre-trained models, prompt tuning involves freezing the parameters in downstream tasks while inserting trainable embeddings into inputs in the first layer. However, previous methods have mainly focused on the initialization of prompt embeddings. The strategy of training and utilizing prompt embeddings in a reasonable way has become a limiting factor in the effectivene… ▽ More

    Submitted 13 May, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  13. arXiv:2212.12114  [pdf

    q-bio.QM cs.LG

    Predicting Survival of Tongue Cancer Patients by Machine Learning Models

    Authors: Angelos Vasilopoulos, Nan Miles Xi

    Abstract: Tongue cancer is a common oral cavity malignancy that originates in the mouth and throat. Much effort has been invested in improving its diagnosis, treatment, and management. Surgical removal, chemotherapy, and radiation therapy remain the major treatment for tongue cancer. The survival of patients determines the treatment effect. Previous studies have identified certain survival and risk factors… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  14. arXiv:2209.06453  [pdf, other

    cs.CL

    Prompt Combines Paraphrase: Teaching Pre-trained Models to Understand Rare Biomedical Words

    Authors: Haochun Wang, Chi Liu, Nuwa Xi, Sendong Zhao, Meizhi Ju, Shiwei Zhang, Ziheng Zhang, Yefeng Zheng, Bing Qin, Ting Liu

    Abstract: Prompt-based fine-tuning for pre-trained models has proven effective for many natural language processing tasks under few-shot settings in general domain. However, tuning with prompt in biomedical domain has not been investigated thoroughly. Biomedical words are often rare in general domain, but quite ubiquitous in biomedical contexts, which dramatically deteriorates the performance of pre-trained… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted to COLING 2022

  15. arXiv:2205.03238  [pdf

    eess.SP cond-mat.mtrl-sci cs.LG

    Ultra-sensitive Flexible Sponge-Sensor Array for Muscle Activities Detection and Human Limb Motion Recognition

    Authors: Jiao Suo, Yifan Liu, Clio Cheng, Keer Wang, Meng Chen, Ho-yin Chan, Roy Vellaisamy, Ning Xi, Vivian W. Q. Lou, Wen Jung Li

    Abstract: Human limb motion tracking and recognition plays an important role in medical rehabilitation training, lower limb assistance, prosthetics design for amputees, feedback control for assistive robots, etc. Lightweight wearable sensors, including inertial sensors, surface electromyography sensors, and flexible strain/pressure, are promising to become the next-generation human motion capture devices. H… ▽ More

    Submitted 29 June, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

    Comments: 17 pages, 6 figures

  16. arXiv:2203.15804  [pdf

    cs.LG q-bio.QM stat.AP

    Improving The Diagnosis of Thyroid Cancer by Machine Learning and Clinical Data

    Authors: Nan Miles Xi, Lin Wang, Chuanjia Yang

    Abstract: Thyroid cancer is a common endocrine carcinoma that occurs in the thyroid gland. Much effort has been invested in improving its diagnosis, and thyroidectomy remains the primary treatment method. A successful operation without unnecessary side injuries relies on an accurate preoperative diagnosis. Current human assessment of thyroid nodule malignancy is prone to errors and may not guarantee an accu… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

  17. arXiv:2201.05669  [pdf

    q-bio.QM cs.LG

    Prediction of Drug-Induced TdP Risks Using Machine Learning and Rabbit Ventricular Wedge Assay

    Authors: Nan Miles Xi, Dalong Patrick Huang

    Abstract: The evaluation of drug-induced Torsades de pointes (TdP) risks is crucial in drug safety assessment. In this study, we discuss machine learning approaches in the prediction of drug-induced TdP risks using preclinical data. Specifically, the random forest model was trained on the dataset generated by the rabbit ventricular wedge assay. The model prediction performance was measured on 28 drugs from… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:2108.00543

  18. arXiv:2106.04951  [pdf, ps, other

    cs.CR

    Information flow based defensive chain for data leakage detection and prevention: a survey

    Authors: Ning Xi, Chao Chen, Jun Zhang, Cong Sun, Shigang Liu, Pengbin Feng, Jianfeng Ma

    Abstract: Mobile and IoT applications have greatly enriched our daily life by providing convenient and intelligent services. However, these smart applications have been a prime target of adversaries for stealing sensitive data. It poses a crucial threat to users' identity security, financial security, or even life security. Research communities and industries have proposed many Information Flow Control (IFC… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: 36 pages, 6 figures, 6 tables

  19. arXiv:1907.09594  [pdf, other

    cs.SI cs.CV cs.HC cs.MM stat.AP

    Understanding the Political Ideology of Legislators from Social Media Images

    Authors: Nan Xi, Di Ma, Marcus Liou, Zachary C. Steinert-Threlkeld, Jason Anastasopoulos, Jungseock Joo

    Abstract: In this paper, we seek to understand how politicians use images to express ideological rhetoric through Facebook images posted by members of the U.S. House and Senate. In the era of social media, politics has become saturated with imagery, a potent and emotionally salient form of political rhetoric which has been used by politicians and political organizations to influence public sentiment and vot… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: To appear in the Proceedings of International AAAI Conference on Web and Social Media (ICWSM 2020)

  20. arXiv:1212.0217  [pdf, ps, other

    physics.soc-ph cs.SI

    Cultural evolution and personalization

    Authors: Ning Xi, Zi-Ke Zhang, Yi-Cheng Zhang

    Abstract: In social sciences, there is currently no consensus on the mechanism for cultural evolution. The evolution of first names of newborn babies offers a remarkable example for the researches in the field. Here we perform statistical analyses on over 100 years of data in the United States. We focus in particular on how the frequency-rank distribution and inequality of baby names change over time. We pr… ▽ More

    Submitted 2 December, 2012; originally announced December 2012.