Skip to main content

Showing 1–42 of 42 results for author: Pan, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02964  [pdf, other

    cs.CL

    FSM: A Finite State Machine Based Zero-Shot Prompting Paradigm for Multi-Hop Question Answering

    Authors: Xiaochen Wang, Junqing He, Zhe yang, Yiru Wang, Xiangdi Meng, Kunhao Pan, Zhifang Sui

    Abstract: Large Language Models (LLMs) with chain-of-thought (COT) prompting have demonstrated impressive abilities on simple nature language inference tasks. However, they tend to perform poorly on Multi-hop Question Answering (MHQA) tasks due to several challenges, including hallucination, error propagation and limited context length. We propose a prompting method, Finite State Machine (FSM) to enhance th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.19593  [pdf, other

    cs.CL cs.CV

    SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

    Authors: Xin Su, Man Luo, Kris W Pan, Tien Pei Chou, Vasudev Lal, Phillip Howard

    Abstract: Synthetic data generation has gained significant attention recently for its utility in training large vision and language models. However, the application of synthetic data to the training of multimodal context-augmented generation systems has been relatively unexplored. This gap in existing work is important because existing vision and language models (VLMs) are not trained specifically for conte… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2406.18070  [pdf, other

    cs.CV

    EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation

    Authors: Baoqi Pei, Guo Chen, Jilan Xu, Yuping He, Yicheng Liu, Kanghua Pan, Yifei Huang, Yali Wang, Tong Lu, Limin Wang, Yu Qiao

    Abstract: In this report, we present our solutions to the EgoVis Challenges in CVPR 2024, including five tracks in the Ego4D challenge and three tracks in the EPIC-Kitchens challenge. Building upon the video-language two-tower model and leveraging our meticulously organized egocentric video data, we introduce a novel foundation model called EgoVideo. This model is specifically designed to cater to the uniqu… ▽ More

    Submitted 30 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: Champion solutions in the EgoVis CVPR 2024 workshop

  4. arXiv:2405.01926  [pdf, other

    cs.CV

    Auto-Encoding Morph-Tokens for Multimodal LLM

    Authors: Kaihang Pan, Siliang Tang, Juncheng Li, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang

    Abstract: For multimodal LLMs, the synergy of visual comprehension (textual output) and generation (visual output) presents an ongoing challenge. This is due to a conflicting objective: for comprehension, an MLLM needs to abstract the visuals; for generation, it needs to preserve the visuals as much as possible. Thus, the objective is a dilemma for visual-tokens. To resolve the conflict, we propose encoding… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  5. arXiv:2312.07226  [pdf, other

    eess.IV cs.CV

    Super-Resolution on Rotationally Scanned Photoacoustic Microscopy Images Incorporating Scanning Prior

    Authors: Kai Pan, Linyang Li, Li Lin, Pujin Cheng, Junyan Lyu, Lei Xi, Xiaoyin Tang

    Abstract: Photoacoustic Microscopy (PAM) images integrating the advantages of optical contrast and acoustic resolution have been widely used in brain studies. However, there exists a trade-off between scanning speed and image resolution. Compared with traditional raster scanning, rotational scanning provides good opportunities for fast PAM imaging by optimizing the scanning mechanism. Recently, there is a t… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  6. arXiv:2311.09198  [pdf, other

    cs.CL cs.AI

    Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering

    Authors: Junqing He, Kunhao Pan, Xiaoqun Dong, Zhuoyang Song, Yibo Liu, Yuxin Liang, Hao Wang, Qianguo Sun, Songxin Zhang, Zejian Xie, Jiaxing Zhang

    Abstract: While large language models (LLMs) are equipped with longer text input capabilities than before, they are struggling to seek correct information in long contexts. The "lost in the middle" problem challenges most LLMs, referring to the dramatic decline in accuracy when correct information is located in the middle. To overcome this crucial issue, this paper proposes to enhance the information search… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  7. arXiv:2311.03301  [pdf, other

    cs.CL

    Ziya2: Data-centric Learning is All LLMs Need

    Authors: Ruyi Gan, Ziwei Wu, Renliang Sun, Junyu Lu, Xiaojun Wu, Dixiang Zhang, Kunhao Pan, Junqing He, Yuanhe Tian, Ping Yang, Qi Yang, Hao Wang, Jiaxing Zhang, Yan Song

    Abstract: Various large language models (LLMs) have been proposed in recent years, including closed- and open-source ones, continually setting new records on multiple benchmarks. However, the development of LLMs still faces several issues, such as high cost of training models from scratch, and continual pre-training leading to catastrophic forgetting, etc. Although many such issues are addressed along the l… ▽ More

    Submitted 4 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  8. arXiv:2310.02821  [pdf, other

    cs.CV cs.AI

    Improving Vision Anomaly Detection with the Guidance of Language Modality

    Authors: Dong Chen, Kaihang Pan, Guoming Wang, Yueting Zhuang, Siliang Tang

    Abstract: Recent years have seen a surge of interest in anomaly detection for tackling industrial defect detection, event detection, etc. However, existing unsupervised anomaly detectors, particularly those for the vision modality, face significant challenges due to redundant information and sparse latent space. Conversely, the language modality performs well due to its relatively single data. This paper ta… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 9 pages, 10 figures

  9. arXiv:2309.09526  [pdf, other

    cs.CV cs.AI

    DFIL: Deepfake Incremental Learning by Exploiting Domain-invariant Forgery Clues

    Authors: Kun Pan, Yin Yifang, Yao Wei, Feng Lin, Zhongjie Ba, Zhenguang Liu, ZhiBo Wang, Lorenzo Cavallaro, Kui Ren

    Abstract: The malicious use and widespread dissemination of deepfake pose a significant crisis of trust. Current deepfake detection models can generally recognize forgery images by training on a large dataset. However, the accuracy of detection models degrades significantly on images generated by new deepfake methods due to the difference in data distribution. To tackle this issue, we present a novel increm… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted by ACMMM2023

  10. arXiv:2308.10025  [pdf, other

    cs.CL

    I3: Intent-Introspective Retrieval Conditioned on Instructions

    Authors: Kaihang Pan, Juncheng Li, Wenjie Wang, Hao Fei, Hongye Song, Wei Ji, Jun Lin, Xiaozhong Liu, Tat-Seng Chua, Siliang Tang

    Abstract: Recent studies indicate that dense retrieval models struggle to perform well on a wide variety of retrieval tasks that lack dedicated training data, as different retrieval tasks often entail distinct search intents. To address this challenge, in this work we leverage instructions to flexibly describe retrieval intents and introduce I3, a unified retrieval system that performs Intent-Introspective… ▽ More

    Submitted 25 April, 2024; v1 submitted 19 August, 2023; originally announced August 2023.

    Comments: Accepted by SIGIR 2024

  11. arXiv:2308.04152  [pdf, other

    cs.CV

    Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions

    Authors: Juncheng Li, Kaihang Pan, Zhiqi Ge, Minghe Gao, Wei Ji, Wenqiao Zhang, Tat-Seng Chua, Siliang Tang, Hanwang Zhang, Yueting Zhuang

    Abstract: Recent advancements in Multimodal Large Language Models (MLLMs) have been utilizing Visual Prompt Generators (VPGs) to convert visual features into tokens that LLMs can recognize. This is achieved by training the VPGs on millions of image-caption pairs, where the VPG-generated tokens of images are fed into a frozen LLM to generate the corresponding captions. However, this image-captioning based tr… ▽ More

    Submitted 25 May, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by ICLR 2024 (Spotlight)

  12. arXiv:2307.16180  [pdf, other

    cs.CL

    Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models

    Authors: Keyu Pan, Yawen Zeng

    Abstract: The field of large language models (LLMs) has made significant progress, and their knowledge storage capacity is approaching that of human beings. Furthermore, advanced techniques, such as prompt learning and reinforcement learning, are being employed to address ethical concerns and hallucination problems associated with LLMs, bringing them closer to aligning with human values. This situation natu… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  13. arXiv:2303.12314  [pdf, other

    cs.CL cs.LG

    Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization

    Authors: Kaihang Pan, Juncheng Li, Hongye Song, Jun Lin, Xiaozhong Liu, Siliang Tang

    Abstract: Prompt tuning is a parameter-efficient method, which learns soft prompts and conditions frozen language models to perform specific downstream tasks. Though effective, prompt tuning under few-shot settings on the one hand heavily relies on a good initialization of soft prompts. On the other hand, it can easily overfit to few-shot training samples, thereby undermining generalizability. Existing work… ▽ More

    Submitted 23 October, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted by EMNLP 2023 (Findings)

  14. arXiv:2211.11304  [pdf, other

    cs.CL

    TCBERT: A Technical Report for Chinese Topic Classification BERT

    Authors: Ting Han, Kunhao Pan, Xinyu Chen, Dingjie Song, Yuchen Fan, Xinyu Gao, Ruyi Gan, Jiaxing Zhang

    Abstract: Bidirectional Encoder Representations from Transformers or BERT~\cite{devlin-etal-2019-bert} has been one of the base models for various NLP tasks due to its remarkable performance. Variants customized for different languages and tasks are proposed to further improve the performance. In this work, we investigate supervised continued pre-training~\cite{gururangan-etal-2020-dont} on BERT for Chinese… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  15. arXiv:2209.02970  [pdf, other

    cs.CL

    Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence

    Authors: Jiaxing Zhang, Ruyi Gan, Junjie Wang, Yuxiang Zhang, Lin Zhang, Ping Yang, Xinyu Gao, Ziwei Wu, Xiaoqun Dong, Junqing He, Jianheng Zhuo, Qi Yang, Yongfeng Huang, Xiayu Li, Yanghan Wu, Junyu Lu, Xinyu Zhu, Weifeng Chen, Ting Han, Kunhao Pan, Rui Wang, Hao Wang, Xiaojun Wu, Zhongshen Zeng, Chongpei Chen

    Abstract: Nowadays, foundation models become one of fundamental infrastructures in artificial intelligence, paving ways to the general intelligence. However, the reality presents two urgent challenges: existing foundation models are dominated by the English-language community; users are often given limited resources and thus cannot always use foundation models. To support the development of the Chinese-lang… ▽ More

    Submitted 30 March, 2023; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: Added the Chinese version and is now a bilingual paper

  16. arXiv:2203.06920  [pdf, other

    eess.IV cs.CV

    DS3-Net: Difficulty-perceived Common-to-T1ce Semi-Supervised Multimodal MRI Synthesis Network

    Authors: Ziqi Huang, Li Lin, Pujin Cheng, Kai Pan, Xiaoying Tang

    Abstract: Contrast-enhanced T1 (T1ce) is one of the most essential magnetic resonance imaging (MRI) modalities for diagnosing and analyzing brain tumors, especially gliomas. In clinical practice, common MRI modalities such as T1, T2, and fluid attenuation inversion recovery are relatively easy to access while T1ce is more challenging considering the additional cost and potential risk of allergies to the con… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 10 pages, 2 figures

  17. arXiv:2104.06677  [pdf, ps, other

    cs.LG

    Multi-Party Dual Learning

    Authors: Maoguo Gong, Yuan Gao, Yu Xie, A. K. Qin, Ke Pan, Yew-Soon Ong

    Abstract: The performance of machine learning algorithms heavily relies on the availability of a large amount of training data. However, in reality, data usually reside in distributed parties such as different institutions and may not be directly gathered and integrated due to various data policy constraints. As a result, some parties may suffer from insufficient data available for training machine learning… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: submitted to IEEE Transactions on Cybernetics

  18. arXiv:2006.14390  [pdf, other

    cs.LG stat.ML

    A New Modal Autoencoder for Functionally Independent Feature Extraction

    Authors: Yuzhu Guo, Kang Pan, Simeng Li, Zongchang Han, Kexin Wang, Li Li

    Abstract: Autoencoders have been widely used for dimensional reduction and feature extraction. Various types of autoencoders have been proposed by introducing regularization terms. Most of these regularizations improve representation learning by constraining the weights in the encoder part, which maps input into hidden nodes and affects the generation of features. In this study, we show that a constraint to… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

  19. arXiv:2004.13927  [pdf, other

    math.OC cs.CR

    Dynamic Anomaly Detection with High-fidelity Simulators: A Convex Optimization Approach

    Authors: Kaikai Pan, Peter Palensky, Peyman Mohajerin Esfahani

    Abstract: The main objective of this article is to develop scalable dynamic anomaly detectors when high-fidelity simulators of power systems are at our disposal. On the one hand, mathematical models of these high-fidelity simulators are typically "intractable" to apply existing model-based approaches. On the other hand, pure data-driven methods developed primarily in the machine learning literature neglect… ▽ More

    Submitted 6 October, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: 19 pages

  20. Detection of False Data Injection Attacks Using the Autoencoder Approach

    Authors: Chenguang Wang, Simon Tindemans, Kaikai Pan, Peter Palensky

    Abstract: State estimation is of considerable significance for the power system operation and control. However, well-designed false data injection attacks can utilize blind spots in conventional residual-based bad data detection methods to manipulate measurements in a coordinated manner and thus affect the secure operation and economic dispatch of grids. In this paper, we propose a detection approach based… ▽ More

    Submitted 14 December, 2022; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: 6 pages, 5 figures, 1 table, conference

    Journal ref: 2020 International Conference on Probabilistic Methods Applied to Power Systems (PMAPS), IEEE, Liege, Belgium, 2020, pp. 1-6

  21. arXiv:2001.07068  [pdf, other

    math.OC cs.CR

    False Data Injection Attacks on Hybrid AC/HVDC Interconnected System with Virtual Inertia -- Vulnerability, Impact and Detection

    Authors: Kaikai Pan, Elyas Rakhshani, Peter Palensky

    Abstract: Power systems are moving towards hybrid AC/DC grids with the integration of HVDC links, renewable resources and energy storage modules. New models of frequency control have to consider the complex interactions between these components. Meanwhile, more attention should be paid to cyber security concerns as these control strategies highly depend on data communications which may be exposed to cyber a… ▽ More

    Submitted 30 May, 2020; v1 submitted 20 January, 2020; originally announced January 2020.

  22. arXiv:1911.08802  [pdf

    cs.NI

    Blockchain-Assisted Spectrum Trading between Elastic Virtual Optical Networks

    Authors: Shifeng Ding, Kevin X. Pan, Sanjay K. Bose, Qiong Zhang, Gangxiang Shen

    Abstract: In communication networks, network virtualization can usually provide better capacity utilization and quality of service (QoS) than what can be achieved otherwise. However, conventional resource allocation for virtualized networks would still follow a fixed pattern based on the predicted capacity needs of the users, even though, in reality, the actual traffic demand of a user will always tend to f… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

    Comments: 7 pages, 5 figures

  23. arXiv:1708.08355  [pdf, other

    cs.CR

    Data Attacks on Power System State Estimation: Limited Adversarial Knowledge vs. Limited Attack Resources

    Authors: Kaikai Pan, André Teixeira, Milos Cvetkovic, Peter Palensky

    Abstract: A class of data integrity attack, known as false data injection (FDI) attack, has been studied with a considerable amount of work. It has shown that with perfect knowledge of the system model and the capability to manipulate a certain number of measurements, the FDI attacks can coordinate measurements corruption to keep stealth against the bad data detection. However, a more realistic attack is es… ▽ More

    Submitted 28 August, 2017; originally announced August 2017.

    Comments: Accepted in the 43rd Annual Conference of the IEEE Industrial Electronics Society (IECON 2017)

  24. arXiv:1708.08349  [pdf, other

    cs.CR

    Cyber Risk Analysis of Combined Data Attacks Against Power System State Estimation

    Authors: Kaikai Pan, André Teixeira, Milos Cvetkovic, Peter Palensky

    Abstract: Understanding smart grid cyber attacks is key for developing appropriate protection and recovery measures. Advanced attacks pursue maximized impact at minimized costs and detectability. This paper conducts risk analysis of combined data integrity and availability attacks against the power system state estimation. We compare the combined attacks with pure integrity attacks - false data injection (F… ▽ More

    Submitted 28 August, 2017; originally announced August 2017.

    Comments: Submitted to IEEE Transactions on Smart Grid on August 14th, 2017

  25. arXiv:1708.08322  [pdf, other

    cs.CR

    Co-simulation for Cyber Security Analysis: Data Attacks against Energy Management System

    Authors: Kaikai Pan, André Teixeira, Claudio López, Peter Palensky

    Abstract: It is challenging to assess the vulnerability of a cyber-physical power system to data attacks from an integral perspective. In order to support vulnerability assessment except analytic analysis, suitable platform for security tests needs to be developed. In this paper we analyze the cyber security of energy management system (EMS) against data attacks. First we extend our analytic framework that… ▽ More

    Submitted 28 August, 2017; originally announced August 2017.

    Comments: Accepted in 8th IEEE International Conference on Smart Grid Communications (SmartGridComm 2017)

  26. arXiv:1607.05606  [pdf, other

    cs.DL cs.SI physics.soc-ph

    The Memory of Science: Inflation, Myopia, and the Knowledge Network

    Authors: Raj K. Pan, Alexander M. Petersen, Fabio Pammolli, Santo Fortunato

    Abstract: Science is a growing system, exhibiting ~4% annual growth in publications and ~1.8% annual growth in the number of references per publication. Combined these trends correspond to a 12-year doubling period in the total supply of references, thereby challenging traditional methods of evaluating scientific production, from researchers to institutions. Against this background, we analyzed a citation n… ▽ More

    Submitted 19 July, 2016; originally announced July 2016.

    Comments: 17 pages, 8 figures, Supplementary Material available at http://physics.bu.edu/~amp17/webpage_files/MyPapers/pppf_Arxiv_July2016_SI.pdf

    Journal ref: Journal of Informetrics 12, 656-678 (2018)

  27. arXiv:1503.01881  [pdf, other

    physics.soc-ph cs.CY cs.DL cs.SI physics.data-an

    Attention decay in science

    Authors: Pietro Della Briotta Parolo, Raj Kumar Pan, Rumi Ghosh, Bernardo A. Huberman, Kimmo Kaski, Santo Fortunato

    Abstract: The exponential growth in the number of scientific papers makes it increasingly difficult for researchers to keep track of all the publications relevant to their work. Consequently, the attention that can be devoted to individual papers, measured by their citation counts, is bound to decay rapidly. In this work we make a thorough study of the life-cycle of papers in different disciplines. Typicall… ▽ More

    Submitted 23 November, 2015; v1 submitted 6 March, 2015; originally announced March 2015.

    Comments: Published version. 14 pages, 9 Figures,

    Journal ref: Journal of Informetrics, 9, 734-745 (2015)

  28. arXiv:1405.7136  [pdf, other

    physics.soc-ph cs.DL

    The Nobel Prize delay

    Authors: Francesco Becattini, Arnab Chatterjee, Santo Fortunato, Marija Mitrović, Raj Kumar Pan, Pietro Della Briotta Parolo

    Abstract: The time lag between the publication of a Nobel discovery and the conferment of the prize has been rapidly increasing for all disciplines, especially for Physics. Does this mean that fundamental science is running out of groundbreaking discoveries?

    Submitted 28 May, 2014; originally announced May 2014.

    Comments: Extended version of Nature 508, 186 (2014) http://www.nature.com/nature/journal/v508/n7495/full/508186a.html ; http://scitation.aip.org/content/aip/magazine/physicstoday/news/10.1063/PT.5.2012

  29. arXiv:1403.1177  [pdf, other

    physics.soc-ph cs.SI physics.data-an

    Effects of temporal correlations on cascades: Threshold models on temporal networks

    Authors: Ville-Pekka Backlund, Jari Saramäki, Raj Kumar Pan

    Abstract: A person's decision to adopt an idea or product is often driven by the decisions of peers, mediated through a network of social ties. A common way of modeling adoption dynamics is to use threshold models, where a node may become an adopter given a high enough rate of contacts with adopted neighbors. We study the dynamics of threshold models that take both the network topology and the timings of co… ▽ More

    Submitted 27 June, 2014; v1 submitted 5 March, 2014; originally announced March 2014.

    Comments: 9 pages, 7 figures, Published version

    Journal ref: Phys. Rev. E 89, 062815 (2014)

  30. arXiv:1312.2650  [pdf, other

    physics.soc-ph cs.DL physics.data-an

    Author Impact Factor: tracking the dynamics of individual scientific impact

    Authors: Raj Kumar Pan, Santo Fortunato

    Abstract: The impact factor (IF) of scientific journals has acquired a major role in the evaluations of the output of scholars, departments and whole institutions. Typically papers appearing in journals with large values of the IF receive a high weight in such evaluations. However, at the end of the day one is interested in assessing the impact of individuals, rather than papers. Here we introduce Author Im… ▽ More

    Submitted 12 May, 2014; v1 submitted 9 December, 2013; originally announced December 2013.

    Comments: Published version. 6 pages, 5 figures + Appendix

    Journal ref: Sci. Rep. 4, 4880 (2014)

  31. arXiv:1306.0114  [pdf, other

    physics.soc-ph cs.DL

    On the Predictability of Future Impact in Science

    Authors: Orion Penner, Raj Kumar Pan, Alexander M. Petersen, Kimmo Kaski, Santo Fortunato

    Abstract: Correctly assessing a scientist's past research impact and potential for future impact is key in recruitment decisions and other evaluation processes. While a candidate's future impact is the main concern for these decisions, most measures only quantify the impact of previous work. Recently, it has been argued that linear regression models are capable of predicting a scientist's future impact. By… ▽ More

    Submitted 29 October, 2013; v1 submitted 1 June, 2013; originally announced June 2013.

    Comments: Published version, 8 pages, 5 figures + Appendix

    Journal ref: Scientific Reports 3, 3052 (2013)

  32. arXiv:1304.0627  [pdf, other

    physics.soc-ph cs.DL physics.data-an

    The case for caution in predicting scientists' future impact

    Authors: Orion Penner, Raj K. Pan, Alexander M. Petersen, Santo Fortunato

    Abstract: We stress-test the career predictability model proposed by Acuna et al. [Nature 489, 201-202 2012] by applying their model to a longitudinal career data set of 100 Assistant professors in physics, two from each of the top 50 physics departments in the US. The Acuna model claims to predict h(t+Δt), a scientist's h-index Δt years into the future, using a linear combination of 5 cumulative career mea… ▽ More

    Submitted 2 April, 2013; originally announced April 2013.

    Comments: 2 pages, 1 figure

    Journal ref: Physics Today 66, 8-9 (2013)

  33. arXiv:1303.7274  [pdf, other

    physics.soc-ph cs.DL physics.data-an

    Reputation and Impact in Academic Careers

    Authors: Alexander M. Petersen, Santo Fortunato, Raj K. Pan, Kimmo Kaski, Orion Penner, Armando Rungi, Massimo Riccaboni, H. Eugene Stanley, Fabio Pammolli

    Abstract: Reputation is an important social construct in science, which enables informed quality assessments of both publications and careers of scientists in the absence of complete systemic information. However, the relation between reputation and career growth of an individual remains poorly understood, despite recent proliferation of quantitative research evaluation methods. Here we develop an original… ▽ More

    Submitted 7 October, 2014; v1 submitted 28 March, 2013; originally announced March 2013.

    Comments: Final published version of the main manuscript including additional analysis: 9 pages, 4 figures, 1 table, and full reference list, including those in the Supplementary Information. For the SI Appendix, see http://physics.bu.edu/~amp17/webpage_files/MyPapers/Reputation_SI.pdf

    Journal ref: Proceedings of the National Academy of Sciences 111, 15316-15321 (2014)

  34. arXiv:1209.0781  [pdf, other

    physics.soc-ph cs.DL cs.SI physics.data-an

    World citation and collaboration networks: uncovering the role of geography in science

    Authors: Raj Kumar Pan, Kimmo Kaski, Santo Fortunato

    Abstract: Modern information and communication technologies, especially the Internet, have diminished the role of spatial distances and territorial boundaries on the access and transmissibility of information. This has enabled scientists for closer collaboration and internationalization. Nevertheless, geography remains an important factor affecting the dynamics of science. Here we present a systematic analy… ▽ More

    Submitted 17 December, 2012; v1 submitted 4 September, 2012; originally announced September 2012.

    Comments: Published version. 9 pages, 5 figures + Appendix, The world citation and collaboration networks at both city and country level are available at http://becs.aalto.fi/~rajkp/datasets.html

    Journal ref: Scientific Reports 2, 902 (2012)

  35. arXiv:1206.0108  [pdf, other

    physics.soc-ph cs.SI physics.data-an

    The evolution of interdisciplinarity in physics research

    Authors: Raj Kumar Pan, Sitabhra Sinha, Kimmo Kaski, Jari Saramäki

    Abstract: Science, being a social enterprise, is subject to fragmentation into groups that focus on specialized areas or topics. Often new advances occur through cross-fertilization of ideas between sub-fields that otherwise have little overlap as they study dissimilar phenomena using different techniques. Thus to explore the nature and dynamics of scientific progress one needs to consider the large-scale o… ▽ More

    Submitted 16 August, 2012; v1 submitted 1 June, 2012; originally announced June 2012.

    Comments: Published version, 10 pages, 8 figures + Supplementary Information

    Journal ref: Scientific Reports 2, 551 (2012)

  36. arXiv:1112.4312  [pdf, other

    physics.soc-ph cs.SI physics.data-an

    Multiscale Analysis of Spreading in a Large Communication Network

    Authors: Mikko Kivelä, Raj Kumar Pan, Kimmo Kaski, János Kertész, Jari Saramäki, Márton Karsai

    Abstract: In temporal networks, both the topology of the underlying network and the timings of interaction events can be crucial in determining how some dynamic process mediated by the network unfolds. We have explored the limiting case of the speed of spreading in the SI model, set up such that an event between an infectious and susceptible individual always transmits the infection. The speed of this proce… ▽ More

    Submitted 19 December, 2011; originally announced December 2011.

    Journal ref: J. Stat. Mech. (2012) P03005

  37. arXiv:1106.5249  [pdf, ps, other

    physics.soc-ph cs.SI physics.data-an

    The strength of strong ties in scientific collaboration networks

    Authors: Raj Kumar Pan, Jari Saramäki

    Abstract: Network topology and its relationship to tie strengths may hinder or enhance the spreading of information in social networks. We study the correlations between tie strengths and topology in networks of scientific collaboration, and show that these are very different from ordinary social networks. For the latter, it has earlier been shown that strong ties are associated with dense network neighborh… ▽ More

    Submitted 11 January, 2012; v1 submitted 26 June, 2011; originally announced June 2011.

    Comments: 6 Pages, 6 Figures, Published version, Minor changes, Results also verified using new weight-scheme

    Journal ref: Europhys. Lett. 97, 18007 (2012)

  38. arXiv:1106.0288  [pdf, other

    physics.soc-ph cs.SI

    Emergence of Bursts and Communities in Evolving Weighted Networks

    Authors: Hang-Hyun Jo, Raj Kumar Pan, Kimmo Kaski

    Abstract: Understanding the patterns of human dynamics and social interaction, and the way they lead to the formation of an organized and functional society are important issues especially for techno-social development. Addressing these issues of social networks has recently become possible through large scale data analysis of e.g. mobile phone call records, which has revealed the existence of modular or co… ▽ More

    Submitted 1 June, 2011; originally announced June 2011.

    Comments: 9 pages, 6 figures

    Journal ref: PLoS ONE 6(8): e22687 (2011)

  39. arXiv:1101.5913  [pdf, other

    physics.soc-ph cond-mat.dis-nn cs.SI physics.data-an

    Path lengths, correlations, and centrality in temporal networks

    Authors: Raj Kumar Pan, Jari Saramäki

    Abstract: In temporal networks, where nodes interact via sequences of temporary events, information or resources can only flow through paths that follow the time-ordering of events. Such temporal paths play a crucial role in dynamic processes. However, since networks have so far been usually considered static or quasi-static, the properties of temporal paths are not yet well understood. Building on a defini… ▽ More

    Submitted 19 July, 2011; v1 submitted 31 January, 2011; originally announced January 2011.

    Comments: 10 pages, 8 figures, Published version

    Journal ref: Phys. Rev. E 84, 016105 (2011)

  40. arXiv:1010.3171  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.SI physics.soc-ph

    Using explosive percolation in analysis of real-world networks

    Authors: Raj Kumar Pan, Mikko Kivelä, Jari Saramäki, Kimmo Kaski, János Kertész

    Abstract: We apply a variant of the explosive percolation procedure to large real-world networks, and show with finite-size scaling that the university class, ordinary or explosive, of the resulting percolation transition depends on the structural properties of the network as well as the number of unoccupied links considered for comparison in our procedure. We observe that in our social networks, the percol… ▽ More

    Submitted 18 April, 2011; v1 submitted 15 October, 2010; originally announced October 2010.

    Comments: 6 pages, 4 figures. Published version. Elongated to include the results and figures of finite-size scaling and modularity analysis

    Journal ref: Phys. Rev. E 83, 046112 (2011)

  41. arXiv:1006.2125  [pdf, ps, other

    physics.soc-ph cs.SI nlin.AO physics.bio-ph

    Small But Slow World: How Network Topology and Burstiness Slow Down Spreading

    Authors: M. Karsai, M. Kivelä, R. K. Pan, K. Kaski, J. Kertész, A. -L. Barabási, J. Saramäki

    Abstract: Communication networks show the small-world property of short paths, but the spreading dynamics in them turns out slow. We follow the time evolution of information propagation through communication networks by using the SI model with empirical data on contact sequences. We introduce null models where the sequences are randomly shuffled in different ways, enabling us to distinguish between the cont… ▽ More

    Submitted 22 August, 2010; v1 submitted 10 June, 2010; originally announced June 2010.

    Journal ref: Phys. Rev. E 83, 025102(R) (2011)

  42. arXiv:1005.4997  [pdf, ps, other

    cs.CL physics.data-an physics.soc-ph

    Network analysis of a corpus of undeciphered Indus civilization inscriptions indicates syntactic organization

    Authors: Sitabhra Sinha, Md Izhar Ashraf, Raj Kumar Pan, Bryan Kenneth Wells

    Abstract: Archaeological excavations in the sites of the Indus Valley civilization (2500-1900 BCE) in Pakistan and northwestern India have unearthed a large number of artifacts with inscriptions made up of hundreds of distinct signs. To date there is no generally accepted decipherment of these sign sequences and there have been suggestions that the signs could be non-linguistic. Here we apply complex networ… ▽ More

    Submitted 27 May, 2010; originally announced May 2010.

    Comments: 17 pages (includes 4 page appendix containing Indus sign list), 14 figures

    Journal ref: Computer Speech and Language, 25 (2011) 639-654