Zum Hauptinhalt springen

Showing 1–39 of 39 results for author: Mai, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12821  [pdf, other

    cs.CV cs.AI

    Examining the Commitments and Difficulties Inherent in Multimodal Foundation Models for Street View Imagery

    Authors: Zhenyuan Yang, Xuhui Lin, Qinyi He, Ziye Huang, Zhengliang Liu, Hanqi Jiang, Peng Shu, Zihao Wu, Yiwei Li, Stephen Law, Gengchen Mai, Tianming Liu, Tao Yang

    Abstract: The emergence of Large Language Models (LLMs) and multimodal foundation models (FMs) has generated heightened interest in their applications that integrate vision and language. This paper investigates the capabilities of ChatGPT-4V and Gemini Pro for Street View Imagery, Built Environment, and Interior by evaluating their performance across various tasks. The assessments include street furniture i… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  2. arXiv:2408.06761  [pdf, other

    cs.CV cs.AI

    Cross-View Geolocalization and Disaster Mapping with Street-View and VHR Satellite Imagery: A Case Study of Hurricane IAN

    Authors: Hao Li, Fabian Deuser, Wenping Yina, Xuanshu Luo, Paul Walther, Gengchen Mai, Wei Huang, Martin Werner

    Abstract: Nature disasters play a key role in shaping human-urban infrastructure interactions. Effective and efficient response to natural disasters is essential for building resilience and a sustainable urban environment. Two types of information are usually the most necessary and difficult to gather in disaster response. The first information is about disaster damage perception, which shows how badly peop… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  3. arXiv:2406.15658  [pdf, other

    cs.CV cs.AI

    TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation Learning

    Authors: Nemin Wu, Qian Cao, Zhangyu Wang, Zeping Liu, Yanlin Qi, Jielu Zhang, Joshua Ni, Xiaobai Yao, Hongxu Ma, Lan Mu, Stefano Ermon, Tanuja Ganu, Akshay Nambi, Ni Lao, Gengchen Mai

    Abstract: Spatial representation learning (SRL) aims at learning general-purpose neural network representations from various types of spatial data (e.g., points, polylines, polygons, networks, images, etc.) in their native formats. Learning good spatial representations is a fundamental problem for various downstream applications such as species distribution modeling, weather forecasting, trajectory generati… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures. Submitted to NeurIPS 2024 Datasets and Benchmarks Track. Under review

  4. arXiv:2405.18459  [pdf, other

    cs.IT cs.AI cs.LG stat.ME

    Probing the Information Theoretical Roots of Spatial Dependence Measures

    Authors: Zhangyu Wang, Krzysztof Janowicz, Gengchen Mai, Ivan Majic

    Abstract: Intuitively, there is a relation between measures of spatial dependence and information theoretical measures of entropy. For instance, we can provide an intuition of why spatial data is special by stating that, on average, spatial data samples contain less than expected information. Similarly, spatial data, e.g., remotely sensed imagery, that is easy to compress is also likely to show significant… ▽ More

    Submitted 23 July, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: COSIT-2024 Conference Proceedings

  5. arXiv:2405.18395  [pdf, other

    cs.LG cs.AI stat.AP

    MC-GTA: Metric-Constrained Model-Based Clustering using Goodness-of-fit Tests with Autocorrelations

    Authors: Zhangyu Wang, Gengchen Mai, Krzysztof Janowicz, Ni Lao

    Abstract: A wide range of (multivariate) temporal (1D) and spatial (2D) data analysis tasks, such as grouping vehicle sensor trajectories, can be formulated as clustering with given metric constraints. Existing metric-constrained clustering algorithms overlook the rich correlation between feature similarity and metric distance, i.e., metric autocorrelation. The model-based variations of these clustering alg… ▽ More

    Submitted 2 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: ICML-2024 Proceedings

  6. Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation

    Authors: Zhongliang Zhou, Jielu Zhang, Zihan Guan, Mengxuan Hu, Ni Lao, Lan Mu, Sheng Li, Gengchen Mai

    Abstract: Geolocating precise locations from images presents a challenging problem in computer vision and information retrieval.Traditional methods typically employ either classification, which dividing the Earth surface into grid cells and classifying images accordingly, or retrieval, which identifying locations by matching images with a database of image-location pairs. However, classification-based appro… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  7. arXiv:2402.15398  [pdf, other

    cs.LG cs.AI cs.CY

    TransFlower: An Explainable Transformer-Based Model with Flow-to-Flow Attention for Commuting Flow Prediction

    Authors: Yan Luo, Zhuoyue Wan, Yuzhong Chen, Gengchen Mai, Fu-lai Chung, Kent Larson

    Abstract: Understanding the link between urban planning and commuting flows is crucial for guiding urban development and policymaking. This research, bridging computer science and urban studies, addresses the challenge of integrating these fields with their distinct focuses. Traditional urban studies methods, like the gravity and radiation models, often underperform in complex scenarios due to their limited… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  8. arXiv:2401.11641  [pdf, other

    cs.CL

    Revolutionizing Finance with LLMs: An Overview of Applications and Insights

    Authors: Huaqin Zhao, Zhengliang Liu, Zihao Wu, Yiwei Li, Tianze Yang, Peng Shu, Shaochen Xu, Haixing Dai, Lin Zhao, Gengchen Mai, Ninghao Liu, Tianming Liu

    Abstract: In recent years, Large Language Models (LLMs) like ChatGPT have seen considerable advancements and have been applied in diverse fields. Built on the Transformer architecture, these models are trained on extensive datasets, enabling them to understand and generate human language effectively. In the financial domain, the deployment of LLMs is gaining momentum. These models are being utilized for aut… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  9. arXiv:2312.17016  [pdf, other

    cs.CV cs.AI

    On the Promises and Challenges of Multimodal Foundation Models for Geographical, Environmental, Agricultural, and Urban Planning Applications

    Authors: Chenjiao Tan, Qian Cao, Yiwei Li, Jielu Zhang, Xiao Yang, Huaqin Zhao, Zihao Wu, Zhengliang Liu, Hao Yang, Nemin Wu, Tao Tang, Xinyue Ye, Lilong Chai, Ninghao Liu, Changying Li, Lan Mu, Tianming Liu, Gengchen Mai

    Abstract: The advent of large language models (LLMs) has heightened interest in their potential for multimodal applications that integrate language and vision. This paper explores the capabilities of GPT-4V in the realms of geography, environmental science, agriculture, and urban planning by evaluating its performance across a variety of tasks. Data sources comprise satellite imagery, aerial photos, ground-… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: 110 Pages; 61 Figures

    ACM Class: I.2.7; I.2.10; I.4.6; I.4.8; J.2

  10. arXiv:2312.06037  [pdf, other

    cs.AI

    Multimodality of AI for Education: Towards Artificial General Intelligence

    Authors: Gyeong-Geon Lee, Lehong Shi, Ehsan Latif, Yizhu Gao, Arne Bewersdorff, Matthew Nyaaba, Shuchen Guo, Zihao Wu, Zhengliang Liu, Hui Wang, Gengchen Mai, Tiaming Liu, Xiaoming Zhai

    Abstract: This paper presents a comprehensive examination of how multimodal artificial intelligence (AI) approaches are paving the way towards the realization of Artificial General Intelligence (AGI) in educational contexts. It scrutinizes the evolution and integration of AI in educational systems, emphasizing the crucial role of multimodality, which encompasses auditory, visual, kinesthetic, and linguistic… ▽ More

    Submitted 12 December, 2023; v1 submitted 10 December, 2023; originally announced December 2023.

  11. arXiv:2310.19626  [pdf, other

    cs.AI

    Transformation vs Tradition: Artificial General Intelligence (AGI) for Arts and Humanities

    Authors: Zhengliang Liu, Yiwei Li, Qian Cao, Junwen Chen, Tianze Yang, Zihao Wu, John Hale, John Gibbs, Khaled Rasheed, Ninghao Liu, Gengchen Mai, Tianming Liu

    Abstract: Recent advances in artificial general intelligence (AGI), particularly large language models and creative image generation systems have demonstrated impressive capabilities on diverse tasks spanning the arts and humanities. However, the swift evolution of AGI has also raised critical questions about its responsible deployment in these culturally significant domains traditionally seen as profoundly… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    ACM Class: J.5; I.2.7; I.2.10

  12. Geo-knowledge-guided GPT models improve the extraction of location descriptions from disaster-related social media messages

    Authors: Yingjie Hu, Gengchen Mai, Chris Cundy, Kristy Choi, Ni Lao, Wei Liu, Gaurish Lakhanpal, Ryan Zhenqi Zhou, Kenneth Joseph

    Abstract: Social media messages posted by people during natural disasters often contain important location descriptions, such as the locations of victims. Recent research has shown that many of these location descriptions go beyond simple place names, such as city names and street names, and are difficult to extract using typical named entity recognition (NER) tools. While advanced machine learning models c… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Journal ref: International Journal of Geographical Information Science, 2023

  13. arXiv:2310.06213  [pdf, other

    cs.CL cs.LG

    GeoLLM: Extracting Geospatial Knowledge from Large Language Models

    Authors: Rohin Manvi, Samar Khanna, Gengchen Mai, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: The application of machine learning (ML) in a range of geospatial tasks is increasingly common but often relies on globally available covariates such as satellite imagery that can either be expensive or lack predictive power. Here we explore the question of whether the vast amounts of knowledge found in Internet language corpora, now compressed within large language models (LLMs), can be leveraged… ▽ More

    Submitted 24 February, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  14. arXiv:2310.00413  [pdf, other

    cs.CV cs.LG eess.IV

    SSIF: Learning Continuous Image Representation for Spatial-Spectral Super-Resolution

    Authors: Gengchen Mai, Ni Lao, Weiwei Sun, Yuchi Ma, Jiaming Song, Chenlin Meng, Hongxu Ma, Jinmeng Rao, Ziyuan Li, Stefano Ermon

    Abstract: Existing digital sensors capture images at fixed spatial and spectral resolutions (e.g., RGB, multispectral, and hyperspectral images), and each combination requires bespoke machine learning models. Neural Implicit Functions partially overcome the spatial resolution challenge by representing an image in a resolution-independent way. However, they still operate at fixed, pre-defined spectral resolu… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    MSC Class: 68T07; 68T45 ACM Class: I.4.10; I.2.10; I.4.6

  15. Building Privacy-Preserving and Secure Geospatial Artificial Intelligence Foundation Models

    Authors: Jinmeng Rao, Song Gao, Gengchen Mai, Krzysztof Janowicz

    Abstract: In recent years we have seen substantial advances in foundation models for artificial intelligence, including language, vision, and multimodal models. Recent studies have highlighted the potential of using foundation models in geospatial artificial intelligence, known as GeoAI Foundation Models, for geographic question answering, remote sensing image understanding, map generation, and location-bas… ▽ More

    Submitted 12 October, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: 1 figure

    ACM Class: I.2.0

    Journal ref: ACM SIGSPATIAL 2023

  16. arXiv:2309.07438  [pdf, other

    cs.AI cs.NI

    Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges

    Authors: Fei Dou, Jin Ye, Geng Yuan, Qin Lu, Wei Niu, Haijian Sun, Le Guan, Guoyu Lu, Gengchen Mai, Ninghao Liu, Jin Lu, Zhengliang Liu, Zihao Wu, Chenjiao Tan, Shaochen Xu, Xianqiao Wang, Guoming Li, Lilong Chai, Sheng Li, Jin Sun, Hongyue Sun, Yunli Shao, Changying Li, Tianming Liu, Wenzhan Song

    Abstract: Artificial General Intelligence (AGI), possessing the capacity to comprehend, learn, and execute tasks with human cognitive abilities, engenders significant anticipation and intrigue across scientific, commercial, and societal arenas. This fascination extends particularly to the Internet of Things (IoT), a landscape characterized by the interconnection of countless devices, sensors, and systems, c… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  17. arXiv:2306.17624  [pdf, other

    cs.CV cs.AI cs.LG

    Sphere2Vec: A General-Purpose Location Representation Learning over a Spherical Surface for Large-Scale Geospatial Predictions

    Authors: Gengchen Mai, Yao Xuan, Wenyun Zuo, Yutong He, Jiaming Song, Stefano Ermon, Krzysztof Janowicz, Ni Lao

    Abstract: Generating learning-friendly representations for points in space is a fundamental and long-standing problem in ML. Recently, multi-scale encoding schemes (such as Space2Vec and NeRF) were proposed to directly encode any point in 2D/3D Euclidean space as a high-dimensional vector, and has been successfully applied to various geospatial prediction and generative tasks. However, all current 2D and 3D… ▽ More

    Submitted 2 July, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: 30 Pages, 16 figures. Accepted to ISPRS Journal of Photogrammetry and Remote Sensing

    MSC Class: 68T07; 68T45 ACM Class: I.2.0; I.2.6; I.2.10; I.5.1; J.2

    Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing, 2023

  18. arXiv:2306.11892  [pdf, other

    cs.CL

    Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications

    Authors: Saed Rezayi, Zhengliang Liu, Zihao Wu, Chandra Dhakal, Bao Ge, Haixing Dai, Gengchen Mai, Ninghao Liu, Chen Zhen, Tianming Liu, Sheng Li

    Abstract: This paper explores new frontiers in agricultural natural language processing by investigating the effectiveness of using food-related text corpora for pretraining transformer-based language models. In particular, we focus on the task of semantic matching, which involves establishing mappings between food descriptions and nutrition data. To accomplish this, we fine-tune a pre-trained transformer-b… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  19. arXiv:2306.10095  [pdf, other

    cs.CL cs.AI cs.IR

    AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology

    Authors: Haixing Dai, Yiwei Li, Zhengliang Liu, Lin Zhao, Zihao Wu, Suhang Song, Ye Shen, Dajiang Zhu, Xiang Li, Sheng Li, Xiaobai Yao, Lu Shi, Quanzheng Li, Zhuo Chen, Donglan Zhang, Gengchen Mai, Tianming Liu

    Abstract: In this pioneering study, inspired by AutoGPT, the state-of-the-art open-source application based on the GPT-4 large language model, we develop a novel tool called AD-AutoGPT which can conduct data collection, processing, and analysis about complex health narratives of Alzheimer's Disease in an autonomous manner via users' textual prompts. We collated comprehensive data from a variety of news sour… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 20 pages, 4 figures

    MSC Class: 68T01; 68T50; 92C50 ACM Class: I.2.7; I.2.1; J.3

  20. arXiv:2305.03513  [pdf, other

    cs.CL cs.AI cs.LG

    ChatGraph: Interpretable Text Classification by Converting ChatGPT Knowledge to Graphs

    Authors: Yucheng Shi, Hehuan Ma, Wenliang Zhong, Qiaoyu Tan, Gengchen Mai, Xiang Li, Tianming Liu, Junzhou Huang

    Abstract: ChatGPT, as a recently launched large language model (LLM), has shown superior performance in various natural language processing (NLP) tasks. However, two major limitations hinder its potential applications: (1) the inflexibility of finetuning on downstream tasks and (2) the lack of interpretability in the decision-making process. To tackle these limitations, we propose a novel framework that lev… ▽ More

    Submitted 19 September, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: 6 pages, 2 figures

  21. arXiv:2305.01118  [pdf, other

    cs.CV cs.AI cs.LG

    CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations

    Authors: Gengchen Mai, Ni Lao, Yutong He, Jiaming Song, Stefano Ermon

    Abstract: Geo-tagged images are publicly available in large quantities, whereas labels such as object classes are rather scarce and expensive to collect. Meanwhile, contrastive learning has achieved tremendous success in various natural image and language tasks with limited labeled data. However, existing methods fail to fully leverage geospatial information, which can be paramount to distinguishing objects… ▽ More

    Submitted 8 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: In: ICML 2023, Jul 23 - 29, 2023, Honolulu, Hawaii, USA

    MSC Class: 68T07; 68T45 ACM Class: I.2.10; I.5.4; I.5.1; J.2

  22. arXiv:2304.12479  [pdf, other

    cs.AI

    AGI: Artificial General Intelligence for Education

    Authors: Ehsan Latif, Gengchen Mai, Matthew Nyaaba, Xuansheng Wu, Ninghao Liu, Guoyu Lu, Sheng Li, Tianming Liu, Xiaoming Zhai

    Abstract: Artificial general intelligence (AGI) has gained global recognition as a future technology due to the emergence of breakthrough large language models and chatbots such as GPT-4 and ChatGPT, respectively. Compared to conventional AI models, typically designed for a limited range of tasks, demand significant amounts of domain-specific data for training and may not always consider intricate interpers… ▽ More

    Submitted 13 March, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: Position Paper on AGI for Education, Submitted to Technology and Society

  23. arXiv:2304.10597  [pdf, other

    cs.CV cs.AI

    Text2Seg: Remote Sensing Image Semantic Segmentation via Text-Guided Visual Foundation Models

    Authors: Jielu Zhang, Zhongliang Zhou, Gengchen Mai, Mengxuan Hu, Zihan Guan, Sheng Li, Lan Mu

    Abstract: Remote sensing imagery has attracted significant attention in recent years due to its instrumental role in global environmental monitoring, land usage monitoring, and more. As image databases grow each year, performing automatic segmentation with deep learning models has gradually become the standard approach for processing the data. Despite the improved performance of current models, certain limi… ▽ More

    Submitted 24 August, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 10 pages, 3 figures

  24. arXiv:2304.06798  [pdf, other

    cs.AI cs.CL cs.CV

    On the Opportunities and Challenges of Foundation Models for Geospatial Artificial Intelligence

    Authors: Gengchen Mai, Weiming Huang, Jin Sun, Suhang Song, Deepak Mishra, Ninghao Liu, Song Gao, Tianming Liu, Gao Cong, Yingjie Hu, Chris Cundy, Ziyuan Li, Rui Zhu, Ni Lao

    Abstract: Large pre-trained models, also known as foundation models (FMs), are trained in a task-agnostic manner on large-scale data and can be adapted to a wide range of downstream tasks by fine-tuning, few-shot, or even zero-shot learning. Despite their successes in language and vision tasks, we have yet seen an attempt to develop foundation models for geospatial artificial intelligence (GeoAI). In this w… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    ACM Class: I.2.0; I.2.4; I.2.7; I.2.10; I.5.1

  25. arXiv:2304.06136  [pdf, other

    cs.AI cs.CY

    AGI for Agriculture

    Authors: Guoyu Lu, Sheng Li, Gengchen Mai, Jin Sun, Dajiang Zhu, Lilong Chai, Haijian Sun, Xianqiao Wang, Haixing Dai, Ninghao Liu, Rui Xu, Daniel Petti, Changying Li, Tianming Liu, Changying Li

    Abstract: Artificial General Intelligence (AGI) is poised to revolutionize a variety of sectors, including healthcare, finance, transportation, and education. Within healthcare, AGI is being utilized to analyze clinical medical notes, recognize patterns in patient data, and aid in patient management. Agriculture is another critical sector that impacts the lives of individuals worldwide. It serves as a found… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  26. arXiv:2304.04893  [pdf, other

    cs.AI

    EVKG: An Interlinked and Interoperable Electric Vehicle Knowledge Graph for Smart Transportation System

    Authors: Yanlin Qi, Gengchen Mai, Rui Zhu, Michael Zhang

    Abstract: Over the past decade, the electric vehicle industry has experienced unprecedented growth and diversification, resulting in a complex ecosystem. To effectively manage this multifaceted field, we present an EV-centric knowledge graph (EVKG) as a comprehensive, cross-domain, extensible, and open geospatial knowledge management system. The EVKG encapsulates essential EV-related knowledge, including EV… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  27. arXiv:2209.15458  [pdf, other

    cs.CV cs.AI cs.LG

    Towards General-Purpose Representation Learning of Polygonal Geometries

    Authors: Gengchen Mai, Chiyu Jiang, Weiwei Sun, Rui Zhu, Yao Xuan, Ling Cai, Krzysztof Janowicz, Stefano Ermon, Ni Lao

    Abstract: Neural network representation learning for spatial data is a common need for geographic artificial intelligence (GeoAI) problems. In recent years, many advancements have been made in representation learning for points, polylines, and networks, whereas little progress has been made for polygons, especially complex polygonal geometries. In this work, we focus on developing a general-purpose polygon… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: 58 pages, 20 figures, Accepted to GeoInformatica

    MSC Class: 68T07; 68T10; 68T30 ACM Class: I.2.6; I.3.5; I.5.4

  28. arXiv:2201.10489  [pdf, other

    cs.CV cs.AI cs.LG

    Sphere2Vec: Multi-Scale Representation Learning over a Spherical Surface for Geospatial Predictions

    Authors: Gengchen Mai, Yao Xuan, Wenyun Zuo, Krzysztof Janowicz, Ni Lao

    Abstract: Generating learning-friendly representations for points in a 2D space is a fundamental and long-standing problem in machine learning. Recently, multi-scale encoding schemes (such as Space2Vec) were proposed to directly encode any point in 2D space as a high-dimensional vector, and has been successfully applied to various (geo)spatial prediction tasks. However, a map projection distortion problem r… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    ACM Class: I.2.10; I.5.1

  29. arXiv:2112.00970  [pdf, other

    cs.AI cs.HC

    Narrative Cartography with Knowledge Graphs

    Authors: Gengchen Mai, Weiming Huang, Ling Cai, Rui Zhu, Ni Lao

    Abstract: Narrative cartography is a discipline which studies the interwoven nature of stories and maps. However, conventional geovisualization techniques of narratives often encounter several prominent challenges, including the data acquisition & integration challenge and the semantic challenge. To tackle these challenges, in this paper, we propose the idea of narrative cartography with knowledge graphs (K… ▽ More

    Submitted 10 March, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: 33 pages, 5 figures, Accepted to Journal of Geovisualization and Spatial Analysis

    MSC Class: 68T30 ACM Class: I.2.4

  30. Time in a Box: Advancing Knowledge Graph Completion with Temporal Scopes

    Authors: Ling Cai, Krzysztof Janowic, Bo Yan, Rui Zhu, Gengchen Mai

    Abstract: Almost all statements in knowledge bases have a temporal scope during which they are valid. Hence, knowledge base completion (KBC) on temporal knowledge bases (TKB), where each statement \textit{may} be associated with a temporal scope, has attracted growing attention. Prior works assume that each statement in a TKB \textit{must} be associated with a temporal scope. This ignores the fact that the… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

  31. A Review of Location Encoding for GeoAI: Methods and Applications

    Authors: Gengchen Mai, Krzysztof Janowicz, Yingjie Hu, Song Gao, Bo Yan, Rui Zhu, Ling Cai, Ni Lao

    Abstract: A common need for artificial intelligence models in the broader geoscience is to represent and encode various types of spatial data, such as points (e.g., points of interest), polylines (e.g., trajectories), polygons (e.g., administrative regions), graphs (e.g., transportation networks), or rasters (e.g., remote sensing images), in a hidden embedding space so that they can be readily incorporated… ▽ More

    Submitted 10 March, 2022; v1 submitted 7 November, 2021; originally announced November 2021.

    Comments: 32 Pages, 5 Figures, Accepted to International Journal of Geographical Information Science, 2021

    MSC Class: 68T07 ACM Class: I.2.0; I.5.1

    Journal ref: International Journal of Geographical Information Science, 2021

  32. arXiv:2105.09392  [pdf, other

    cs.CL cs.AI

    Geographic Question Answering: Challenges, Uniqueness, Classification, and Future Directions

    Authors: Gengchen Mai, Krzysztof Janowicz, Rui Zhu, Ling Cai, Ni Lao

    Abstract: As an important part of Artificial Intelligence (AI), Question Answering (QA) aims at generating answers to questions phrased in natural language. While there has been substantial progress in open-domain question answering, QA systems are still struggling to answer questions which involve geographic entities or concepts and that require spatial operations. In this paper, we discuss the problem of… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 20 pages, 3 figure, Full paper accepted to AGILE 2021

    MSC Class: 68T50; 68T30; 68T07; 03B65; 91F20 ACM Class: I.2.7; I.2.4

    Journal ref: AGILE 2021

  33. arXiv:2004.14171  [pdf, other

    cs.DB cs.AI cs.CL cs.LG stat.ML

    SE-KGE: A Location-Aware Knowledge Graph Embedding Model for Geographic Question Answering and Spatial Semantic Lifting

    Authors: Gengchen Mai, Krzysztof Janowicz, Ling Cai, Rui Zhu, Blake Regalia, Bo Yan, Meilin Shi, Ni Lao

    Abstract: Learning knowledge graph (KG) embeddings is an emerging technique for a variety of downstream tasks such as summarization, link prediction, information retrieval, and question answering. However, most existing KG embedding models neglect space and, therefore, do not perform well when applied to (geo)spatial data and tasks. For those models that consider space, most of them primarily rely on some n… ▽ More

    Submitted 25 April, 2020; originally announced April 2020.

    Comments: Accepted to Transactions in GIS

    ACM Class: I.2.4; I.1.3; I.2.2

    Journal ref: Transactions in GIS, 2020

  34. arXiv:2003.06561  [pdf, other

    cs.IR cs.CL

    Semantically-Enriched Search Engine for Geoportals: A Case Study with ArcGIS Online

    Authors: Gengchen Mai, Krzysztof Janowicz, Sathya Prasad, Meilin Shi, Ling Cai, Rui Zhu, Blake Regalia, Ni Lao

    Abstract: Many geoportals such as ArcGIS Online are established with the goal of improving geospatial data reusability and achieving intelligent knowledge discovery. However, according to previous research, most of the existing geoportals adopt Lucene-based techniques to achieve their core search functionality, which has a limited ability to capture the user's search intentions. To better understand a user'… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

    Comments: 18 pages; Accepted to AGILE 2020 as a full paper GitHub Code Repository: https://github.com/gengchenmai/arcgis-online-search-engine

    ACM Class: H.3.3

    Journal ref: AGILE 2020, Jun. 16 - 19, 2020, Chania, Crete, Greece

  35. arXiv:2003.00824  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Multi-Scale Representation Learning for Spatial Feature Distributions using Grid Cells

    Authors: Gengchen Mai, Krzysztof Janowicz, Bo Yan, Rui Zhu, Ling Cai, Ni Lao

    Abstract: Unsupervised text encoding models have recently fueled substantial progress in NLP. The key idea is to use neural networks to convert words in texts to vector space representations based on word positions in a sentence and their contexts, which are suitable for end-to-end training of downstream tasks. We see a strikingly similar situation in spatial analysis, which focuses on incorporating both ab… ▽ More

    Submitted 15 February, 2020; originally announced March 2020.

    Comments: 15 pages; Accepted to ICLR 2020 as a spotlight paper

    ACM Class: I.2.0; I.2.6; I.5.1; J.2

    Journal ref: ICLR 2020, Apr. 26 - 30, 2020, Addis Ababa, ETHIOPIA

  36. arXiv:1910.00702  [pdf, other

    cs.LG cs.CL stat.ML

    TransGCN:Coupling Transformation Assumptions with Graph Convolutional Networks for Link Prediction

    Authors: Ling Cai, Bo Yan, Gengchen Mai, Krzysztof Janowicz, Rui Zhu

    Abstract: Link prediction is an important and frequently studied task that contributes to an understanding of the structure of knowledge graphs (KGs) in statistical relational learning. Inspired by the success of graph convolutional networks (GCN) in modeling graph data, we propose a unified GCN framework, named TransGCN, to address this task, in which relation and entity embeddings are learned simultaneous… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

  37. arXiv:1910.00084  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Contextual Graph Attention for Answering Logical Queries over Incomplete Knowledge Graphs

    Authors: Gengchen Mai, Krzysztof Janowicz, Bo Yan, Rui Zhu, Ling Cai, Ni Lao

    Abstract: Recently, several studies have explored methods for using KG embedding to answer logical queries. These approaches either treat embedding learning and query answering as two separated learning tasks, or fail to deal with the variability of contributions from different query paths. We proposed to leverage a graph attention mechanism to handle the unequal contribution of different query paths. Howev… ▽ More

    Submitted 30 September, 2019; originally announced October 2019.

    Comments: 8 pages, 3 figures, camera ready version of article accepted to K-CAP 2019, Marina del Rey, California, United States

    ACM Class: I.2.4; I.1.3

    Journal ref: K-CAP 2019, Nov. 19 - 21, 2019, Marina del Rey, CA, USA

  38. arXiv:1810.02802  [pdf, ps, other

    cs.AI cs.CL cs.IR

    POIReviewQA: A Semantically Enriched POI Retrieval and Question Answering Dataset

    Authors: Gengchen Mai, Krzysztof Janowicz, Cheng He, Sumang Liu, Ni Lao

    Abstract: Many services that perform information retrieval for Points of Interest (POI) utilize a Lucene-based setup with spatial filtering. While this type of system is easy to implement it does not make use of semantics but relies on direct word matches between a query and reviews leading to a loss in both precision and recall. To study the challenging task of semantically enriching POIs from unstructured… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Journal ref: 12th Workshop on Geographic Information Retrieval (GIR 2018)

  39. On the Reconstruction of Face Images from Deep Face Templates

    Authors: Guangcan Mai, Kai Cao, Pong C. Yuen, Anil K. Jain

    Abstract: State-of-the-art face recognition systems are based on deep (convolutional) neural networks. Therefore, it is imperative to determine to what extent face templates derived from deep networks can be inverted to obtain the original face image. In this paper, we study the vulnerabilities of a state-of-the-art face recognition system based on template reconstruction attack. We propose a neighborly de-… ▽ More

    Submitted 28 April, 2018; v1 submitted 2 March, 2017; originally announced March 2017.

    Comments: To appear in TPAMI, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018