Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Golac, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01876  [pdf, other

    cs.DB cs.AI cs.CL cs.IR cs.LG

    GRAM: Generative Retrieval Augmented Matching of Data Schemas in the Context of Data Security

    Authors: Xuanqing Liu, Luyang Kong, Runhui Wang, Patrick Song, Austin Nevins, Henrik Johnson, Nimish Amlathe, Davor Golac

    Abstract: Schema matching constitutes a pivotal phase in the data ingestion process for contemporary database systems. Its objective is to discern pairwise similarities between two sets of attributes, each associated with a distinct data table. This challenge emerges at the initial stages of data analytics, such as when incorporating a third-party table into existing databases to inform business insights. G… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: KDD 2024 Camera Ready; 11 pages, 8 figures

  2. arXiv:2401.18064  [pdf, other

    cs.IR cs.DB

    Neural Locality Sensitive Hashing for Entity Blocking

    Authors: Runhui Wang, Luyang Kong, Yefan Tao, Andrew Borthwick, Davor Golac, Henrik Johnson, Shadie Hijazi, Dong Deng, Yongfeng Zhang

    Abstract: Locality-sensitive hashing (LSH) is a fundamental algorithmic technique widely employed in large-scale data processing applications, such as nearest-neighbor search, entity resolution, and clustering. However, its applicability in some real-world scenarios is limited due to the need for careful design of hashing functions that align with specific metrics. Existing LSH-based Entity Blocking solutio… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.