Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Lam, H T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09837  [pdf, other

    cs.LG

    TabularFM: An Open Framework For Tabular Foundational Models

    Authors: Quan M. Tran, Suong N. Hoang, Lam M. Nguyen, Dzung Phan, Hoang Thanh Lam

    Abstract: Foundational models (FMs), pretrained on extensive datasets using self-supervised techniques, are capable of learning generalized patterns from large amounts of data. This reduces the need for extensive labeled datasets for each new task, saving both time and resources by leveraging the broad knowledge base established during pretraining. Most research on FMs has primarily focused on unstructured… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2406.02245  [pdf, other

    cs.CL cs.IR cs.LG

    Description Boosting for Zero-Shot Entity and Relation Classification

    Authors: Gabriele Picco, Leopold Fuchs, Marcos Martínez Galindo, Alberto Purpura, Vanessa López, Hoang Thanh Lam

    Abstract: Zero-shot entity and relation classification models leverage available external information of unseen classes -- e.g., textual descriptions -- to annotate input text data. Thanks to the minimum data requirement, Zero-Shot Learning (ZSL) methods have high value in practice, especially in applications where labeled data is scarce. Even though recent research in ZSL has demonstrated significant resul… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2307.13497  [pdf, other

    cs.CL cs.AI cs.LG

    Zshot: An Open-source Framework for Zero-Shot Named Entity Recognition and Relation Extraction

    Authors: Gabriele Picco, Marcos Martínez Galindo, Alberto Purpura, Leopold Fuchs, Vanessa López, Hoang Thanh Lam

    Abstract: The Zero-Shot Learning (ZSL) task pertains to the identification of entities or relations in texts that were not seen during training. ZSL has emerged as a critical research area due to the scarcity of labeled data in specific domains, and its applications have grown significantly in recent years. With the advent of large pretrained language models, several novel methods have been proposed, result… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted at ACL 2023

    Journal ref: Association for Computational Linguistics. 3 (2023) 357-368

  4. arXiv:2306.12802  [pdf, other

    cs.LG cs.AI q-bio.BM

    Otter-Knowledge: benchmarks of multimodal knowledge graph representation learning from different sources for drug discovery

    Authors: Hoang Thanh Lam, Marco Luca Sbodio, Marcos Martínez Galindo, Mykhaylo Zayats, Raúl Fernández-Díaz, Víctor Valls, Gabriele Picco, Cesar Berrospi Ramis, Vanessa López

    Abstract: Recent research on predicting the binding affinity between drug molecules and proteins use representations learned, through unsupervised learning techniques, from large databases of molecule SMILES and protein sequences. While these representations have significantly enhanced the predictions, they are usually based on a limited set of modalities, and they do not exploit available knowledge about e… ▽ More

    Submitted 19 October, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

  5. arXiv:2302.03525  [pdf, other

    cs.IR

    Multi-Task Deep Recommender Systems: A Survey

    Authors: Yuhao Wang, Ha Tsz Lam, Yi Wong, Ziru Liu, Xiangyu Zhao, Yichao Wang, Bo Chen, Huifeng Guo, Ruiming Tang

    Abstract: Multi-task learning (MTL) aims at learning related tasks in a unified model to achieve mutual improvement among tasks considering their shared knowledge. It is an important topic in recommendation due to the demand for multi-task prediction considering performance and efficiency. Although MTL has been well studied and developed, there is still a lack of systematic review in the recommendation comm… ▽ More

    Submitted 8 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  6. arXiv:2202.03558  [pdf, other

    cs.LG cs.AI

    Attacking c-MARL More Effectively: A Data Driven Approach

    Authors: Nhan H. Pham, Lam M. Nguyen, Jie Chen, Hoang Thanh Lam, Subhro Das, Tsui-Wei Weng

    Abstract: In recent years, a proliferation of methods were developed for cooperative multi-agent reinforcement learning (c-MARL). However, the robustness of c-MARL agents against adversarial attacks has been rarely explored. In this paper, we propose to evaluate the robustness of c-MARL agents via a model-based approach, named c-MBA. Our proposed formulation can craft much stronger adversarial state perturb… ▽ More

    Submitted 10 September, 2023; v1 submitted 7 February, 2022; originally announced February 2022.

  7. arXiv:2110.09131  [pdf, other

    cs.CL cs.AI

    Ensembling Graph Predictions for AMR Parsing

    Authors: Hoang Thanh Lam, Gabriele Picco, Yufang Hou, Young-Suk Lee, Lam M. Nguyen, Dzung T. Phan, Vanessa López, Ramon Fernandez Astudillo

    Abstract: In many machine learning tasks, models are trained to predict structure data such as graphs. For example, in natural language processing, it is very common to parse texts into dependency trees or abstract meaning representation (AMR) graphs. On the other hand, ensemble methods combine predictions from multiple models to create a new one that is more robust and accurate than individual predictions.… ▽ More

    Submitted 24 January, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021

  8. arXiv:2109.08460  [pdf, other

    cs.CL

    Neural Unification for Logic Reasoning over Natural Language

    Authors: Gabriele Picco, Hoang Thanh Lam, Marco Luca Sbodio, Vanessa Lopez Garcia

    Abstract: Automated Theorem Proving (ATP) deals with the development of computer programs being able to show that some conjectures (queries) are a logical consequence of a set of axioms (facts and rules). There exists several successful ATPs where conjectures and axioms are formally provided (e.g. formalised as First Order Logic formulas). Recent approaches, such as (Clark et al., 2020), have proposed trans… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP2021 Findings

  9. arXiv:1806.05886  [pdf, other

    cs.CV

    Automated Image Data Preprocessing with Deep Reinforcement Learning

    Authors: Tran Ngoc Minh, Mathieu Sinn, Hoang Thanh Lam, Martin Wistuba

    Abstract: Data preparation, i.e. the process of transforming raw data into a format that can be used for training effective machine learning models, is a tedious and time-consuming task. For image data, preprocessing typically involves a sequence of basic transformations such as cropping, filtering, rotating or flipping images. Currently, data scientists decide manually based on their experience which trans… ▽ More

    Submitted 29 April, 2021; v1 submitted 15 June, 2018; originally announced June 2018.

  10. arXiv:1802.03628  [pdf, other

    cs.LG stat.ML

    Learning Correlation Space for Time Series

    Authors: Han Qiu, Hoang Thanh Lam, Francesco Fusco, Mathieu Sinn

    Abstract: We propose an approximation algorithm for efficient correlation search in time series data. In our method, we use Fourier transform and neural network to embed time series into a low-dimensional Euclidean space. The given space is learned such that time series correlation can be effectively approximated from Euclidean distance between corresponding embedded vectors. Therefore, search for correlate… ▽ More

    Submitted 15 May, 2018; v1 submitted 10 February, 2018; originally announced February 2018.

  11. arXiv:1801.05372  [pdf, other

    cs.AI cs.LG

    Neural Feature Learning From Relational Database

    Authors: Hoang Thanh Lam, Tran Ngoc Minh, Mathieu Sinn, Beat Buesser, Martin Wistuba

    Abstract: Feature engineering is one of the most important but most tedious tasks in data science. This work studies automation of feature learning from relational database. We first prove theoretically that finding the optimal features from relational data for predictive tasks is NP-hard. We propose an efficient rule-based approach based on heuristics and a deep neural network to automatically learn approp… ▽ More

    Submitted 15 June, 2019; v1 submitted 16 January, 2018; originally announced January 2018.

  12. arXiv:1706.00327  [pdf, other

    cs.DB cs.AI

    One button machine for automating feature engineering in relational databases

    Authors: Hoang Thanh Lam, Johann-Michael Thiebaut, Mathieu Sinn, Bei Chen, Tiep Mai, Oznur Alkan

    Abstract: Feature engineering is one of the most important and time consuming tasks in predictive analytics projects. It involves understanding domain knowledge and data exploration to discover relevant hand-crafted features from raw data. In this paper, we introduce a system called One Button Machine, or OneBM for short, which automates feature discovery in relational databases. OneBM automatically perform… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

  13. arXiv:1509.05257  [pdf, other

    stat.ML cs.AI cs.CY cs.LG

    (Blue) Taxi Destination and Trip Time Prediction from Partial Trajectories

    Authors: Hoang Thanh Lam, Ernesto Diaz-Aviles, Alessandra Pascale, Yiannis Gkoufas, Bei Chen

    Abstract: Real-time estimation of destination and travel time for taxis is of great importance for existing electronic dispatch systems. We present an approach based on trip matching and ensemble learning, in which we leverage the patterns observed in a dataset of roughly 1.7 million taxi journeys to predict the corresponding final destination and travel time for ongoing taxi trips, as a solution for the EC… ▽ More

    Submitted 17 September, 2015; originally announced September 2015.

    Comments: ECML/PKDD Discovery Challenge 2015

    ACM Class: I.2.6; I.5.2

  14. arXiv:1412.7990  [pdf, other

    cs.IR cs.CY cs.LG

    Predicting User Engagement in Twitter with Collaborative Ranking

    Authors: Ernesto Diaz-Aviles, Hoang Thanh Lam, Fabio Pinelli, Stefano Braghin, Yiannis Gkoufas, Michele Berlingerio, Francesco Calabrese

    Abstract: Collaborative Filtering (CF) is a core component of popular web-based services such as Amazon, YouTube, Netflix, and Twitter. Most applications use CF to recommend a small set of items to the user. For instance, YouTube presents to a user a list of top-n videos she would likely watch next based on her rating and viewing history. Current methods of CF evaluation have been focused on assessing the q… ▽ More

    Submitted 26 December, 2014; originally announced December 2014.

    Comments: RecSysChallenge'14 at RecSys 2014, October 10, 2014, Foster City, CA, USA

    ACM Class: H.3.3; I.2.6

    Journal ref: In Proceedings of the 2014 Recommender Systems Challenge (RecSysChallenge'14). ACM, New York, NY, USA, , Pages 41 , 6 pages

  15. arXiv:1412.4218  [pdf

    physics.soc-ph cs.NE cs.SI

    Optimization of Reliability of Network of Given Connectivity using Genetic Algorithm

    Authors: Ho Tat Lam, Kwok Yip Szeto

    Abstract: Reliability is one of the important measures of how well the system meets its design objective, and mathematically is the probability that a system will perform satisfactorily for at least a given period of time. When the system is described by a connected network of N components (nodes) and their L connection (links), the reliability of the system becomes a difficult network design problem which… ▽ More

    Submitted 13 December, 2014; originally announced December 2014.

    Comments: 9 pages, 10 figures, 3 tables

  16. arXiv:1403.4165  [pdf, ps, other

    cs.CR math.GR

    Heisenberg Groups as Platform for the AAG key-exchange protocol

    Authors: Delaram Kahrobaei, Ha T. Lam

    Abstract: Garber, Kahrobaei, and Lam studied polycyclic groups generated by number field as platform for the AAG key-exchange protocol. In this paper, we discuss the use of a different kind of polycyclic groups, Heisenberg groups, as a platform group for AAG by submitting Heisenberg groups to one of AAG's major attacks, the length-based attack.

    Submitted 17 March, 2014; originally announced March 2014.

    Comments: arXiv admin note: text overlap with arXiv:1305.0548

  17. arXiv:1305.0548  [pdf, ps, other

    math.GR cs.CR

    Length-based attacks in polycyclic groups

    Authors: David Garber, Delaram Kahrobaei, Ha T. Lam

    Abstract: After the Anshel-Anshel-Goldfeld (AAG) key-exchange protocol was introduced in 1999, it was implemented and studied with braid groups and with the Thompson group as its underlying platforms. The length-based attack, introduced by Hughes and Tannenbaum, has been used to extensively study AAG with the braid group as the underlying platform. Meanwhile, a new platform, using polycyclic groups, was pro… ▽ More

    Submitted 22 November, 2014; v1 submitted 2 May, 2013; originally announced May 2013.

    Comments: J. Math. Crypt. 2014