TAROT: A Hierarchical Framework with Multitask Co-Pretraining on Semi-Structured Data towards Effective Person-Job Fit

Cao, Yihan; Chen, Xu; Du, Lun; Chen, Hao; Fu, Qiang; Han, Shi; Du, Yushu; Kang, Yanbin; Lu, Guangming; Li, Zi

Computer Science > Computation and Language

arXiv:2401.07525 (cs)

[Submitted on 15 Jan 2024 (v1), last revised 17 Jan 2024 (this version, v2)]

Title:TAROT: A Hierarchical Framework with Multitask Co-Pretraining on Semi-Structured Data towards Effective Person-Job Fit

Authors:Yihan Cao, Xu Chen, Lun Du, Hao Chen, Qiang Fu, Shi Han, Yushu Du, Yanbin Kang, Guangming Lu, Zi Li

View PDF HTML (experimental)

Abstract:Person-job fit is an essential part of online recruitment platforms in serving various downstream applications like Job Search and Candidate Recommendation. Recently, pretrained large language models have further enhanced the effectiveness by leveraging richer textual information in user profiles and job descriptions apart from user behavior features and job metadata. However, the general domain-oriented design struggles to capture the unique structural information within user profiles and job descriptions, leading to a loss of latent semantic correlations. We propose TAROT, a hierarchical multitask co-pretraining framework, to better utilize structural and semantic information for informative text embeddings. TAROT targets semi-structured text in profiles and jobs, and it is co-pretained with multi-grained pretraining tasks to constrain the acquired semantic information at each level. Experiments on a real-world LinkedIn dataset show significant performance improvements, proving its effectiveness in person-job fit tasks.

Comments:	ICASSP 2024 camera ready. 5 pages, 1 figure, 3 tables
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2401.07525 [cs.CL]
	(or arXiv:2401.07525v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.07525

Submission history

From: Yihan Cao [view email]
[v1] Mon, 15 Jan 2024 07:57:58 UTC (327 KB)
[v2] Wed, 17 Jan 2024 23:06:15 UTC (328 KB)

Computer Science > Computation and Language

Title:TAROT: A Hierarchical Framework with Multitask Co-Pretraining on Semi-Structured Data towards Effective Person-Job Fit

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:TAROT: A Hierarchical Framework with Multitask Co-Pretraining on Semi-Structured Data towards Effective Person-Job Fit

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators