230
Views
0
CrossRef citations to date
0
Altmetric
Research Article

An Empirical Job Matching Model based on Expert Human Knowledge: A Mixed-Methods Approach

ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Article: 2364158 | Received 08 Feb 2024, Accepted 30 May 2024, Published online: 27 Jun 2024

Figures & data

Figure 1. General methodological diagram.

The flowchart is divided into three main sections, each represented by a different background color. CV Preference of Human Experts (light blue background), Human Level Performance Determination (light green background) and Model for Job-Candidates Matching (light red background).

Figure 1. General methodological diagram.The flowchart is divided into three main sections, each represented by a different background color. CV Preference of Human Experts (light blue background), Human Level Performance Determination (light green background) and Model for Job-Candidates Matching (light red background).

Table 1. Distribution of respondents working in Mexico according to data from the Stack Overflow 2023 Developer Survey in segmentation used for maximum variation sampling.

Table 2. Number of professionals to sample according to maximum variation technique through segmentation criteria of years of experience and number of employees in their current company.

Table 3. Number of interviewed professionals organized according to the criteria of the maximum variation sample.

Table 4. Key names of the CVs used in the different sets of the evaluation experiment.

Table 5. Description of the features that each of the CVs used in the evaluation experiment had to meet.

Figure 2. Baseline model architecture. The vi represents embedding vectors. Possible values of i are jp (job posting), and cv (curriculum vitae). The acronym UE stands for “Universal encoder”.

Figure 2. Baseline model architecture. The vi represents embedding vectors. Possible values of i are jp (job posting), and cv (curriculum vitae). The acronym UE stands for “Universal encoder”.

Figure 3. Experimental model architecture. The vi represents embedding vectors. Possible values of i are jp (job posting), and cv (curriculum vitae). The acronym UE stands for “Universal encoder”.

The experimental model considers seniority requirements, semantic evaluation, and Choquet integral computation.

Figure 3. Experimental model architecture. The vi represents embedding vectors. Possible values of i are jp (job posting), and cv (curriculum vitae). The acronym UE stands for “Universal encoder”.The experimental model considers seniority requirements, semantic evaluation, and Choquet integral computation.

Figure 4. Siamese network architecture used for training the sentence embedding model. The vi represents embedding vectors. Possible values of i are a (the job anchor), p (the positive example), and n (the negative example). The acronym UE stands for “Universal encoder”.

Figure 4. Siamese network architecture used for training the sentence embedding model. The vi represents embedding vectors. Possible values of i are a (the job anchor), p (the positive example), and n (the negative example). The acronym UE stands for “Universal encoder”.

Figure 5. Sentence embedding model architecture.

Figure 5. Sentence embedding model architecture.

Figure 6. t-SNE 2-dimensionality reduction of the test set’s sentence embeddings with CVs and job descriptions hue by occupational areas.

Figure 6. t-SNE 2-dimensionality reduction of the test set’s sentence embeddings with CVs and job descriptions hue by occupational areas.

Figure 7. Ranking data’s 2-dimensional singular value decomposition for every set evaluation.

Figure 7. Ranking data’s 2-dimensional singular value decomposition for every set evaluation.

Table 6. Descriptive statistics for analysis of ranking evaluation data. On the top: pair CV frequencies used on the different evaluation sets. On the bottom: marginal frequency distribution for each CV used on the different evaluation sets (RP stands for “Relative Position”).

Table 7. Qualities of the factor space X and the reason behind its inclusion.

Figure 8. Mean precision and mean recall for the CVs classification as suitable or unsuitable, using baseline and experimental models, for every set evaluation.

Mean precision and mean recall for the CVs classification for every set evaluation.
Figure 8. Mean precision and mean recall for the CVs classification as suitable or unsuitable, using baseline and experimental models, for every set evaluation.

Data Availability Statement

The data that support the findings of this study are openly available in figshare at https://doi.org/10.6084/m9.figshare.25127627 and https://doi.org/10.6084/m9.figshare.25127564.