Search in:

Applied Artificial Intelligence

An International Journal

Volume 38, 2024 - Issue 1

Submit an article Journal homepage

Open access

230

Views

CrossRef citations to date

Altmetric

Research Article

An Empirical Job Matching Model based on Expert Human Knowledge: A Mixed-Methods Approach

María Elena Martínez-ManzanaresDepartamento de Matemáticas, Universidad de Sonora, Hermosillo, MéxicoCorrespondence[email protected]

https://orcid.org/0009-0008-7121-8265 View further author information

Jordan Joel Urias-ParamoDepartamento de Matemáticas, Universidad de Sonora, Hermosillo, México

https://orcid.org/0009-0003-2565-8562 View further author information

Julio Waissman-VilanovaDepartamento de Matemáticas, Universidad de Sonora, Hermosillo, México

https://orcid.org/0000-0002-1040-1081 View further author information

Gudelia Figueroa-PreciadoDepartamento de Matemáticas, Universidad de Sonora, Hermosillo, México

https://orcid.org/0000-0002-0758-2061 View further author information

Article: 2364158 | Received 08 Feb 2024, Accepted 30 May 2024, Published online: 27 Jun 2024

Cite this article
https://doi.org/10.1080/08839514.2024.2364158
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Figure 1. General methodological diagram.

The flowchart is divided into three main sections, each represented by a different background color. CV Preference of Human Experts (light blue background), Human Level Performance Determination (light green background) and Model for Job-Candidates Matching (light red background).

Table 1. Distribution of respondents working in Mexico according to data from the Stack Overflow 2023 Developer Survey in segmentation used for maximum variation sampling.

Download CSV Display Table

Table 2. Number of professionals to sample according to maximum variation technique through segmentation criteria of years of experience and number of employees in their current company.

Download CSV Display Table

Table 3. Number of interviewed professionals organized according to the criteria of the maximum variation sample.

Download CSV Display Table

Table 4. Key names of the CVs used in the different sets of the evaluation experiment.

Download CSV Display Table

Table 5. Description of the features that each of the CVs used in the evaluation experiment had to meet.

Download CSV Display Table

Figure 2. Baseline model architecture. The $v_{i}$ represents embedding vectors. Possible values of $i$ are $jp$ (job posting), and $cv$ (curriculum vitae). The acronym UE stands for “Universal encoder”.

The diagram illustrates the baseline model architecture for matching job postings with CVs using sentence embedding models and cosine similarity. Input Elements: Job Posting: The process starts with a job posting (blue box) which is processed by a Universal Encoder (black box). Sentence Embedding Models: Both the job posting and CV are converted into sentence embeddings by the Sentence Embedding Model (green boxes). Embedding Vectors: The output of the sentence embedding models are two vectors: embedding vector for the job posting and embedding vector for the CV. Cosine Similarity: These vectors are then compared using cosine similarity (blue box), which measures the similarity between the job posting and the CV. Threshold Evaluation: The cosine similarity score is evaluated against a threshold (blue box) to determine the suitability of the candidate. Classification: The final step is classification (green box) where candidates are labeled as suitable (1) or unsuitable (0) based on the threshold evaluation. Arrows indicate the flow of information through the various steps of the model architecture.

Figure 3. Experimental model architecture. The $v_{i}$ represents embedding vectors. Possible values of $i$ are $jp$ (job posting), and $cv$ (curriculum vitae). The acronym UE stands for “Universal encoder”.

The experimental model considers seniority requirements, semantic evaluation, and Choquet integral computation.

Figure 4. Siamese network architecture used for training the sentence embedding model. The $v_{i}$ represents embedding vectors. Possible values of $i$ are $a$ (the job anchor), $p$ (the positive example), and $n$ (the negative example). The acronym UE stands for “Universal encoder”.

Figure 5. Sentence embedding model architecture.

Figure 6. t-SNE 2-dimensionality reduction of the test set’s sentence embeddings with CVs and job descriptions hue by occupational areas.

Figure 7. Ranking data’s 2-dimensional singular value decomposition for every set evaluation.

Table 6. Descriptive statistics for analysis of ranking evaluation data. On the top: pair CV frequencies used on the different evaluation sets. On the bottom: marginal frequency distribution for each CV used on the different evaluation sets (RP stands for “Relative Position”).

Download CSV Display Table

Table 7. Qualities of the factor space $X$ and the reason behind its inclusion.

Display Table

Figure 8. Mean precision and mean recall for the CVs classification as suitable or unsuitable, using baseline and experimental models, for every set evaluation.

Mean precision and mean recall for the CVs classification for every set evaluation.

Data Availability Statement

The data that support the findings of this study are openly available in figshare at https://doi.org/10.6084/m9.figshare.25127627 and https://doi.org/10.6084/m9.figshare.25127564.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

An Empirical Job Matching Model based on Expert Human Knowledge: A Mixed-Methods Approach

Table 1. Distribution of respondents working in Mexico according to data from the Stack Overflow 2023 Developer Survey in segmentation used for maximum variation sampling.

Table 2. Number of professionals to sample according to maximum variation technique through segmentation criteria of years of experience and number of employees in their current company.

Table 3. Number of interviewed professionals organized according to the criteria of the maximum variation sample.

Table 4. Key names of the CVs used in the different sets of the evaluation experiment.

Table 5. Description of the features that each of the CVs used in the evaluation experiment had to meet.

Table 6. Descriptive statistics for analysis of ranking evaluation data. On the top: pair CV frequencies used on the different evaluation sets. On the bottom: marginal frequency distribution for each CV used on the different evaluation sets (RP stands for “Relative Position”).

Table 7. Qualities of the factor space $X$ and the reason behind its inclusion.

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

An Empirical Job Matching Model based on Expert Human Knowledge: A Mixed-Methods Approach

Figures & data

Table 1. Distribution of respondents working in Mexico according to data from the Stack Overflow 2023 Developer Survey in segmentation used for maximum variation sampling.

Table 2. Number of professionals to sample according to maximum variation technique through segmentation criteria of years of experience and number of employees in their current company.

Table 3. Number of interviewed professionals organized according to the criteria of the maximum variation sample.

Table 4. Key names of the CVs used in the different sets of the evaluation experiment.

Table 5. Description of the features that each of the CVs used in the evaluation experiment had to meet.

Table 6. Descriptive statistics for analysis of ranking evaluation data. On the top: pair CV frequencies used on the different evaluation sets. On the bottom: marginal frequency distribution for each CV used on the different evaluation sets (RP stands for “Relative Position”).

Table 7. Qualities of the factor space X and the reason behind its inclusion.

Data Availability Statement

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 7. Qualities of the factor space $X$ and the reason behind its inclusion.