Description

Feature Name Description Form1 Stat2 Text3
locale CV Language Language used to write CV. Derivation
country Country Country of residence. Derivation
birth_year Birth Year Year of birth. Derivation
gender Gender Female, male or missing value. Derivation
nationality Nationality Nationality. Derivation
mother_tongue Mother Tongue Native language. Derivation
headline_job Headline Job Job-related type of application. Derivation
headline_isco Headline ISCO 3 ISCO level 3 classification for job declared in the type of application. Derivation
latest_job_esco Latest Job ESCO ESCO classification for latest job declared. Derivation
latest_job_isco Latest Job ISCO 3 ISCO level 3 classification for latest job declared. Derivation
is_employed Employment Status Estimation of employment status based on recruitment and termination year. Derivation
num_jobs Number of Jobs Number of jobs declared. Derivation
total_work_years Total Work Years Sum of all work years. Derivation
min_work_years Min Work Years Minimum number of years remaining in a job. Derivation
max_work_years Max Work Years Maximum number of years remaining in a job. Derivation
mean_work_years Mean Work Years Average number of years remaining in a job. Derivation
eqf_completed EQF Completed Highest qualification completed with respect to EQF level (excluding ongoing). Derivation
eqf_highest EQF Highest Highest qualification reported with respect to EQF level (including ongoing). Derivation
is_student Student Status Estimation of student status based on enrollment and graduation year. Derivation
studied_abroad Studied Abroad Estimation of whether or not at least one qualification was granted outside country of origin. Derivation
respondents Respondents Number of respondents with a particular combination of values for the above variables. Derivation
1 Variable derives directly from a Europass CV;
2 Variable is a statistical transformation of one or more Europass CV variables;
3 Variable is a result of information retrieval using text mining

Summary Statistics

locale

Feature Result
Variable type character (nominal)
Number of missing values* 0 ( 0.00% )
Number of unique values 29
* SkillsPassport.Locale missing.

country

Feature Result
Variable type character (nominal)
Number of missing values* 13008 ( 3.97% )
Number of unique values 185
* SkillsPassport…Country.Code missing.

birth_year

Feature Result
Variable type character (nominal)
Number of missing values* 128709 ( 39.27% )
Number of unique values 84
* SkillsPassport…Birthdate.Year missing.

gender

Feature Result
Variable type character (nominal)
Number of missing values* 149959 ( 45.76% )
Number of unique values 3
* SkillsPassport…Gender.Code missing.

nationality

Feature Result
Variable type character (nominal)
Number of missing values* 162649 ( 49.63% )
Number of unique values 171
* SkillsPassport…Nationality.Code missing.

mother_tongue

Feature Result
Variable type character (nominal)
Number of missing values* 63005 ( 19.22% )
Number of unique values 63
* Skills.Linguistic.MotherTongue.Description missing.

headline_job

Feature Result
Variable type character (nominal)
Number of missing values* 200192 ( 61.08% )
Number of unique values 4
* SkillsPassport…Headline.Type.Code missing.

headline_isco

Feature Result
Variable type character (nominal)
Number of missing values* 222153 ( 67.78% )
Number of unique values 126
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

latest_job_esco

Feature Result
Variable type character (nominal)
Number of missing values* 60425 ( 18.44% )
Number of unique values 2865
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

latest_job_isco

Feature Result
Variable type character (nominal)
Number of missing values* 60425 ( 18.44% )
Number of unique values 126
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

is_employed

Feature Result
Variable type logical
Number of missing values* 24110 ( 7.36% )
Number of unique values 3
* No work experience entries filled, or both WorkExperien
ce.Period.From.Year and WorkExperience.Period.To.Year m
issing.

num_jobs

Feature Result
Variable type integer
Number of missing values* 24110 ( 7.36% )
Number of unique values 14
Min. 1
1st Qu. 2
Median 4
Mean 4.22
3rd Qu. 6
Max. 13
NA’s 24110
* No work experience entries filled.

total_work_years

Feature Result
Variable type numeric
Number of missing values* 33008 ( 10.07% )
Number of unique values 173
Min. 1
1st Qu. 3
Median 6
Mean 9.41
3rd Qu. 12
Max. 312
NA’s 33008
* No work experience entries filled, or both WorkExperien
ce.Period.From.Year and WorkExperience.Period.To.Year m
issing.

min_work_years

Feature Result
Variable type numeric
Number of missing values* 33008 ( 10.07% )
Number of unique values 52
Min. 1
1st Qu. 1
Median 1
Mean 1.64
3rd Qu. 1
Max. 55
NA’s 33008
* No work experience entries filled, or both WorkExperien
ce.Period.From.Year and WorkExperience.Period.To.Year m
issing.

max_work_years

Feature Result
Variable type numeric
Number of missing values* 33008 ( 10.07% )
Number of unique values 54
Min. 1
1st Qu. 1
Median 3
Mean 4.43
3rd Qu. 6
Max. 57
NA’s 33008
* No work experience entries filled, or both WorkExperien
ce.Period.From.Year and WorkExperience.Period.To.Year m
issing.

mean_work_years

Feature Result
Variable type numeric
Number of missing values* 33008 ( 10.07% )
Number of unique values 246
Min. 1
1st Qu. 1
Median 1.5
Mean 2.56
3rd Qu. 2.8
Max. 55
NA’s 33008
* No work experience entries filled, or both WorkExperien
ce.Period.From.Year and WorkExperience.Period.To.Year m
issing.

eqf_completed

Feature Result
Variable type character (nominal)
Number of missing values* 89124 ( 27.19% )
Number of unique values 9
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

eqf_highest

Feature Result
Variable type character (nominal)
Number of missing values* 99148 ( 30.25% )
Number of unique values 9
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

is_student

Feature Result
Variable type logical
Number of missing values* 16917 ( 5.16% )
Number of unique values 3
* No education entries filled, or both Education.Period.F
rom.Year and Education.Period.To.Year missing.

studied_abroad

Feature Result
Variable type logical
Number of missing values* 196638 ( 60.00% )
Number of unique values 3
* No education entries filled, or SkillsPassport…Nation
ality.Code missing, or Education.Organisation…Country
.Code missing from all entries.

respondents

Feature Result
Variable type integer
Number of missing values* 0 ( 0.00% )
Number of unique values 88
Min. 1
1st Qu. 1
Median 1
Mean 1.08
3rd Qu. 1
Max. 401
* No missing values permitted.