Description

Feature Name Description Form1 Stat2 Text3
skill_type Skill Category Broad category of a classified skill response. Derivation
skill_value Skill Title ESCO classification for skill free-text or drop down response. Derivation
locale CV Language Language used to write CV. Derivation
country Country Country of residence. Derivation
age_group1 Broad Age Group Broad age groups with three categories. Derivation
age_group2 Narrow Age Group Narrow age groups with five categories. Derivation
gender Gender Female, male or missing value. Derivation
nationality Nationality Nationality. Derivation
mother_tongue Mother Tongue Native language. Derivation
latest_job_isco1 Latest Job ISCO 1 ISCO level 1 classification for latest job declared. Derivation
latest_job_isco2 Latest Job ISCO 2 ISCO level 2 classification for latest job declared. Derivation
latest_job_isco3 Latest Job ISCO 3 ISCO level 3 classification for latest job declared. Derivation
total_work_years Total Work Years Sum of all work years. Derivation
responses Responses Number of responses with a particular combination of values for the above variables. Derivation
1 Variable derives directly from a Europass CV;
2 Variable is a statistical transformation of one or more Europass CV variables;
3 Variable is a result of information retrieval using text mining

Summary Statistics

skill_type

Feature Result
Variable type factor (nominal)
Number of missing values* 0 ( 0.00% )
Number of unique values 7
* No missing values permitted.

skill_value

Feature Result
Variable type factor (nominal)
Number of missing values* 0 ( 0.00% )
Number of unique values 8066
* No missing values permitted.

locale

Feature Result
Variable type factor (nominal)
Number of missing values* 0 ( 0.00% )
Number of unique values 29
* SkillsPassport.Locale missing.

country

Feature Result
Variable type factor (nominal)
Number of missing values* 46517 ( 6.00% )
Number of unique values 183
* SkillsPassport…Country.Code missing.

age_group1

Feature Result
Variable type ordered (ordinal)
Number of missing values* 252025 ( 32.53% )
Number of unique values 4
* SkillsPassport…Birthdate.Year missing, or over 65 yea
rs old.

age_group2

Feature Result
Variable type ordered (ordinal)
Number of missing values* 250790 ( 32.37% )
Number of unique values 6
* SkillsPassport…Birthdate.Year missing.

gender

Feature Result
Variable type factor (nominal)
Number of missing values* 317736 ( 41.02% )
Number of unique values 3
* SkillsPassport…Gender.Code missing.

nationality

Feature Result
Variable type factor (nominal)
Number of missing values* 356501 ( 46.02% )
Number of unique values 162
* SkillsPassport…Nationality.Code missing.

mother_tongue

Feature Result
Variable type factor (nominal)
Number of missing values* 115805 ( 14.95% )
Number of unique values 63
* Skills.Linguistic.MotherTongue.Description missing.

latest_job_isco1

Feature Result
Variable type factor (nominal)
Number of missing values* 74666 ( 9.64% )
Number of unique values 11
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

latest_job_isco2

Feature Result
Variable type factor (nominal)
Number of missing values* 74666 ( 9.64% )
Number of unique values 43
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

latest_job_isco3

Feature Result
Variable type factor (nominal)
Number of missing values* 74666 ( 9.64% )
Number of unique values 126
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

total_work_years

Feature Result
Variable type ordered (ordinal)
Number of missing values* 36400 ( 4.70% )
Number of unique values 6
* No work experience entries filled, or both WorkExperien
ce.Period.From.Year and WorkExperience.Period.To.Year m
issing.

responses

Feature Result
Variable type integer
Number of missing values* 0 ( 0.00% )
Number of unique values 323
Min. 1
1st Qu. 1
Median 1
Mean 1.78
3rd Qu. 1
Max. 3093
* No missing values permitted.