Description

Feature Name Description Form1 Stat2 Text3
from_education_field From Education Field ISCED FoET classification of a free-text qualification response for initial education field. Derivation
to_education_field To Education Field ISCED FoET classification of a free-text qualification response for subsequent education field. Derivation
enrollment_year Enrollment Year Year of enrollment. Derivation
graduation_year Graduation Year Year of graduation. Derivation
study_years Study Years Years of studying on a specific qualification. Derivation
country Country Country of residence. Derivation
age_group1 Broad Age Group Broad age groups with three categories. Derivation
age_group2 Narrow Age Group Narrow age groups with five categories. Derivation
gender Gender Female, male or missing value. Derivation
responses Responses Number of responses with a particular combination of values for the above variables. Derivation
1 Variable derives directly from a Europass CV;
2 Variable is a statistical transformation of one or more Europass CV variables;
3 Variable is a result of information retrieval using text mining

Summary Statistics

from_education_field

Feature Result
Variable type factor (nominal)
Number of missing values* 0 ( 0.00% )
Number of unique values 116
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

to_education_field

Feature Result
Variable type factor (nominal)
Number of missing values* 0 ( 0.00% )
Number of unique values 116
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

enrollment_year

Feature Result
Variable type ordered (ordinal)
Number of missing values* 0 ( 0.00% )
Number of unique values 63
* Education.Period.From.Year missing.

graduation_year

Feature Result
Variable type ordered (ordinal)
Number of missing values* 257644 ( 18.40% )
Number of unique values 62
* Education.Period.To.Year missing.

study_years

Feature Result
Variable type numeric
Number of missing values* 31607 ( 2.26% )
Number of unique values 16
Min. 1
1st Qu. 1
Median 2
Mean 3.08
3rd Qu. 4
Max. 15
NA’s 31607
* Education.Period.From.Year missing, or value less than
0 years, or value more than 15 years.

country

Feature Result
Variable type factor (nominal)
Number of missing values* 69763 ( 4.98% )
Number of unique values 178
* SkillsPassport…Country.Code missing.

age_group1

Feature Result
Variable type ordered (ordinal)
Number of missing values* 470706 ( 33.62% )
Number of unique values 4
* SkillsPassport…Birthdate.Year missing, or over 65 yea
rs old.

age_group2

Feature Result
Variable type ordered (ordinal)
Number of missing values* 469043 ( 33.50% )
Number of unique values 6
* SkillsPassport…Birthdate.Year missing.

gender

Feature Result
Variable type factor (nominal)
Number of missing values* 607590 ( 43.40% )
Number of unique values 3
* SkillsPassport…Gender.Code missing.

responses

Feature Result
Variable type integer
Number of missing values* 0 ( 0.00% )
Number of unique values 64
Min. 1
1st Qu. 1
Median 1
Mean 1.23
3rd Qu. 1
Max. 151
* No missing values permitted.