Description

Feature Name Description Form1 Stat2 Text3
term Term Unigram or bigram of keywords in a free-text response. Derivation
eqf_granted EQF Granted EQF classification of a free-text qualification response. Derivation
institution_country Institution Country Country of instutution where qualification was granted. Derivation
enrollment_year Enrollment Year Year of enrollment. Derivation
graduation_year Graduation Year Year of graduation. Derivation
locale CV Language Language used to write CV. Derivation
country Country Country of residence. Derivation
birth_year Birth Year Year of birth. Derivation
gender Gender Female, male or missing value. Derivation
responses Responses Number of responses with a particular combination of values for the above variables. Derivation
1 Variable derives directly from a Europass CV;
2 Variable is a statistical transformation of one or more Europass CV variables;
3 Variable is a result of information retrieval using text mining

Summary Statistics

term

Feature Result
Variable type factor (nominal)
Number of missing values* 0 ( 0.00% )
Number of unique values 79516
* No missing values permitted.

eqf_granted

Feature Result
Variable type character (nominal)
Number of missing values* 2654084 ( 39.06% )
Number of unique values 9
* Missing features required for estimation, or unidentifi
ed from classification algorithm.

institution_country

Feature Result
Variable type character (nominal)
Number of missing values* 1398208 ( 20.58% )
Number of unique values 189
* Education.Organisation…Country.Code missing.

enrollment_year

Feature Result
Variable type character (nominal)
Number of missing values* 531491 ( 7.82% )
Number of unique values 76
* Education.Period.From.Year missing.

graduation_year

Feature Result
Variable type character (nominal)
Number of missing values* 1623268 ( 23.89% )
Number of unique values 72
* Education.Period.To.Year missing.

locale

Feature Result
Variable type character (nominal)
Number of missing values* 0 ( 0.00% )
Number of unique values 29
* SkillsPassport.Locale missing.

country

Feature Result
Variable type character (nominal)
Number of missing values* 341258 ( 5.02% )
Number of unique values 182
* SkillsPassport…Country.Code missing.

birth_year

Feature Result
Variable type character (nominal)
Number of missing values* 2215112 ( 32.60% )
Number of unique values 83
* SkillsPassport…Birthdate.Year missing.

gender

Feature Result
Variable type character (nominal)
Number of missing values* 2846659 ( 41.89% )
Number of unique values 3
* SkillsPassport…Gender.Code missing.

responses

Feature Result
Variable type integer
Number of missing values* 0 ( 0.00% )
Number of unique values 354
Min. 1
1st Qu. 1
Median 1
Mean 1.31
3rd Qu. 1
Max. 894
* No missing values permitted.