Using big data & AI for identifying Labour market information in - - PowerPoint PPT Presentation

using big data ai for identifying labour market
SMART_READER_LITE
LIVE PREVIEW

Using big data & AI for identifying Labour market information in - - PowerPoint PPT Presentation

Using big data & AI for identifying Labour market information in Austria Claudia Plaimauer 3s Unternehmensberatung GmbH www.3s.co.at ILO Workshop Geneva, 19-20/09/2019 Our incentives for using big data & AI Save time and money;


slide-1
SLIDE 1

Using big data & AI for identifying Labour market information in Austria

Claudia Plaimauer 3s Unternehmensberatung GmbH www.3s.co.at ILO Workshop

Geneva, 19-20/09/2019

slide-2
SLIDE 2

2

  • Save time and money;
  • Gain a little more independence from traditional data collections;
  • Expand our possibilities of data analysis e.g. when
  • Monitoring national labour market demand;
  • Analysing occupational skills profiles;
  • Generating input for the maintenance of labour market

taxonomies.

Our incentives for using big data & AI

3s Unternehmensberatung, www.3s.co.at

slide-3
SLIDE 3

3

  • Test AI methods for
  • normalising free text survey results;
  • processing online vacancies;
  • validating occupational skills profiles;
  • maintaining labour market taxonomies.
  • Support Textkernel in setting up Jobfeed AT

(www.jobfeed.com/at/home.php), a big data platform for systematically querying Austria‘s online job market.

  • Conduct annual analysis of Austrian online vacancy market since

survey year 2016.

  • Use Jobfeed-based analysis to inform curriculum development;
  • Investigate potential of automation for a comparison of European

VET qualifications.

Preliminary project experience, e.g.

3s Unternehmensberatung, www.3s.co.at

slide-4
SLIDE 4

Testing AI methods and big data in taxonomy management

Goal 1: Validation of taxonomy terms

  • Are the taxonomy terms chosen to represent occupational

requirements (‚skills‘) actually used in vacancies?

  • Which formats occur frequently in vacancies, which rarely, or

never? Goal 2: Amendment of taxonomy

  • Are skills terms commonly used in vacancies missing?
  • Is there any indication that skills concepts are missing?

Goal 3: Does the current format of the taxonomy in any way impede its application in NLP?

4 3s Unternehmensberatung, www.3s.co.at

slide-5
SLIDE 5

3s Unternehmensberatung, www.3s.co.at

The Austrian PES’ central LM taxonomies

Occupational taxonomy ‚Skills‘ taxonomy

__Occupations

__ 1999/2000- __ 13.500+ concepts __ 84.000+ terms

__Occ. requirements

__ 2000/2001-

__ 17.500+ concepts __ 29.000+ terms

__Goal: Comprehensiveness, high actuality, clarity, descriptiveness,

uniformity, proximity to everyday language; __ Structure: Thesaurus & taxonomy; __ Usage context: Labour market information / matching / research.

slide-6
SLIDE 6

3s Unternehmensberatung, www.3s.co.at

Method used for validating ‘skills’ terms

29,000+ ‚skills‘ designations from PES‘ taxonomy Requirement descriptions taken from 850,000+ vacancies Text string matching Frequency counts

slide-7
SLIDE 7

3s Unternehmensberatung, www.3s.co.at

Insights from validating ‘skills’ terms

1. Clear, descriptive, consistent and unambiguous taxonomy terms have a format that doesn‘t fully correspond to the language used in vacancies: 56% of our ‚skills‘ terms never appeared in job ads. 2. The longer the taxonomy term the less frequently it is used in vacancies. 3. Preferred (PT) and non-preferred (NPT) taxonomy terms mostly show the same frequency distribution. 4. Some naming strategies are a hindrance in NLP and should be avoided, e.g. excessive contextualisation (‚Use 3-dimensional printing in car construction‘), or explanatory adjuncts in parentheses (‚determination of optimum selling price (basics)‘).

slide-8
SLIDE 8

Mix of methods used for supplementing the ‚skills‘ thesaurus

Automated methods:

  • Key word extraction;
  • Frequency counts;
  • Data cleansing (detection of

spelling variants, declensions and typing errors);

  • Key word classification;
  • Text string matching;
  • Co-occurrence analysis.

Editorial methods:

  • Supplemental investigations to

clarify content and use;

  • Semantic analysis;
  • Terminology control;
  • Conceptual modeling.

8

3s Unternehmensberatung, www.3s.co.at

slide-9
SLIDE 9

3s Unternehmensberatung, www.3s.co.at

Insights from supplementing ‘skills’ terms

1. The Austrian ‘skills’ taxonomy is fairly comprehensive: only 1900+ words/phrases/text segments were identified as missing (of these only approx. 900 specialist skills); in the end 97 new specialist skill concepts and a total of 635 new specialist skill terms were supplemented. 2. Text mining is a highly effective method for identifying evidence- based amendment needs for thesauri, but it comes at a price. 3. Automated skills mining delivers amendment candidates only; considerable additional human processing is necessary to fully integrate new terms, or even concepts. 4. In order to improve its applicability in NLP our ‘skills’ taxonomy should also include formats predominately found in vacancy text, at least as hidden search terms. 5. Certain naming strategies should be avoided to improve the taxonomies applicability in NLP.

slide-10
SLIDE 10

Thank you for your attention!

Claudia Plaimauer 3s Unternehmensberatung Wiedner Hauptstraße 18 1040 Vienna, Austria Tel +43-1-5850915/33 plaimauer@3s.co.at