Statistics Eurostat contract no. 30501.2012.001-2012.452 Explore - - PowerPoint PPT Presentation

statistics
SMART_READER_LITE
LIVE PREVIEW

Statistics Eurostat contract no. 30501.2012.001-2012.452 Explore - - PowerPoint PPT Presentation

Feasibility Study on the Use of Mobile Positioning Data for Tourism Statistics Eurostat contract no. 30501.2012.001-2012.452 Explore the possibilities and limits of using mobile positioning data in the production of tourism statistics Project


slide-1
SLIDE 1

Feasibility Study on the Use of Mobile Positioning Data for Tourism Statistics

Eurostat contract no. 30501.2012.001-2012.452

slide-2
SLIDE 2

Explore the possibilities and limits of using mobile positioning data in the production

  • f tourism statistics

Project time: January 2013 – June 2014 Project website: mobfs.positium.ee

slide-3
SLIDE 3

Main project objectives

  • Assess feasibility to access databases with mobile positioning data in

European countries

  • Assess the feasibility to use mobile positioning data for tourism statistics

in the European context

  • Identify, discuss and address the main challenges for implementation
  • Assess the potential impact on cost-efficiency of data production
  • Assess the possibility to expand the methodology to other domains and

define joint algorithms Can the technology/methodology be applied to the particular case of tourism statistics, across a wide group of countries in a similar way? Can the

  • utcomes be generalised to all countries?
slide-4
SLIDE 4

Project tasks

Task 1: Stock-taking Task 2: Feasibility of Access Task 3a: Methodolgoy Task 3b: Coherence Task 4: Opportunities and Benefits Task 5: Visibility and Consolidated Report Task 6: Project Management

slide-5
SLIDE 5

Task 1: Stock-taking

Map the relevant use cases of mobile positioning data

Official tourism statistics Other official statistics Private initiatives and applications Scientific research

Documented 31 significant cases

slide-6
SLIDE 6

Applications

Research

2002 – Estonia - MPS tracking in urban studies, University of Tartu 2004 – Estonia - CDR data collection, Positium LBS 2005 – Austria – „Graz in real time“, MIT Sensible City Lab 2006 – Portugal – „Socio-Geography of Human Mobility“, Orange Lab 2006 – Italy - „Rome in Real Time“, MIT Sensible City Lab 2009 – France – „Paris Tourism with CDR“, Orange Labs 2009 - Ireland - "Utilising Mobile Phone RSSI Metric...“ University of Ireland Maynooth, IBM Research“ 2009 - Switzerland – „Mobile Data Challenge“, Nokia 2010 – Czech Republic – CE Traffic, traffic analysis 2012, 2013 - Telefonica, Orange – commercial offerings ...

Tourism Statistics 2008 – Estonia – Central Bank started to use mobile data for „Balance of payment calculation“ Positium LBS 2010 – the Netherlands – „Time patterns, geospatial clustering“ Statistics Netherlands 2012 - Czech Republic – Czech Tourism 2014 – Ireland – „Mobile data for tourism Statistics“ The Central Statistics Office Ireland (CSO) ...

slide-7
SLIDE 7

Stock-taking conclusions

Number of projects in tourism statistics increasing Mostly aggregated data in public sectors Longitudinal data in research Some business initiatives, but business models difficult MNOs looking for new revenues

slide-8
SLIDE 8

Task 2: Feasibility of Access

Online survey Interviews with stakeholders Privacy and Regulations (EU & national) Technology Financial and Business barriers Practical Experience on Accessing the Data

slide-9
SLIDE 9

Awareness of possibilities of mobile positioning data

Expectations Better temporal and spatial accuracy New statistical indicators Volumes of travellers, event visitors Duration of trips Travel routes Point of entry Places visited Plausibility checks of tourism data Faster data generation Reduced respondent burden

slide-10
SLIDE 10

Main Obstacles to Access

MNOs Mostly understand the idea, but have concerns with

  • Legal restriction and obligations to provide the data
  • Public opinion and a possible loss of reputation and customers
  • Value for the MNOs if they provide the data
slide-11
SLIDE 11

Regulations

The first and main ‘barrier’ for accessing the data Regulations governing the subject:

  • Data Protection Directive (Directive 1995/46/EC and its successor,

the General Data Protection Regulation)

  • Electronic Privacy Directive (Directive 2002/58/EC)
  • Data Retention Directive (Directive 2006/24/EC)
  • European Statistics Regulation and European Statistics Regulation
  • n tourism statistics (Regulations 223/2009/EC and 692/2011/EU)
  • the opinions of the Article 29 Data Protection Working Party
slide-12
SLIDE 12

For NSIs

National Statistics Act (weak ... strong) Need for harmonised EU regulations on legal frame, methodology, technology, setup

slide-13
SLIDE 13

Threats

Legal - No clear legal framework to access Technological capability - Overall, is not seen as a hard barrier to access Financial and business barriers

  • MNOs expect a mutually beneficial relationship: a) a remuneration scheme or b)

being able to use the resulting data themselves for other (including internal and profit-making) purposes Continuity of data access

  • Major global shift in mobile technology; Changes in the characteristics of the

data; Administrative changes (e.g. changed number of providing MNOs) - Can have positive, negative or unforeseen effects on data quality. It is necessary to remain flexible in methodology and estimation to adapt to changes.

Practical experience on accessing the data from FI, FR, DE and other MNOs across Europe was negative - available data not usable (initial low value aggregates) and too expensive

slide-14
SLIDE 14

Task 3a: Methodology

slide-15
SLIDE 15

Task 3a: Methodology

slide-16
SLIDE 16

Task 3a: Methodology

slide-17
SLIDE 17

Task 3a: Methodology

slide-18
SLIDE 18

Task 3a: Methodology

slide-19
SLIDE 19

Task 3a: Methodology

slide-20
SLIDE 20

Task 3a: Methodology

slide-21
SLIDE 21

Task 3a: Methodology

DATA EXTRACTION FRAME FORMATION DATA COMPILATION ESTIMATION COMBINING

slide-22
SLIDE 22

Tourism Statistics Indicators

Breakdown: Country of residence/place of residence Aggregation of time (day, week, month) Aggregation of space (different level of

  • admin. units, grid)

Duration of trip/stay (length, same- day/overnight) Destination, secondary destination, transit pass-through Collective movement patterns Repeat visits Indicators: Number of trips/visits Number of nights spent Number of days present Duration of trips Number of unique visitors

Many indicators coincide with traditional indicators but lacking several classification aspects that are required for tourism statistics

slide-23
SLIDE 23

Identifying Usual Environment

Limitations due to the lack of data from

  • ther countries.

Not possible to ask. Large differences due to definitions

Using LAU-1 for defining usual environment Using LAU-2 for defining usual environment

slide-24
SLIDE 24

Limitations of the data source

No accommodation Mostly unknown purpose of the trip No expenditure information Mostly unknown means of transportation Usually no socio-demographic breakdown

slide-25
SLIDE 25

Quality

Validity - How well does mobile positioning represent real-world facts? - Looking at the official definitions

  • Minor differences with likely negligible effect.
  • Main issue with definition of ‘usual environment’

Accuracy: Coverage, measurement and processing

  • Over- and under-coverage of aspects like:
  • Mobile phone not used; more than one mobile device; visitors not actually crossing the

border, etc.

  • Problems are inherent in mobile usage data
  • Missing values, incorrect formats, duplicated data - not more

problematic than other data sources

  • Defining usual environment

Comparability: Over time

  • Depends on changes in data quality
  • Depends on changes in the telecommunication market (e.g. cost of

calls/SMS, emerging of new MNOs, merging of MNOs)

slide-26
SLIDE 26

Methodology: Conclusions

Quality assessment relies heavily on existing external information No easy estimation as no reliable reference data Indicators do not comply to requirements

  • f the regulation (692/2011) fully

Longitudinal data required Coverage issues most important

slide-27
SLIDE 27

Task 3b: Coherence

Tourism Domain Mobile Positioning Data Reference (Mirror) Statistics Combined inbound and outbound tourism Total trips Inbound+outbound Ferry passengers Inbound tourism Total trips Total trips Demand Statistics (FI) Border Control (EE) Overnight trips Overnight trips Demand Statistics (FI) Supply Statistics (EE) Same-day trips Same-day trips Demand Statistics (FI) Nights spent on overnight trips Overnight trips Supply Statistics (EE) Outbound tourism Total trips Total trips Demand Statistics (EE) Border Interview (FI) Overnight trips Overnight trips Demand Statistics (EE) Supply Statistics (EU) Same-day trips Same-day trips Not available (begins 2014) Domestic tourism Demand A Total trips Total trips Demand Statistics (EE) Overnight trips Overnight trips Demand Statistics (EE) Supply Statistics Same-day trips Same-day trips Not available (begins 2018)

slide-28
SLIDE 28

Very good

50 000 100 000 150 000 200 000 250 000 300 000 350 000 Jan-09 Mar-09 May-09 Jul-09 Sep-09 Nov-09 Jan-10 Mar-10 May-10 Jul-10 Sep-10 Nov-10 Jan-11 Mar-11 May-11 Jul-11 Sep-11 Nov-11 Jan-12 Mar-12 May-12 Jul-12 Sep-12 Nov-12

MOB_IN(EU-27)_OVERNIGHT SUPPLY_EE(EU-27)_ARR

Inbound Overnight Trips: Accommodation Statistics Inbound + Outbound: Ferry passengers, FI <-> EE

slide-29
SLIDE 29

Moderate

50 000 100 000 150 000 200 000 250 000 300 000 350 000 400 000 450 000 500 000 Q1-09 Q2-09 Q3-09 Q4-09 Q1-10 Q2-10 Q3-10 Q4-10 Q1-11 Q2-11 Q3-11 Q4-11 Q1-12 Q2-12 Q3-12 Q4-12

MOB_OUT(EU-27)_OVERNIGHT DEMAND_EE(EU-27)_OVERNIGHT

Outbound Overnight Trips: Demand Statistics, EE>EU27

slide-30
SLIDE 30

Low Coherence

20 000 40 000 60 000 80 000 100 000 120 000 140 000 160 000 180 000 Jan-09 Mar-09 May-09 Jul-09 Sep-09 Nov-09 Jan-10 Mar-10 May-10 Jul-10 Sep-10 Nov-10 Jan-11 Mar-11 May-11 Jul-11 Sep-11 Nov-11 Jan-12 Mar-12 May-12 Jul-12 Sep-12 Nov-12

MOB_EE(RU) BORDCONT_EE(RU)

Inbound Overnight Trips: Border Control, RU>EE

slide-31
SLIDE 31

Task 4: Opportunities & Benefits

Completeness

No complete coverage of any sector relevant for tourism statistics  No replacement of traditional sources

Timeliness

Full integration and automatisation  Much quicker than traditional sources

Validity

No specific advantages/disadvantages

Accuracy

Advantages over traditional sources (smaller sampling error, no memory gaps). ‘Usual environment’ needs redefining

Consistency

High grade of consistency compared to traditional sources.

Resolution

Finer granulation of space and time  new possibilities (again, ‘usual environment’ needs redefining) Basis for assessment: Regulation (EU) 692/2011

At present, mobile positioning data cannot replace traditional sources of tourism statistics but could deliver additional information … 1. Quick indicators (key tourism statistics indicators faster than today) 2. Finer spatial and timely resolution than possible today 3. Source of calibration for traditional sources (to quantify bias)

slide-32
SLIDE 32

Findings: Cost

Example country with a population of 16 million, 3 MNOs (10M, 5M, 1M subscribers), 15-day latency.

Data processing carried out by MNOs Data processing carried out by NSI Figures in ,000 EUR

1. High implementation costs – low annual running cost 2. Processing within the NSI less costly than when done at the MNOs

slide-33
SLIDE 33

Findings: Synergies

Analysis has shown that specific opportunities can be found with regard to

  • 1. The Balance of Payments Statistics
  • 2. Transport Statistics
  • 3. Population statistics: migration and commuting

statistics But more possibilities seen also in other domains (non-

  • fficial)
slide-34
SLIDE 34

Strengths and weaknesses of mobile positioning data

  • Very good consistency
  • Superior coverage compared

to supply statistics

  • Breakdowns by region and

nationality

  • Various quantitative criteria

for definitions

  • Improved timeliness
  • Automation level of statistical

production

  • Possible positive cost effects
  • Pan-European travel network

statistics

  • Access/continuity of access
  • No information on the purpose,

expenditure, means of transport

  • Bias between some

classifications (e.g. same- day/overnight)

  • Possible misclassification of

actual tourism events

  • Over- and under-coverage issues

concerning the phone usage patterns

  • Difficulty to assess the accuracy
  • f data as mobile phone usage
  • n travel is unknown
slide-35
SLIDE 35

Thank You!

mobfs.positium.ee