Analytics in Revenue Daniel Sinnott Chief Analytics Officer, - - PowerPoint PPT Presentation

analytics in revenue
SMART_READER_LITE
LIVE PREVIEW

Analytics in Revenue Daniel Sinnott Chief Analytics Officer, - - PowerPoint PPT Presentation

Analytics in Revenue Daniel Sinnott Chief Analytics Officer, Revenue What I will cover this morning Governance & infrastructure Organisational set-up Capability development Data sources Analytical approaches


slide-1
SLIDE 1

Analytics in Revenue

Daniel Sinnott Chief Analytics Officer, Revenue

slide-2
SLIDE 2

What I will cover this morning

  • Data sources
  • Analytical methods
  • Governance & infrastructure
  • Capability development

Organisational set-up Analytical approaches

  • Data quality & representativeness
  • Natural taxation

Challenges &

  • pportunities

Theme: Just because it’s quantitative, doesn’t mean it’s informative!

slide-3
SLIDE 3

Types of operational data use in Revenue

Hypothesis-based Rules-based Analytics-based

Data query tools Targeted projects REAP SNA Anomaly detection Predictive modelling RCTs

Increasing complexity More reliant

  • n data

More reliant on experts

slide-4
SLIDE 4

Organisational set-up

slide-5
SLIDE 5

Strong governance ensures IT, operations, and analytics work together effectively

5

IT Analytics Operations Revenue Analytics Group

Outputs that are technically robust, statistically sound, and operationally useful

slide-6
SLIDE 6

Data processes & warehouse designed specifically for analytics

  • Metadata key to realising value from

diverse data-holdings – Full tracking of data lineage in place – Populate metadata as tables are created – Working with dev teams to ensure metadata is created at source where possible

  • Software platform meets specific

needs of analytics function – Performance & reliability – Handles unstructured and semi- structured data – Access to a wide range of tools for data exploration and modelling – Strong data governance

slide-7
SLIDE 7

Developing capabilities in-house

7

Identify and Recruit: ‐ Seek out suitable talent in–house ‐ Look for enthusiasm, and a background in natural or social sciences Develop and Retain: ‐ Focus on developing programming skills ‐ Blend online and classroom training ‐ Provide diverse opportunities

slide-8
SLIDE 8

Analytical approaches

slide-9
SLIDE 9

Overview of Revenue Data Sources

  • Tax returns
  • Intervention
  • utcomes
  • Filing behaviour
  • Registrations
  • Payments
  • Automatic exchange
  • f information:
  • Income & assets
  • Breakdown of

corporate activities

  • Government bodies
  • Banks
  • Merchant acquirers
  • Letting agents
  • General

requirements – e.g., Form 46G

  • Phone calls
  • Emails
  • Letters
  • Case notes
  • Tax rulings
  • Spontaneous

exchanges

  • Sundry other (eg.

Panama Papers)

  • Suspicious

Transaction Reports

  • Good Citizen

Reports

Structured Unstructured Internal External - Domestic External - Foreign

Revenue draws in millions of records annually – only selected sources shown here

slide-10
SLIDE 10

Our ideal project: Models supervised by past intervention outcomes

Deploy Validate Train But case selection process may introduce substantial bias…

  • Work recommended

cases and review model performance

  • Integrate model into

Revenue case selection process

  • Currently used for VAT

& PAYE repayments; three new models ready for testing

x y x y

Yield > €5k Yield < €5k

slide-11
SLIDE 11

Peer Groups (Mineral Oils, Construction) Predicted Values (Income-Consumption)

Analytics allows us to make sophisticated comparisons between taxpayers to identify

  • utliers

A compromise: Models for anomaly detection

slide-12
SLIDE 12

A sideline: Use analytics to predict response to intervention

12

% claiming online Age Mailed group Control group Response

Predicting outcomes is not the same as predicting response

Model output Approach taken

  • Business objective to target

campaigns aimed at persuading taxpayers to claim expenses online

  • Initial hypothesis was that younger

taxpayers should be targeted

  • Controlled experiment run to assess

incremental impact; model(s) built to ‘predict’ experimental results

  • Found that older taxpayers responded

more strongly

slide-13
SLIDE 13

Challenges & opportunities

slide-14
SLIDE 14

Why we are embracing a ‘low-tech’ approach

  • Unrepresentative training sets
  • Variation in system usage, etc
  • Many relationships are just

artefacts of data

  • Can’t just automate search for

predictive patterns ‘Low-tech’ methods that business experts can review and understand make it much easier to weed out spurious patterns

Challenges Implications (Attempted) Solution

slide-15
SLIDE 15

Data without analytics?

Traditional Taxation Burden Convenience

  • Periodic reporting

Timeliness

  • Real-time exchange

Natural Taxation Already under way through eRCT and PAYE real-time; Opportunity to make compliance the default setting

  • Manual submission
  • Automatic submission

Corroboration

  • Self-reported
  • Immediate checking

against counter-party and 3rd party returns