IMPACT OF DE-IDENTIFICATION ON MASTER PATIENT INDEX AND DATA LINKAGES
CENTER FOR HEALTH INFORMATION AND ANALYSIS
MASTER PATIENT INDEX AND DATA LINKAGES August 2020 Kathy Hines, - - PowerPoint PPT Presentation
IMPACT OF DE-IDENTIFICATION ON MASTER PATIENT INDEX AND DATA LINKAGES August 2020 Kathy Hines, Senior Director of Partner Operations & Data Compliance Scott Curley, Manager of Privacy & Compliance CENTER FOR HEALTH INFORMATION AND
CENTER FOR HEALTH INFORMATION AND ANALYSIS
3 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
4 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
5 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
(insurance carriers and hospitals) that replaces key PII fields with pseudonymized equivalents
from the data warehouse and are not released to internal users or external data applicants.
to link to external data
7 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
*Slide courtesy of ONPOINT Health Data
8 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
* Slide courtesy of ONPOINT Health Data
Graphic should fit approximately in this space
9 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
* Slide courtesy of ONPOINT Health Data
11 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
12 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
14 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
15 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
Eligibility Removed:
Medical Claims Removed: PII Product Provider Dental Removed: PII Rx Claims Removed: PII CHIA File Secure Software
Nickname Table
and Last name
Function
known dummy values
population (small ZIP codes) Eligibility Final HASH
Clear
Encrypted for Transport to CHIA
CHIA Landing Zone APCD Submission Files (“in the clear”)
17 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
18 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
CHIA Landing Zone Data Preparation CHIA APCD Algorithm
Probabilistic matching method using:
Links records within and across carriers. Filter Known Data Issues
CHIA Master Patient Index Hub (MEID) Data Load
Records where:
are the same are considered the same person. The last 5 valid values of each input field are stored to capture name changes, people moving etc. CHIA MPI Org ID Insurance ID First Name Last Name DOB Gender SSN ZIP Code 111111 30 BBY00002211 ABCD QRSTUVWXYZ POIUYT F HFHDSFH 02116 KDFGJKDFKFK 02461 02090 112233 22 HVD00000122 QWDD DGFGDFFGFG GFGDFF M FGDDDFG 02118 112233 30 BBY000034234 QWDD DGFGDFFGFG GFGDFF M FGDDDFG 01056 13116 01025
20 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020 Customer File
The more complete the file, the better the match results however not all fields are needed for each record for a confident match
CHIA File Secure Software Linking File Prepped HASH
In the Clear
CHIA APCD Algorithm
High Score Matches Lower Score Matches Custom Match Threshold based
the matches need to be. For example: Higher = All fields present and up to 1 mismatch Any number of additional match scenarios can be added and separated from the High Score Matches based
For example: Lower = SSN Missing and up to 1 mismatch Scores each input record against likely candidates in the MPI Hub. Used by customer CHIA use Master Enterprise ID to identify corresponding claims, this ID is then replaced with the project’s unique Study ID and claims returned to customer
21 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
Input Row from Customer - Hashed Equivalent Study ID First Name Last Name DOB SSN Zip Code Gender 8888 ABCD QRSTUVWXYZ POIUYT 02116 F
APCD Linking Scenarios
CHIA ID (MPI) First Name Last Name DOB SSN Zip Code Gender Match Result Match Score Disposition 4455544 ABCD QRSTUVWXYZ POIUYT 02116 F 5 Matches, 0 Mismatch Highest Input Row links to these APCD records 4455544 ABCD QRSTUVWXYZ POIUYT 02119 F 4 Matches, 1 Mismatch Higher 4455544 ABCD HIJKLMNOPQ POIUYT 02116 F 4 Matches, 1 Mismatch 4455544 ABCD QRSTUVWXYZ POIUYT 02116 M 4 Matches, 1 Mismatch 4455544 MNOP QRSTUVWXYZ POIUYT 02116 F 4 Matches, 1 Mismatch 2332332 ABCD QRSTUVWXYZ LKJHGD 02116 F 4 Matches, 1 Mismatch, DOB weighted stronger Based on Study Requirements, Input Row may link to these APCD Records 4455544 ABCD HIJKLMNOPQ POIUYT 02116 M 3 Matches, 1 Mismatch Lower 5755542 ABCD MNBCDVSWX LKJHGD 02119 F 2 Matches, 3 Mismatch Input Row does not link to these APCD records 7886655 MNOP HIJKLMNOPQ POIUYT 02116 M 2 Matches, 3 Mismatch Too Low
datasets)
Housing & Urban Development housing data
postpartum depression
22 Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020
23
Impact of De-identification on MPI and Data Linkages | Scott Curley, Kathy Hines| August 2020