Customer Data Unification at TME

Plumb differently

Created by Wouter Dullaert / @wouterdullaert

Overview

How did we get here? Data Integration Data Unification

How did we get here?

Organic growth

1 Retailer

A few retailers

A lot retailers and 1 distributor

A lot of retailers and a few distributors

TME to centralise operations accros Europe

Result

All of customer data lives close to the customer

But relevant data is increasingly incoming centrally

How do we link all this data together and feed it back?

Data Integration - Ingestion

Mapping 2 sources

Linking 2 sources

Mapping 3 sources

Linking 3 sources

After some more time and sources

Y U NO WORK!!1!11

Data Integration - Consumption

Which values do you retain?

What is the origin of the data?

What if multiple consumers of the data have different requirements for the merged entity?

How do you handle data updates in source systems?

How do you handle data updates in consuming systems?

Data Integration

Slow

Write a ton of scripts
Manually profile, clean and map the data

Costly

Only scales with people
Very often outsourced

Opaque

No documentation
No audit trail

Inefficient

Low quality results
Hard to keep up to date

Data Unification

Forward flow Feedback flow

Forward flow

Ingest the data

Ingest in schema of source system (Removes friction)

Map the data

Map fields from source systems into the target schema.

Machine learning assisted

Based on your mappings and the statistical profile of the attributes Tamr will offer mapping recommendations

Effort of mapping the data goes down as more sources are integrated

Link the data

Use ML model to link entities across all the records

Train model by evaluating record pairs

A business user can do this!

Regularly train new pairs to keep the model in sync with changes in the sources

Consume/Merge the data

Multiple views that essentially describe how individual fields are merged

Views are functions over the data -> flexibility

Feedback Flow

Save all updates as immutable events

Create "pseudo" sources by providence

ML model will group events into clusters

Integrates legacy sources Scales well as a function of sources Obtains knowledge where it resides Supports multiple views/uses of the same base data Avoids the creation of additional sources Allows creation of new processes on top of consolidated data

Questions?

Customer Data Unification at TME Plumb differently Created by Wouter Dullaert / @wouterdullaert

Customer Data Unification at TME – Plumb differently

wdullaer

Customer Data Unification at TME – Plumb differently

0 0

data-unification-slides

Customer Data Unification at TME

Plumb differently

Overview

How did we get here?

Organic growth

Result

Data Integration - Ingestion

Y U NO WORK!!1!11

Data Integration - Consumption

Data Integration

Slow

Costly

Opaque

Inefficient

Data Unification

Forward flow

Ingest the data

Map the data

Link the data

Consume/Merge the data

Feedback Flow

Questions?

Customer Data Unification at TME – Plumb differently

wdullaer

Customer Data Unification at TME – Plumb differently

0 0 (function() { var po = document.createElement('script'); po.type = 'text/javascript'; po.async = true; po.src = 'https://apis.google.com/js/platform.js'; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(po, s); })();

data-unification-slides

Customer Data Unification at TME

Plumb differently

Overview

How did we get here?

Organic growth

Result

Data Integration - Ingestion

Y U NO WORK!!1!11

Data Integration - Consumption

Data Integration

Slow

Costly

Opaque

Inefficient

Data Unification

Forward flow

Ingest the data

Map the data

Link the data

Consume/Merge the data

Feedback Flow

Questions?

0 0