Householding Use Case
Householding MDM in CluedIn — Step-by-Step Implementation Guide
Householding MDM creates a reliable, privacy-aware view of the people who live together and their shared attributes (address, contact channels, preferences, relationships). This guide walks you through the end-to-end build in CluedIn: ingest → standardize → match/link (person & address) → household formation → golden record → governance → publish.
Outcomes
- One golden Household per dwelling (with effective dating for moves).
- Linked entities: Household ↔ Person(s) ↔ Address(es) ↔ Accounts/Orders/Policies ↔ Preferences/Consents.
- Accurate roles (e.g., Head, Partner, Dependent, Roommate) and membership periods.
- Better targeting, deduplication, and compliance (opt-outs and legal notices aggregated).
Prerequisites
- Source access: CRM, billing/subscriptions, e-commerce/loyalty, service/support, marketing lists, (optional) property/geo sources.
- RACI: Ensure stewards have Accountable access to governed objects.
- For heavy matching runs, use a dedicated server/VM (not a laptop) to avoid timeouts.
- Agreed business rules for:
- How to define a “household” (shared address + time overlap ± surname/relationship evidence).
- How to aggregate preferences/consents (“most restrictive wins” recommended).
Reference Data Model
Core entities
Person
(individual)Address
(standardized, geocoded; includes unit/apartment)Household
(grouping)HouseholdMembership
(Person ↔ Household with role & date range)Preference
/Consent
(per Person; optionally aggregated to Household)Account
/Order
/Policy
(linked for analytics)
Key attributes (examples)
- Address: Line1, Line2/Unit, City, Region, Postcode, Country, GeoPoint, GeoID/PropertyID, DPV/AV status.
- Household: HouseholdId (stable key), PrimaryAddressId, Type (Family, Shared, Multi-Family, Dorm), Size, Start/End dates.
- Membership: Role, StartDate, EndDate, Evidence (AddressOverlap, Relationship, FinancialLink).
Step 1 — Connect & Ingest Sources
Portal: Data Sources → Add Source
- CRM persons/contacts, service/install addresses, billing/ship addresses.
- Marketing lists (email/phone), loyalty/e-com profiles.
- Optional: property datasets (parcel/building IDs), geocoding service outputs.
- Load ad-hoc spreadsheets for legacy segments.
Capture provenance on ingest:
SourceSystem
,SourceRecordID
,LoadTimestamp
.
Step 2 — Define Entities & Mappings
Portal: Entity Explorer → + Create Entity Type
- Create
Person
,Address
,Household
,HouseholdMembership
,Preference/Consent
. - Map source fields to canonical attributes (normalize naming; keep
Unit/Apt
explicit). - Persist
Address.Raw
alongside parsed components for lineage.
Step 3 — Address Standardization & Geocoding
Portal: Governance → Rules (Data Part Rules)
Create rules to normalize addresses before matching:
- Parse Line1/Line2, extract Unit/Apt; normalize street types and directional suffixes.
- Normalize Postcode/Region (ISO where applicable).
- Geocode to GeoPoint; attach GeoID/PropertyID when available.
- Validate deliverability (e.g., DPV/AV flags) and tag issues.
Recommended tags: StandardizedAddress
, InvalidAddress
, MissingUnit
, POBox
.
Step 4 — Person Identity Resolution
Portal: Entity Matching → Person
- Blocking keys: (Email), (Phone), (LastName + DOB), (CustomerId).
- High-confidence: exact email OR exact phone OR government/customer ID.
- Medium: fuzzy name + same birthdate OR same email domain + strong address similarity.
- Review in Data Stewardship; approve merges; Unmerge when needed.
Avoid over-merging roommates with same last name but different emails/phones.
Step 5 — Address Deduplication
Portal: Entity Matching → Address
- Blocking: (Postcode + HouseNumber), (GeoID), (GeoPoint rounded).
- High-confidence: exact normalized address (including Unit/Apt).
- Candidate: same building (GeoID) + similar line components; send to stewardship.
- Keep P.O. Boxes separate; never merge PO Box with street delivery points.
Step 6 — Household Formation Rules
Portal: Governance → Rules (Golden Record & Data Part Rules)
Create Household keys and membership logic:
- Household Key (example)
HHKey = hash(GeoID or (Address.Line1+Line2+Postcode+Country) + CohabitationWindow)
- Formation conditions (any of):
- ≥2
Person
entities share the same normalized Address (including Unit) with overlapping residency periods. - A
Person
moves in (address change) to an existing household and shares a relationship (spouse/child) or financial link (shared account).
- ≥2
- Membership periods: set
StartDate
when first evidence appears; setEndDate
when address changes or account closes.
Tags: CandidateHousehold
, ConfirmedHousehold
, SharedResidence
, Dormitory
, MultiUnit
.
Step 7 — Roles & Evidence
Portal: Data Stewardship → Household Review
- Assign roles per member:
Head
,Partner
,Dependent
,Roommate
. - Evidence priority: Shared last name & age gap (family), shared contract/account (financial), tenancy record, emergency contact links.
- Store
EvidenceType
onHouseholdMembership
.
Step 8 — Household Golden Survivorship
Portal: Governance → Rules (Golden Record Rules)
Define how household-level attributes are chosen:
Attribute | Precedence (high → low) | Tie-breaker |
---|---|---|
PrimaryAddress | Verified service/billing > CRM mailing > marketing | Most recent verified |
PrimaryPhone/Email | Landline at address > Head’s preferred > shared contact | Consent state = granted |
Language/Locale | Majority of members > Head’s preference | Most recent update |
HouseholdType | Evidence-based (Family > Shared > Dorm > Multi-Family) | Steward override |
Preferences/Consents | Most restrictive wins (opt-out propagates) | N/A |
- Compute
HouseholdSize
,HasChildren
(derived), andCompletenessScore
. - Tag
DoNotContact
at household if any member hasDoNotContact=true
(policy).
Step 9 — Consent & Preference Aggregation
Portal: Rules (Data Part & Golden)
- Household suppression: if any member opt-outs, set household
DoNotContact=true
. - Channel logic: e.g., suppress SMS if no members consent to SMS; allow email if any consenting adult (confirm legal stance).
- Keep person-level consent for precision; use household-level only for targeting/ops.
Step 10 — Quality Scoring & Dashboards
Portal: Data Quality → Profiling / Dashboards
Track KPIs:
- Address deliverability %, unit capture rate, geocoding success.
- Duplicate person rate, duplicate address rate.
- Household coverage (people with a HouseholdId).
- False-merge & unmerge counts; membership with missing dates.
Step 11 — Governance, Security & Lifecycle
Portal: Governance → Permissions / Retention / Policies
- Restrict access to sensitive person attributes (DOB, IDs).
- Retention: keep ended memberships and prior addresses for N years (audit & analytics).
- Enable rollback: merges, household formation, and role changes captured in History (undo/restore supported).
Step 12 — Publish / Integrate
Portal: Exports
- CRM/Marketing: export Household table (id, primary address/contact, size, type) + HouseholdMembership for segmentation.
- Service/Billing: publish golden addresses to reduce delivery failures.
- Analytics/DW: wide conformed model (Person ↔ Household ↔ Address).
- APIs/Eventing: emit household-changed events for downstream sync.
Start read-only, validate mapping & suppression behavior; then make CluedIn authoritative where appropriate.
Step 13 — Scheduling & Operations
- Schedule person/address matching, household refresh, consent aggregation, and exports.
- Run heavy matching and rebuilds off-peak on a dedicated VM/server to avoid timeouts.
- Alert on spikes: sudden household churn can indicate address parsing issues upstream.
Step 14 — Validate & UAT
- Sample 100–200 households across multi-unit buildings, single-family homes, dorms/aged-care.
- Verify roles, membership dates, and suppression logic.
- Business sign-off from CRM/Marketing/Service & Privacy teams.
Go-Live Checklist
- Address standardization + geocoding > 95% success; unit capture validated.
- Person & Address matching thresholds tuned; unmerge tested.
- Household formation rules approved; role assignment guidelines documented.
- Consent aggregation (“most restrictive wins”) validated end-to-end.
- Exports/APIs tested with downstream systems.
- Permissions & retention configured; audit/rollback verified.
- Product Toolkit package built & promoted from staging to prod (run on dedicated box with Accountable access).
Example Rules (Snippets)
Address Normalize (Data Part Rule)
- Condition:
Address.Raw present
- Actions: parse components; standardize abbreviations; set
Address.Normalized
; geocode; tagStandardizedAddress
orInvalidAddress
.
Create Household Key
- Condition:
Address.Valid=true
- Action:
HouseholdKey = hash(Address.Normalized+Postcode+Country)
; (optionally addUnit
andGeoID
).
Membership Inference
- Condition: two Persons share
HouseholdKey
with residence overlap ≥ 60 days - Action: create
HouseholdMembership
for both with role defaults (Head
for account holder; elseUnknown
); tagCandidateHousehold
.
Consent Aggregation
- Condition: any member
DoNotContact=true
- Action: set
Household.DoNotContact=true
; storeAggregatedFrom=PersonIds
.
Golden Survivorship
- PrimaryAddress: verified service address > billing address > CRM; tie-break newest verified.
- PrimaryPhone: shared landline > Head’s preferred > most recent; require consent.
Common Pitfalls & How to Avoid Them
- Merging different apartments: always include Unit/Apt or GeoID in address matching.
- Dorms & aged-care: many unrelated people at one address — require extra evidence (relationship/financial link) to form a household.
- Surname-only logic: causes roommate conflation — use multi-evidence (address + email/phone + relationship).
- PO Boxes: do not use for household formation; treat separately from street delivery points.
- Churn from upstream formatting changes: lock down address normalization and monitor drift.
- Running big rebuilds on laptops: use dedicated compute to prevent timeouts.
Success Metrics
- Household precision ≥ 98% (low false merges); recall ≥ 95% for eligible addresses.
- Mail bounce rate ↓; delivery success ↑ after golden address adoption.
- Suppression accuracy ≥ 99% for household-level opt-outs.
- Duplicate Person rate ↓; average time to household new mover ↓.
- Unmerge rate trending down month-over-month after tuning.
Summary
Householding MDM in CluedIn depends on clean addresses, robust person/address matching, and clear household formation rules with dates and roles. Keep golden household attributes canonical, aggregate preferences conservatively, and validate across tricky dwelling types (multi-unit, dorms, aged-care) before go-live. .