Four key risk areas requiring attention before platform build proceeds.
High
Call Recordings Through External AI
Voice calls across 260 practices (~65,000 call-minutes/day) are routed through Peer Logic, RingCentral, 8x8, and Weave, then transcribed via OpenAI Whisper. Audio recordings contain unfiltered verbal PHI — patient names, conditions, insurance details — sent directly to external API without redaction.
Action Required
Verify OpenAI Whisper BAA explicitly covers audio data. Research real-time audio redaction or on-premise transcription alternatives.
High
Four Vendors Without BAAs
Cloudinary (dental photos including facial images), Swell (patient reviews with health details), Cinnamon (call analytics with conversation content), and Dental Intel (practice analytics with patient data) all process PHI without confirmed Business Associate Agreements.
Action Required
Execute or verify BAAs with all four vendors before any PHI flows are established. If BAAs are unavailable, identify HIPAA-compliant alternatives.
Medium
PHI Surface Area Breadth
The platform spans 6 PHI sources, 8 processing services, and 7 destination categories across 260 dental practices. Daily volume: ~13,000 patient messages and ~65,000 call-minutes. Each integration point is a potential compliance exposure.
Action Required
Implement comprehensive PHI flow monitoring. Add PHI detection to the Kafka event bus with automated alerting for unexpected data exposure.
Medium
PHI Filter as Single Point of Trust
The AI Service PHI Filter uses regex + NER to strip protected health information before prompts reach the Claude API. This is the sole barrier preventing PHI from reaching external LLM services. A single bypass or failure mode could expose patient data.
Action Required
Implement defense-in-depth: evaluate OursPrivacy or Freshpaint as a secondary PHI check. Conduct penetration testing focused on filter bypass scenarios.
PHI Surface Area
Data sources, processing services, and destinations handling protected health information.
6PHI Sources
High PMS Systems (NexHealth + Sikka)
High Phone Systems (4 vendors)
High Staff Content Capture
Med Patient Communications
Med Review Platforms (Swell)
Med Phone Analytics (3 vendors)
8Processing Services
Crit AI Service + PHI Filter
High Voice AI Service (Whisper)
High Content Engine
Med Practice Data Hub
Med Communications Service
Med H-Score Engine
Med Event Bus (Kafka/MSK)
Low Compliance & Audit Layer
7Destinations
High External AI APIs (Claude, Whisper)
High Cloudinary (no BAA)
High Vendors Without BAAs (3)
Med Patient-Facing Messages
Low Executive Dashboards
Low Audit Logs (S3/Glacier)
Low Analytics Data Lake
PHI Data Flow Map
Click any node to expand details. Flows move from sources through processing to destinations.
High Risk
Medium Risk
Low Risk
Critical Control
⚠Critical Path: Phone Systems → Voice AI (Whisper) → Claude API — unfiltered audio with verbal PHI sent to external transcription service.
Obtain BAA or migrate to HIPAA-compliant image CDN (e.g., AWS S3 + CloudFront)
Swell
Patient reviews mentioning procedures, conditions
✗ No BAA — Verify
High
Contact vendor to verify if BAA is available; evaluate alternatives if not
Cinnamon
Call analytics with conversation content
✗ No BAA — Verify
High
Contact vendor to verify if BAA is available; evaluate alternatives if not
Dental Intel
Practice analytics with patient-level data
✗ No BAA — Verify
Med
Contact vendor to verify if BAA is available; assess PHI exposure scope
Needs Verification
4 vendors
Vendor
PHI Exposure
BAA Status
Risk
Action Needed
OpenAI (Whisper)
Audio transcription — receives raw call recordings with verbal PHI
⚠ Required — Verify
High
Verify BAA explicitly covers audio data, not just text
Anthropic (Claude)
Receives filtered text prompts via PHI Filter
⚠ Required — Verify
Med
Verify enterprise BAA covers filtered text prompt use case
NexHealth
Practice management data — patient records, scheduling
⚠ Required — Verify
Med
Obtain and review BAA terms for data integration use case
Sikka ONE
Practice management data — patient records, procedures
⚠ Required — Verify
Med
Obtain and review BAA terms for data integration use case
Evaluate
2 vendors
Vendor
PHI Exposure
BAA Status
Risk
Action Needed
ClickHouse Cloud
Analytics data lake — de-identified data
⚠ Evaluate
Low
Evaluate BAA availability; low risk given de-identified data
Temporal Cloud
Workflow orchestration — may handle PHI in task payloads
⚠ Evaluate
Low
Evaluate BAA availability; assess whether PHI passes through workflows
Confirmed
4 vendors
Vendor
PHI Exposure
BAA Status
Risk
Notes
AWS
Infrastructure — hosts all core services, databases, storage
✓ Confirmed
Low
Standard AWS BAA covers all HIPAA-eligible services
Twilio
Patient communications — SMS, voice, forms
✓ Confirmed
Low
HIPAA-eligible product; BAA available for enterprise tier
Datadog
Application monitoring — may capture PHI in logs
✓ Confirmed
Low
BAA available; ensure log scrubbing to minimize PHI exposure
Canva
Marketing content creation
N/A
Low
No PHI processed; BAA not required
Prioritized Recommendations
Action items organized by urgency. All items must be resolved before platform build proceeds to production.
Immediate — Before Build
Immediate
1
Execute BAAs with Cloudinary, Swell, Cinnamon, and Dental Intel before any PHI flows are established. If BAAs are unavailable, identify and migrate to HIPAA-compliant alternatives.
Legal + CTO
Not Started
2
Verify Anthropic Claude enterprise BAA explicitly covers filtered text prompts generated by the PHI Filter service.
Legal + Engineering
Not Started
3
Verify OpenAI Whisper BAA explicitly covers audio data containing verbal PHI — not just text-based inputs.
Legal + Engineering
Not Started
4
Verify NexHealth and Sikka ONE BAAs cover the data integration use case (bidirectional patient record sync across 260 practices).
Legal + Integrations
Not Started
Short-Term — First 90 Days
90 Days
5
Implement defense-in-depth for the PHI Filter: evaluate OursPrivacy or Freshpaint as a secondary PHI detection layer to eliminate single-point-of-failure risk.
Engineering
Not Started
6
Research and implement audio redaction for call recordings before transcription — even imperfect redaction reduces exposure surface for Whisper.
Engineering + AI Team
Not Started
7
Add PHI detection scanning to the Kafka event bus with automated alerting for unexpected PHI in event payloads.