ACBDiDATAP.AI

DATAP.AI HEALTH

AI Governance & Compliance

Privacy-aware AI routing, full audit trail, and immutable session archival. No black boxes.

ENZHJAKOVITHMSID

Total Requests

12,847

+8.2% vs last month

PHI Detected

342

+2.1% vs last month

Models Active

7

Compliance Score

94%

+1.5% vs last month

DATAP.AI Document Processing Pipeline

DATAP.AI Health processes clinical documents through 4 layers of privacy protection. Raw text containing patient identifiers (Medicare, IHI, MRN) is processed exclusively by HIPAA-compliant AI providers. De-identified text uses frontier models for the best clinical reasoning quality.

DATAP.AI HEALTHDATAP.AI Document Processing Pipeline1. Document UploadPDF, FHIR, HL7, DOCX2. Text ExtractionPyMuPDF, OCR, FHIR3. PHI DetectionMedicare, IHI, MRN4. De-identificationSafe HarbourFireworks AI (HIPAA-Compliant)BAA signed | Zero data retention | SOC2HIGH PHI RISKClinical NERPHI ScanningForm ExtractionGoogle Gemini (Frontier)Highest reasoning qualityLOW PHI RISKDocument Q&AClinical CopilotDefence-in-Depth: 4 Layers of Privacy Protection1Layer 1: PHI Detection2Layer 2: De-identification3Layer 3: HIPAA Provider4Layer 4: Audit Trail

DATAP.AI Privacy-Aware LLM Router

DATAP.AI classifies every healthcare AI task by PHI risk level, then routes to the appropriate provider. Fireworks AI (HIPAA-compliant, BAA signed) handles 8 of 11 tasks. Google Gemini handles 3 patient-facing tasks where reasoning quality is paramount.

DATAP.AI HEALTHDATAP.AI Privacy-Aware LLM RouterHealthcare TaskPHI RiskLLM ROUTERFireworks AIBAA signed | Zero data retention | SOC2DeepSeek V3 | Qwen3 | 8/11Google GeminiHighest reasoning qualityGemini 2.5 Flash | 3/11PHI ScanningHIGHDocument NERHIGHForm ExtractionHIGHEmbeddingsMEDIUMSignal DetectionNONESignal ClassificationNONECross-ValidationNONEInvestigationNONEClinical CopilotLOWDocument Q&ALOWCompliance ReportLOWWhy This Routing?HIGH/MEDIUM PHI risk tasks send raw patient data to the AI model — HIPAA provider is the compliance safety net.

Healthcare AI — LLM Routing Table

Each healthcare AI task is routed to a specific provider and model based on PHI risk, clinical reasoning requirements, and cost.

TaskProviderModelWhy
Customer chat(copilot widget)Gemini2.5 FlashFast, cheap, good UX
Clinical copilot(backend)BedrockClaude Sonnet 4.6Best clinical reasoning, AU data residency
Document Q&ABedrockClaude Sonnet 4.6Long-document comprehension
Compliance reportsBedrockClaude Opus 4.6Highest writing + regulatory accuracy
PHI scanningFireworksDeepSeek V3HIPAA BAA, sees raw patient data
Document NERFireworksDeepSeek V3HIPAA BAA, raw clinical text
Regulatory signalsFireworksDeepSeek V3Public data, high volume, cheap
EmbeddingsFireworksNomic EmbedHIPAA, residual PHI risk

Healthcare AI Architecture — Four Intelligences

Every healthcare AI interaction at DATAP.AI runs on four intelligence layers. Each layer is auditable, configurable, and compliant by default.

PillarIntelligenceComponentsTools & Services
AIArtificial IntelligenceMulti-agent LLM orchestration, privacy-aware routing, clinical NER, PHI detectionGemini, Fireworks AI, ag2, LanceDB
BIBusiness IntelligenceDashboards, Text-to-SQL, AI-generated clinical insightsLightdash, Vanna, Recharts
CICustomer IntelligenceCRM-backed AI chatbot, patient & practitioner management, interaction auditERPNext Healthcare, REST API
DIData IntelligenceChat archival to S3 Parquet, Glue catalog, Athena queries, immutable audit trailpyarrow, boto3, Athena, Glue

AI Audit Trail — Three-Tier Traceability

Every AI conversation is persisted to three independent stores. The S3 cold tier is immutable — once a Parquet file is written, it is never modified or deleted. This guarantees a tamper-proof audit trail for regulatory review.

Hot Tier

PostgreSQL (framework_db) · 90 days

Every AI chat message — user question + AI response + model + tokens — persisted in real-time for active UI and LLM context recall.

Access: Live queries via clinical chat endpoint

CRM Tier

ERPNext (crm-health.datap.ai) · Forever

Each chat interaction logged as a Note on the practitioner timeline. Human reviewers see the full conversation history in the CRM admin.

Access: ERPNext admin UI + REST API

Cold Tier

S3 Parquet (codepais3 bucket) · Forever (immutable)

Weekly archival exports chat history as Hive-partitioned Parquet files. Snappy-compressed, verified read-after-write. Once written, files are never modified — immutable audit trail.

Access: AWS Athena / DuckDB / Glue Crawler

S3 Immutable Session Archive — Hive Partitioning

s3://codepais3/stock/raw/chat_history/year=2026/month=04/day=12/part-00000-*.parquet
s3://codepais3/health/raw/chat_history/year=2026/month=04/day=12/part-00000-*.parquet

Each file contains: message_id, session_id, user_id, role, content, model_used, tokens_used, context_sources, created_at. Queryable via AWS Athena or DuckDB. Glue Crawler auto-discovers new partitions.

Healthcare Data Standards

DATAP.AI processes clinical data using international and Australian healthcare standards.

StandardFull NameWhat It DoesAustralian Equivalent
FHIR R4Fast Healthcare Interoperability Resources (HL7)Standard format for exchanging clinical data between healthcare systemsAustralian Digital Health Agency adopted FHIR as national standard. My Health Record uses FHIR R4.
HL7 v2Health Level Seven (messaging protocol)Legacy messaging format used between hospital systemsStill widely used in Australian hospitals and pathology labs
HIPAAUS Health Insurance Portability and Accountability ActUS law governing protection of patient health data. Requires BAA with vendors who handle PHI.Australian Privacy Act 1988 + Health Records Act. Australian Privacy Principles (APPs) govern health data.
PHIProtected Health InformationAny data that can identify a patient — names, Medicare numbers, medical record numbers, dates of birthIn Australia: Medicare number, IHI (Individual Healthcare Identifier), MRN (Medical Record Number), DVA numbers
SOC2Service Organization Control Type 2Independent security audit verifying data protection controlsIRAP or ISO 27001 are the Australian equivalents for government/healthcare
TGATherapeutic Goods AdministrationAustralia's regulatory body for medical devices, medicines, and biologicals.Equivalent to US FDA, EU EMA. DATAP.AI monitors TGA but does NOT require TGA approval.
BAABusiness Associate AgreementLegal contract with AI/cloud vendors ensuring they protect patient data.No direct AU equivalent, but APP 8 and contractual privacy clauses serve similar purpose under the Privacy Act.

DATAP.AI Technology Partners

Fireworks AI

$4B valuation | Sequoia Capital-backed

  • HIPAA + SOC2 compliant with signed BAA
  • Zero data retention — patient data never stored
  • 140B+ tokens/day, 99.99% uptime
  • 5-10x cheaper than proprietary models
  • Handles 8 of 11 healthcare AI tasks

Google Gemini

Frontier reasoning model

  • Highest quality clinical reasoning
  • 2M token context window
  • Google Search grounding for real-time data
  • Used for patient-facing responses only
  • Handles 3 of 11 healthcare AI tasks

How DATAP.AI Addresses Healthcare AI Governance

Patient Privacy (Australian Privacy Act)

DATAP.AI detects Australian healthcare identifiers (Medicare, IHI, MRN), de-identifies via Safe Harbour method, and routes high-risk tasks to HIPAA-compliant providers. 4-layer defence-in-depth ensures no single point of failure.

AI Transparency (TGA Feb 2026 Guidance)

Every AI decision is logged with model name, provider, data classification level, and full audit trail. The LLM routing table is exposed via API for governance review. DATAP.AI builds governance INTO the platform from day 1.

Bias Detection (AI Ethics)

DATAP.AI monitors statistical parity and equalised odds across demographic dimensions and 8 Asia-Pacific languages. Bias reports are generated automatically and available via the governance dashboard.

Cost Control (Operational Governance)

CostGuard enforces daily LLM spend limits per provider. Multi-provider routing optimises cost-per-task — $0.56/1M tokens for bulk work, frontier models only where clinical reasoning demands it. Critical for B2B pricing in APAC markets.

Governance Modules

DATAP.AI Live Routing Table (API)

DATAP.AI exposes the full LLM routing table as a live API endpoint for governance audit and compliance review:

GET https://healthapi.datap.ai/agent/llm-routing