Architecture Overview

This section provides a high-level overview of Deepdesk's architecture, including system components, data flows, and integration patterns.

Purpose

The Architecture Overview documentation is designed to help:

Technical Leaders: Understand the system architecture and design decisions
Developers: Get oriented with the overall system structure before diving into specific components
Operations Teams: Understand deployment architecture and infrastructure components
Integration Partners: Learn how Deepdesk integrates with external systems

System Architecture

Deepdesk is built as a cloud-native, multi-tenant platform deployed on Google Cloud Platform (GCP). The architecture follows microservices principles with clear separation between:

Global Services: Shared infrastructure and management services
Account-Specific Services: Isolated per-customer deployments
Integration Layer: Components that connect to external CX platforms

Key Architectural Principles

Multi-Tenancy

Each customer account runs in its own isolated Kubernetes namespace, ensuring:

Data isolation and security
Independent scaling and resource allocation
Customized configuration per account

Microservices Architecture

Services are organized by function:

Admin: Account configuration and management
Backend: Core API and business logic
Engine: AI/ML processing and suggestions
Integration Components: Platform-specific widgets and data ingestion

GitOps Deployment

Infrastructure and configuration are managed as code:

Terraform: Infrastructure provisioning
FluxCD: Continuous deployment from Git repositories
Helm: Service packaging and deployment

High-Level System Diagram

Core Services (Agent Assist)

The Deepdesk core services deliver autocomplete response recommendations and autocomplete suggestions to customer service agents. The training pipeline and model deployment are described in Next Best Message, Search & Autocomplete; the flow of data and services involved in daily operations is described below and in Data Ingestion.

Components:

Messaging platform (CX platform) — The third-party messaging application used by agents (e.g. Salesforce, LiveEngage, RingCentral Engage). For supported platforms, see Platform Integrations.
Admin — Account and platform configuration (feature flags, training parameters, platform API config), data ingestion (webhooks, converting platform-specific data to a generic format), and authentication (e.g. OAuth2).
Backend — Delivers live suggestions to the agent: receives messages from Admin, stores them, and requests new suggestions from Engine.
Engine — Model inference and autocomplete suggestions. One account can have one or more Engines, mapped to account profiles.

For a detailed flow diagram of these components and data flow, see the Core Services diagram (draw.io).

Component Overview

Global Services

Onboarding Service

Manages account provisioning and deployment across all regions and clusters.

Location: Cloud Run (global)
Purpose: Account lifecycle management
Access: Internal Deepdesk staff only

Account-Specific Services

Admin Service

Account-specific configuration and management interface.

Configuration: Service settings, credentials, integrations
Deployment: One instance per account namespace
Access: Account administrators and Deepdesk staff

Backend Service

Core API and business logic for the account.

Functions: Message handling, suggestion retrieval, user management
Integration: Connects to CX platforms and frontend widgets
Data: Stores conversations and account data
Webhooks: Process incoming events from CX platforms.

Engine Service

AI/ML processing and suggestion generation.

Processing: Real-time suggestion generation
Models: Text suggestions, URL suggestions, GPT models
Training: Continuous learning from conversation data

LLM Gateway Service

Centralized gateway for GenAI features.

Purpose: Proxy for all LLM API calls
Provider Integration: Connects to Azure OpenAI and other LLM providers
Functions: Request routing, resilience (retry, failover routing)
Usage: Used by Backend for LLM-powered features (CX Assistants, Voice agents)

Integration Layer

Frontend Widgets

UI components embedded in CX platforms.

Types: Native integrations, browser extensions, iframe widgets
Platforms: Genesys Cloud, Salesforce, LivePerson, RingCentral
Features: Agent Assist UI, autocomplete, suggestions display

Data Flow Patterns

Real-Time Suggestions Flow

Agent types in CX platform
Frontend widget captures input
Backend processes request
Engine generates suggestions
Suggestions displayed in widget

Webhook-Based Ingestion Flow

Conversation event occurs in CX platform
Platform sends webhook to Deepdesk
Webhook handler processes and validates
Data stored in Backend database
Engine processes for future suggestions

Training Pipeline Flow

Conversations accumulated in BigQuery
Training pipeline triggered (manual or scheduled)
ML models trained on conversation data
Models deployed to Engine service
New suggestions available for agents

Summarizer Assistant Flow

Conversation ends
Frontend widget triggers summary request
Backend receives conversation data and summary request
Backend sends summarization prompt to LLM Gateway
LLM Gateway calls Azure OpenAI with summarization prompt
OpenAI generates structured summary (key points, sentiment, resolution)
Summary returned to Backend
Summary displayed in widget

Deployment Architecture

Regional Clusters

Deepdesk deploys to regional Google Kubernetes Engine (GKE) clusters:

Europe: Primary deployment region
Other Regions: Available based on customer requirements

Account Isolation

Each account runs in an isolated namespace:

Namespace: Dedicated Kubernetes namespace per account
Resources: CPU, memory, and storage limits
Network: Network policies for traffic isolation
Secrets: Account-specific credentials and keys

Infrastructure as Code

All infrastructure is managed through:

Terraform: GKE clusters, Cloud Storage, Cloud Functions
Helm Charts: Kubernetes service deployments
FluxCD: GitOps-based continuous deployment
Git Repositories: Source of truth for configuration

Security Architecture

Authentication & Authorization

User Authentication: SSO via OAuth2
Service-to-Service: API keys and service accounts
Platform Integration: Platform-specific OAuth2

Data Security

Encryption at Rest: All stored data encrypted
Encryption in Transit: TLS for all connections
Anonymization: Optional PII anonymization
Data Isolation: Per-account database isolation

Network Security

Private Clusters: GKE clusters not publicly accessible
Firewall Rules: Restricted ingress/egress
Cloud Armor: DDoS protection and WAF

Scalability & Reliability

Horizontal Scaling

Services scale independently based on load:

Backend: Scales with API request volume
Engine: Scales with suggestion generation demand
Frontend: Distributed via CDN

High Availability

Multi-Zone Deployment: Services distributed across availability zones
Health Checks: Kubernetes liveness and readiness probes
Auto-Recovery: Automatic pod restart on failure

Monitoring & Observability

Metrics: Prometheus for service metrics
Logging: Centralized logging to Cloud Logging
Tracing: Distributed tracing for request flows
Alerting: Automated alerts for service health

Next Steps

To dive deeper into specific aspects of the architecture:

Onboarding Architecture: Account provisioning and deployment
Platform Integrations: Integration developer guides
Authentication: SSO and authentication implementation

For operational documentation:

Onboarding User Guide: Setting up new accounts
Platform Integrations User Guide: Integration setup and configuration

Purpose​

System Architecture​

Key Architectural Principles​

Multi-Tenancy​

Microservices Architecture​

GitOps Deployment​

High-Level System Diagram​

Core Services (Agent Assist)​

Component Overview​

Global Services​

Onboarding Service​

Account-Specific Services​

Admin Service​

Backend Service​

Engine Service​

LLM Gateway Service​

Integration Layer​

Frontend Widgets​

Data Flow Patterns​

Real-Time Suggestions Flow​

Webhook-Based Ingestion Flow​

Training Pipeline Flow​

Summarizer Assistant Flow​

Deployment Architecture​

Regional Clusters​

Account Isolation​

Infrastructure as Code​

Security Architecture​

Authentication & Authorization​

Data Security​

Network Security​

Scalability & Reliability​

Horizontal Scaling​

High Availability​

Monitoring & Observability​

Next Steps​