Health Guide Assistant: Amazon Nova Sonic with Amazon Bedrock Knowledge Bases

Health Guide AI Voice Agent using Amazon Nova Sonic with Amazon Bedrock Knowledge Bases.

June 20, 2025 medium

bedrock demo voice-agent speech-to-speech agents real-time-data javascript health

Health Guide Assistant: Amazon Nova Sonic with Amazon Bedrock Knowledge Bases

Health Guide AI Voice Agent using Amazon Nova Sonic with Amazon Bedrock Knowledge Bases.

Overview

This project demonstrates how to build an intelligent conversational health assistant by integrating Amazon Nova Sonic model with Amazon Bedrock Knowledge Base. The application enables natural speech-to-speech interactions while leveraging a health knowledge base to provide informational responses about health topics.

Technologies

Node.js v16+
Amazon Bedrock
Amazon Bedrock Knowledge Bases
Amazon Nova Sonic
Javascript
WebSockets
CSS

Difficulty

Medium

Prerequisites

Node.js (v16 or higher)
AWS Account with Bedrock access
Amazon Nova Sonic model enabled in Bedrock:
1. Go to AWS Bedrock Console
2. Navigate to “Model access” in the left sidebar
3. Click “Manage model access”
4. Find “Amazon Nova Sonic” and enable it
5. Wait for the status to show “Access granted”
IAM permissions configured (see Required IAM Permissions section below)
AWS CLI configured with appropriate credentials
Modern web browser with WebAudio API support

Solution Design

Architecture

Architecture Overview

This application implements a real-time speech-to-speech health AI Agent using a WebSocket-based architecture that enables bidirectional audio streaming between the browser and the Amazon Nova Sonic model.

Key Architectural Components

Frontend (Browser)
- WebAudio API for audio capture and playback
- Socket.IO client for real-time WebSocket communication
- Web-based UI for conversation monitoring and agent actions
Backend (Node.js Server)
- Express.js HTTP server with Socket.IO for WebSocket management
- TypeScript-based AI agent orchestration engine
- Direct integration with AWS Bedrock services
- Session management for concurrent users
AWS Services
- Amazon Nova Sonic for speech-to-speech AI capabilities
- Amazon Bedrock Knowledge Base for health information retrieval
- Vector database for semantic search

Security Requirements for Remote Deployment

Important: This application requires secure contexts (HTTPS) for microphone access when deployed beyond localhost.

Why SSL/TLS is Required

Modern browsers enforce strict security policies for accessing sensitive APIs like getUserMedia() (microphone access):

Localhost Exception: Browsers allow microphone access over HTTP only on localhost and 127.0.0.1
Remote Access Requirement: Any other hostname (including EC2 public IPs, custom domains, or local network IPs) requires HTTPS
Browser Security Model: This is a fundamental browser security feature to protect users from unauthorized audio/video capture

⚠️ Important Disclaimers

This application is for educational and informational purposes only. It is NOT a substitute for professional medical advice, diagnosis, or treatment.

This application is a DEMO and should not be used in production environments.

Always consult with qualified healthcare professionals for medical concerns
Never disregard professional medical advice or delay seeking it because of information from this application
This system has built-in safety measures to redirect emergency situations to appropriate resources
The AI assistant will not provide medical diagnoses or specific treatment recommendations
This demo is intended for testing and evaluation purposes only
Production use would require additional security, compliance, and reliability considerations

By using this application, you acknowledge that you understand these limitations.

Security Limitations

This is not a production application, therefore keep in mind the following:

No Input Validation: The application lacks proper input sanitization and validation
No Authentication: There is no user authentication or authorization system
No Data Encryption: Data is not encrypted in transit or at rest
No Rate Limiting: The application is vulnerable to abuse and DoS attacks

Application Interface

Health Guide Assistant UI

The application features a modern, intuitive interface with:

Real-time chat interface with speech-to-text capabilities
Agent Actions panel showing AI tool usage and analytics
Audio controls for seamless voice interaction
Live conversation monitoring with turn-by-turn analysis
Safety metrics tracking emergency and off-topic redirects

Key Features

AI Agentic Architecture: Intelligent tool selection and orchestration using Amazon Nova Sonic’s advanced reasoning capabilities
Health Knowledge Base Integration: Retrieves accurate information from health resources stored in Amazon Bedrock Knowledge Base
Real-time Speech-to-Speech: Bidirectional WebSocket-based audio streaming with Amazon Nova Sonic model
Advanced Tool System: 7 specialized tools for health information, appointments, and safety responses
Natural Conversational Experience: Seamless interaction through a responsive web interface
Contextual Health Information: AI-generated responses informed by knowledge base content
Safety Guardrails: Built-in redirects for emergency situations and medical advice boundaries
Appointment Management: Complete scheduling system with availability checking and booking
Multi-platform Support: Web interface with comprehensive agent action monitoring and analytics

AI Agentic Architecture

This application demonstrates advanced AI agent capabilities through Nova Sonic’s intelligent tool selection and orchestration:

Tool System Overview

The AI agent has access to 7 specialized tools that it selects autonomously based on user intent:

Health Information Tools

retrieve_health_info - Searches the health knowledge base for medical information
greeting - Provides personalized introductions and welcomes
safety_response - Handles inappropriate requests with proper boundaries

Appointment Management Tools

check_doctor_availability - Queries doctor schedules by specialty or ID
check_appointments - Retrieves existing appointments for patients or doctors
schedule_appointment - Books new appointments after collecting required information
cancel_appointment - Cancels existing appointments with proper confirmation

Intelligent Tool Orchestration

The Nova Sonic model demonstrates sophisticated reasoning by:

Context-aware tool selection: Automatically chooses appropriate tools based on user queries
Multi-step workflows: Chains tools together (e.g., check availability → collect info → schedule appointment)
Information validation: Ensures all required data is collected before executing actions
Safety prioritization: Always applies safety checks before processing requests

Agentic Behavior Examples

User: "I need to see a cardiologist next week"
Agent Process:
Uses check_doctor_availability with specialty="Cardiology"
Presents available options with calendar formatting
Collects patient information systematically
Uses schedule_appointment only after all data is gathered
Confirms booking with appointment details

The agent maintains conversation context across tool calls and provides natural, flowing interactions while ensuring all safety and business logic requirements are met.

Health Knowledge Base Workflow

User Speech → Amazon Nova Sonic → Safety Check → Tool Use Detection → Bedrock KB Query → 
                             ↓                                         ↓
                     Emergency Response                            Vector DB
                             ↓                                         ↓
User ← Audio Output ← Amazon Nova Sonic ← Safety Response ← Retrieved Health Context

Repository Structure

.
├── backend/                # Backend TypeScript application
│   ├── src/                # TypeScript source files
│   │   ├── client.ts       # AWS Bedrock client implementation
│   │   ├── bedrock-kb-client.ts # AWS Bedrock Knowledge Base client
│   │   ├── server.ts       # Express server implementation
│   │   ├── consts.ts       # Constants including tool schemas and configurations
│   │   ├── types.ts        # TypeScript type definitions
│   │   ├── appointment-service.ts # Backend appointment management
│   │   └── appointment-tools.ts # Backend appointment tools
│   ├── dist/               # Compiled JavaScript (auto-generated)
│   └── tsconfig.json       # TypeScript configuration
├── frontend/               # Frontend JavaScript application
│   ├── src/                # Frontend source code
│   │   ├── main.js         # Main application entry point
│   │   ├── audio-handler.js # Audio processing and streaming
│   │   ├── chat-ui.js      # Chat interface management
│   │   ├── action-panel.js # Agent actions and analytics
│   │   ├── socket-events.js # WebSocket event handling
│   │   ├── ui-manager.js   # UI interaction management
│   │   ├── appointment-service.js # Frontend appointment management
│   │   ├── AppointmentDatabase.js # Client-side appointment data
│   │   └── lib/            # Utility libraries
│   ├── css/                # Stylesheets
│   └── index.html          # Main application entry point
├── kb/                     # Knowledge Base source files
│   └── health-documents/   # Sample health information documents for KB
└── package.json            # Project configuration and scripts

Full-Stack Architecture

This application uses a full-stack TypeScript/JavaScript architecture:

Backend (TypeScript) - AI Agent Engine

Source: src/*.ts files
Compiled: dist/*.js files (via TypeScript compiler)
Purpose: AI agent orchestration, AWS integration, tool management, business logic
Key Components:
- client.ts - Nova Sonic bidirectional streaming and tool processing
- consts.ts - Tool schemas and AI agent configuration
- appointment-tools.ts - Appointment management business logic
- bedrock-kb-client.ts - Knowledge base integration
- server.ts - WebSocket server and session management
Commands:
- npm run dev - Development server with hot reload
- npm run build - Compile TypeScript to JavaScript
- npm start - Production server

Frontend (JavaScript)

Source: public/src/*.js files
Purpose: Browser-side UI, audio handling, real-time communication
Features: Speech recognition, audio playback, agent action monitoring

Setting Up the Health Knowledge Base

Required IAM Permissions

The IAM user or role running this application needs the following permissions:

Minimum Required Permissions

Create an IAM policy with these permissions:

{
    "Version": "2012-10-17",
    "Statement": [
        {
  "Sid": "BedrockModelAccess",
  "Effect": "Allow",
  "Action": [
      "bedrock:InvokeModel",
      "bedrock:InvokeModelWithResponseStream"
  ],
  "Resource": [
      "arn:aws:bedrock:*:*:foundation-model/amazon.nova-sonic-v1:0"
  ]
        },
        {
  "Sid": "BedrockKnowledgeBaseAccess",
  "Effect": "Allow",
  "Action": [
      "bedrock:Retrieve",
      "bedrock:RetrieveAndGenerate"
  ],
  "Resource": [
      "arn:aws:bedrock:*:*:knowledge-base/*"
  ]
        },
        {
  "Sid": "BedrockAgentRuntimeAccess",
  "Effect": "Allow",
  "Action": [
      "bedrock-agent-runtime:Retrieve",
      "bedrock-agent-runtime:RetrieveAndGenerate"
  ],
  "Resource": "*"
        },
        {
  "Sid": "S3KnowledgeBaseAccess",
  "Effect": "Allow",
  "Action": [
      "s3:GetObject",
      "s3:ListBucket"
  ],
  "Resource": [
      "arn:aws:s3:::your-kb-bucket-name",
      "arn:aws:s3:::your-kb-bucket-name/*"
  ]
        },
        {
  "Sid": "OpenSearchServerlessAccess",
  "Effect": "Allow",
  "Action": [
      "aoss:APIAccessAll"
  ],
  "Resource": [
      "arn:aws:aoss:*:*:collection/*"
  ]
        }
    ]
}

Creating Your Health Knowledge Base

Before running the application, you must create a Knowledge Base in Amazon Bedrock:

Access the AWS Bedrock Console:
- Navigate to the AWS Management Console
- Search for “Amazon Bedrock” and open the service
Create a New Knowledge Base:
- In the left navigation pane, select “Knowledge bases”
- Click “Create knowledge base”
- Follow the wizard to create a new knowledge base with vector store
- Choose a name like “HealthGuideKB”
Configure Data Source:
- Select “Upload files” as your data source using S3
- Upload health information documents to your knowledge base
- Configure chunking settings with semantic chunking
- Select all files available on kb/files directory, which include a few markdown and metadata files (JSON)
Complete Setup:
- Review your settings and create the knowledge base
- Once created, note your Knowledge Base ID for the next step

Installation and Setup

Clone the repository:

git clone https://github.com/aws-samples/sample-ai-possibilities.git
cd sample-ai-possibilities/demos/health-voice-ai-agent-websocket-nodejs

Install dependencies:
```
npm install
```
Update Application Configuration: Create the .env file and replace YOUR_KB_ID_HERE with your actual Amazon Bedrock Knowledge Bases ID.

# Create .env file from example
cp .env.example .env
nano .env

Configure AWS credentials:

# Configure AWS CLI with your credentials - we recommend using IAM role whenever possible
aws configure

Build the TypeScript backend:
```
npm run build
```

Running the Application

npm start

Access the Application

Open your browser to: http://localhost:4000
For Amazon EC2 deployment, you may want to create an SSH tunnel before opening your browser so you dont need to expose your app to the internet or add a certificate:
```
ssh -i /your/key.pem -L 4000:localhost:4000 ec2-user@your-ec2-ip
```

Note: If you are using Amazon EC2, make sure SSH is allowed to your workstation.

Grant microphone permissions when prompted
Start asking health-related questions to see the Knowledge Base in action:
- “What are the symptoms of the common cold?”
Test other tools:
- “I would like to schedule an appointment”
Check the “Agent Actions” panel for more details about the AI Agent tools and logs

Safety Features

The application includes several safety mechanisms:

Emergency Detection: Automatically detects emergency situations and provides 911 guidance
Medical Advice Boundaries: Redirects requests for medical diagnoses or treatment
Off-Topic Handling: Politely redirects non-health questions back to health topics
Appropriate Disclaimers: All responses include appropriate health disclaimers

Agent Actions Monitoring

The application includes a comprehensive monitoring panel that tracks:

Conversation Turns: Number of user interactions
Knowledge Base Searches: Queries to the health knowledge base
Emergency Redirects: Emergency situations detected
Off-Topic Attempts: Non-health questions asked
Medical Advice Redirects: Inappropriate medical advice requests

Testing Health Knowledge Base Retrieval

To verify the Knowledge Base integration:

Ask a health-related question
The system should:
- Recognize the question requires knowledge base information
- Query the knowledge base for relevant content
- Provide an accurate response with appropriate disclaimers
Check server logs:
```
npm start | grep "Knowledge Base"
```

Project Scripts

{
  "scripts": {
    "build": "tsc",                    // Compile TypeScript
    "start": "node dist/server.js",    // Start production server
    "dev": "ts-node src/server.ts",    // Start development server
    "clean": "rm -rf dist/",           // Clean compiled files
    "rebuild": "npm run clean && npm run build" // Full rebuild
  }
}

Deployment Considerations

This application is primarily designed for local development on your laptop, which minimizes costs and complexity. However, it can also be deployed on Amazon EC2 or other computing platforms with proper SSL configuration.

Recommended Deployment

Primary: Local laptop/desktop (localhost:4000)
- No SSL certificates needed
- No hosting costs
- Immediate development and testing
- Full microphone access
Secondary: Amazon EC2 or cloud instances
- Requires SSL setup
- Additional hosting costs
- Suitable for demos and sharing

💰 Cost Considerations

AWS Service Costs

When running this application, you will incur costs for:

Amazon Bedrock
- Nova Sonic model: Charged per input/output tokens
- AMazon Bedrock Knowledge Bases: Storage and retrieval costs
- Vector database (e.g. Amazon OpenSearch Serverless): Minimum charges apply even when idle
Amazon S3 (for Amazon Bedrock Knowledge Base documents)
- Storage costs for uploaded health documents
- Generally minimal for demo purposes
Amazon EC2 Instance (if deployed remotely)
- Instance hourly rates based on type
- You may want to use eligible instance types for free tier
- Costs vary by instance type and region

Cost Optimization Tips

Development: Use localhost to avoid Amazon EC2 costs
Testing: Limit conversation length to reduce token usage
Knowledge Base: Use minimum documents needed for testing
Shut down resources when not in use

Cleanup Instruction

To avoid ongoing charges, follow these steps to delete all resources:

1. Stop the Application

# Stop the Node.js server
# Press Ctrl+C in the terminal running the server

# If running on Amazon EC2, also stop the instance
aws ec2 stop-instances --instance-ids <your-instance-id>

2. Delete Bedrock Knowledge Base

# List Amazon Bedrock knowledge bases
aws bedrock-agent list-knowledge-bases

# Delete the Amazon Bedrock knowledge base (replace with your KB ID)
aws bedrock-agent delete-knowledge-base --knowledge-base-id YOUR_KB_ID

# Note: This may take several minutes

3. Clean Up S3 Bucket

# List and delete objects in your KB bucket
aws s3 rm s3://your-kb-bucket-name --recursive

# Delete the bucket
aws s3 rb s3://your-kb-bucket-name

4. Delete OpenSearch Serverless Collection

If you created an OpenSearch Serverless collection for the Knowledge Base:

Go to AWS Console → OpenSearch Service
Select “Serverless collections”
Find your collection and delete it
Also delete any associated security policies

Note: If you are using a different vector store, please check our document pages for more details.

5. Terminate Amazon EC2 Instance (if used)

# Terminate EC2 instance permanently
aws ec2 terminate-instances --instance-ids <your-instance-id>

# Delete associated security groups
aws ec2 delete-security-group --group-id <security-group-id>

# Release Elastic IP (if allocated)
aws ec2 release-address --allocation-id <allocation-id>

6. Clean Up Local Files

Delete the repository files from your workstation

7. Verify Resource Deletion

Check AWS Cost Explorer after 24 hours to ensure no resources are still running:

# Check for any remaining Bedrock resources
aws bedrock-agent list-knowledge-bases
aws bedrock-agent list-data-sources --knowledge-base-id YOUR_KB_ID

# Check S3 buckets
aws s3 ls

# Check EC2 instances
aws ec2 describe-instances --query 'Reservations[].Instances[?State.Name!=`terminated`]'

Important Cost Notes

⚠️ OpenSearch Serverless Minimum Charges: Even when idle, OpenSearch Serverless collections have minimum charges. Delete them when not in use.

⚠️ Amazon Bedrock Model Costs: Conversations are charged per token. Long conversations can accumulate costs quickly.

⚠️ Free Tier Limits: Be aware of AWS Free Tier limits, especially for Amazon EC2 and Amazon S3.

Troubleshooting

Knowledge Base Issues

Knowledge Base Not Responding:
- Verify your Knowledge Base ID in .env
- Check AWS credentials and permissions
- Ensure knowledge base status is “Available”
Incorrect Health Information:
- Verify health documents were properly ingested
- Check chunking settings in AWS console
- Ensure source documents are from reputable health sources

Audio Issues

Microphone Not Working:
- Check browser permissions
- Ensure HTTPS (need to install/add certificate) or localhost
- Try different browser (recommended to use Chrome)
No Audio Output:
- Check browser audio settings
- Verify WebSocket connection in browser console
Error Error: {"source":"bidirectionalStream","error":{"name":"CredentialsProviderError","tryNextLink":false}}
- Check your AWS credentials

General Connection Issues

Check server logs for errors

Verify WebSocket connection:

socket.on('connect_error', (error) => {
  console.error('Connection failed:', error);
});

Customizing the AI Agent

Adding New Tools

To extend the agent’s capabilities with new tools:

Define Tool Schema (in src/consts.ts):

export const NewToolSchema = JSON.stringify({
  "type": "object",
  "properties": {
 "parameter": {
   "type": "string",
   "description": "Parameter description"
 }
  },
  "required": ["parameter"]
});

Add Tool to Configuration (in src/consts.ts, within setupPromptStartEvent):

{
  toolSpec: {
 name: "new_tool_name",
 description: "Tool description for the AI agent",
 inputSchema: {
   json: NewToolSchema
 }
  }
}

Implement Tool Logic (in src/client.ts, within processToolUse method):

case "new_tool_name":
 console.log(`Processing new tool: ${JSON.stringify(toolUseContent)}`);
 return this.processNewTool(toolUseContent);

Create Tool Function:

private processNewTool(toolUseContent: any): Object {
 // Parse tool content
 const content = JSON.parse(toolUseContent.content || "{}");
    
 // Implement your tool logic here
 return {
     success: true,
     result: "Tool execution result"
 };
}

Modifying Agent Behavior

System Prompt (in src/consts.ts):

Modify DefaultSystemPrompt to change the agent’s personality, capabilities, and conversation flow
Add new guidelines for tool usage and conversation structure

Tool Selection Logic:

The AI agent automatically selects appropriate tools based on the tool descriptions and system prompt
Modify tool descriptions to influence when each tool is used

Safety Boundaries:

Update the safety_response tool schema to handle new types of inappropriate requests
Modify the safety response generation logic in generateSafetyResponse method

Knowledge Base Configuration

Knowledge Base Query Parameters:

Modify queryHealthKnowledgeBase method to adjust search parameters
Change numberOfResults for more or fewer search results

Audio and Voice Configuration

Voice Settings (in src/consts.ts):

export const DefaultAudioOutputConfiguration = {
    sampleRateHertz: 24000,
    voiceId: "tiffany", // Change voice here
};

Available voice options: tiffany, amy and matthew.

Data Flow Architecture

User Health Question → Browser → Server → AI Agent → Tool Selection & Orchestration
                                   ↓              ↓
                          Safety Check    Knowledge Base Query
                                   ↓              ↓
                          Emergency Check   Amazon Nova Sonic
                                   ↓              ↓
                          Tool Execution    Response Generation
                                   ↓              ↓
         Audio Response ← Browser ← Server ← Generated Response + Disclaimers

Infrastructure Requirements

Backend: Node.js server with Express.js and Socket.IO
AI Agent Engine: Amazon Nova Sonic with bidirectional streaming
Frontend: Modern browser with WebAudio API support
AWS Services: Amazon Bedrock and Amazon Bedrock Knowledge Bases
Real-time Communication: WebSocket-based bidirectional streaming
Tool Management: JSON schema-based tool definitions with automatic orchestration

Security

See CONTRIBUTING for more information.

License

This library is licensed under the MIT-0 License. See the LICENSE file.

This project is for educational purposes and not designed for production use.

Full Source Code

View on GitHub

Health Guide Assistant: Amazon Nova Sonic with Amazon Bedrock Knowledge Bases

Overview

Tags

Technologies

Difficulty

Prerequisites

Solution Design

Architecture Overview

Key Architectural Components

Security Requirements for Remote Deployment

Why SSL/TLS is Required

⚠️ Important Disclaimers

Security Limitations

Application Interface

Key Features

AI Agentic Architecture

Tool System Overview

Health Information Tools

Appointment Management Tools

Intelligent Tool Orchestration

Agentic Behavior Examples

Health Knowledge Base Workflow

Repository Structure

Full-Stack Architecture

Backend (TypeScript) - AI Agent Engine

Frontend (JavaScript)

Setting Up the Health Knowledge Base

Required IAM Permissions

Minimum Required Permissions

Creating Your Health Knowledge Base

Installation and Setup

Running the Application

Access the Application

Safety Features

Agent Actions Monitoring

Testing Health Knowledge Base Retrieval

Project Scripts

Deployment Considerations

Recommended Deployment

💰 Cost Considerations

AWS Service Costs

Cost Optimization Tips

Cleanup Instruction

1. Stop the Application

2. Delete Bedrock Knowledge Base

3. Clean Up S3 Bucket

4. Delete OpenSearch Serverless Collection

5. Terminate Amazon EC2 Instance (if used)

6. Clean Up Local Files

7. Verify Resource Deletion

Important Cost Notes

Troubleshooting

Knowledge Base Issues

Audio Issues

General Connection Issues

Customizing the AI Agent

Adding New Tools

Modifying Agent Behavior

Knowledge Base Configuration

Audio and Voice Configuration

Data Flow Architecture

Infrastructure Requirements

Security

License

Full Source Code