January 12, 2026

18 min read

Claudio Acquaviva

Principal Architect, Kong

Jason Matis

Staff Solutions Engineer, Kong

In one of our posts, Kong AI/MCP Gateway and Kong MCP Server technical breakdown, we described the new capabilities added to Kong AI Gateway to support MCP (Model Context Protocol). The post focused exclusively on consuming MCP server and MCP tools through Kong MCP Gateway. Now, it's time to check how an AI agent can leverage the AI and MCP infrastructure exposed and protected by Kong AI/MCP Gateway.

Strands Agents, the Python-based framework for building agents, was introduced by AWS in May, 2025. Strands makes the integration with tools, GenAI models and services straightforward by providing a consistent way for agents to interact with external systems. It simplifies how developers orchestrate tools, gather context, and orchestrate reasoning, turning complex multi-service workflows into maintainable, event-driven agent logic.

At the infrastructure layer, the Kong AI/MCP Gateway extends these capabilities by securely exposing GenAI model endpoints, AI tools and enterprise APIs through standardized GenAI and MCP interfaces. When combined with Amazon Bedrock, which offers scalable access to leading foundation models, the result is an architecture where agents can reliably call Bedrock models and MCP based tools through a unified Gateway.

Kong AI/MCP Gateway and Kong MCP Server technical breakdown also discussed the fundamental MCP abstractions and how Kong AI/MCP Gateway addresses them. However, that post doesn't show how to leverage the gateway to implement an AI agent.

In this post, we explore how Strands SDK, Kong AI/MCP Gateway and Amazon Bedrock can work together to build robust, production-ready AI agents.

Kong MCP Gateway introduction

In October 2025, Kong announced Kong Gateway 3.12 with several new capabilities. In a nutshell, this version opens a new chapter in Kong's AI story: the introduction of the Kong MCP Gateway. The following diagram illustrates the new component:

Similarly to what the Kong AI Gateway does with the GenAI models, the LLMs we consume, the Kong MCP Gateway protects and controls the MCP servers consumption.

The new MCP Gateway allows you, just like you do with LLMs and other GenAI models with Kong AI Gateway, to host your MCP Servers sitting behind the Gateway and, therefore, leverage the capabilities provided by it. Which includes:

Specific MCP-based security mechanisms like OAuth 2.1.
Combine historical Kong API Gateway plugins to implement policies like Transformations, Security, etc.
Take advantage of the Observability plugins provided by the Kong API Gateway and Konnect.

Kong AI Gateway and Amazon Bedrock

Let's discuss the benefits Kong AI Gateway brings to Amazon Bedrock for an LLM-based application. As mentioned before, Kong AI Gateway leverages the existing Kong API Gateway extensibility model to provide specific AI-based plugins. Here are some values provided by Kong AI Gateway and its plugins:

AI Proxy and AI Proxy Advanced plugins: The Multi-LLM capability allows the AI Gateway to abstract Amazon Bedrock (and other LLMs as well) load balancing models based on several policies including latency time, model usage, semantics, etc. These plugins extract LLM observability metrics (like number of requests, latencies, and errors for each LLM provider) and the number of incoming prompt tokens and outgoing response tokens. All this is in addition to hundreds of metrics already provided by Kong Gateway on the underlying API requests and responses. Finally, Kong AI Gateway leverages the observability capabilities provided by Konnect to track Amazon Bedrock usage out of the box, as well as generate reports based on monitoring data.
Prompt Engineering:
- AI Prompt Template plugin, responsible for pre-configuring AI prompts to users
- AI Prompt Decorator plugin, which injects messages at the start or end of a caller's chat history
- AI Prompt Compressor
- AI Prompt Guard plugin lets you configure a series of PCRE-compatible regular expressions to allow and block specific prompts, words, phrases, or otherwise and have more control over a LLM service, controlled by Amazon Bedrock.
- AI Semantic Prompt Guard plugin to self-configure semantic (or natural-language based pattern-matching) prompt protection
AI RAG Injector
AI Semantic Response Guard
AI Semantic Cache plugin caches responses based on threshold, to improve performance (and therefore end-user experience) and cost
AI Rate Limiting Advanced: You can tailor per-user or per-model policies based on the tokens returned by the LLM provider under Amazon Bedrock management or craft a custom function to count the tokens for requests.
AI Request Transformer and AI Response Transformer plugins seamlessly integrate with the LLM on Amazon Bedrock, enabling introspection and transformation of the request's body before proxying it to the upstream service and prior to forwarding the response to the client.
AI AWS Guardrails
AI LLM as Judge
AI PII Sanitizer

Besides, the Kong AI/MCP Gateway use cases can combine policies implemented by hundreds of Kong Gateway plugins, such as:

Authentication and authorization: OIDC, mTLS, API Key, LDAP, SAML, Open Policy Agent (OPA)
Traffic control: Request Validator and Size Limiting, WebSocket support, Route by Header, etc.
Observability: OpenTelemetry (OTel), Prometheus, TCP-Log, etc.

Also, from the architecture perspective, in brief, the Konnect Control Plane and Data Plane nodes topology remains the same.

By leveraging the same underlying core of Kong API Gateway, we're reducing complexity in deploying the AI/MCP Gateway capabilities as well. And of course, it works on Konnect, Kubernetes, self-hosted, or across multiple clouds.

Kong AI/MCP Gateway and Amazon Bedrock integration and APIs

Let's take a look at the specific integration point between Kong AI/MCP Gateway and Amazon Bedrock. To get a better understanding, here's the architecture cut isolating both:

The consumer can be any RESTful-based component; in our case, it will be a Strands agent.

As you can see, there are two important topics here.

OpenAI API specification
Amazon Bedrock Converse API and EKS Pod Identity

Let's discuss each one of them.

OpenAI API support

Kong AI Gateway supports the OpenAI API specification. That means the consumer can send standard OpenAI requests to the Kong AI Gateway. As a basic example, consider this OpenAI request:

curl https://api.openai.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "model": "gpt-5",
    "messages": [
      {
        "role": "user",
        "content": "Hello!"
      }
    ]
  }'

When we add Kong AI Gateway, sitting in front of Amazon Bedrock, we're not just exposing it but also allowing the consumers to use the same mechanism — in this case, OpenAI APIs — to consume it. That leads to a very flexible and powerful capability when we come to development processes. In other words, Kong AI Gateway normalizes the consumption of any LLM infrastructure, including Amazon Bedrock, Mistral, OpenAI, Cohere, etc.

As an exercise, the new request should be something like this. The request has some minor differences:

It sends a request to the Kong API Gateway Data Plane Node.
It replaces the OpenAI endpoint with a Kong API Gateway route.
The API Key is actually managed by the Kong API Gateway now.
We're using an Amazon Bedrock Model, us.amazon.nova-micro-v1:0.

curl http://$DATA_PLANE_LB/bedrock-route \
  -H "Content-Type: application/json" \
  -H "apikey: $KONG_API_KEY" \
  -d '{
     "model": "us.amazon.nova-micro-v1:0",
     "messages": [
       {
         "role": "user",
         "content": "Hello!"
       }
     ]
   }'

Amazon EKS Pod Identity

The Konnect Data Plane Node, where the Kong AI Gateway runs, has to send requests to Amazon Bedrock on behalf of the Gateway consumer. In order to do it, we need to grant permissions to the Data Plane deployment to access the Amazon Bedrock API, more precisely the Converse API, used by the AI Gateway to interact with it.

For example, here's a request to Amazon Bedrock, using the AWS CLI. Use "aws configure" first to set your access and secret key as well as the AWS region you want to use, then run the command.

aws bedrock-runtime converse \
  --model-id us.amazon.nova-micro-v1:0 \
  --messages '[{"role":"user","content":[{"text":"Tell me about John Coltrane"}]}]'

It's reasonable to consume Bedrock service with local CLI commands. However, for Amazon EKS deployments, the recommended approach is EKS Pod Identity, instead of simple long-term credentials like AWS Access and Secret Keys. In short, EKS Pod Identity allows the Data Plane Pod's container to use the AWS SDK and send API requests to AWS services using AWS Identity and Access Management (IAM) permissions.

Amazon EKS Pod Identity associations provide the ability to manage credentials for your applications, similar to the way that Amazon EC2 instance profiles provide credentials to Amazon EC2 instances.

As another best practice, we recommend the Private Key and Digital Certificate pair used by the Konnect Control Plane and the Data Plane connectivity to be stored in AWS Secrets Manager. In this sense, the Data Plane deployment refers to the secrets to get installed.

Amazon Elastic Kubernetes Service (EKS) installation and preparation

Now, we're ready to get started with our Kong AI Gateway deployment. As the installation architecture defines, it'll be running on an EKS Cluster.

Amazon EKS Cluster creation

In order to create the EKS Cluster, you can use eksctl, the official CLI for Amazon EKS, like this:

eksctl create cluster --name kong-aws \
  --version 1.34 \
  --region us-east-2 \
  --nodegroup-name kong-node \
  --node-type g6.xlarge \
  --nodes 1

Note command creates an EKS Cluster, version 1.34, with a single node based on the g6.xlarge instance type, powered by NVIDIA GPUs. That is particularly interesting if you're planning to deploy and run LLMs locally in the EKS Cluster.

Cluster preparation

After the installation, we should prepare the cluster to receive the other components:

AWS Load Balancer Controller to expose both Kong AI Gateway with public Network Load Balancers (NLB).
EKS Pod Identity Agent to be able to define Pod Identity Association and grant permissions to the Kong AI Gateway Data Plane Pod to access both Amazon Bedrock and AWS Secrets Manager.

Refer to the official documentation to learn more about the components and their installation processes.

Kubernetes Kong Operator and Konnect Control Plane/Data Plane installation

The Kong AI/MCP Gateway deployment process can be divided into two steps:

Pod Identity configuration
Kong Control Plane and Data Plane deployment using the Kubernetes Kong Operator

Pod Identity configuration

In this first step, we configure EKS Pod Identity describing which AWS Services the Data Plane Pods should be allowed to access. In our case, we need to consume Amazon Bedrock and AWS Secrets Manager.

IAM Policy

Pod Identity relies on IAM policies to check which AWS Services can be consumed. Our policy should allow access to AWS Bedrock actions so the Data Plane will be able to send requests to Bedrock APIs, more precisely, Converse and ConverseStream APIs. The Converse API requires permission to the InvokeModel action as ConverseStream needs access to InvokeModelWithResponseStream.

Also, we're going to use AWS Secrets Manager to store our Private Key and Digital Certificate pair, which the Konnect Control Plane and Data Plane used to communicate.

Considering all this, let's create the IAM policy with the following request:

aws iam create-policy \
 --policy-name bedrock-policy \
 --policy-document '{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Action": [
                "bedrock:InvokeModel",
                "bedrock:InvokeModelWithResponseStream",
                "secretsmanager:ListSecrets",
                "secretsmanager:GetSecretValue"
            ],
            "Resource": "*"
        }
    ]
}'

Pod Identity Association

Pod Identity takes a Kubernetes Service Account to manage the permissions. So create the Kubernetes namespace for the Kong Data Plane deployment and a simple service account inside of it.

kubectl create namespace kong
kubectl create sa kaigateway-podid-sa -n kong

Now we're ready to create the Pod Identity Association. We use the same eksctl command to do it:

eksctl create podidentityassociation \
  --cluster kong-aws
  --region us-east-2 \
  --namespace kong \
  --service-account-name kaigateway-podid-sa \
  --role-name kaigateway-aws-podid-role \
  --permission-policy-arns arn:aws:iam::<your_aws_account>:policy/bedrock-policy

The command above is responsible for:

IAM Role creation based on the IAM Policy we previously defined
Associating the IAM Role to the existing Kubernetes Service Account

You can check the Pod Identity Association with:

eksctl get podidentityassociation \
  --cluster kong-aws \
  --region us-east-2 \
  --namespace kong

Check the IAM Role and Policies attached with:

aws iam get-role --role-name kaigateway-aws-podid-role
aws iam list-attached-role-policies --role-name kaigateway-aws-podid-role

Kong Operator and Control Plane/Data Plane deployment

The Data Plane deployment comprises the following steps:

Konnect subscription
Kong Operator installation
Konnect Control Plane creation
Konnect Data Plane deployment

Konnect subscription

This fundamental step is required to get access to Konnect. Click on the Registration link and present your credentials. Or, if you already have a Konnect subscription, log in to it.

Any Konnect subscription has a "default" Control Plane defined. You can proceed using it or optionally create a new one. The following instructions are based on a new Control Plane.

Kong Operator installation

The Konnect Control Plane and Data Plane creation and deployments are totally managed by the Kong Operator (KO) which is fully compliant with the Kubernetes Operator standards. First, we need to install it. Check the documentation to learn more.

helm repo add kong https://charts.konghq.com
helm repo update kong

helm upgrade --install kong-operator kong/kong-operator \
-n kong-system \
--create-namespace \
--set image.tag=2.0.6 \
--set kubernetes-configuration-crds.enabled=true \
--set env.ENABLE_CONTROLLER_KONNECT=true

You can check the Operator’s log with:

kubectl logs -f $(kubectl get pod -n kong-system -o json | jq -r '.items[].metadata | select(.name | startswith("kong-operator"))' | jq -r '.name') -n kong-system

Konnect Control Plane creation

In order to start using the Kong Operator, you need to issue a Konnect Personal Access Token (PAT) or a System Access Token (SAT). To generate your PAT, go to Konnect UI, click on your initials in the upper right corner of the Konnect home page, then select "Personal Access Tokens." Click on "+ Generate Token," name your PAT, set its expiration time, and be sure to copy and save it as an environment variable also named as PAT. Konnect won’t display your PAT again.

export PAT=<YOUR_PAT>

Now, you can create your Control Plane with the first Kong Operator declaration. The first CRD tells the Operator which Konnect region you’re using, and what token (PAT or SAT) to use to authenticate.

cat <<EOF | kubectl apply -f -
kind: KonnectAPIAuthConfiguration
apiVersion: konnect.konghq.com/v1alpha1
metadata:
  name: konnect-api-auth-conf
  namespace: kong
spec:
  type: token
  token: $PAT
  serverURL: us.api.konghq.com
EOF

The second CRD creates the new Control Plane:

cat <<EOF | kubectl apply -f -
kind: KonnectGatewayControlPlane
apiVersion: konnect.konghq.com/v1alpha2
metadata:
 name: kong-aws
 namespace: kong
spec:
  createControlPlaneRequest:
    name: kong-aws
  konnect:
    authRef:
      name: konnect-api-auth-conf
EOF

Konnect Data Plane deployment

Finally, in the last step we will deploy the Data Plane. The following KonnectExtension CRD allows you to define your Konnect Control Plane details. The DataPlane CRD actually creates the Konnect Data Plane, attaching it to the Control Plane.

Note the DataPlane declaration:

Adds the service annotation to request a public NLB for the Data Plane.
Uses the Kubernetes Service Account that has been used to create the Pod Identity Association, so the Data Plane can have access to both Amazon Bedrock and Secrets Manager.

cat <<EOF | kubectl apply -f -
kind: KonnectExtension
apiVersion: konnect.konghq.com/v1alpha2
metadata:
 name: konnect-config-aws
 namespace: kong
spec:
 clientAuth:
   certificateSecret:
     provisioning: Automatic
 konnect:
   controlPlane:
     ref:
       type: konnectNamespacedRef
       konnectNamespacedRef:
         name: kong-aws
---
apiVersion: gateway-operator.konghq.com/v1beta1
kind: DataPlane
metadata:
 name: kong-aws-dp
 namespace: kong
spec:
 extensions:
 - kind: KonnectExtension
   name: konnect-config-aws
   group: konnect.konghq.com
 deployment:
   podTemplateSpec:
     spec:
       containers:
       - name: proxy
         image: kong/kong-gateway:3.12.0.2
       serviceAccountName: kaigateway-podid-sa
 network:
   services:
     ingress:
       name: proxy-kong-aws
       type: LoadBalancer
       annotations:
         "service.beta.kubernetes.io/aws-load-balancer-scheme": "internet-facing"
         "service.beta.kubernetes.io/aws-load-balancer-nlb-target-type": "ip"
EOF

Checking the Data Plane

Use the Load Balancer created during the deployment:

export DATA_PLANE_LB=$(kubectl get service -n kong proxy-kong-aws --output=jsonpath='{.status.loadBalancer.ingress[].hostname}')
export DATA_PLANE_PORT=$(kubectl get service -n kong proxy-kong-aws --output=jsonpath='{.spec.ports[].port}')

You should get a response like this:

% curl -i $DATA_PLANE_LB:$DATA_PLANE_PORT
HTTP/1.1 404 Not Found
Date: Tue, 09 Dec 2025 12:55:42 GMT
Content-Type: application/json; charset=utf-8
Connection: keep-alive
Content-Length: 103
X-Kong-Response-Latency: 0
Server: kong/3.12.0.1-enterprise-edition
X-Kong-Request-Id: 95b09b2a40dc745d59ea6d684f9c4a13

{
  "message":"no Route matched with those values",
  "request_id":"95b09b2a40dc745d59ea6d684f9c4a13"
}

Now we can define the Kong Objects necessary to expose and control Bedrock, including Kong Gateway Service, Routes, and Plugins.

decK

With decK (declarations for Kong) you can manage Kong Konnect configuration and create Kong Objects in a declarative way. decK state files describe the configuration of Kong API Gateway. State files encapsulate the complete configuration of Kong in a declarative format, including services, routes, plugins, consumers, and other entities that define how requests are processed and routed through Kong. Please check the decK documentation to learn how to install it.

You can ping Konnect using your PAT with:

% deck gateway ping --konnect-control-plane-name kong-aws --konnect-token $PAT
Successfully Konnected to the Kong organization!

Strands Agent SDK - AI agent and MCP fundamentals

With all the components we need for our AI agent in place, it's time for the most exciting part of this post: the Strands agent itself. Let's start with a very basic code:

from strands import Agent
from strands.models.bedrock import BedrockModel

import boto3
import os


aws_access_key_id = os.getenv("AWS_ACCESS_KEY_ID");
aws_secret_access_key = os.getenv("AWS_SECRET_ACCESS_KEY");


session = boto3.Session(
    aws_access_key_id=f"{aws_access_key_id}",
    aws_secret_access_key=f"{aws_secret_access_key}"
)

bedrock_model = BedrockModel(
    model_id="us.anthropic.claude-sonnet-4-20250514-v1:0",
    boto_session=session
)


agent = Agent(model=bedrock_model)
agent("Who is Aldous Huxley?")

The code is straightforward. The diagram shows the two components:

Strands has an extensive list of Model Providers where Amazon Bedrock is one of them. The provider requires AWS credentials which, for this code, are set in the environment variables.

The code instantiates an agent which sends a simple prompt to the model. Nothing is really special until you send prompts like:

agent("What is the weather like in my location?")

If you do, you'll get a response like:

"I don't have access to your current location or real-time weather data, so I can't tell you what the weather is like where you are.

To get current weather information, you could:
- Check a weather app on your phone
- Visit weather websites like Weather.com, AccuWeather, or your local meteorological service
- Ask a voice assistant with location access
- Look outside your window for immediate conditions

If you'd like to tell me your city or region, I could provide general information about the typical climate there, but I wouldn't be able to give you current conditions."

MCP principles

That's one of the main reasons why we should add tools to our agent. In fact, LLMs are not able to process prompts like this without setting some context. By context, we mean artifacts like transcripts or documents, presentations or functions so the LLM can respond accordingly. That's the purpose of MCPs: to provide a standardized mechanism for LLMs to access some context.

As defined in the documentation, there are three core primitives that a MCP server can expose:

Tools: Executable functions that AI applications can invoke to perform actions (e.g., file operations, API calls, database queries)
Resources: Data sources that provide contextual information to AI applications (e.g., file contents, database records, API responses)
Prompts: Reusable templates that help structure interactions with language models (e.g., system prompts, few-shot examples)

Now, before adding tools to our agent, let's create a Kong version for it.

Kong version

Kong declarations

Kong needs to be configured to understand how to connect and consume Bedrock. Here's the decK declaration saying so:

_format_version: "3.0"
_info:
  select_tags:
  - agent
_konnect:
  control_plane_name: kong-aws
services:
- name: agent-service
  url: http://httpbin.konghq.com
  routes:
  - name: agent-route-bedrock
    paths:
    - /bedrock-route
    plugins:
    - name: ai-proxy-advanced
      instance_name: "ai-proxy-advanced-bedrock"
      enabled: true
      config:
        max_request_body_size: 65536
        targets:
        - auth:
            allow_override: false
          route_type: "llm/v1/chat"
          model:
            provider: "bedrock"
            options:
              bedrock:
                aws_region: "us-east-2"
          logging:
            log_payloads: true
            log_statistics: true

The declaration defines multiple Kong Objects:

Kong Gateway Service named "agent-service". The service doesn’t need to map to any real upstream URL. In fact, it can point somewhere empty, for example, http://localhost:32000. This is because the AI Proxy plugin, also configured in the declaration, overwrites the upstream URL. This requirement will be removed in a later Kong revision.
Kong Route: The gateway Service has a route defined with the "/bedrock-route" path. That's the route we're going to consume to reach out to Bedrock.
Kong Route Plugins: The Kong Route has some plugins configured. Note that only the AI Proxy and Key Auth Plugins are enabled. The other ones are configured but disabled.

Kong AI Proxy Advanced Plugin: That's the plugin that allows us to connect to the LLM infrastructure. For Bedrock, among other things, we need to configure which AWS region we should connect to.

The declaration has been tagged as "agent" so you can manage its objects without impacting any other ones you might have created previously. Also, note the declaration is saying it should be applied to the "kong-aws" Konnect Control Plane.

You can submit the declaration to your Konnect Control Plane with the following decK command:

deck gateway sync --konnect-token $PAT <YOUR_DECLARATION_FILE>

First Strands Kong AI Gateway agent

A very noticeable improvement in this updated Kong version is that, since we have Pod Identity configured in our EKS Cluster, we don't need to set up AWS credentials. Besides, as Kong exposes itself as an OpenAI complaint server, the agent uses the OpenAI Model Provider to talk to it.

from strands import Agent
from strands.models.openai import OpenAIModel

import os


data_plane_lb = os.getenv("DATA_PLANE_LB");
data_plane_port = os.getenv("DATA_PLANE_PORT");

kong_dp = f"http://{data_plane_lb}:{data_plane_port}"
kong_dp_route = f"http://{data_plane_lb}:{data_plane_port}/bedrock-route"

print("kong_dp_route")
print(kong_dp_route)


openai_model = OpenAIModel(
  client_args={
      "base_url": kong_dp_route,
      "api_key": "dummy"
  },
  model_id="us.anthropic.claude-sonnet-4-20250514-v1:0"
)


agent = Agent(model=openai_model)
agent("Who is Aldous Huxley?")

Here's the new topology:

As all requests are reported back to the Konnect Control Plane, you can check them in the Analytics tab.

However, we don't have any tools defined so we still get a similar response if we send some weather related prompt.

Strands Tools Decorator

Tools can be added to agents using several techniques. The most basic one is Tool Decorator, which transforms a Python function into a Strands Tool. Our next Python code has some tools defined with decorators.


from strands import Agent, tool
from strands.models.openai import OpenAIModel
from strands.types.tools import ToolResult, ToolUse

import os
from typing import Any
import httpx


data_plane_lb = os.getenv("DATA_PLANE_LB");
data_plane_port = os.getenv("DATA_PLANE_PORT");
kong_dp = f"http://{data_plane_lb}:{data_plane_port}"
kong_dp_route = f"http://{data_plane_lb}:{data_plane_port}/bedrock-route"
print("kong_dp_route")
print(kong_dp_route)


model_id="us.anthropic.claude-sonnet-4-20250514-v1:0"
print("model_id")
print(model_id)


@tool
def get_user_geolocation() -> str:
#def get_user_geolocation(tool: ToolUse, **kwargs: Any) -> ToolResult:
    """Get the user's geolocation
       description: Location query. Takes no parameter and returns lat and lng of current location.
    """

    city = kong_dp + "/geolocation"
    result = httpx.post(city)
    lat = result.json()["location"]["lat"]
    lng = result.json()["location"]["lng"]

    return f"{lat},{lng}"


@tool
def get_user_geocode(location: str) -> str:
#def get_user_location(tool: ToolUse, **kwargs: Any) -> ToolResult:
    """Get the user's geocode
       description: Geo Code query. Take a location defined as "lat,lng" and return the name of the city.
    """

    city = kong_dp + "/geocode"
    result = httpx.get(city, params={"latlng": location})    
    result = result.json()["results"][0]["address_components"][-3]["long_name"]
    print(f"city: {result}")
    
    return result


@tool
def get_weather(geocode: str) -> str:
#def get_weather(tool: ToolUse, **kwargs: Any) -> ToolResult:
    """Get the geocode's weather
       description: Weather query. Accepts US Zipcode, UK Postcode, Canada Postalcode, IP address, latitude/longitude, or city name.
    """

    weatherapi_url = kong_dp + "/weather"
    result = httpx.get(weatherapi_url, params={"q": geocode})
    weather_info = result.json()["current"]

    return result.json()





openai_model = OpenAIModel(
  client_args={
      "base_url": kong_dp_route,
      "api_key": "dummy"
  },
  model_id=model_id
)


tools = [get_user_geolocation, get_user_geocode, get_weather]
agent = Agent(model=openai_model, tools=tools)

# Use the agent with the custom tools
prompt = "What is the weather like in Tokyo?"

print("prompt")
print(prompt)
agent(prompt)

Here's the new diagram:

decK
All tools are sitting behind the Kong Data Plane, so, before running the code, we need to define the new Kong Objects. Here's the new decK declaration:

_format_version: "3.0"
_info:
  select_tags:
  - agent
_konnect:
  control_plane_name: kong-aws
services:
- name: agent-service
  url: http://httpbin.konghq.com
  routes:
  - name: agent-route-bedrock
    paths:
    - /bedrock-route
    plugins:
    - name: ai-proxy-advanced
      instance_name: "ai-proxy-advanced-bedrock"
      enabled: true
      config:
        max_request_body_size: 65536
        targets:
        - auth:
            allow_override: false
          route_type: "llm/v1/chat"
          model:
            provider: "bedrock"
            options:
              bedrock:
                aws_region: "us-east-2"
          logging:
            log_payloads: true
            log_statistics: true
- name: geolocation-service
  url: https://www.googleapis.com/geolocation/v1/geolocate
  routes:
  - name: geolocation-route
    paths:
    - /geolocation
    plugins:
    - name: request-transformer-advanced
      instance_name: request-transformer-advanced-geolocation
      enabled: true
      config:
        add:
          querystring:
          - key:${{ env "DECK_GOOGLEAPI_API_KEY" }}
- name: geocode-service
  url: https://maps.googleapis.com/maps/api/geocode/json
  routes:
  - name: geocode-route
    paths:
    - /geocode
    plugins:
    - name: request-transformer-advanced
      instance_name: request-transformer-advanced-geocode
      enabled: true
      config:
        add:
          querystring:
          - key:${{ env "DECK_GOOGLEAPI_API_KEY" }}
- name: weather-service
  url: https://api.weatherapi.com/v1/current.json
  routes:
  - name: weather-route
    paths:
    - /weather
    plugins:
    - name: request-transformer-advanced
      instance_name: request-transformer-advanced-apiweathermap
      enabled: true
      config:
        add:
          headers:
          - key:${{ env "DECK_WEATHERAPI_API_KEY" }}
plugins:
- name: post-function
  instance_name: post-function1
  enabled: false
  config:
    access:
    - kong.log.err("request.body", kong.request.get_raw_body())
    body_filter:
    - kong.log.err("response.body", kong.response.get_raw_body())

You can submit the declaration with:

deck gateway sync --konnect-token $PAT <YOUR_DECLARATION_FILE>

The configuration defines regular Kong Gateway Services to each one of the external services:

Geolocation: Based on Google’s service. It can be consumed directly with requests like:

curl -X POST "https://www.googleapis.com/geolocation/v1/geolocate?key=<YOUR_GOOGLE_APIKEY?"
You should get a response similar to:
{
  "location": {
    "lat": -23.5732992,
    "lng": -46.6550784
  },
  "accuracy": 2377.7249247988957
}

Geocode: Based on Google's service. For example, if you submit the following request, with the latitude and longitude you got from the previous one, you should get a response like:

curl -sX POST "https://maps.googleapis.com/maps/api/geocode/json?latlng=-23.5732992,-46.6550784&key=<YOUR_GOOGLE_APIKEY>" | jq -r '.plus_code.compound_code'
C8GV+MXM São Paulo, State of São Paulo, Brazil

WeatherAPI: That's the actual service responsible for returning the weather in a given city. Here's an example:

curl -sX POST "https://api.weatherapi.com/v1/current.json?q=S%C3%A3o%20Paulo" -H key:<YOUR_WEATHERAPI_APIKEY> | jq -r '.current.condition'
{
  "text": "Patchy rain nearby",
  "icon": "//cdn.weatherapi.com/weather/64x64/day/176.png",
  "code": 1063
}

Each Kong Route defined for the Kong Gateway Services is configured with the Request Transformer Advanced Plugin. Each plugin instance injects the corresponding API Keys defined in the decK environment variables key:${{ env "DECK_GOOGLEAPI_API_KEY" }} and key:${{ env "DECK_WEATHERAPI_API_KEY" }}

Lastly, we have configured the Post Function Plugin globally so we can check the raw bodies of all coming requests and responses.

Observability

From the Observability perspective, it's interesting to check the metrics and logs the Data Plane reports back to its Control Plane. For example, if you execute the code, invoking the agent with the prompt "What is the weather like in Tokyo?", you should see in the Konnect Analytics Explorer page the consumption of two Kong Gateway Services, “agent-service” and “weather-service”:

Note that the “agent-service” was called twice. That's where the Post Function plugin can be really helpful. If you check the Data Plane log, you should see all requests that have been processed. Let's check them now. You can do it with the following “kubectl” command:

kubectl logs -f $(kubectl get pod -n kong -o json | jq -r '.items[].metadata | select(.name | startswith("kong-kong"))' | jq -r '.name') -n kong

Request and Response #1

In this first request sent through the Kong AI Gateway, the agent asks the Bedrock model, in our case us.anthropic.claude-sonnet-4-20250514-v1:0, as specified in the code, to tell which tool should be called to answer the prompt.

Note the Kong AI Gateway Proxy Advanced Plugin sends a request to Bedrock using the expected “/chat/completions". Here's the request body:


2025/12/08 15:06:10 [error] 2680#0: *10917 [kong] [string "kong.log.err("request.body", kong.request.get..."]:1 [post-function] request.body
{
   "messages":[
      {
         "role":"user",
         "content":[
            {
               "type":"text",
               "text":"What is the weather like in Tokyo?"
            }
         ]
      }
   ],
   "inferenceConfig":{
      
   },
   "anthropic_version":"bedrock-2023-05-31",
   "toolConfig":{
      "tools":[
         {
            "toolSpec":{
               "description":"Get the user's geolocation\ndescription: Location query. Takes no parameter and returns lat and lng of current location.",
               "inputSchema":{
                  "json":{
                     "type":"object",
                     "required":[],
                     "properties":{}
                  }
               },
               "name":"get_user_geolocation"
            }
         },
         {
            "toolSpec":{
               "description":"Get the user's geocode\ndescription: Geo Code query. Take a location defined as \"lat,lng\" and return the name of the city.",
               "inputSchema":{
                  "json":{
                     "type":"object",
                     "properties":{
                        "location":{
                           "description":"Parameter location",
                           "type":"string"
                        }
                     },
                     "required":[
                        "location"
                     ]
                  }
               },
               "name":"get_user_geocode"
            }
         },
         {
            "toolSpec":{
               "description":"Get the geocode's weather\ndescription: Weather query. Accepts US Zipcode, UK Postcode, Canada Postalcode, IP address, latitude/longitude, or city name.",
               "inputSchema":{
                  "json":{
                     "type":"object",
                     "properties":{
                        "geocode":{
                           "description":"Parameter geocode",
                           "type":"string"
                        }
                     },
                     "required":[
                        "geocode"
                     ]
                  }
               },
               "name":"get_weather"
            }
         }
      ]
   }
}
, client: 192.168.77.15, server: kong, request: "POST /bedrock-route/chat/completions HTTP/1.1", host: "k8s-kong-kongkong-fae959197b-6192eafba1fb8e62.elb.us-east-2.amazonaws.com", request_id: "21303c3016a136366e17ed07c40bfd2c"

Bedrock replies back with a stream based response. If you check it, you'll see it instructs the agent to call the “get_weather” Tool passing “Tokyo” as the expected “geocode” parameter.

Request and Response #2

This request actually calls the second Kong Route “/weather”, which exposes the “weather-service” Gateway Service and, therefore, sends requests to the WeatherAPI external Service, injecting its API Key.

Request and Response #3

In the final request, the agent invokes the Kong AI Gateway, which routes the request to Bedrock model, to get the final response, now considering the context for the city the prompt included:


{
    "role":"user",
    "content":[
    {
        "toolResult":{
            "content":[
                {
                "json":{
                    "result":[
                        {
                            "type":"text",
                            "text":"{'location': {'name': 'Tokyo', 'region': 'Tokyo', 'country': 'Japan', 'lat': 35.6895, 'lon': 139.6917, 'tz_id': 'Asia/Tokyo', 'localtime_epoch': 1765206397, 'localtime': '2025-12-09 00:06'}, 'current': {'last_updated_epoch': 1765206000, 'last_updated': '2025-12-09 00:00', 'temp_c': 12.0, 'temp_f': 53.6, 'is_day': 0, 'condition': {'text': 'Partly Cloudy', 'icon': '//cdn.weatherapi.com/weather/64x64/night/116.png', 'code': 1003}, 'wind_mph': 11.0, 'wind_kph': 17.6, 'wind_degree': 357, 'wind_dir': 'N', 'pressure_mb': 1016.0, 'pressure_in': 30.0, 'precip_mm': 0.0, 'precip_in': 0.0, 'humidity': 47, 'cloud': 0, 'feelslike_c': 10.1, 'feelslike_f': 50.2, 'windchill_c': 8.2, 'windchill_f': 46.8, 'heatindex_c': 10.5, 'heatindex_f': 50.8, 'dewpoint_c': -0.2, 'dewpoint_f': 31.7, 'vis_km': 10.0, 'vis_miles': 6.0, 'uv': 0.0, 'gust_mph': 15.4, 'gust_kph': 24.8}}"
                        }
                    ]
                }
                }
            ],
            "toolUseId":"tooluse_PIgSZiFLTXSTsHTqMKUeIQ"
        }
    }
    ]
}

The response is also a stream and you can see it after executing the code.

$ python3 <YOUR_CODE>.py
kong_dp_route
http://k8s-kong-kongkong-fae959197b-6192eafba1fb8e62.elb.us-east-2.amazonaws.com:80/bedrock-route
model_id
us.anthropic.claude-sonnet-4-20250514-v1:0
prompt
What is the weather like in Tokyo?
I'll get the weather information for Tokyo for you.
Tool #1: get_weather
Based on the current weather data for Tokyo, here's what it's like right now:

**Current Weather in Tokyo:**
- **Temperature:** 12.2°C (54°F)
- **Condition:** Partly cloudy
- **Time:** 2:23 AM local time (December 9, 2025)
- **Feels like:** 10.3°C (50.6°F)
- **Wind:** 18 km/h (11.2 mph) from the North
- **Humidity:** 47%
- **Cloud coverage:** 75%
- **Visibility:** 10 km (6 miles)
- **Pressure:** 1017 mb

It's currently nighttime in Tokyo with partly cloudy skies and cool temperatures. The weather is quite pleasant for this time of year, with moderate humidity and good visibility. There's a gentle northerly wind making it feel a bit cooler than the actual temperature.

Note that, considering the direct prompt, the agent just called only the “get_weather” tool. You may find it interesting to run the same agent code with different prompts like:

prompt = "What is the weather like in my location?"
prompt = "What is the city I'm currently located in?"
prompt = "What is the city located at latitude 52.52000659999999 and longitude 13.404954?"

Depending on the prompts the agent will invoke other tools to respond properly to it. For example, for the prompt:

prompt = "What is the weather in the city located at latitude 52.52000659999999 and longitude 13.404954?"

The execution of the agent should look like this. This time the agent had to call two Tools: “get_user_code” and “get_weather”.

kong_dp_route
http://k8s-kong-kongkong-fae959197b-6192eafba1fb8e62.elb.us-east-2.amazonaws.com:80/bedrock-route
model_id
us.anthropic.claude-sonnet-4-20250514-v1:0
prompt
What is the weather in the city located at latitude 52.52000659999999 and longitude 13.404954
I'll help you get the weather for that location. Let me first find out what city is at those coordinates, and then get the weather information.
Tool #1: get_user_geocode
city: Berlin
Now let me get the weather for Berlin:
Tool #2: get_weather
The coordinates you provided (latitude 52.52000659999999, longitude 13.404954) are in **Berlin, Germany**.

Here's the current weather in Berlin:

**Current Conditions (as of 6:45 PM local time):**
- **Temperature:** 12.2°C (54°F)
- **Condition:** Partly cloudy
- **Feels like:** 11.0°C (51.8°F)
- **Humidity:** 94%
- **Wind:** 11.9 kph (7.4 mph) from the SSW
- **Pressure:** 1013.0 mb
- **Visibility:** 10 km (6 miles)
- **Cloud coverage:** 75%
- **No precipitation** currently

It's currently nighttime in Berlin with partly cloudy skies and relatively mild temperatures for December.

Strands MCP client and Kong AI MCP Proxy plugin

The current agent code is helpful but the tools are defined using Decorators and are not MCP based. That is, the agent integrates with Kong AI Gateway and Bedrock with the Strands OpenAI Model Provider but consumes the tools by using the “httpx” Python package which sends regular REST/HTTP based requests to the Kong Data Plane.

What we really want is to take the existing Kong Gateway Services and convert them as MCP tools. Moreover, the agent should call them as regular tools and not manage REST based calls. To illustrate the scenario, here's a new diagram:

On the Kong AI Gateway side, it's time to configure the AI MCP Proxy Plugin to take the existing Kong Gateway Service and create MCP Tools based on it. In order to do it, check the new decK declaration:

decK

The main difference here is that we have enabled the AI MCP Proxy Plugin to each one of our Kong Gateway Services related to the external services “geolocation”, “geocode” and “weather”.

In summary, as the AI Proxy Advanced plugin takes care of the LLM model connection, the AI MCP Proxy plugin provides similar capabilities to the MCP tools and external services.

Inside the “config” section, you can see the “mode: conversion-listener” configuration. That means the plugin will not just convert RESTful API paths into MCP tools but also accept incoming MCP requests on the Route path.

The “tools” section of the AI MCP Proxy declaration is an OpenAPI snippet. It's used to instruct how the plugin should integrate with the external service. It has the following main configuration parameters:

Method: It's related to the HTTP method the plugin should use to consume the external service.
Parameters: it maps the API parameters the external service expects.

_format_version: "3.0"
_info:
  select_tags:
  - agent
_konnect:
  control_plane_name: kong-aws
vaults:
- name: aws
  prefix: aws-vault
  config:
    region: us-east-2
services:
- name: agent-service
  url: http://httpbin.konghq.com
  routes:
  - name: agent-route-bedrock
    paths:
    - /bedrock-route
    plugins:
    - name: ai-proxy-advanced
      instance_name: "ai-proxy-advanced-bedrock"
      enabled: true
      config:
        max_request_body_size: 65536
        targets:
        - auth:
            allow_override: false
          route_type: "llm/v1/chat"
          model:
            provider: "bedrock"
            options:
              bedrock:
                aws_region: "us-east-2"
          logging:
            log_payloads: true
            log_statistics: true
- name: geolocation-service
  url: https://www.googleapis.com/geolocation/v1/geolocate
  routes:
  - name: geolocation-route
    paths:
    - /geolocation
    plugins:
    - name: request-transformer-advanced
      instance_name: request-transformer-advanced-geolocation
      enabled: true
      config:
        add:
          querystring:
          - "{vault://aws-vault/google-apikey/google-key}"
    - name: ai-mcp-proxy
      instance_name: ai-mcp-proxy-geolocation
      enabled: true
      config:
        mode: conversion-listener
        tools:
        - description: Get current geo location
          method: POST
        server:
          timeout: 60000
- name: geocode-service
  url: https://maps.googleapis.com/maps/api/geocode/json
  routes:
  - name: geocode-route
    paths:
    - /geocode
    plugins:
    - name: request-transformer-advanced
      instance_name: request-transformer-advanced-geocode
      enabled: true
      config:
        add:
          querystring:
          - "{vault://aws-vault/google-apikey/google-key}"
    - name: ai-mcp-proxy
      instance_name: ai-mcp-proxy-geocode
      enabled: true
      config:
        mode: conversion-listener
        tools:
        - description: Get current geo code
          method: GET
          parameters:
          - name: latlng
            in: query
            required: true
            schema:
              type: string
            description: Get Code query. Accepts Lat and Lng.
        server:
          timeout: 60000
- name: weather-service
  url: https://api.weatherapi.com/v1/current.json
  routes:
  - name: weather-route
    paths:
    - /weather
    plugins:
    - name: request-transformer-advanced
      instance_name: request-transformer-advanced-apiweathermap
      enabled: true
      config:
        add:
          headers:
          - "{vault://aws-vault/weather-apikey/weather-key}"
    - name: ai-mcp-proxy
      instance_name: ai-mcp-proxy-apiweathermap
      enabled: true
      config:
        mode: conversion-listener
        tools:
        - description: Get current weather for a location
          method: GET
          parameters:
          - name: q
            in: query
            required: true
            schema:
              type: string
            description: Weather query. Accepts US Zipcode, UK Postcode, Canada Postalcode,
              IP address, latitude/longitude, or city name.
        server:
          timeout: 60000

Secrets

A second update we made in code is to add a vault to our Konnect environment to get the API Keys from AWS Secrets Manager. It provides a more secure and recommended environment to store secrets like the API Keys the Gateway Services use.

This is done in the “vault” section of the declaration. The secrets can be injected into your AWS Secrets Manager with:

aws secretsmanager create-secret --name google-apikey --region us-east-2 --secret-string "{\"google-key\": \"key:$DECK_GOOGLEAPI_API_KEY\"}"

aws secretsmanager create-secret --name weather-apikey --region us-east-2 --secret-string "{\"weather-key\": \"key:$DECK_WEATHERAPI_API_KEY\"}"

You can check the secrets with:

kubectl exec -ti $(kubectl get pod -n kong -o json | jq -r '.items[].metadata | select(.name | startswith("kong-kong"))' | jq -r '.name') -n kong -- kong vault get {vault://aws-vault/google-apikey/google-key}

Or even consuming the WeatherAPI Service:

curl "$DATA_PLANE_LB/weather?q=Milan"

The Code

This time the code takes the Bedrock model to be consumed from an environment variable. The most critical update is the construction of Streamable HTTP based MCP Client with the MCPClient package. Note that we have one client for each MCP tool defined by the AI MCP Proxy Plugin. To reach out to them, we use the same route path, e.g. “/weather”.

The agent object is created the same way we did before. The main difference is that the tools are based on the Streamable HTTP based MCP Client and not on Tool Decorators.

from strands import Agent
from strands.models.openai import OpenAIModel
from strands.tools.mcp import MCPClient
from mcp.client.streamable_http import streamablehttp_client
import os

data_plane_lb = os.getenv("DATA_PLANE_LB");
data_plane_port = os.getenv("DATA_PLANE_PORT");
model_id = os.getenv("MODEL_ID");

kong_dp = f"http://{data_plane_lb}:{data_plane_port}"
kong_dp_route = f"http://{data_plane_lb}:{data_plane_port}/bedrock-route"

openai_model = OpenAIModel(
  client_args={
      "base_url": kong_dp_route,
      "api_key": "dummy"
  },
  model_id=model_id
)


# Define the Streamable HTTP MCP Clients
streamable_http_mcp_weather_client = MCPClient(lambda: streamablehttp_client(f"{kong_dp}/weather"))

streamable_http_mcp_geolocation_client = MCPClient(lambda: streamablehttp_client(f"{kong_dp}/geolocation"))

streamable_http_mcp_geocode_client = MCPClient(lambda: streamablehttp_client(f"{kong_dp}/geocode"))


# Call the Agent
with streamable_http_mcp_weather_client, streamable_http_mcp_geolocation_client, streamable_http_mcp_geocode_client:
    tools = streamable_http_mcp_weather_client.list_tools_sync() + \
            streamable_http_mcp_geolocation_client.list_tools_sync() + \
            streamable_http_mcp_geocode_client.list_tools_sync()
    agent = Agent(model=openai_model, tools=tools)

    prompt = "What is the weather like in my location?"
    print("prompt")
    print(prompt)
    agent(prompt)

If you run the code above, you should see something like this. Note the agent had to call two tools this time: the first one to get the city name and the second one to get its actual weather.

% python3 <YOUR_CODE>.py
prompt
What is the weather like in my location?
I'll help you get the weather for your current location. First, let me find out where you are, and then I'll get the weather information for that location.
Tool #1: geolocation-route-1
Now let me get the weather for your location using those coordinates:
Tool #2: weather-route-1
Based on your current location in Pickerington, Ohio, here's the current weather:

**Current Weather Conditions:**
- **Temperature:** 29°F (-2°C)
- **Condition:** Sunny ☀️
- **Feels like:** 20°F (-7°C)
- **Wind:** Northeast at 10 mph (16 km/h)
- **Humidity:** 39%
- **Visibility:** 9 miles (16 km)
- **Pressure:** 30.22 in (1023 mb)
- **UV Index:** 0.8 (Low)

It's a cold but sunny day in your area! The temperature is below freezing, so make sure to dress warmly if you're heading outside. The wind chill is making it feel even colder at around 20°F.

Aggregate MCP tools

One last enhancement should be done in our code. As you can see, there is one MCP client for each MCP tool enabled by the Kong AI MCP Proxy plugin. That's not so optimized and does not abstract all tools in a single MCP server. That's the purpose of the AI MCP Proxy plugin “conversion-only” and “listener” modes. Here's the new abstraction:

decK

The decK basically sets the AI MCP Proxy plugin instances as “conversion-only”. That means the tool will be created but not exposed. Moreover, each AI MCP Proxy instance has been tagged as “mcp-tools”.

At the same time, we create a new and serviceless Kong Route with the AI MCP Proxy plugin also enabled. The only difference is that this instance is configured as “mode: listener” and aggregates all AI MCP Proxy instances defined with the tag “mcp-tools”.

_format_version: "3.0"
_info:
  select_tags:
  - agent
_konnect:
  control_plane_name: kong-aws
vaults:
- name: aws
  prefix: aws-vault
  config:
    region: us-east-2
services:
- name: agent-service
  url: http://httpbin.konghq.com
  routes:
  - name: agent-route-bedrock
    paths:
    - /bedrock-route
    plugins:
    - name: ai-proxy-advanced
      instance_name: "ai-proxy-advanced-bedrock"
      enabled: true
      config:
        max_request_body_size: 65536
        targets:
        - auth:
            allow_override: false
          route_type: "llm/v1/chat"
          model:
            provider: "bedrock"
            options:
              bedrock:
                aws_region: "us-east-2"
          logging:
            log_payloads: true
            log_statistics: true
- name: geolocation-service
  url: https://www.googleapis.com/geolocation/v1/geolocate
  routes:
  - name: geolocation-route
    paths:
    - /geolocation
    plugins:
    - name: request-transformer-advanced
      instance_name: request-transformer-advanced-geolocation
      enabled: true
      config:
        add:
          querystring:
          - "{vault://aws-vault/google-apikey/google-key}"
    - name: ai-mcp-proxy
      instance_name: ai-mcp-proxy-geolocation
      enabled: true
      tags:
      - mcp-tools
      config:
        mode: conversion-only
        tools:
        - description: Get current geo location
          method: POST
- name: geocode-service
  url: https://maps.googleapis.com/maps/api/geocode/json
  routes:
  - name: geocode-route
    paths:
    - /geocode
    plugins:
    - name: request-transformer-advanced
      instance_name: request-transformer-advanced-geocode
      enabled: true
      config:
        add:
          querystring:
          - "{vault://aws-vault/google-apikey/google-key}"
    - name: ai-mcp-proxy
      instance_name: ai-mcp-proxy-geocode
      enabled: true
      tags:
      - mcp-tools
      config:
        mode: conversion-only
        tools:
        - description: Get current geo code
          method: GET
          parameters:
          - name: latlng
            in: query
            required: true
            schema:
              type: string
            description: Get Code query. Accepts Lat and Lng.
- name: weather-service
  url: https://api.weatherapi.com/v1/current.json
  routes:
  - name: weather-route
    paths:
    - /weather
    plugins:
    - name: request-transformer-advanced
      instance_name: request-transformer-advanced-apiweathermap
      enabled: true
      config:
        add:
          headers:
          - "{vault://aws-vault/weather-apikey/weather-key}"
    - name: ai-mcp-proxy
      instance_name: ai-mcp-proxy-apiweathermap
      enabled: true
      tags:
      - mcp-tools
      config:
        mode: conversion-only
        tools:
        - description: Get current weather for a location
          method: GET
          parameters:
          - name: q
            in: query
            required: true
            schema:
              type: string
            description: Location query. Accepts US Zipcode, UK Postcode, Canada Postalcode,
              IP address, latitude/longitude, or city name.
routes:
  - name: mcp-listener-route
    paths:
    - /mcp-listener
    plugins:
      - name: ai-mcp-proxy
        config:
          mode: listener
          server:
            tag: mcp-tools
            timeout: 45000
          logging:
            log_statistics: true
            log_payloads: false
          max_request_body_size: 32768

The Code

With the aggregation in place, the code is even simpler:

from strands import Agent
from strands.models.openai import OpenAIModel
from strands.tools.mcp import MCPClient
from mcp.client.streamable_http import streamablehttp_client
import os

data_plane_LB = os.getenv("DATA_PLANE_LB");
data_plane_port = os.getenv("DATA_PLANE_PORT");
model_id = os.getenv("MODEL_ID");

kong_dp = f"http://{data_plane_lb}:{data_plane_port}"
kong_dp_route = f"http://{data_plane_lb}:{data_plane_port}/bedrock-route"


openai_model = OpenAIModel(
  client_args={
      "base_url": kong_dp_route,
      "api_key": "dummy"
  },
  model_id=model_id
)


streamable_http_mcp_listener_client = MCPClient(lambda: streamablehttp_client(f"{kong_dp}/mcp-listener"))

with streamable_http_mcp_listener_client:
    tools = streamable_http_mcp_listener_client.list_tools_sync()
    agent = Agent(model=openai_model, tools=tools)
    agent("What is the weather like in my location?")

The result should be the same:

% python3 <YOUR_CODE>.py
kong_dp_route
http://k8s-kong-kongkong-fae959197b-6192eafba1fb8e62.elb.us-east-2.amazonaws.com:80/bedrock-route
model_id
us.anthropic.claude-sonnet-4-20250514-v1:0
I'll help you get the weather for your current location. Let me first find out where you are, and then get the weather information for that location.
Tool #1: geolocation-route-1
Now let me get the weather for your location using those coordinates:
Tool #2: weather-route-1
Based on your current location in Pickerington, Ohio, here's the current weather:

**Current Weather:**
- **Temperature:** 28.9°F (-1.7°C)
- **Condition:** Sunny ☀️
- **Feels like:** 20.6°F (-6.3°C)
- **Wind:** 8.7 mph from the northeast
- **Humidity:** 39%
- **Visibility:** 9 miles
- **Pressure:** 30.22 inches

It's a cold but sunny day! The wind chill is making it feel quite a bit colder than the actual temperature, so make sure to dress warmly if you're heading outside.

You can also check the Konnect Analytics’ Explorer dashboard once again:

Conclusion

We presented a basic AI agent using Kong AI Gateway, Strands and Amazon Bedrock, including LLM models and MCP servers. It's totally feasible to implement advanced AI agents with query transformation, multiple data sources, multiple retrieval stages, RAG, etc. Moreover, Kong AI Gateway provides other plugins to enrich the relationship with the LLM providers and MCP servers, including Semantic Cache, Semantic Routing, Request and Response Transformation, etc.

Also, it's important to keep in mind that we can continue combining other API Gateway plugins to your AI-based use cases like using the OIDC plugin to secure your Foundation Models with AWS Cognito, using the Prometheus plugin to monitor your AI Gateway with Amazon Managed Prometheus and Grafana, and so on.

Finally, the architectural flexibility provided natively by Konnect and Kong AI Gateway allows us to deploy the Data Planes in a variety of platforms including AWS EC2 VMs, Amazon ECS, and Kong Dedicated Cloud Gateway, Kong's SaaS service for the Data Planes running in AWS.

You can discover all the features available on the Kong AI Gateway product page, or you can check the Kong and AWS landing page to learn more!

Learn More Get a Demo

Topics

MCP AI Gateway Agentic AI AWS

Claudio Acquaviva

Principal Architect, Kong

Jason Matis

Staff Solutions Engineer, Kong

Kong Simplifies Multicloud Cloud Gateways with Managed Redis Cache

Product ReleasesMarch 12, 2026

Managed Redis cache is a turnkey "Shared State" add-on for Kong Dedicated Cloud Gateways. It is designed to combine the performance of an in-memory data store with the simplicity of a SaaS product. When you spin up a Dedicated Cloud Gateway in Kong

Amit Shah

Governing Claude Code: How To Secure Agent Harness Rollouts with Kong AI Gateway

EngineeringMarch 7, 2026

Claude Code is Anthropic's agentic coding and agent harness tool. Unlike traditional code-completion assistants that suggest the next line in an editor, Claude Code operates as an autonomous agent that reads entire codebases, edits files across mult

Alex Drag

Model Context Protocol (MCP) Security: How to Restrict Tool Access Using AI Gateways

EngineeringFebruary 3, 2026

MCP servers expose all tools by default. There are two problems with this: security (agents get capabilities they shouldn't have) and performance (too many tools degrade LLM tool selection). The solution? Put a gateway between agents and MCP server

Deepak Grewal

Building Secure AI Agents with Kong's MCP Proxy and Volcano SDK

EngineeringJanuary 27, 2026

The example below shows how an AI agent can be built using Volcano SDK with minimal code, while still interacting with backend services in a controlled way. The agent is created by first configuring an LLM, then defining an MCP (Model Context Prot

Eugene Tan

AI Input vs. Output: Why Token Direction Matters for AI Cost Management

EnterpriseMarch 10, 2026

The Shifting Economic Landscape: The AI token economy in 2026 is evolving, and enterprise leaders must distinguish between low-cost input tokens and high-premium output tokens to maintain profitability. Agentic AI Financial Risks: The transition t

Dan Temkin

The Platform Enterprises Need to Compete? Kong Already Built It

EnterpriseFebruary 25, 2026

A Response to Gartner’s Latest Research Gartner's strategic planning assumption stops you in your tracks: by 2029, 50% of software application providers will be forced to share their context layer externally for third-party orchestrators to stay r

Alex Drag

From APIs to Agentic Integration: Introducing Kong Context Mesh

Product ReleasesFebruary 10, 2026

Agents are ultimately decision makers. They make those decisions by combining intelligence with context, ultimately meaning they are only ever as useful as the context they can access. An agent that can't check inventory levels, look up customer his

Alex Drag

AI Agent with Strands SDK, Kong AI/MCP Gateway & Amazon Bedrock

Kong MCP Gateway introduction

Kong AI Gateway and Amazon Bedrock

Kong AI/MCP Gateway and Amazon Bedrock integration and APIs

OpenAI API support

Amazon Elastic Kubernetes Service (EKS) installation and preparation

Amazon EKS Cluster creation

Cluster preparation

Kubernetes Kong Operator and Konnect Control Plane/Data Plane installation

Pod Identity configuration

IAM Policy

Pod Identity Association

Kong Operator and Control Plane/Data Plane deployment

Konnect subscription

Kong Operator installation

Konnect Data Plane deployment

Checking the Data Plane

decK

Strands Agent SDK - AI agent and MCP fundamentals

MCP principles

Kong version

Kong declarations

First Strands Kong AI Gateway agent

Strands Tools Decorator

Request and Response #1

Request and Response #2

Request and Response #3

Strands MCP client and Kong AI MCP Proxy plugin

decK

Secrets

Aggregate MCP tools

decK

The Code

Conclusion

AI-powered API security? Yes please!

Recommended posts

Kong Simplifies Multicloud Cloud Gateways with Managed Redis Cache

Governing Claude Code: How To Secure Agent Harness Rollouts with Kong AI Gateway

Model Context Protocol (MCP) Security: How to Restrict Tool Access Using AI Gateways

Building Secure AI Agents with Kong's MCP Proxy and Volcano SDK

AI Input vs. Output: Why Token Direction Matters for AI Cost Management

The Platform Enterprises Need to Compete? Kong Already Built It

From APIs to Agentic Integration: Introducing Kong Context Mesh

Ready to see Kong in action?