Portkey vs Bedrock Latency Benchmark

Measure the real-world latency overhead of routing requests through Portkey

Quick Start

Prerequisites

Node.js 18+ (uses native fetch, zero npm dependencies)
AWS Bedrock access (Access Keys or Bearer token)
Portkey API key

Step 1: Clone & Setup

git clone <repo-url>
cd benchmark-test
cp config.example.json config.json

Step 2: Add Your Credentials

Open config.json and edit these fields:

{
  "amazonRegion": "us-east-1",
  "portkeyApiKey": "...",
  "portkeyProviderSlug": "@your-bedrock-slug"
}

Then choose one of the two Bedrock authentication methods below:

Option 1: AWS Access Keys (Recommended)

Use IAM credentials directly. Best for programmatic access and automation.

{
  "awsAccessKeyId": "AKIA...",
  "awsSecretAccessKey": "...",
  "awsSessionToken": ""
}

Getting Access Keys:

IAM User credentials: Create from IAM Console → Users → Security credentials

AWS CLI: Run aws configure to set up credentials, then export them:

# If using environment variables
export AWS_ACCESS_KEY_ID="your-key"
export AWS_SECRET_ACCESS_KEY="your-secret"

Temporary credentials with STS:
```
aws sts get-session-token --duration-seconds 3600
```
Copy AccessKeyId, SecretAccessKey, and SessionToken from the output.

Note: awsSessionToken is optional. Only required when using temporary credentials (e.g., from sts get-session-token or assumed roles).

Option 2: Bearer Token

Use HTTP Bearer auth. Best for quick testing with AWS SSO.

{
  "bedrockBearerToken": "..."
}

From AWS SSO Portal:

Log into your organization's AWS SSO
Select your Bedrock-enabled account
Choose "Command line or programmatic access"
Copy the Bearer token

Set amazonRegion to match where your Bedrock model is deployed (e.g., us-east-1, us-west-2)

Portkey API Key

Log in to app.portkey.ai
Click API Keys in the left sidebar
Copy your API key

Provider Slug

Log in to app.portkey.ai
Go to Models in the left sidebar
You'll see a list of providers — copy the slug for the one you are using (e.g., @bedrock-prod)

Step 3: Run

npm start

Sample Output

AGGREGATED PERFORMANCE COMPARISON:
┌─────────────────────┬──────────────┬──────────────┬──────────────┐
│ Metric              │ Bedrock      │ Portkey      │ Difference   │
├─────────────────────┼──────────────┼──────────────┼──────────────┤
│ Avg Total Time      │  1104.71ms   │  1197.87ms   │   +93.16ms   │
│ Median Time         │   949.00ms   │   974.00ms   │   +25.00ms   │
│ P95 Time            │  1999.95ms   │  2276.00ms   │  +276.05ms   │
│ P99 Time            │  2487.62ms   │  2624.08ms   │  +136.46ms   │
│ Success Rate        │      96.0%   │      89.0%   │     -7.0%    │
└─────────────────────┴──────────────┴──────────────┴──────────────┘

KEY INSIGHTS:
• Portkey adds an average of 93.16ms latency (8.4% overhead)
• Median overhead: 25.00ms

Results are saved to results/benchmark_results_<timestamp>.json.

Default Configuration

The benchmark comes pre-configured with these defaults:

Setting	Default Value
Model	`us.anthropic.claude-3-5-sonnet-20241022-v2:0`
Mode	`comparison` (Bedrock vs Portkey)
Prompt	`"What is the capital of France?"`
Max Tokens	`100`
Temperature	`0.7`
Concurrency	`2` workers
Max Requests	`3` per iteration
Iterations	`2`

Changing Models

To use a different model, update bedrockModelId and model in config.json:

{
  "bedrockModelId": "us.anthropic.claude-3-haiku-20240307-v1:0",
  "model": "us.anthropic.claude-3-haiku-20240307-v1:0"
}

Configuration Reference

All Options

Field	Type	Default	Description
`mode`	string	`"comparison"`	`"comparison"` (Bedrock vs Portkey) or `"loadtest"` (Portkey only)
`prompt`	string	—	The prompt to send to the model
`maxTokens`	number	`100`	Maximum tokens in response
`temperature`	number	`0.7`	Model temperature
`concurrency`	number	`2`	Number of parallel request workers
`maxRequests`	number	`3`	Total requests per iteration
`testDuration`	number	`60`	Maximum test duration (seconds)
`iterations`	number	`2`	Number of test runs to average

Credentials

Credential	Required For	Description
`awsAccessKeyId`	`comparison` mode (Option 1)	AWS Access Key ID (IAM credentials)
`awsSecretAccessKey`	`comparison` mode (Option 1)	AWS Secret Access Key
`awsSessionToken`	`comparison` mode (Optional)	Session token (only for temporary credentials)
`bedrockBearerToken`	`comparison` mode (Option 2)	AWS Bearer token with Bedrock invoke permissions
`portkeyApiKey`	Both modes	Your Portkey API key
`amazonRegion`	Both modes	AWS region (e.g., `us-east-1`)
`bedrockModelId`	Both modes	Model ID for Bedrock
`portkeyProviderSlug`	Both modes	Provider slug (e.g., `@bedrock-prod`)
`strictSSL`	Both modes (Optional)	Set `false` to bypass SSL verification (fix proxy errors)

Note: For Bedrock authentication, use either Option 1 (Access Keys) OR Option 2 (Bearer Token), not both.

How It Works

1. PREFLIGHT    Validate credentials, test connectivity
       ↓
2. WARMUP       5 requests per provider (establish connections)
       ↓
3. BENCHMARK    Concurrent workers fire parallel requests
       ↓         - Randomized order eliminates bias
       ↓         - Measures total round-trip time
       ↓
4. AGGREGATE    Calculate avg, median, P95, P99
       ↓
5. REPORT       Console summary + JSON artifact

Understanding Overhead

The latency overhead represents round-trip network latency through the proxy:

Direct Bedrock:     Client → Bedrock → Client
                           (1 hop each direction)

Through Portkey:    Client → Portkey → Bedrock → Portkey → Client
                           (2 hops each direction)

Typical overhead: ~50-150ms depending on geographic distance and network conditions.

When Portkey Shows Lower Latency than Direct Bedrock

This can happen due to:

Network variance — With fewer iterations, jitter can skew results. Run 10+ iterations for accuracy.
Connection reuse — Portkey maintains persistent connections, reducing handshake overhead.

Test Modes

Mode	Description
`comparison`	Bedrock vs Portkey side-by-side
`loadtest`	Portkey only (stress test)

{
  "mode": "loadtest"
}

Project Structure

benchmark-test/
├── benchmark.js         # Main benchmark script
├── config.example.json  # Template configuration
├── config.json          # Your configuration (gitignored)
├── results/             # Output directory (gitignored)
└── README.md

Troubleshooting

"SELF_SIGNED_CERT_IN_CHAIN" Error

If you see this error (or fetch failed), you are likely behind a corporate firewall/proxy (like Zscaler, Netskope) that performs SSL inspection.

To fix this:

Open your config.json.
Add "strictSSL": false to the top level JSON object.
Run the benchmark again.

Example:

{
  "mode": "comparison",
  "strictSSL": false,
  ...
}

License

MIT License - see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
benchmark.js		benchmark.js
config.example.json		config.example.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Portkey vs Bedrock Latency Benchmark

Quick Start

Prerequisites

Step 1: Clone & Setup

Step 2: Add Your Credentials

Step 3: Run

Sample Output

Default Configuration

Changing Models

Configuration Reference

How It Works

Understanding Overhead

When Portkey Shows Lower Latency than Direct Bedrock

Test Modes

Project Structure

Troubleshooting

"SELF_SIGNED_CERT_IN_CHAIN" Error

License

About

Uh oh!

Releases

Packages

Languages

License

Portkey-AI/benchmark-test

Folders and files

Latest commit

History

Repository files navigation

Portkey vs Bedrock Latency Benchmark

Quick Start

Prerequisites

Step 1: Clone & Setup

Step 2: Add Your Credentials

Step 3: Run

Sample Output

Default Configuration

Changing Models

Configuration Reference

How It Works

Understanding Overhead

When Portkey Shows Lower Latency than Direct Bedrock

Test Modes

Project Structure

Troubleshooting

"SELF_SIGNED_CERT_IN_CHAIN" Error

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages