Back to The EDiT Journal

Report

Report. A guide on pricing for Amazon Bedrock

A guide on pricing for Amazon Bedrock, including high-performing foundation models (FMs) through an API, to build generative AI applications

Ammar Mohanna

AI Engineering Lead, EDT&Partners

Benito Castellanos

VP of Technology, EDT&Partners

January 21, 2025

3 min

AI in Education

Cloud & Infrastructure

EdTech

Publishers

Pricing overview

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) through a single API, along with a broad set of capabilities you need to build generative AI applications with security, privacy, and responsible AI.

‍

With Amazon Bedrock, you will be charged for:

Model inference
Model customization
You have a choice of two pricing plans for inference:
- 1. On-Demand and Batch
  Pay-as-you-go pricing without time-based commitments.
- 2. Provisioned Throughput:
  Provision throughput to meet performance requirements in exchange for a time-based commitment.

‍

Pricing Models

On Demand and Batch

On-Demand Mode: Pay only for what you use with no term commitments.
- Text-Generation Models: Charged per input and output token.
- Embeddings Models: Charged per input token.
- Image-Generation Models: Charged per image generated.
- Cross-Region Inference: Supports using compute across different AWS Regions to manage traffic bursts, with no extra charge.

‍

Batch Mode: Submit prompts as a single input file and receive responses in an output file.
- Responses are stored in an Amazon S3 bucket for future access.
- Batch inference pricing is 50% lower than on-demand pricing for select models from providers like Anthropic, Meta, Mistral AI, and Amazon.

‍

Provisioned Throughput Mode

Purchase model units for a specific base or custom model.

Designed for large, consistent inference workloads needing guaranteed throughput.
Custom models are only available with this mode.
Model Unit: Provides a defined throughput (tokens processed per minute).
Pricing: Charged by the hour with a choice of 1-month or 6-month commitment terms.

‍

Custom Model Import

Import your customized models into Amazon Bedrock to use them like other hosted models.

No Charge: Importing a custom model to Bedrock is free.
On-Demand Serving: Imported models are available on-demand with no control plane actions required.
Inference Pricing: Charged based on the number of model copies needed for inference and their active duration (billed in 5-minute increments).
Model Copy Cost: Pricing depends on factors like architecture, context length, AWS Region, compute version, and model size tier.

‍

To keep reading, please download the report.

‍

Ammar Mohanna

AI Engineering Lead, EDT&Partners

An AI consultant and engineering leader, Ammar is passionate about building ethical and accessible AI, specializing in graph learning, prompt engineering, and real-time systems.

Benito Castellanos

VP of Technology, EDT&Partners

Beni is the VP of Technology at EDT Partners, combining deep software development expertise with agile strategy to deliver high-quality solutions and technology-driven transformation.

Get in touch

Join our newsletter

Be part of our global community — receive the latest articles, perspectives, and resources from The EDiT Journal.

Report. A guide on pricing for Amazon Bedrock

A guide on pricing for Amazon Bedrock, including high-performing foundation models (FMs) through an API, to build generative AI applications

Ammar Mohanna

AI Engineering Lead, EDT&Partners

An AI consultant and engineering leader, Ammar is passionate about building ethical and accessible AI, specializing in graph learning, prompt engineering, and real-time systems.

Benito Castellanos

VP of Technology, EDT&Partners

Beni is the VP of Technology at EDT Partners, combining deep software development expertise with agile strategy to deliver high-quality solutions and technology-driven transformation.

January 21, 2025

3 min

AI in Education

Cloud & Infrastructure

EdTech

Publishers

This report offers a concise overview of Amazon Bedrock’s pricing model, detailing how educational organizations and EdTech providers can optimize costs when building generative AI applications.

Pricing overview

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) through a single API, along with a broad set of capabilities you need to build generative AI applications with security, privacy, and responsible AI.

‍

With Amazon Bedrock, you will be charged for:

Model inference
Model customization
You have a choice of two pricing plans for inference:
- 1. On-Demand and Batch
  Pay-as-you-go pricing without time-based commitments.
- 2. Provisioned Throughput:
  Provision throughput to meet performance requirements in exchange for a time-based commitment.

‍

Pricing Models

On Demand and Batch

On-Demand Mode: Pay only for what you use with no term commitments.
- Text-Generation Models: Charged per input and output token.
- Embeddings Models: Charged per input token.
- Image-Generation Models: Charged per image generated.
- Cross-Region Inference: Supports using compute across different AWS Regions to manage traffic bursts, with no extra charge.

‍

Batch Mode: Submit prompts as a single input file and receive responses in an output file.
- Responses are stored in an Amazon S3 bucket for future access.
- Batch inference pricing is 50% lower than on-demand pricing for select models from providers like Anthropic, Meta, Mistral AI, and Amazon.

‍

Provisioned Throughput Mode

Purchase model units for a specific base or custom model.

Designed for large, consistent inference workloads needing guaranteed throughput.
Custom models are only available with this mode.
Model Unit: Provides a defined throughput (tokens processed per minute).
Pricing: Charged by the hour with a choice of 1-month or 6-month commitment terms.

‍

Custom Model Import

Import your customized models into Amazon Bedrock to use them like other hosted models.

No Charge: Importing a custom model to Bedrock is free.
On-Demand Serving: Imported models are available on-demand with no control plane actions required.
Inference Pricing: Charged based on the number of model copies needed for inference and their active duration (billed in 5-minute increments).
Model Copy Cost: Pricing depends on factors like architecture, context length, AWS Region, compute version, and model size tier.

‍

To keep reading, please download the report.

‍

Ammar Mohanna

AI Engineering Lead, EDT&Partners

An AI consultant and engineering leader, Ammar is passionate about building ethical and accessible AI, specializing in graph learning, prompt engineering, and real-time systems.

Benito Castellanos

VP of Technology, EDT&Partners

Beni is the VP of Technology at EDT Partners, combining deep software development expertise with agile strategy to deliver high-quality solutions and technology-driven transformation.

Get in touch

Join our newsletter

Be part of our global community — receive the latest articles, perspectives, and resources from The EDiT Journal.

Report. A guide on pricing for Amazon Bedrock

A guide on pricing for Amazon Bedrock, including high-performing foundation models (FMs) through an API, to build generative AI applications

Ammar Mohanna

AI Engineering Lead, EDT&Partners

Benito Castellanos

VP of Technology, EDT&Partners

January 21, 2025

3 min

AI in Education

Cloud & Infrastructure

EdTech

Publishers

This report offers a concise overview of Amazon Bedrock’s pricing model, detailing how educational organizations and EdTech providers can optimize costs when building generative AI applications.

Pricing overview

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) through a single API, along with a broad set of capabilities you need to build generative AI applications with security, privacy, and responsible AI.

‍

With Amazon Bedrock, you will be charged for:

Model inference
Model customization
You have a choice of two pricing plans for inference:
- 1. On-Demand and Batch
  Pay-as-you-go pricing without time-based commitments.
- 2. Provisioned Throughput:
  Provision throughput to meet performance requirements in exchange for a time-based commitment.

‍

Pricing Models

On Demand and Batch

On-Demand Mode: Pay only for what you use with no term commitments.
- Text-Generation Models: Charged per input and output token.
- Embeddings Models: Charged per input token.
- Image-Generation Models: Charged per image generated.
- Cross-Region Inference: Supports using compute across different AWS Regions to manage traffic bursts, with no extra charge.

‍

Batch Mode: Submit prompts as a single input file and receive responses in an output file.
- Responses are stored in an Amazon S3 bucket for future access.
- Batch inference pricing is 50% lower than on-demand pricing for select models from providers like Anthropic, Meta, Mistral AI, and Amazon.

‍

Provisioned Throughput Mode

Purchase model units for a specific base or custom model.

Designed for large, consistent inference workloads needing guaranteed throughput.
Custom models are only available with this mode.
Model Unit: Provides a defined throughput (tokens processed per minute).
Pricing: Charged by the hour with a choice of 1-month or 6-month commitment terms.

‍

Custom Model Import

Import your customized models into Amazon Bedrock to use them like other hosted models.

No Charge: Importing a custom model to Bedrock is free.
On-Demand Serving: Imported models are available on-demand with no control plane actions required.
Inference Pricing: Charged based on the number of model copies needed for inference and their active duration (billed in 5-minute increments).
Model Copy Cost: Pricing depends on factors like architecture, context length, AWS Region, compute version, and model size tier.

‍

To keep reading, please download the report.

‍

Ammar Mohanna

AI Engineering Lead, EDT&Partners

An AI consultant and engineering leader, Ammar is passionate about building ethical and accessible AI, specializing in graph learning, prompt engineering, and real-time systems.

Benito Castellanos

VP of Technology, EDT&Partners

Beni is the VP of Technology at EDT Partners, combining deep software development expertise with agile strategy to deliver high-quality solutions and technology-driven transformation.

Get in touch

Join our newsletter

Be part of our global community — receive the latest articles, perspectives, and resources from The EDiT Journal.

Report. A guide on pricing for Amazon Bedrock

A guide on pricing for Amazon Bedrock, including high-performing foundation models (FMs) through an API, to build generative AI applications

Ammar Mohanna

AI Engineering Lead, EDT&Partners

An AI consultant and engineering leader, Ammar is passionate about building ethical and accessible AI, specializing in graph learning, prompt engineering, and real-time systems.

Benito Castellanos

VP of Technology, EDT&Partners

Beni is the VP of Technology at EDT Partners, combining deep software development expertise with agile strategy to deliver high-quality solutions and technology-driven transformation.

January 21, 2025

3 min

AI in Education

Cloud & Infrastructure

EdTech

Publishers

This report offers a concise overview of Amazon Bedrock’s pricing model, detailing how educational organizations and EdTech providers can optimize costs when building generative AI applications.

Pricing overview

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) through a single API, along with a broad set of capabilities you need to build generative AI applications with security, privacy, and responsible AI.

‍

With Amazon Bedrock, you will be charged for:

Model inference
Model customization
You have a choice of two pricing plans for inference:
- 1. On-Demand and Batch
  Pay-as-you-go pricing without time-based commitments.
- 2. Provisioned Throughput:
  Provision throughput to meet performance requirements in exchange for a time-based commitment.

‍

Pricing Models

On Demand and Batch

On-Demand Mode: Pay only for what you use with no term commitments.
- Text-Generation Models: Charged per input and output token.
- Embeddings Models: Charged per input token.
- Image-Generation Models: Charged per image generated.
- Cross-Region Inference: Supports using compute across different AWS Regions to manage traffic bursts, with no extra charge.

‍

Batch Mode: Submit prompts as a single input file and receive responses in an output file.
- Responses are stored in an Amazon S3 bucket for future access.
- Batch inference pricing is 50% lower than on-demand pricing for select models from providers like Anthropic, Meta, Mistral AI, and Amazon.

‍

Provisioned Throughput Mode

Purchase model units for a specific base or custom model.

Designed for large, consistent inference workloads needing guaranteed throughput.
Custom models are only available with this mode.
Model Unit: Provides a defined throughput (tokens processed per minute).
Pricing: Charged by the hour with a choice of 1-month or 6-month commitment terms.

‍

Custom Model Import

Import your customized models into Amazon Bedrock to use them like other hosted models.

No Charge: Importing a custom model to Bedrock is free.
On-Demand Serving: Imported models are available on-demand with no control plane actions required.
Inference Pricing: Charged based on the number of model copies needed for inference and their active duration (billed in 5-minute increments).
Model Copy Cost: Pricing depends on factors like architecture, context length, AWS Region, compute version, and model size tier.

‍

To keep reading, please download the report.

‍

Download PDF

Ammar Mohanna

AI Engineering Lead, EDT&Partners

An AI consultant and engineering leader, Ammar is passionate about building ethical and accessible AI, specializing in graph learning, prompt engineering, and real-time systems.

Benito Castellanos

VP of Technology, EDT&Partners

Beni is the VP of Technology at EDT Partners, combining deep software development expertise with agile strategy to deliver high-quality solutions and technology-driven transformation.

Get in touch

Join our newsletter

Be part of our global community — receive the latest articles, perspectives, and resources from The EDiT Journal.

Report. A guide on pricing for Amazon Bedrock

In this article

Pricing overview

Pricing Models

On Demand and Batch

Provisioned Throughput Mode

Custom Model Import

Related Posts

Report. A guide on pricing for Amazon Bedrock

Pricing overview

Pricing Models

On Demand and Batch

Provisioned Throughput Mode

Custom Model Import

Related Posts

Report. A guide on pricing for Amazon Bedrock

Pricing overview

Pricing Models

On Demand and Batch

Provisioned Throughput Mode

Custom Model Import

Related Posts

Report. A guide on pricing for Amazon Bedrock

Pricing overview

Pricing Models

On Demand and Batch

Provisioned Throughput Mode

Custom Model Import

Related Posts

Who we are

What we do

Who we help

Our insights

Contact

Report. A guide on pricing for Amazon Bedrock

In this article

Pricing overview

Pricing Models

On Demand and Batch

Provisioned Throughput Mode

Custom Model Import

Related Posts

The EDiT talks. We're All in It Together: What Vendors and Schools Both Need to Get Right on Technology Adoption

Cumplimiento normativo en IA y accesibilidad: 5 preguntas que toda editorial debe responder antes de que sea tarde

AI & Accessibility Compliance: 5 Questions Every Publisher Needs to Answer Before the Deadline

Report. A guide on pricing for Amazon Bedrock

Pricing overview

Pricing Models

On Demand and Batch

Provisioned Throughput Mode

Custom Model Import

Related Posts

The EDiT talks. We're All in It Together: What Vendors and Schools Both Need to Get Right on Technology Adoption

Cumplimiento normativo en IA y accesibilidad: 5 preguntas que toda editorial debe responder antes de que sea tarde

AI & Accessibility Compliance: 5 Questions Every Publisher Needs to Answer Before the Deadline

Report. A guide on pricing for Amazon Bedrock

Pricing overview

Pricing Models

On Demand and Batch

Provisioned Throughput Mode

Custom Model Import

Related Posts

The EDiT talks. Beyond the Pilot: What Happens When AI Is Designed Around Learning

The EDiT talks. We're All in It Together: What Vendors and Schools Both Need to Get Right on Technology Adoption

Cumplimiento normativo en IA y accesibilidad: 5 preguntas que toda editorial debe responder antes de que sea tarde

Report. A guide on pricing for Amazon Bedrock

Pricing overview

Pricing Models

On Demand and Batch

Provisioned Throughput Mode

Custom Model Import

Related Posts

The EDiT talks. We're All in It Together: What Vendors and Schools Both Need to Get Right on Technology Adoption

Cumplimiento normativo en IA y accesibilidad: 5 preguntas que toda editorial debe responder antes de que sea tarde

AI & Accessibility Compliance: 5 Questions Every Publisher Needs to Answer Before the Deadline

Who we are

What we do

Who we help

Our insights

Contact