Foundation-Models | mikePietsch.com

41.8 Bedrock Pricing: On-Demand vs Provisioned Throughput

Right, let’s talk money. Because as much as I love playing with billion-parameter AI models, I’m not the one paying Amazon’s AWS bill, and I’m guessing you are. Bedrock’s pricing model is actually one of its better features—it’s designed to be flexible, but that flexibility means you have a choice to make: pay as you go, or commit like you’re in a serious relationship. Let’s break down the two modes so you don’t end up with a bill that makes you gasp.

41.7 Bedrock Fine-Tuning and Continued Pre-Training

Alright, let’s talk about making these foundation models actually yours. Because let’s be honest, out-of-the-box models are impressive, but they’re like a brilliant intern who’s read every book in the library yet has no clue about your specific business, your internal jargon, or your weirdly named projects from 2014. That’s where fine-tuning and continued pre-training come in. Think of it as giving that intern a intensive, hyper-focused crash course in your world.

41.6 Bedrock Model Evaluation: Automatic and Human-Based Benchmarks

Right, let’s talk about evaluating these foundation models. You don’t just pick one from the Bedrock menu like you’re ordering a burger. “I’ll have the Claude, medium-rare, with a side of extra parameters.” If you do that, you’re going to have a bad time. These models are incredibly powerful, but they’re not all the same. They have different strengths, weaknesses, weird quirks, and, let’s be honest, prices that can make your CFO’s eye twitch. So how do you choose? You put them through their paces. You run benchmarks.

41.5 Bedrock Guardrails: Content Filtering and PII Redaction

Right, let’s talk about guardrails. You’ve got this incredibly powerful, creative, borderline-ungovernable model sitting in Bedrock. It’s like a genius intern who’s read the entire internet—the good parts, the weird parts, and the parts that would get you a visit from HR. You need to let them do their brilliant work, but you also need to stop them from accidentally writing a sonnet about your company’s AWS secret keys. That’s where Bedrock Guardrails come in. They’re your system of polite, but firm, bouncers for generative AI.

41.4 Bedrock Agents: Multi-Step Reasoning and Action Group Integration

Right, so you’ve played with a single foundation model, maybe through the playground, and you’ve thought, “Cool trick. But my actual problems require more than one step.” You don’t just need a paragraph written; you need to get something done. You need to look up a policy, cross-reference a support ticket, and then file a request—all based on a user’s vague, rambling question. This is where Bedrock Agents come in. They’re your automated interns that don’t need coffee breaks, capable of multi-step reasoning and actually taking actions in the world.

41.3 Bedrock Knowledge Bases: RAG with S3 and Vector Stores

Right, so you’ve got a big pile of documents in S3—PDFs, text files, maybe some Word docs from that one colleague who refuses to join the 21st century. You want to query them intelligently with a Large Language Model (LLM), but we all know the problem: LLMs are brilliant idiots. They have vast knowledge but are utterly clueless about your specific data. That’s where Bedrock’s Knowledge Bases come in. Think of it as giving your model a pair of glasses and a very, very good filing system. It’s Retrieval Augmented Generation (RAG) without you having to build the entire plumbing system from scratch.

41.2 Bedrock Converse API and InvokeModel API

Right, let’s talk about how you actually get these models to do your bidding. Forget the flashy demos for a second; we’re getting into the API trenches. Bedrock offers two primary ways to have a chat: the newer, more capable Converse API and the older, more granular InvokeModel (and InvokeModelWithResponseStream) API. One is for having a conversation, the other is for sending a precisely crafted note and hoping for the best. You can probably guess which one I prefer.

41.1 Bedrock Overview: Accessing Claude, Titan, Llama, Mistral, and Cohere via API

Right, let’s get this out of the way: you’re not here to train a multi-billion parameter model from scratch. You’d need a VC’s entire bank account, a few PhDs, and the patience of a saint. You’re here to use them. Amazon Bedrock is your all-access pass to the most capable foundation models on the planet, without the soul-crushing infrastructure overhead. Think of it as the world’s most powerful API cocktail menu, and you’re the bartender. Your job is to pick the right ingredients (models), mix them correctly (prompting), and serve the drink (the API response). No cleaning the glasses.