358: AI Spend Limits Because Frontier Models Aren't Free Therapy

Episode 358 June 19, 2026 01:22:50
358: AI Spend Limits Because Frontier Models Aren't Free Therapy
The Cloud Pod | Weekly AI & Cloud News on AWS, Azure & GCP
358: AI Spend Limits Because Frontier Models Aren't Free Therapy

Jun 19 2026 | 01:22:50

/

Hosted By

Jonathan Baker Justin Brodley Matthew Kohn Ryan Lucas

Show Notes

Welcome to episode 358 of The Cloud Pod, where the weather is always cloudy! 

Justin, Matt, and Ryan (who, rumour has it, was working on an Eagles music podcast) are in the studio this week to bring you all the latest in AI and cloud news (and begging for a AI spend limit increase), including anthropic wanting everyone – except themselves – to slow down AI development, GitHub’s insane number of commits, and even an announcement from CoreWeave, plus so much more. Let’s get started! 

Titles we almost went with this week

A big thanks to this week’s sponsors:

There are many cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. 

Check out thecloudpod.net/archera to schedule a demo today. 

General News

01:27 How GitHub plans to win developers back

03:0 Ryan – “I’d actually like to see AI coding take this up a little bit, because I think it is a ridiculous sort of growth that I don’t think is sustainable, and so much of vibe-coded garbage is really bloated…But there are definitely functionality things that it can do a lot more efficiently, and doesn’t.” 

AI Is Going Great – or How ML Makes Money 

07:44 The Interoperable Lakehouse: Agency Over Your Data

09:25 Snowflake CoCo: AI Coding Agent for the Modern Data Stack

09:59 Snowflake CoWork: The Personal Work Agent for Every Knowledge Worker

10:29 Justin – “I assume Anthropic will be suing them any moment for trademark infringement, but nice to see that you’re getting some smartness for the data friends who desperately need all the DevOps help they can get. So I appreciate they’re getting these tools.”

16:00 Anthropic urges global pause in AI development

16:41 Ryan – “This has been what people have been sort of warning for ages with AI development, and this isn’t anything new. I’m surprised by the timing of it because it doesn’t make sense to me that they’re doing this now, but this is a huge concern. And I know just from trying to secure workloads in my day job, you try to put human and loop flows in place, but you know, people don’t really want to be in the loop. The whole advantage of using AI is the advantage the velocity gains. So having a human that does all the approval is problematic.”

20:04 Claude Fable 5 and Claude Mythos 5

23:34 Matt – “I would also say you gotta get the foundation of your house set up. So if you are patching, it’s not that you’re patching, it’s how you’re patching… I don’t want somebody, to use a very simple example, I have fifty EC2 instances or VMs, and to do patching, I can’t have somebody log into fifty VMs. That’s not sustainable, and that’s not gonna work. Ryan in security here will check the box saying you are doing patching, but I’ve wasted three people’s days on this. But if you build it out so that each thing is an auto scaling group and everything else, which is where you’re going with the CICD stuff, and you build that proper workflow out, then patching is just release the new image.”

Security 

29:46 Dashlane explains how attackers managed to download encrypted password vaults

30:55 Ryan – “And right now, it’s the strength of that master password. But with quantum encryption, it’s going to be able to break through the algorithm generally.” 

Cloud Tools

36:30  Hashicorp: rethinking infrastructure access in the age of agentic AI

37:52 Ryan – “I’m so annoyed by this because they’re like, this is rethinking an age of agentic AI. No, this is what we should do for all authentication, not just AI. It doesn’t treat anything about AI. It doesn’t identify AI agents. And it’s just setting up a user within HashiCorp boundary and then assigning that user to an agentic AI, just like a human. So this doesn’t actually address anything agentic. And these things should be patterns we need to be moving to in general.” 

AWS

42:46 Improve your application resilience with Amazon Cognito multi-Region replication 

43:54 Matt – “… it’s just a nice quality of life improvement to actually get this out.”

45:36 Customize federated sign-in with new Amazon Cognito Lambda trigger

47:26 AWS Step Functions adds AgentCore-powered agentic reasoning step

48:25 Ryan – “You know I lust over state machines, so I find it funny because this is all I think about when I’m putting an agent workflow together. This would be so much easier in a state machine. And so now they’ve done it. I will absolutely use this so much, because it’s something I already kind of do with lambda functions. It’s just now that I won’t have to define the logic as specifically. It’ll just be like four pages of markdown in my lab.”

51:29 Amazon Bedrock AgentCore Runtime introduces interactive shells for terminal access into agent sessions

52:46 Matt – “Somebody needed it to debug some environment variable or working directory, and they were like, we could just quickly do this thing because it’s running ECS under the hood. We’ll just literally change the CLI call from AWS ECS exec to AWS Agent Core exec, and we’ve added a whole new feature, guys.” 

53:12 AWS Cost Explorer launches intelligent cost explanations powered by Amazon Q

54:10 Matt – “That will forever be my goal in life – understand what’s an EC2 other.” 

54:20 AWS FinOps Agent is now available in preview

55:02 Justin – “This is kind of nice. I don’t know if it’s a full-featured solution for everybody, but it’s definitely something that’s gonna help you get started.”

GCP

56:52 Introducing Gemma 4 12B

57:36 Gemma 4 with quantization-aware training

58:17 Ryan – “These are things we need Jonathan for.” 

58:45 Gemini models for Apple developers

1:00:01 Ryan – “I love the Apple Google partnership on this. You know, I’m really happy that Apple didn’t decide to develop its own frontier model and just muddy that space.” 

Azure

1:03:27 New Azure Cobalt 200 VMs deliver 50% performance improvement, fully optimized for modern agentic AI workloads

1:04:44 Matt – “It’s great that they added this; I feel like they’re finally getting into the game of ARM. Getting capacity for them might require some twisting of your account team’s arm, especially if you want them at any scale. But the other problem is, which I still find comical, is that you can’t run Windows Server on ARM.”

1:06:58 Foundry IQ: Build smarter agents faster with unified knowledge and serverless retrieval

1:10:26 Generally Available: Azure Database for PostgreSQL – Flexible Server: DuckDB extension

1:10:50 Justin – “I remember when there were companies that made nothing but columnar databases. Now you just get it as an extension on top of PostgreSQL. Kind of impressive. I bet those companies aren’t doing well these days.”

51:03 Global PTU Reservations Are Now Region-Agnostic

1:12:02 Justin – “Good! Glad you learned what the word ‘global’ means.” 

1:15:30 Generally Available: Azure API Management Premium v2 and Standard v2 now support wildcard custom hostnames

Emerging Clouds 

1:22:25 Full Stack Observability for AI | CoreWeave Solution Brief

Closing

And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod

Other Episodes

Episode 161

April 21, 2022 00:23:40
Episode Cover

161: The Cloud Pod Observes Its Databases With Google Cloud SQL Insights

On The Cloud Pod this week and with half the team gone fishin’, Justin and Peter hash it out short and sweet. Plus Google...

Listen

Episode 246

February 16, 2024 01:03:25
Episode Cover

246: The CloudPod Will Never Type localllm Correctly

Welcome to episode 246 of The CloudPod podcast, where the forecast is always cloudy! This week we’re discussion localllm and just why they’ve saddled...

Listen

Episode 305

May 28, 2025 01:12:49
Episode Cover

305: AWS Breaks Up with Unpopular Services - "It's Not You, It's Me"

Welcome to episode 305 of The Cloud Pod – where the forecast is always cloudy! How did you do on your Microsoft Build Predictions?...

Listen