All upcoming events
WORKSHOP
•
VIRTUALDeploy and Manage LLMs on Google Cloud Run GPUs with Pulumi
DATE
Oct 17, 2024
TIME
DURATION
90 minutes
This workshop is taught with Pulumi Cloud. Sign up for free to follow along.
If you’ve ever wanted to inference with an LLM in under a minute while paying only for what you consume, then Google Cloud Run GPUs are for you! In this hands-on workshop, we will demonstrate how Pulumi can seamlessly stand up an environment for deploying your LLMs and custom models on Google Cloud Run GPUs. Participants will learn how to create scalable, cost-efficient infrastructure that allows for rapid LLM inference, leveraging the power of Pulumi to automate and manage their deployments. Whether you’re deploying pre-trained LLMs or custom models, this workshop will provide the tools and knowledge you need to optimize your AI workloads on the cloud.
You'll Learn:
How to efficiently deploy and manage LLMs and custom models on Google Cloud Run GPUs.
Best practices for setting up scalable and cost-effective infrastructure for fast LLM inference.
How to automate cloud infrastructure and streamline AI workload management using Pulumi.
Register today
Event Speakers
Jay Smith
Sr. Cloud Customer Engineer, Google
Mitch Gerdisch
Solutions Architect, Pulumi