Posts Tagged deepseek

Run Open-Source LLMs on AWS EC2 with Ollama and Pulumi

Monday, Jan 27, 2025

TL;DR. Want to self-host an open-source LLM on AWS? Use a g4dn.xlarge ($0.526/hr on-demand, 16 GB GPU memory) for 7B/8B models, a g5.xlarge ($1.006/hr, 24 GB) for 13B–14B models, a g5.2xlarge ($1.212/hr, 24 GB) for 32B models, or a g6e.2xlarge ($2.242/hr, 48 GB) for 70B models. Deploy with the Pulumi program below and Ollama will run any model from its library: DeepSeek-R1, Llama 3, Qwen, or Mistral, with a one-line change.

Run Open-Source LLMs on AWS EC2 with Ollama and Pulumi

The infrastructure as code platform for any cloud.