41 lines
1.2 KiB
Markdown
41 lines
1.2 KiB
Markdown
---
|
|
title: Ollama Model Management
|
|
description: Pulling, verifying, and managing models on the Gremlin stack
|
|
published: true
|
|
date: 2026-04-12T00:00:00.000Z
|
|
tags: gremlin, ollama, models, runbook
|
|
editor: markdown
|
|
dateCreated: 2026-04-12T00:00:00.000Z
|
|
---
|
|
|
|
# Ollama Model Management
|
|
|
|
## Pull Required Models
|
|
|
|
Run on docker4 after any fresh deploy or after the Ollama container is recreated:
|
|
|
|
```bash
|
|
docker exec $(docker ps -qf name=gremlin_ollama) ollama pull llama3.2:3b
|
|
docker exec $(docker ps -qf name=gremlin_ollama) ollama pull qwen2.5-coder:7b
|
|
```
|
|
|
|
## Verify Models Loaded
|
|
|
|
```bash
|
|
docker exec $(docker ps -qf name=gremlin_ollama) ollama list
|
|
```
|
|
|
|
## Model Reference
|
|
|
|
| Model | Size | Pull Time (CPU) | Used By |
|
|
|-------|------|----------------|---------|
|
|
| `llama3.2:3b` | ~2 GB | ~5 min | Kuma triage, Open WebUI |
|
|
| `qwen2.5-coder:7b` | ~5 GB | ~15 min | Forgejo audit, Open WebUI |
|
|
|
|
## Models Storage Path
|
|
|
|
`/DockerVol/ollama` — survives container restarts and redeployments.
|
|
|
|
## ⚠ Pull Before Workflows Run
|
|
|
|
n8n workflows fail silently if models aren't present. Ollama returns a model-not-found response but n8n may not surface this as an obvious error. Always pull models immediately after deploy before enabling workflows.
|