desktop-dotfiles

History

Davide Polonio ebc71492c3 fix(ollama): restrict to RX 9070 XT, restore mmproj

- Set HIP_VISIBLE_DEVICES=0 to use only the discrete GPU (gfx1201).
  llama.cpp was trying to split layers across the iGPU (gfx1036) which
  caused segfaults when loading the multimodal projector.
- Restore --mmproj for both HF models (multimodal works correctly with
  single GPU).
- Keep qwen3.5:9b disabled (Ollama-extracted GGUF uses old mrope_sections
  key format incompatible with this llama.cpp build).

2026-04-10 00:09:12 +02:00

Caddyfile

feat(ollama): migrate from Ollama to llama.cpp + llama-swap

2026-04-09 23:14:43 +02:00

docker-compose.yml

fix(ollama): restrict to RX 9070 XT, restore mmproj

2026-04-10 00:09:12 +02:00

Dockerfile

feat(ollama): migrate from Ollama to llama.cpp + llama-swap

2026-04-09 23:14:43 +02:00

llama-swap.yaml

fix(ollama): restrict to RX 9070 XT, restore mmproj

2026-04-10 00:09:12 +02:00