desktop-dotfiles

polpetta/desktop-dotfiles

Fork 0

Commit Graph

Author	SHA1	Message	Date
Davide Polonio	3034f987d7	feat(ollama): migrate from Ollama to llama.cpp + llama-swap Replace the Ollama service with a custom ROCm image combining ghcr.io/ggml-org/llama.cpp:server-rocm and llama-swap v199. Main motivations: - Unblock qwen35 HF GGUFs (qwen35 architecture not supported in Ollama 0.20.4 for HF-imported models) - Stay current with llama.cpp upstream without waiting for Ollama releases Changes: - ollama/Dockerfile: build llama-swap on top of llama.cpp:server-rocm - ollama/llama-swap.yaml: define 4 models with full sampler config, GPU offload, and mmproj for the two multimodal HF fine-tunes - ollama/docker-compose.yml: replace Ollama image with local build; fix broken volume mount (was /ubuntu/.ollama, now explicit /models) - ollama/Caddyfile: update upstream port 11434→8080 (llama-swap default) - ai/docker-compose.yml: switch Open WebUI from OLLAMA_BASE_URL to OPENAI_API_BASE_URL pointing at llama-swap /v1 endpoint	2026-04-09 23:14:43 +02:00
Davide Polonio	d856a8704b	feat(ai): add Open WebUI with Tailscale and Caddy reverse proxy Set up a complete AI services stack with Open WebUI as the main interface, secured behind Caddy reverse proxy with automatic HTTPS via Namecheap DNS challenge. Tailscale integration provides secure remote access. - Configure Open WebUI to connect to Ollama backend - Set up Caddy with Namecheap DNS plugin for wildcard SSL certificates - Add Tailscale for secure networking with health checks - Configure reverse proxy to forward requests to Open WebUI - Enable proper header forwarding for client IP preservation - Set up persistent volumes for data, config, and SSL certificates - Configure JSON logging with rotation for all services	2026-03-18 22:16:58 +01:00

Author

SHA1

Message

Date

Davide Polonio

3034f987d7

feat(ollama): migrate from Ollama to llama.cpp + llama-swap

Replace the Ollama service with a custom ROCm image combining
ghcr.io/ggml-org/llama.cpp:server-rocm and llama-swap v199.

Main motivations:
- Unblock qwen35 HF GGUFs (qwen35 architecture not supported in
  Ollama 0.20.4 for HF-imported models)
- Stay current with llama.cpp upstream without waiting for Ollama releases

Changes:
- ollama/Dockerfile: build llama-swap on top of llama.cpp:server-rocm
- ollama/llama-swap.yaml: define 4 models with full sampler config,
  GPU offload, and mmproj for the two multimodal HF fine-tunes
- ollama/docker-compose.yml: replace Ollama image with local build;
  fix broken volume mount (was /ubuntu/.ollama, now explicit /models)
- ollama/Caddyfile: update upstream port 11434→8080 (llama-swap default)
- ai/docker-compose.yml: switch Open WebUI from OLLAMA_BASE_URL to
  OPENAI_API_BASE_URL pointing at llama-swap /v1 endpoint

2026-04-09 23:14:43 +02:00

Davide Polonio

d856a8704b

feat(ai): add Open WebUI with Tailscale and Caddy reverse proxy

Set up a complete AI services stack with Open WebUI as the main interface,
secured behind Caddy reverse proxy with automatic HTTPS via Namecheap DNS
challenge. Tailscale integration provides secure remote access.

- Configure Open WebUI to connect to Ollama backend
- Set up Caddy with Namecheap DNS plugin for wildcard SSL certificates
- Add Tailscale for secure networking with health checks
- Configure reverse proxy to forward requests to Open WebUI
- Enable proper header forwarding for client IP preservation
- Set up persistent volumes for data, config, and SSL certificates
- Configure JSON logging with rotation for all services

2026-03-18 22:16:58 +01:00

2 Commits