home
7a40ff629e
Build and Deploy GooSeek / build-and-deploy (push) Failing after 8m25s
feat: LLM routing by tier (free→Ollama, pro→Timeweb)
- Add tier-based provider routing in llm-svc
- free tier → Ollama (local qwen3.5:9b)
- pro/business → Timeweb Cloud AI
- Add /api/v1/embed endpoint for embeddings via Ollama
- Update Ollama client: qwen3.5:9b default, remove auth
- Add GenerateEmbedding() function for qwen3-embedding:0.6b
- Add Ollama K8s deployment with GPU support (RTX 4060 Ti)
- Add monitoring stack (Prometheus, Grafana, Alertmanager)
- Add Grafana dashboards for LLM and security metrics
- Update deploy.sh with monitoring and Ollama deployment
Made-with: Cursor
2026-03-03 02:25:22 +03:00
..
2026-03-02 22:01:51 +03:00
2026-02-27 04:15:32 +03:00
2026-03-03 02:25:22 +03:00
2026-02-27 04:15:32 +03:00
2026-02-27 04:15:32 +03:00
2026-03-03 02:25:22 +03:00
2026-03-03 02:25:22 +03:00
2026-02-27 04:15:32 +03:00
2026-03-02 22:31:04 +03:00
2026-03-03 02:25:22 +03:00
2026-03-02 20:25:44 +03:00
2026-03-02 22:31:04 +03:00
2026-03-03 02:25:22 +03:00
2026-03-02 20:25:44 +03:00
2026-03-03 02:25:22 +03:00
2026-03-02 20:25:44 +03:00
2026-03-03 02:25:22 +03:00
2026-02-27 04:15:32 +03:00
2026-03-02 22:01:51 +03:00
2026-03-03 02:25:22 +03:00
2026-03-03 02:25:22 +03:00
2026-03-03 02:25:22 +03:00
2026-03-02 21:38:49 +03:00
2026-02-27 04:15:32 +03:00
2026-02-27 04:15:32 +03:00
2026-03-02 20:25:44 +03:00
2026-03-02 21:38:49 +03:00
2026-03-02 20:25:44 +03:00
2026-02-27 04:15:32 +03:00
2026-02-27 04:15:32 +03:00
2026-02-27 04:15:32 +03:00
2026-03-02 20:25:44 +03:00
2026-03-02 20:25:44 +03:00