Files
gooseek/backend/deploy/k8s/api-gateway.yaml
home 7a40ff629e
Some checks failed
Build and Deploy GooSeek / build-and-deploy (push) Failing after 8m25s
feat: LLM routing by tier (free→Ollama, pro→Timeweb)
- Add tier-based provider routing in llm-svc
  - free tier → Ollama (local qwen3.5:9b)
  - pro/business → Timeweb Cloud AI
- Add /api/v1/embed endpoint for embeddings via Ollama
- Update Ollama client: qwen3.5:9b default, remove auth
- Add GenerateEmbedding() function for qwen3-embedding:0.6b
- Add Ollama K8s deployment with GPU support (RTX 4060 Ti)
- Add monitoring stack (Prometheus, Grafana, Alertmanager)
- Add Grafana dashboards for LLM and security metrics
- Update deploy.sh with monitoring and Ollama deployment

Made-with: Cursor
2026-03-03 02:25:22 +03:00

73 lines
1.5 KiB
YAML

apiVersion: apps/v1
kind: Deployment
metadata:
name: api-gateway
namespace: gooseek
labels:
app: api-gateway
app.kubernetes.io/name: api-gateway
app.kubernetes.io/part-of: gooseek
spec:
replicas: 2
selector:
matchLabels:
app: api-gateway
template:
metadata:
labels:
app: api-gateway
annotations:
prometheus.io/scrape: "true"
prometheus.io/port: "3015"
prometheus.io/path: "/metrics"
spec:
containers:
- name: api-gateway
image: gooseek/backend:latest
env:
- name: SERVICE
value: "api-gateway"
- name: PORT
value: "3015"
envFrom:
- configMapRef:
name: gooseek-config
- secretRef:
name: gooseek-secrets
ports:
- containerPort: 3015
name: http
livenessProbe:
httpGet:
path: /health
port: 3015
initialDelaySeconds: 10
periodSeconds: 15
readinessProbe:
httpGet:
path: /ready
port: 3015
initialDelaySeconds: 5
periodSeconds: 10
resources:
requests:
cpu: 100m
memory: 128Mi
limits:
cpu: 500m
memory: 256Mi
---
apiVersion: v1
kind: Service
metadata:
name: api-gateway
namespace: gooseek
spec:
type: ClusterIP
selector:
app: api-gateway
ports:
- port: 3015
targetPort: 3015
name: http