Documentation Index
Fetch the complete documentation index at: https://docs.trainy.ai/llms.txt
Use this file to discover all available pages before exploring further.
Konduktor Config Yaml
The schema defaults to what is stored in~/.konduktor/config.yaml, but can be customized
with the environment variable KONDUKTOR_CONFIG='/path/to/custom_konduktorconf.yaml'
Schema
Details
allowed_clouds - which storage backends Konduktor uses for uploads/downloads
logs: timeout - timeout in seconds for querying logs via konduktor logs
logs: backend - which log stack your cluster exposes
tailscale: secret_name - Kubernetes secret containing your Tailscale auth key so Konduktor can auto-connect jobs/containers to your Tailnet
ssh: enable - flips on SSH support (and key management) for launched pods so you can debug or port-forward into them
serving: endpoint - choice between direct IPs or .trainy.us endpoints for accesing deployments
kubernetes: allowed_contexts - pins Konduktor to a specific kube context from KUBECONFIG i.e. ~/.kube/config. Only use a 1 item list
kubernetes: pod_config: spec - raw PodSpec overrides merged into every JobSet pod (handy for imagePullSecrets, volumes, tolerations, etc.)
kubernetes: pod_config: spec: imagePullSecrets - supply registry creds for pulling private images
kubernetes: jobset_config - JobSet-level metadata (annotations and labels) merged into every JobSet. Useful for setting Kueue annotations like maxRunDurationSeconds as a default for all jobs. See the JobSet documentation for more details on JobSet configuration
kubernetes: jobset_config: metadata: annotations: kueue.x-k8s.io/maxRunDurationSeconds - maximum job runtime in seconds before Kueue terminates it (defaults to 604800 = 7 days). Can be overridden per-task via resources.labels.maxRunDurationSeconds
kubernetes: provision_timeout - timeout in seconds for pods/services/deployments to become ready before giving up (set -1 to disable)