Which instance should I choose for my workers?

It depends on what runs on the nodes. For classic web applications and microservices, s1 (1:2 ratio) is the right starting point. For databases, data pipelines or GPU workers, prefer u1 (1:4 ratio), which gives more RAM per core. Reserve m1 (1:8 ratio) for genuinely memory-hungry workloads — monitoring, caches, analytics.

Which storageClass should I use in production?

replicated is the default recommendation for production. It replicates your persistent volumes across the three Swiss data centres synchronously — your data stays accessible even if a site fails. local is faster (a single copy, less latency) but with no storage-level high availability — reserve it for development or genuinely temporary data.

How do I add GPU nodes to an existing cluster?

Add a new node group with the gpus[] field in your manifest, enable the gpuOperator add-on, and apply. The GPU Operator automatically installs the NVIDIA drivers on the new nodes. Note: each node in the GPU node group consumes one physical GPU. A node group with minReplicas: 4 ties up 4 GPUs permanently. Consider minReplicas: 0 if your GPU workloads are intermittent.

How do I retrieve my kubeconfig back?

The kubeconfig is generated automatically when the cluster is created and stored in a Kubernetes Secret.

Can I use FluxCD to deploy my applications?

Yes. Enable the fluxcd add-on in the cluster manifest with your Git repository URL. FluxCD automatically syncs all the YAML manifests from the repository into your cluster. A push to your main branch triggers a deployment with no manual action. For private repositories, configure SSH or token authentication in the cluster after Flux's initial deployment.

MANAGED KUBERNETES

A Kubernetes cluster
production-ready in 5 minutes

Control plane fully managed by Hikube, worker nodes in your project. Replicated storage across 3 datacenters. You keep control of your workloads, we take care of the rest.

CONTACT OUR EXPERTS SEE THE ARCHITECTURE

Addons available

kubectl

Helm

FluxCD

cert-manager

Cilium

Maintenance-free, managed control plane
kube-apiserver, etcd, scheduler and controller-manager are hosted and operated by Hikube. Replicated on multiple sites.
Worker nodes in your project
Your workers are VMs in your space. You configure them via declarative node groups - instance type, scaling, roles, GPU.
Replicated storage across 3 datacenters
Your persistent volumes are automatically replicated to Geneva, Gland and Lucerne. Native high availability at storage level.
Standard Kubernetes API
Access via kubectl, SDK client, Terraform or any compatible tool. No proprietary tools required.
Cilium as default NIC
Pod-to-pod network, NetworkPolicies and Hubble observability included. Default-allow policy modifiable at any time.

CONTROL PLANE - MANAGED BY HIKUBE

kube-apiserver etcd x3 scheduler controller-manager

NODE GROUPS

Workers (your project)

VMs with min/max autoscaling. Separated by role and load type...

NETWORK

Cilium CNI

NetworkPolicy, Hubble observability, LoadBalancer, Ingress NGINX.

STORAGE

Replicated PVCs

Dynamic provisioning. 3 backends: Geneva - Gland - Lucerne.

yaml

apiVersion: apps.cozystack.io/v1alpha1
kind: Kubernetes
metadata:
  name: my-first-cluster
spec:
  controlPlane:
    replicas: 2
    storageClass: "replicated"
  nodeGroups:
    general:
      minReplicas: 1
      maxReplicas: 5
      instanceType: "s1.large" # 4 vCPU, 8 GB RAM
      ephemeralstorage: 50G1
      resources：{}
      roles:
        - ingress-nginx
addons:
  certManager:
    enabled: true
  ingressNginx:
    enabled: true

bash

# Deploy the cluster
kubectl apply -f my-first-cluster. yaml

# Retrieve kubeconfig
kubectl get §projectSECRET§ my-first-cluster admin-
kubeconfig
 -o jsonpath='{.data.super-admin}' \
 | base64 -d > kubeconfig.yaml

# Check nodes
export KUBECONFIG-kubeconfig.yaml
kubectl get nodes

A few points to remember to get started:

replicas: 3 for production
An odd number of replicas is recommended to guarantee the etcd quorum and high availability of the control plane.
resources: {} is mandatory
Each node group must declare this field, even if empty. Without it, CPU/RAM values are not inherited from instanceType.
Separate your node groups by role
Web, compute, monitoring, GPU - separate groups enable independent autoscaling and better cost control.
Scale to zero for GPU workloads
minReplicas: 0 on a GPU node group allows resources to be consumed only when GPU pods are actually scheduled.

s1 - Standard

1:2

vCPU:RAM

General workloads, web servers

u1 - Universal

1:4

vCPU:RAM

Business applications, databases

m1 - Memory

1:8

vCPU:RAM

Cache, analytics, monitoring, ML

local

A single datacenter

No replication, minimal latency

replicated

Synchronous multi-datacenter

Writing confirmed on 3 sites before return

replicated-async

Asynchronous multi-datacenter

Local writing then delayed propagation

Automatic TLS certificates via Let's Encrypt or your private authority. Automatic renewal.

Advanced HTTP/HTTPS routing based on the Kubernetes standard. Fine-grained incoming traffic management and multi-protocol support.

Node Exporter, Fluent Bit, Kube-State-Metrics. Grafana and VictoriaMetrics integration.

Automatic NVIDIA drivers on GPU nodes. Required to use GPU workloads in Kubernetes.

Native GitOps, ensuring continuous synchronization from your Git repository. Automated deployment and rollback.

Application stack with Ingress and TLS

Node group web with ingress-nginx role, cert-manager for Let's Encrypt certificates, autoscaling between 2 and 10 nodes depending on traffic.

sl.large ingress-nginx cert-manager replicated

Continuous deployment from Git with FluxCD

FluxCD synchronizes your Git repository every minute. Push on main = automatic deployment in the cluster. Rollback via Git revert.

fluxcd cert-manager cert-ingress-nginx

Hybrid CPU + GPU cluster with zero scaling

GPU node group with minReplicas: 0 - GPU nodes only run when jobs are scheduled. Permanent CPU node group for orchestration.

u1.2xwide L40S GPU gpuOperator 500Gi

Node group dedicated to monitoring

Node group m1.xlarge with monitoring role and generous ephemeral storage (200Gi). VictoriaMetrics, Grafana and Fluent Bit isolated from the rest of the workloads.

m1.xlarge monitoringAgents 200Gi

Why managed Kubernetes

Less ops, more time for your workloads

Maintaining a Kubernetes control plane in production means managing version upgrades, monitoring the health of etcd, managing internal certificates, and ensuring that quorum is preserved at all times. It's not complicated, but it does take time and attention.

With Hikube, this operational burden disappears. You declare what you want: cluster size, node groups, add-ons and the platform takes care of the rest. You retain full access via the standard Kubernetes API: kubectl, Helm, Terraform and FluxCD work exactly as on a self-managed cluster.

Multi-datacenter replication across three Swiss sites is transparent to your workloads. Your PVCs are available even if one datacenter is inaccessible, without any additional configuration on your part.

It depends on what's running on the nodes. For web applications and classic microservices, s1 (ratio 1:2) is the right starting point. For databases, data pipelines or GPU workers, u1 (ratio 1:4) gives more RAM per core. Reserve m1 (ratio 1:8) for really memory-hungry loads - monitoring, caches, analytics.

Autoscaling is controlled by minReplicas and maxReplicas on each node group. When pods remain in the Pending state for lack of resources, new nodes are provisioned automatically. When load decreases, under-utilized nodes are removed - without going below minReplicas.

You can adjust the bounds at any time by modifying the manifest and re-running kubectl apply. For GPU or development workloads, minReplicas: 0 allows you to go down to zero nodes when there's nothing to do - allow a few minutes of cold start at the first scheduled pod.

replicated is the default recommendation for production. It replicates your persistent volumes synchronously across the three Swiss datacenters - your data remains accessible even in the event of a site failure.

local is faster (a single copy, less latency) but without high storage availability - reserve it for development or truly temporary data.

Add a new node group with the gpus[] field in your manifest, activate the gpuOperator addon, and apply. The GPU Operator will automatically install NVIDIA drivers on the new nodes.

Please note: each node in the GPU node group consumes one physical GPU. A node group with minReplicas: 4 uses 4 GPUs at all times - consider using minReplicas: 0 if your GPU workloads are intermittent.

The kubeconfig is automatically generated when the cluster is created and stored in a Kubernetes Secret.

Yes - enable the fluxcd addon in the cluster manifest with the URL of your Git repository. FluxCD automatically synchronizes all repository YAML manifests in your cluster. A push on your main branch triggers deployment without any manual action.

For private repositories, configure SSH or token authentication in the cluster after initial Flux deployment.

A Kubernetes clusterproduction-ready in 5 minutes

3 DC

5 mins

0 Masters

API K8's

Turnkey managed Kubernetes

CONTROL PLANE - MANAGED BY HIKUBE

NODE GROUPS

Workers (your project)

NETWORK

Cilium CNI

STORAGE

Replicated PVCs

Fully declarative configuration

A cluster in a few lines of YAML

Three ranges for every type of workload

Examples:

PVCs replicated across three datacenters

Activate what you need

Cert-Manager

Gateway API

Monitoring

GPU Operator

FluxCD

Common Hikube architectures

WEB APPLICATION

GITOPS

ML / AI

OBSERVABILITY

Less ops, more time for your workloads

No control plane maintenance

GPU and CPU on the same cluster

Declarative and automatic scaling

Swiss data, sovereign architecture

Question about Kubernetes as a Service

Which instance should I choose for my workers?

How does autoscaling work?

Which storageClass to use in production?

How do I add GPU nodes to an existing cluster?

How do I get my kubeconfig back?

Can I use FluxCD to deploy my applications?

A Kubernetes cluster
production-ready in 5 minutes