Registry Access
You’ll need credentials from our support team to access Membrane artifacts: Registry Credentials- Username format:
robot$<your-company-name> - Access to:
harbor.getmembrane.com
Setting Up Registry Access
- Login to Helm registry
- Pull and unpack the Integration.app Helm chart:
Prerequisites
Before installing Membrane using Helm, ensure you have the following components set up:Prometheus Stack
The kube-prometheus stack provides Prometheus, Grafana dashboards, and necessary Prometheus rules:KEDA
If you plan to use autoscaling features, install KEDA:Installation
- Configure Container Registry Access
- Prepare Configuration
values.yaml file with the values for your setup:
yaml
- Validate Chart
- Select Cluster Context
- Deploy
Autoscaling Configuration
The following components support autoscaling:- API
- Instant Tasks Worker
- Queued Tasks Worker
- Custom Code Runner
| Parameter | Type | Description |
|---|---|---|
.autoscaling.enabled | Boolean | Enables/disables autoscaling for the component. If autoscaling is a number of replicas will taken from.replicas property. IF autoscaling is enabled, .replicas is ignored. |
.autoscaling.minReplicaCount | Number | Minimum number of replicas |
.autoscaling.maxReplicaCount | Number | Maximum number of replicas |
.autoscaling.cooldownPeriod | Number | Cooldown period between scaling operations |
.autoscaling.pollingInterval | Number | How often to check scaling metrics |
Component-Specific Scaling
Each component has specific scaling parameters that control its autoscaling behavior:| Parameter | Type | Default | Description |
|---|---|---|---|
api.autoscaling. scalingTargets. cpuUtilizationPercent | Number | 50 | Defines the target CPU utilization percentage. Adjusting this value will influence how aggressively the API scales in response to CPU load |
instantTasksWorker. autoscaling. scalingTargets. utilizationRate | Number | 0.75 | Defines the expected percentage of time (0.0-1.0) that workers should be actively processing tasks. Higher values minimize worker idle time but can cause processing delays during high load periods |
customCodeRunner. autoscaling. scalingTargets. capacityRate | Number | 0.45 | Defines the capacity rate of available to total slots. A higher value increases the likelihood of custom code execution waiting for a slot, potentially slowing down API requests. A lower value ensures that custom code requests are processed promptly, but it may result in a higher number of idle pods. |
queuedTasksWorker. autoscaling. scalingTargets. utilizationRate | Number | 0.85 | Defines the expected percentage of time (0.0-1.0) that workers should be actively processing tasks. Higher values minimize worker idle time but can cause processing delays during high load periods |