Architecture

Scaling

How Datamotive scales management, replication, and recovery using horizontal node expansion and adaptive flow control.

Product: Datamotive Platform
Version: v1.0
Documentation status: Published
Last updated: Updated May 28, 2026
Reading time: 2 min read

Datamotive scales horizontally across the data plane and independently scales management, replication, and recovery orchestration. Add nodes when a single node cannot sustain the required RPO, replication throughput, or recovery concurrency.

Scalability model

Datamotive scalability is achieved through:

Parallel replication streams
Sliding-window replication
Vertical or horizontal replication-node expansion
Distributed recovery orchestration

Small environments can operate with minimal infrastructure. Large environments can add Replication Nodes to distribute data movement and recovery workloads. Recovery throughput scales independently from management operations, and replication throughput scales independently from orchestration operations.

Data plane scaling

Add Replication Nodes

Add Replication Nodes when cycle duration approaches or exceeds the configured RPO, node CPU or memory pressure stays elevated, or cloud-write latency increases under concurrency.

Each node operates independently while the control plane aggregates status across the site.

Environment size	Recommended Replication Nodes
Fewer than 50 workloads	1
50 to 300 workloads	2 to 4
300 to 1000 workloads	4 to 8
More than 1000 workloads	8+

Horizontal scaling provides increased aggregate throughput, better API distribution, improved recovery concurrency, reduced node-level contention, and better operational isolation.

Platform-specific scaling

VMware environments are commonly constrained by CBT fragmentation, snapshot performance, VDDK read concurrency, datastore latency, and WAN throughput. AWS and Azure environments are often constrained by cloud API throughput, snapshot or disk operation concurrency, instance network bandwidth, and regional quota behavior.

Control plane scaling

The Management Server handles UI, API, CLI, orchestration, scheduling, monitoring, reporting, inventory, and metadata workflows. Increase Management Server capacity as protected workload count, recovery orchestration concurrency, reporting retention, API activity, and multi-site deployment scale grow.

Environment scale	vCPU	Memory	Storage
Very small	4 vCPU	8 GB	120 GB SSD
Small	4 vCPU	16 GB	200 GB SSD
Medium	8 vCPU	24 GB	500 GB SSD
Large	16 vCPU	32 GB	1 TB SSD
Very large	32 vCPU	64 GB	2 TB SSD

Recovery scaling

Recovery orchestration is parallelized but must respect cloud API limits, quota constraints, disk provisioning concurrency, storage write behavior, and network provisioning limits.

For Windows workloads, Prep Nodes provide recovery preparation and boot remediation. A Prep Node supports up to 20 parallel Windows recoveries; add Prep Nodes horizontally for larger recovery events.

Scale signals

Add capacity or reduce concurrency when you observe:

Replication cycle duration consistently exceeding 80 percent of the RPO interval.
CPU or memory pressure on a Replication Node during replication windows.
Cloud storage write latency increasing under parallel replication.
API throttling responses during replication or recovery.
Job queue depth growing over time.
Recovery batches waiting on VM, disk, NIC, or IP quota.

Related docs

Was this page helpful?