Product comparison matrix

When StoatFlow vs. each

vs. Kafka Streams

Kafka Streams is battle-tested and has one of the most ergonomic topology APIs on the JVM. StoatFlow keeps a Kafka Streams DSL-compatible programming model, but trades Kafka Streams' horizontal scale-out architecture for a simpler single-instance runtime. That removes whole categories of distributed overhead: inter-instance rebalancing, state migration, and repartition-topic round-trips. For Kafka-native workloads that fit comfortably on one modern machine, this can translate into lower compute, memory, network, and storage usage — and often lower latency, higher throughput on the same hardware, and materially lower operating cost. The trade-off is explicit: StoatFlow gives up open-ended horizontal scale in favour of simpler, more resource-efficient vertical scale.

vs. Apache Flink (self-hosted)

Flink is the right tool for non-Kafka sources and sinks, analytics, ML, unified streaming + batch, or massive-scale workloads (Netflix, Uber, Stripe, Alibaba). StoatFlow targets the wide middle of Kafka-native stream processing applications — workloads Flink can also handle, but where the JobManager/TaskManager runtime, checkpoint tuning, and platform expertise can become a significant part of the cost structure.

vs. Managed Flink

Managed Flink removes much of the operational burden, but you still pay for a managed distributed data-processing platform. That cost is justified when you need Flink's scale, elasticity, SQL, or connector ecosystem. It can be disproportionate for Kafka-native workloads that would otherwise fit comfortably on one machine. StoatFlow targets that middle ground: simpler deployment and service-like cost structure, with explicit single-machine limits.

Measured single-machine limit

On a basic 8-core Hetzner dedicated VM, StoatFlow saturates CPU or network at ~200–300 MB/s uncompressed throughput — in events, ~124K/sec on a 1KB stateless transform, up to ~2.1M/sec output on word-count-style aggregation. That is the practical ceiling to plan against when sizing a single instance; sustained workloads above it are the ones where horizontal scale-out is the right tool.

Two reference workloads anchor that envelope.

stateless-simple: 1 KB strings → map(uppercase()) → sink

124k ops/s in -> 124k ops/s out

word-count: phrases dictionary → split + groupBy + count → sink

52k ops/s in -> 2.1M ops/s out

Last measured 2026-05-10. More benchmarks — on stronger hardware — coming soon.