CometBFT QA Results v0.34.x

v0.34.x - From Tendermint Core to CometBFT

This section reports on the QA process we followed before releasing the first v0.34.x version from our CometBFT repository.

The changes with respect to the last version of v0.34.x (namely v0.34.26, released from the Informal Systems’ Tendermint Core fork) are minimal, and focus on rebranding our fork of Tendermint Core to CometBFT at places where there is no substantial risk of breaking compatibility with earlier Tendermint Core versions of v0.34.x.

Indeed, CometBFT versions of v0.34.x (v0.34.27 and subsequent) should fulfill the following compatibility-related requirements.

Operators can easily upgrade a v0.34.x version of Tendermint Core to CometBFT.
Upgrades from Tendermint Core to CometBFT can be uncoordinated for versions of the v0.34.x branch.
Nodes running CometBFT must be interoperable with those running Tendermint Core in the same chain, as long as all are running a v0.34.x version.

These QA tests focus on the third bullet, whereas the first two bullets are tested using our e2e tests.

It would be prohibitively time consuming to test mixed networks of all combinations of existing v0.34.x versions, combined with the CometBFT release candidate under test. Therefore our testing focuses on the last Tendermint Core version (v0.34.26) and the CometBFT release candidate under test.

We run the 200 node test, but not the rotating node test. The effort of running the latter is not justified given the amount and nature of the changes we are testing with respect to the full QA cycle run previously on v0.34.x. Since the changes to the system’s logic are minimal, we are interested in these performance requirements:

The CometBFT release candidate under test performs similarly to Tendermint Core (i.e., the baseline)
- when used at scale (i.e., in a large network of CometBFT nodes)
- when used at scale in a mixed network (i.e., some nodes are running CometBFT and others are running an older Tendermint Core version)

Therefore we carry out a complete run of the 200-node test on the following networks:

A homogeneous 200-node testnet, where all nodes are running the CometBFT release candidate under test.
A mixed network where 1/2 (99 out of 200) of the nodes are running the CometBFT release candidate under test, and the rest (101 out of 200) are running Tendermint Core v0.34.26.
A mixed network where 1/3 (66 out of 200) of the nodes are running the CometBFT release candidate under test, and the rest (134 out of 200) are running Tendermint Core v0.34.26.
A mixed network where 2/3 (133 out of 200) of the nodes are running the CometBFT release candidate under test, and the rest (67 out of 200) are running Tendermint Core v0.34.26.

Configuration and Results

In the following sections we provide the results of the 200 node test. Each section reports the baseline results (for reference), the homogeneous network scenario (all CometBFT nodes), and the mixed networks with 1/2, 1/3 and 2/3 of Tendermint Core nodes.

Saturation Point

As the CometBFT release candidate under test has minimal changes with respect to Tendermint Core v0.34.26, other than the rebranding changes, we can confidently reuse the results from the v0.34.x baseline test regarding the saturation point.

Therefore, we will simply use a load of (r=200,c=2) (see the explanation here) on all experiments.

We also include the baseline results for quick reference and comparison.

Experiments

On each of the three networks, the test consists of 4 experiments, with the goal of ensuring the data obtained is consistent across experiments.

On each of the networks, we pick only one representative run to present and discuss the results.

Examining latencies

For each network the figures plot the four experiments carried out with the network. We can see that the latencies follow comparable patterns across all experiments.

Unique identifiers, UUID, for each execution are presented on top of each graph. We refer to these UUID to indicate to the representative runs.

CometBFT Homogeneous network

latencies

1/2 Tendermint Core - 1/2 CometBFT

latencies

1/3 Tendermint Core - 2/3 CometBFT

latencies

2/3 Tendermint Core - 1/3 CometBFT

latencies_all_tm2_3_cmt1_3

Prometheus Metrics

This section reports on the key Prometheus metrics extracted from the following experiments.

Baseline results: v0.34.x, obtained in October 2022 and reported here.
CometBFT homogeneous network: experiment with UUID starting with be8c.
Mixed network, 1/2 Tendermint Core v0.34.26 and 1/2 running CometBFT: experiment with UUID starting with 04ee.
Mixed network, 1/3 Tendermint Core v0.34.26 and 2/3 running CometBFT: experiment with UUID starting with fc5e.
Mixed network, 2/3 Tendermint Core v0.34.26 and 1/3 running CometBFT: experiment with UUID starting with 4759.

We make explicit comparisons between the baseline and the homogenous setups, but refrain from commenting on the mixed network experiment unless they show some exceptional results.

Mempool Size

For each reported experiment we show two graphs. The first shows the evolution over time of the cumulative number of transactions inside all full nodes’ mempools at a given time.

The second one shows the evolution of the average over all full nodes.

Baseline

mempool-cumulative

mempool-avg