pgbench - 20-30% variation in benchmark results (non-repeatable benchmarks)

Problem

I'm trying to use pgbench to help me tune Postgres for my hardware, but I'm seeing a very strange behaviour. I cannot seem to get a stable number for TPS in consecutive runs of pgbench. Since I was running pgbench for 60 secs, I assumed that this is because of checkpointing and auto-vacuuming, so I made the following changes to my config:

autovacuum=off

max_wal_size=5GB

But this led to even more wildly varying numbers! For example, here's the output of the exact same pgbench command run consecutively:

Output 1

starting vacuum...end.

progress: 5.0 s, 566.0 tps, lat 10.577 ms stddev 2.788

progress: 10.0 s, 513.0 tps, lat 11.689 ms stddev 2.907

progress: 15.0 s, 513.8 tps, lat 11.680 ms stddev 2.995

progress: 20.0 s, 519.6 tps, lat 11.546 ms stddev 2.969

progress: 25.0 s, 518.4 tps, lat 11.576 ms stddev 2.929

progress: 30.0 s, 518.2 tps, lat 11.576 ms stddev 2.978

progress: 35.0 s, 522.8 tps, lat 11.472 ms stddev 2.966

progress: 40.0 s, 521.0 tps, lat 11.516 ms stddev 2.962

progress: 45.0 s, 521.2 tps, lat 11.510 ms stddev 2.909

progress: 50.0 s, 581.6 tps, lat 10.313 ms stddev 2.636

progress: 55.0 s, 520.8 tps, lat 11.526 ms stddev 2.919

progress: 60.0 s, 522.2 tps, lat 11.494 ms stddev 2.927

transaction type: <builtin: TPC-B (sort of)>

scaling factor: 2000

query mode: simple

number of clients: 6

number of threads: 6

duration: 60 s

number of transactions actually processed: 31699

latency average = 11.357 ms

latency stddev = 2.938 ms

tps = 528.185674 (including connections establishing)

tps = 528.269291 (excluding connections establishing)

Output 2

starting vacuum...end.

progress: 5.0 s, 528.4 tps, lat 11.318 ms stddev 2.940

progress: 10.0 s, 526.0 tps, lat 11.418 ms stddev 2.884

progress: 15.0 s, 522.8 tps, lat 11.473 ms stddev 2.892

progress: 20.0 s, 525.6 tps, lat 11.409 ms stddev 3.008

progress: 25.0 s, 528.0 tps, lat 11.366 ms stddev 2.858

progress: 30.0 s, 525.6 tps, lat 11.412 ms stddev 2.893

progress: 35.0 s, 521.8 tps, lat 11.503 ms stddev 2.973

progress: 40.0 s, 524.4 tps, lat 11.439 ms stddev 2.966

progress: 45.0 s, 736.6 tps, lat 8.152 ms stddev 3.801

progress: 50.0 s, 1101.2 tps, lat 5.447 ms stddev 0.738

progress: 55.0 s, 1012.2 tps, lat 5.929 ms stddev 0.609

progress: 60.0 s, 723.4 tps, lat 8.285 ms stddev 2.969

transaction type: <builtin: TPC-B (sort of)>

scaling factor: 2000

query mode: simple

number of clients: 6

number of threads: 6

duration: 60 s

number of transactions actually processed: 38886

latency average = 9.257 ms

latency stddev = 3.629 ms

tps = 647.993705 (including connections establishing)

tps = 648.099359 (excluding connections establishing)

That's a 20% variation in TPS for the exact same configuration!

What am I missing here?

Hardware setup

postgres server: 32 GB RAM / 6-core (12 thread) / SSD with RAID1

pgbench server: 32 GB RAM / 4-core (8 thread) / SSD

Relevant Postgres config for the above output

max_connection=100

work_mem=4MB

maintenance_work_mem=64MB

shared_buffers=12288MB

temp_buffers=8MB

effective_cache_size=16GB

wal_buffers=-1

wal_sync_method=fsync

max_wal_size=5GB

autovacuum=off

pgbench settings

Initialisation

pgbench 

  --initialize 

  --init-steps=dtgpf 

  --scale=2000  # Results in approx 30-32GB of data

  --username=benchmarking

Benchmarking

pgbench 

  --builtin=tpcb-like 

  --client=6 

  --jobs=6 

  --time=60 

  --progress=5 

  --username=benchmarking

asked 3 mins ago

Saurabh Nanda

13110

add a comment |

Problem

autovacuum=off

max_wal_size=5GB

But this led to even more wildly varying numbers! For example, here's the output of the exact same pgbench command run consecutively:

Output 1

starting vacuum...end.

progress: 5.0 s, 566.0 tps, lat 10.577 ms stddev 2.788

progress: 10.0 s, 513.0 tps, lat 11.689 ms stddev 2.907

progress: 15.0 s, 513.8 tps, lat 11.680 ms stddev 2.995

progress: 20.0 s, 519.6 tps, lat 11.546 ms stddev 2.969

progress: 25.0 s, 518.4 tps, lat 11.576 ms stddev 2.929

progress: 30.0 s, 518.2 tps, lat 11.576 ms stddev 2.978

progress: 35.0 s, 522.8 tps, lat 11.472 ms stddev 2.966

progress: 40.0 s, 521.0 tps, lat 11.516 ms stddev 2.962

progress: 45.0 s, 521.2 tps, lat 11.510 ms stddev 2.909

progress: 50.0 s, 581.6 tps, lat 10.313 ms stddev 2.636

progress: 55.0 s, 520.8 tps, lat 11.526 ms stddev 2.919

progress: 60.0 s, 522.2 tps, lat 11.494 ms stddev 2.927

transaction type: <builtin: TPC-B (sort of)>

scaling factor: 2000

query mode: simple

number of clients: 6

number of threads: 6

duration: 60 s

number of transactions actually processed: 31699

latency average = 11.357 ms

latency stddev = 2.938 ms

tps = 528.185674 (including connections establishing)

tps = 528.269291 (excluding connections establishing)

Output 2

starting vacuum...end.

progress: 5.0 s, 528.4 tps, lat 11.318 ms stddev 2.940

progress: 10.0 s, 526.0 tps, lat 11.418 ms stddev 2.884

progress: 15.0 s, 522.8 tps, lat 11.473 ms stddev 2.892

progress: 20.0 s, 525.6 tps, lat 11.409 ms stddev 3.008

progress: 25.0 s, 528.0 tps, lat 11.366 ms stddev 2.858

progress: 30.0 s, 525.6 tps, lat 11.412 ms stddev 2.893

progress: 35.0 s, 521.8 tps, lat 11.503 ms stddev 2.973

progress: 40.0 s, 524.4 tps, lat 11.439 ms stddev 2.966

progress: 45.0 s, 736.6 tps, lat 8.152 ms stddev 3.801

progress: 50.0 s, 1101.2 tps, lat 5.447 ms stddev 0.738

progress: 55.0 s, 1012.2 tps, lat 5.929 ms stddev 0.609

progress: 60.0 s, 723.4 tps, lat 8.285 ms stddev 2.969

transaction type: <builtin: TPC-B (sort of)>

scaling factor: 2000

query mode: simple

number of clients: 6

number of threads: 6

duration: 60 s

number of transactions actually processed: 38886

latency average = 9.257 ms

latency stddev = 3.629 ms

tps = 647.993705 (including connections establishing)

tps = 648.099359 (excluding connections establishing)

That's a 20% variation in TPS for the exact same configuration!

What am I missing here?

Hardware setup

postgres server: 32 GB RAM / 6-core (12 thread) / SSD with RAID1

pgbench server: 32 GB RAM / 4-core (8 thread) / SSD

Relevant Postgres config for the above output

max_connection=100

work_mem=4MB

maintenance_work_mem=64MB

shared_buffers=12288MB

temp_buffers=8MB

effective_cache_size=16GB

wal_buffers=-1

wal_sync_method=fsync

max_wal_size=5GB

autovacuum=off

pgbench settings

Initialisation

pgbench 

  --initialize 

  --init-steps=dtgpf 

  --scale=2000  # Results in approx 30-32GB of data

  --username=benchmarking

Benchmarking

pgbench 

  --builtin=tpcb-like 

  --client=6 

  --jobs=6 

  --time=60 

  --progress=5 

  --username=benchmarking

asked 3 mins ago

Saurabh Nanda

13110

add a comment |

Problem

autovacuum=off

max_wal_size=5GB

But this led to even more wildly varying numbers! For example, here's the output of the exact same pgbench command run consecutively:

Output 1

starting vacuum...end.

progress: 5.0 s, 566.0 tps, lat 10.577 ms stddev 2.788

progress: 10.0 s, 513.0 tps, lat 11.689 ms stddev 2.907

progress: 15.0 s, 513.8 tps, lat 11.680 ms stddev 2.995

progress: 20.0 s, 519.6 tps, lat 11.546 ms stddev 2.969

progress: 25.0 s, 518.4 tps, lat 11.576 ms stddev 2.929

progress: 30.0 s, 518.2 tps, lat 11.576 ms stddev 2.978

progress: 35.0 s, 522.8 tps, lat 11.472 ms stddev 2.966

progress: 40.0 s, 521.0 tps, lat 11.516 ms stddev 2.962

progress: 45.0 s, 521.2 tps, lat 11.510 ms stddev 2.909

progress: 50.0 s, 581.6 tps, lat 10.313 ms stddev 2.636

progress: 55.0 s, 520.8 tps, lat 11.526 ms stddev 2.919

progress: 60.0 s, 522.2 tps, lat 11.494 ms stddev 2.927

transaction type: <builtin: TPC-B (sort of)>

scaling factor: 2000

query mode: simple

number of clients: 6

number of threads: 6

duration: 60 s

number of transactions actually processed: 31699

latency average = 11.357 ms

latency stddev = 2.938 ms

tps = 528.185674 (including connections establishing)

tps = 528.269291 (excluding connections establishing)

Output 2

starting vacuum...end.

progress: 5.0 s, 528.4 tps, lat 11.318 ms stddev 2.940

progress: 10.0 s, 526.0 tps, lat 11.418 ms stddev 2.884

progress: 15.0 s, 522.8 tps, lat 11.473 ms stddev 2.892

progress: 20.0 s, 525.6 tps, lat 11.409 ms stddev 3.008

progress: 25.0 s, 528.0 tps, lat 11.366 ms stddev 2.858

progress: 30.0 s, 525.6 tps, lat 11.412 ms stddev 2.893

progress: 35.0 s, 521.8 tps, lat 11.503 ms stddev 2.973

progress: 40.0 s, 524.4 tps, lat 11.439 ms stddev 2.966

progress: 45.0 s, 736.6 tps, lat 8.152 ms stddev 3.801

progress: 50.0 s, 1101.2 tps, lat 5.447 ms stddev 0.738

progress: 55.0 s, 1012.2 tps, lat 5.929 ms stddev 0.609

progress: 60.0 s, 723.4 tps, lat 8.285 ms stddev 2.969

transaction type: <builtin: TPC-B (sort of)>

scaling factor: 2000

query mode: simple

number of clients: 6

number of threads: 6

duration: 60 s

number of transactions actually processed: 38886

latency average = 9.257 ms

latency stddev = 3.629 ms

tps = 647.993705 (including connections establishing)

tps = 648.099359 (excluding connections establishing)

That's a 20% variation in TPS for the exact same configuration!

What am I missing here?

Hardware setup

postgres server: 32 GB RAM / 6-core (12 thread) / SSD with RAID1

pgbench server: 32 GB RAM / 4-core (8 thread) / SSD

Relevant Postgres config for the above output

max_connection=100

work_mem=4MB

maintenance_work_mem=64MB

shared_buffers=12288MB

temp_buffers=8MB

effective_cache_size=16GB

wal_buffers=-1

wal_sync_method=fsync

max_wal_size=5GB

autovacuum=off

pgbench settings

Initialisation

pgbench 

  --initialize 

  --init-steps=dtgpf 

  --scale=2000  # Results in approx 30-32GB of data

  --username=benchmarking

Benchmarking

pgbench 

  --builtin=tpcb-like 

  --client=6 

  --jobs=6 

  --time=60 

  --progress=5 

  --username=benchmarking

asked 3 mins ago

Saurabh Nanda

13110

Problem

autovacuum=off

max_wal_size=5GB

But this led to even more wildly varying numbers! For example, here's the output of the exact same pgbench command run consecutively:

Output 1

starting vacuum...end.

progress: 5.0 s, 566.0 tps, lat 10.577 ms stddev 2.788

progress: 10.0 s, 513.0 tps, lat 11.689 ms stddev 2.907

progress: 15.0 s, 513.8 tps, lat 11.680 ms stddev 2.995

progress: 20.0 s, 519.6 tps, lat 11.546 ms stddev 2.969

progress: 25.0 s, 518.4 tps, lat 11.576 ms stddev 2.929

progress: 30.0 s, 518.2 tps, lat 11.576 ms stddev 2.978

progress: 35.0 s, 522.8 tps, lat 11.472 ms stddev 2.966

progress: 40.0 s, 521.0 tps, lat 11.516 ms stddev 2.962

progress: 45.0 s, 521.2 tps, lat 11.510 ms stddev 2.909

progress: 50.0 s, 581.6 tps, lat 10.313 ms stddev 2.636

progress: 55.0 s, 520.8 tps, lat 11.526 ms stddev 2.919

progress: 60.0 s, 522.2 tps, lat 11.494 ms stddev 2.927

transaction type: <builtin: TPC-B (sort of)>

scaling factor: 2000

query mode: simple

number of clients: 6

number of threads: 6

duration: 60 s

number of transactions actually processed: 31699

latency average = 11.357 ms

latency stddev = 2.938 ms

tps = 528.185674 (including connections establishing)

tps = 528.269291 (excluding connections establishing)

Output 2

starting vacuum...end.

progress: 5.0 s, 528.4 tps, lat 11.318 ms stddev 2.940

progress: 10.0 s, 526.0 tps, lat 11.418 ms stddev 2.884

progress: 15.0 s, 522.8 tps, lat 11.473 ms stddev 2.892

progress: 20.0 s, 525.6 tps, lat 11.409 ms stddev 3.008

progress: 25.0 s, 528.0 tps, lat 11.366 ms stddev 2.858

progress: 30.0 s, 525.6 tps, lat 11.412 ms stddev 2.893

progress: 35.0 s, 521.8 tps, lat 11.503 ms stddev 2.973

progress: 40.0 s, 524.4 tps, lat 11.439 ms stddev 2.966

progress: 45.0 s, 736.6 tps, lat 8.152 ms stddev 3.801

progress: 50.0 s, 1101.2 tps, lat 5.447 ms stddev 0.738

progress: 55.0 s, 1012.2 tps, lat 5.929 ms stddev 0.609

progress: 60.0 s, 723.4 tps, lat 8.285 ms stddev 2.969

transaction type: <builtin: TPC-B (sort of)>

scaling factor: 2000

query mode: simple

number of clients: 6

number of threads: 6

duration: 60 s

number of transactions actually processed: 38886

latency average = 9.257 ms

latency stddev = 3.629 ms

tps = 647.993705 (including connections establishing)

tps = 648.099359 (excluding connections establishing)

That's a 20% variation in TPS for the exact same configuration!

What am I missing here?

Hardware setup

postgres server: 32 GB RAM / 6-core (12 thread) / SSD with RAID1

pgbench server: 32 GB RAM / 4-core (8 thread) / SSD

Relevant Postgres config for the above output

max_connection=100

work_mem=4MB

maintenance_work_mem=64MB

shared_buffers=12288MB

temp_buffers=8MB

effective_cache_size=16GB

wal_buffers=-1

wal_sync_method=fsync

max_wal_size=5GB

autovacuum=off

pgbench settings

Initialisation

pgbench 

  --initialize 

  --init-steps=dtgpf 

  --scale=2000  # Results in approx 30-32GB of data

  --username=benchmarking

Benchmarking

pgbench 

  --builtin=tpcb-like 

  --client=6 

  --jobs=6 

  --time=60 

  --progress=5 

  --username=benchmarking

postgresql postgresql-performance performance-tuning benchmark

asked 3 mins ago

Saurabh Nanda

13110

asked 3 mins ago

Saurabh Nanda

13110

asked 3 mins ago

Saurabh Nanda

13110

asked 3 mins ago

Saurabh Nanda

13110

asked 3 mins ago

Saurabh Nanda

13110

add a comment |

0

active

oldest

votes

Your Answer

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "182"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdba.stackexchange.com%2fquestions%2f227790%2fpgbench-20-30-variation-in-benchmark-results-non-repeatable-benchmarks%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

0

active

oldest

votes

0

active

oldest

votes

draft saved

draft discarded

Thanks for contributing an answer to Database Administrators Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

8kDf,X7mejZCk

搜尋此網誌

Xrhrft