Suboptimal query plan when updating partitioned table
Background
- I have a simple CTE used to update a declaratively-partitioned table.
- The subquery runs quickly (
1.7secperEXPLAIN ANALYZE) and returns3,769records (CTEyin the query below). - The
UPDATEseeks to update non-index columns of a declaratively-partitioned table. A few characteristics of the table:
- Contains
21 millionrecords
184partitions -- yes, too many -- each child partition has primary/ foreign keys and indices
fillfactor=100(plan to reduce to potentially use HOT on same pages, but not certain that query plan affected by lack of page space)
- Contains
Problem arises upon UPDATE, per the incremental query plan shown (EXPLAIN) after the subquery (via CTE) runs (several nested loops of hash joins):
-> Hash Join /* Run for each child partition, based on CTE PK = child table PK */
-> Nested Loop
-> CTE Scan
-> Append
-> Index Scan /* for index on each child partition */
... /* Index scans (for each child partition) */
-> Hash
-> Seq Scan /* on child table */
... /* Hash Joins (for each child partition) */
Query
The following query is the UPDATE statement causing the issue. Basically, the query performs a couple functions using a value from parent_table that cannot be nested into a single SQL statement (so two CTEs used), then UPDATE the same parent_table for the result (the functions are expensive, so the result stored in the table itself).
WITH x AS (
SELECT t."p1", t."p2", f(t."b1") OVER "win_x" AS "c1"
FROM parent_table AS "t"
WHERE t."p1" IN ('val1','val2')
WINDOW "win_x" AS (PARTITION BY "p1" ORDER BY "p1","p2")
), y AS (
SELECT x."p1", x."p2", f(x."c1") OVER "win_y" AS "c2"
FROM x
WINDOW "win_y" AS (PARTITION BY "p1" ORDER BY "p1","p2")
)
UPDATE parent_table AS "t2"
SET ("a1")=(y."c2")
FROM y INNER JOIN parent_table AS "t" USING ("p1","p2")
WHERE t2."p1"=y."p1" AND t2."p2"=y."p2";
Question
How can I perform the UPDATE without the nested loop hash joins for each of the 184 child tables?
System Info
Postgres version 10.3
postgresql
add a comment |
Background
- I have a simple CTE used to update a declaratively-partitioned table.
- The subquery runs quickly (
1.7secperEXPLAIN ANALYZE) and returns3,769records (CTEyin the query below). - The
UPDATEseeks to update non-index columns of a declaratively-partitioned table. A few characteristics of the table:
- Contains
21 millionrecords
184partitions -- yes, too many -- each child partition has primary/ foreign keys and indices
fillfactor=100(plan to reduce to potentially use HOT on same pages, but not certain that query plan affected by lack of page space)
- Contains
Problem arises upon UPDATE, per the incremental query plan shown (EXPLAIN) after the subquery (via CTE) runs (several nested loops of hash joins):
-> Hash Join /* Run for each child partition, based on CTE PK = child table PK */
-> Nested Loop
-> CTE Scan
-> Append
-> Index Scan /* for index on each child partition */
... /* Index scans (for each child partition) */
-> Hash
-> Seq Scan /* on child table */
... /* Hash Joins (for each child partition) */
Query
The following query is the UPDATE statement causing the issue. Basically, the query performs a couple functions using a value from parent_table that cannot be nested into a single SQL statement (so two CTEs used), then UPDATE the same parent_table for the result (the functions are expensive, so the result stored in the table itself).
WITH x AS (
SELECT t."p1", t."p2", f(t."b1") OVER "win_x" AS "c1"
FROM parent_table AS "t"
WHERE t."p1" IN ('val1','val2')
WINDOW "win_x" AS (PARTITION BY "p1" ORDER BY "p1","p2")
), y AS (
SELECT x."p1", x."p2", f(x."c1") OVER "win_y" AS "c2"
FROM x
WINDOW "win_y" AS (PARTITION BY "p1" ORDER BY "p1","p2")
)
UPDATE parent_table AS "t2"
SET ("a1")=(y."c2")
FROM y INNER JOIN parent_table AS "t" USING ("p1","p2")
WHERE t2."p1"=y."p1" AND t2."p2"=y."p2";
Question
How can I perform the UPDATE without the nested loop hash joins for each of the 184 child tables?
System Info
Postgres version 10.3
postgresql
add a comment |
Background
- I have a simple CTE used to update a declaratively-partitioned table.
- The subquery runs quickly (
1.7secperEXPLAIN ANALYZE) and returns3,769records (CTEyin the query below). - The
UPDATEseeks to update non-index columns of a declaratively-partitioned table. A few characteristics of the table:
- Contains
21 millionrecords
184partitions -- yes, too many -- each child partition has primary/ foreign keys and indices
fillfactor=100(plan to reduce to potentially use HOT on same pages, but not certain that query plan affected by lack of page space)
- Contains
Problem arises upon UPDATE, per the incremental query plan shown (EXPLAIN) after the subquery (via CTE) runs (several nested loops of hash joins):
-> Hash Join /* Run for each child partition, based on CTE PK = child table PK */
-> Nested Loop
-> CTE Scan
-> Append
-> Index Scan /* for index on each child partition */
... /* Index scans (for each child partition) */
-> Hash
-> Seq Scan /* on child table */
... /* Hash Joins (for each child partition) */
Query
The following query is the UPDATE statement causing the issue. Basically, the query performs a couple functions using a value from parent_table that cannot be nested into a single SQL statement (so two CTEs used), then UPDATE the same parent_table for the result (the functions are expensive, so the result stored in the table itself).
WITH x AS (
SELECT t."p1", t."p2", f(t."b1") OVER "win_x" AS "c1"
FROM parent_table AS "t"
WHERE t."p1" IN ('val1','val2')
WINDOW "win_x" AS (PARTITION BY "p1" ORDER BY "p1","p2")
), y AS (
SELECT x."p1", x."p2", f(x."c1") OVER "win_y" AS "c2"
FROM x
WINDOW "win_y" AS (PARTITION BY "p1" ORDER BY "p1","p2")
)
UPDATE parent_table AS "t2"
SET ("a1")=(y."c2")
FROM y INNER JOIN parent_table AS "t" USING ("p1","p2")
WHERE t2."p1"=y."p1" AND t2."p2"=y."p2";
Question
How can I perform the UPDATE without the nested loop hash joins for each of the 184 child tables?
System Info
Postgres version 10.3
postgresql
Background
- I have a simple CTE used to update a declaratively-partitioned table.
- The subquery runs quickly (
1.7secperEXPLAIN ANALYZE) and returns3,769records (CTEyin the query below). - The
UPDATEseeks to update non-index columns of a declaratively-partitioned table. A few characteristics of the table:
- Contains
21 millionrecords
184partitions -- yes, too many -- each child partition has primary/ foreign keys and indices
fillfactor=100(plan to reduce to potentially use HOT on same pages, but not certain that query plan affected by lack of page space)
- Contains
Problem arises upon UPDATE, per the incremental query plan shown (EXPLAIN) after the subquery (via CTE) runs (several nested loops of hash joins):
-> Hash Join /* Run for each child partition, based on CTE PK = child table PK */
-> Nested Loop
-> CTE Scan
-> Append
-> Index Scan /* for index on each child partition */
... /* Index scans (for each child partition) */
-> Hash
-> Seq Scan /* on child table */
... /* Hash Joins (for each child partition) */
Query
The following query is the UPDATE statement causing the issue. Basically, the query performs a couple functions using a value from parent_table that cannot be nested into a single SQL statement (so two CTEs used), then UPDATE the same parent_table for the result (the functions are expensive, so the result stored in the table itself).
WITH x AS (
SELECT t."p1", t."p2", f(t."b1") OVER "win_x" AS "c1"
FROM parent_table AS "t"
WHERE t."p1" IN ('val1','val2')
WINDOW "win_x" AS (PARTITION BY "p1" ORDER BY "p1","p2")
), y AS (
SELECT x."p1", x."p2", f(x."c1") OVER "win_y" AS "c2"
FROM x
WINDOW "win_y" AS (PARTITION BY "p1" ORDER BY "p1","p2")
)
UPDATE parent_table AS "t2"
SET ("a1")=(y."c2")
FROM y INNER JOIN parent_table AS "t" USING ("p1","p2")
WHERE t2."p1"=y."p1" AND t2."p2"=y."p2";
Question
How can I perform the UPDATE without the nested loop hash joins for each of the 184 child tables?
System Info
Postgres version 10.3
postgresql
postgresql
asked 47 secs ago
WheeWhee
406
406
add a comment |
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "182"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdba.stackexchange.com%2fquestions%2f228740%2fsuboptimal-query-plan-when-updating-partitioned-table%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Database Administrators Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdba.stackexchange.com%2fquestions%2f228740%2fsuboptimal-query-plan-when-updating-partitioned-table%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown