“Resource deadlock avoided during COMMIT” when a node leaves the group
I believe it's a bug in MariaDB Galera (10.3) but wanted to confirm. I have 3 nodes in the cluster (A, B, C) and a sysbench running the OLTP write-only test connecting only to node "A". When the node "B" is being shutdown gracefully with systemctl stop mariadb the sysbench is always immediately interrupted, showing error lines below (varying number depending on the run).
I expected sysbench to be completely unaware of node "A" leaving the group.
Sysbench reported error:
Initializing worker threads...
Threads started!
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
sysbench invocation:
sysbench --mysql-user='sbench'
--mysql-password='test'
--db-driver=mysql
--mysql-host=$TEST_HOST_A
--mysql-db=$TEST_DB_NAME
--time=300
--threads=16
oltp_write_only
run
When this happens one of the other 2-nodes reports:
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: declaring eb2c2526 at tcp://10.4.153.11:4567 stable
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: forgetting f3789450 (tcp://10.4.151.8:4567)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Node 33129c3a state prim
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: view(view_id(PRIM,33129c3a,32) memb {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 33129c3a,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: eb2c2526,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } joined {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } left {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } partitioned {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: f3789450,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: })
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: save pc into disk
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: forgetting f3789450 (tcp://10.4.151.8:4567)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 0, memb_num = 2
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE_EXCHANGE: sent state UUID: fc356030-3326-11e9-b854-fa77c4586cf9
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: sent state msg: fc356030-3326-11e9-b854-fa77c4586cf9
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: got state msg: fc356030-3326-11e9-b854-fa77c4586cf9 from 0 (ip-10-4-20-211)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: got state msg: fc356030-3326-11e9-b854-fa77c4586cf9 from 1 (ip-10-4-153-11)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Quorum results:
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: version = 4,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: component = PRIMARY,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: conf_id = 29,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: members = 2/2 (joined/total),
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: act_id = 17299676,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: last_appl. = 17299555,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: protocols = 0/9/3 (gcs/repl/appl),
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: group UUID = 3c6c83e4-3286-11e9-945b-e253cb1ea9e0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Flow-control interval: [23, 23]
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Trying to continue unpaused monitor
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: New cluster view: global state: 3c6c83e4-3286-11e9-945b-e253cb1ea9e0:17299676, view# 30: Primary, number of nodes: 2, my index: 0, protocol version 3
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: REPL Protocols: 9 (4, 2)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: Assign initial position for certification: 17299676, protocol version: 4
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Service thread queue flushed.
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 122 [Warning] Aborted connection 122 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 115 [Warning] Aborted connection 115 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 124 [Warning] Aborted connection 124 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:39 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:39 0 [Note] WSREP: cleaning up f3789450 (tcp://10.4.151.8:4567)
Other details:
- OS: Ubuntu 18.04.2 LTS
- MariaDB: 10.3.12-MariaDB-1:10.3.12+maria~bionic-log mariadb.org binary distribution
galera
add a comment |
I believe it's a bug in MariaDB Galera (10.3) but wanted to confirm. I have 3 nodes in the cluster (A, B, C) and a sysbench running the OLTP write-only test connecting only to node "A". When the node "B" is being shutdown gracefully with systemctl stop mariadb the sysbench is always immediately interrupted, showing error lines below (varying number depending on the run).
I expected sysbench to be completely unaware of node "A" leaving the group.
Sysbench reported error:
Initializing worker threads...
Threads started!
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
sysbench invocation:
sysbench --mysql-user='sbench'
--mysql-password='test'
--db-driver=mysql
--mysql-host=$TEST_HOST_A
--mysql-db=$TEST_DB_NAME
--time=300
--threads=16
oltp_write_only
run
When this happens one of the other 2-nodes reports:
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: declaring eb2c2526 at tcp://10.4.153.11:4567 stable
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: forgetting f3789450 (tcp://10.4.151.8:4567)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Node 33129c3a state prim
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: view(view_id(PRIM,33129c3a,32) memb {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 33129c3a,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: eb2c2526,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } joined {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } left {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } partitioned {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: f3789450,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: })
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: save pc into disk
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: forgetting f3789450 (tcp://10.4.151.8:4567)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 0, memb_num = 2
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE_EXCHANGE: sent state UUID: fc356030-3326-11e9-b854-fa77c4586cf9
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: sent state msg: fc356030-3326-11e9-b854-fa77c4586cf9
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: got state msg: fc356030-3326-11e9-b854-fa77c4586cf9 from 0 (ip-10-4-20-211)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: got state msg: fc356030-3326-11e9-b854-fa77c4586cf9 from 1 (ip-10-4-153-11)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Quorum results:
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: version = 4,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: component = PRIMARY,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: conf_id = 29,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: members = 2/2 (joined/total),
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: act_id = 17299676,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: last_appl. = 17299555,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: protocols = 0/9/3 (gcs/repl/appl),
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: group UUID = 3c6c83e4-3286-11e9-945b-e253cb1ea9e0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Flow-control interval: [23, 23]
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Trying to continue unpaused monitor
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: New cluster view: global state: 3c6c83e4-3286-11e9-945b-e253cb1ea9e0:17299676, view# 30: Primary, number of nodes: 2, my index: 0, protocol version 3
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: REPL Protocols: 9 (4, 2)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: Assign initial position for certification: 17299676, protocol version: 4
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Service thread queue flushed.
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 122 [Warning] Aborted connection 122 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 115 [Warning] Aborted connection 115 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 124 [Warning] Aborted connection 124 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:39 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:39 0 [Note] WSREP: cleaning up f3789450 (tcp://10.4.151.8:4567)
Other details:
- OS: Ubuntu 18.04.2 LTS
- MariaDB: 10.3.12-MariaDB-1:10.3.12+maria~bionic-log mariadb.org binary distribution
galera
add a comment |
I believe it's a bug in MariaDB Galera (10.3) but wanted to confirm. I have 3 nodes in the cluster (A, B, C) and a sysbench running the OLTP write-only test connecting only to node "A". When the node "B" is being shutdown gracefully with systemctl stop mariadb the sysbench is always immediately interrupted, showing error lines below (varying number depending on the run).
I expected sysbench to be completely unaware of node "A" leaving the group.
Sysbench reported error:
Initializing worker threads...
Threads started!
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
sysbench invocation:
sysbench --mysql-user='sbench'
--mysql-password='test'
--db-driver=mysql
--mysql-host=$TEST_HOST_A
--mysql-db=$TEST_DB_NAME
--time=300
--threads=16
oltp_write_only
run
When this happens one of the other 2-nodes reports:
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: declaring eb2c2526 at tcp://10.4.153.11:4567 stable
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: forgetting f3789450 (tcp://10.4.151.8:4567)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Node 33129c3a state prim
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: view(view_id(PRIM,33129c3a,32) memb {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 33129c3a,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: eb2c2526,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } joined {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } left {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } partitioned {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: f3789450,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: })
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: save pc into disk
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: forgetting f3789450 (tcp://10.4.151.8:4567)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 0, memb_num = 2
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE_EXCHANGE: sent state UUID: fc356030-3326-11e9-b854-fa77c4586cf9
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: sent state msg: fc356030-3326-11e9-b854-fa77c4586cf9
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: got state msg: fc356030-3326-11e9-b854-fa77c4586cf9 from 0 (ip-10-4-20-211)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: got state msg: fc356030-3326-11e9-b854-fa77c4586cf9 from 1 (ip-10-4-153-11)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Quorum results:
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: version = 4,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: component = PRIMARY,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: conf_id = 29,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: members = 2/2 (joined/total),
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: act_id = 17299676,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: last_appl. = 17299555,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: protocols = 0/9/3 (gcs/repl/appl),
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: group UUID = 3c6c83e4-3286-11e9-945b-e253cb1ea9e0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Flow-control interval: [23, 23]
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Trying to continue unpaused monitor
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: New cluster view: global state: 3c6c83e4-3286-11e9-945b-e253cb1ea9e0:17299676, view# 30: Primary, number of nodes: 2, my index: 0, protocol version 3
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: REPL Protocols: 9 (4, 2)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: Assign initial position for certification: 17299676, protocol version: 4
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Service thread queue flushed.
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 122 [Warning] Aborted connection 122 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 115 [Warning] Aborted connection 115 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 124 [Warning] Aborted connection 124 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:39 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:39 0 [Note] WSREP: cleaning up f3789450 (tcp://10.4.151.8:4567)
Other details:
- OS: Ubuntu 18.04.2 LTS
- MariaDB: 10.3.12-MariaDB-1:10.3.12+maria~bionic-log mariadb.org binary distribution
galera
I believe it's a bug in MariaDB Galera (10.3) but wanted to confirm. I have 3 nodes in the cluster (A, B, C) and a sysbench running the OLTP write-only test connecting only to node "A". When the node "B" is being shutdown gracefully with systemctl stop mariadb the sysbench is always immediately interrupted, showing error lines below (varying number depending on the run).
I expected sysbench to be completely unaware of node "A" leaving the group.
Sysbench reported error:
Initializing worker threads...
Threads started!
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
FATAL: mysql_stmt_execute() returned error 1180 (Got error 35 "Resource deadlock avoided" during COMMIT) for query 'COMMIT'
FATAL: `thread_run' function failed: /usr/share/sysbench/oltp_common.lua:405: SQL error, errno = 1180, state = 'HY000': Got error 35 "Resource deadlock avoided" during COMMIT
sysbench invocation:
sysbench --mysql-user='sbench'
--mysql-password='test'
--db-driver=mysql
--mysql-host=$TEST_HOST_A
--mysql-db=$TEST_DB_NAME
--time=300
--threads=16
oltp_write_only
run
When this happens one of the other 2-nodes reports:
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: declaring eb2c2526 at tcp://10.4.153.11:4567 stable
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: forgetting f3789450 (tcp://10.4.151.8:4567)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Node 33129c3a state prim
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: view(view_id(PRIM,33129c3a,32) memb {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 33129c3a,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: eb2c2526,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } joined {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } left {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: } partitioned {
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: f3789450,0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: })
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: save pc into disk
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: forgetting f3789450 (tcp://10.4.151.8:4567)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 0, memb_num = 2
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE_EXCHANGE: sent state UUID: fc356030-3326-11e9-b854-fa77c4586cf9
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: sent state msg: fc356030-3326-11e9-b854-fa77c4586cf9
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: got state msg: fc356030-3326-11e9-b854-fa77c4586cf9 from 0 (ip-10-4-20-211)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: STATE EXCHANGE: got state msg: fc356030-3326-11e9-b854-fa77c4586cf9 from 1 (ip-10-4-153-11)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Quorum results:
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: version = 4,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: component = PRIMARY,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: conf_id = 29,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: members = 2/2 (joined/total),
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: act_id = 17299676,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: last_appl. = 17299555,
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: protocols = 0/9/3 (gcs/repl/appl),
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: group UUID = 3c6c83e4-3286-11e9-945b-e253cb1ea9e0
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Flow-control interval: [23, 23]
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Trying to continue unpaused monitor
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: New cluster view: global state: 3c6c83e4-3286-11e9-945b-e253cb1ea9e0:17299676, view# 30: Primary, number of nodes: 2, my index: 0, protocol version 3
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: REPL Protocols: 9 (4, 2)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 23 [Note] WSREP: Assign initial position for certification: 17299676, protocol version: 4
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 0 [Note] WSREP: Service thread queue flushed.
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 122 [Warning] Aborted connection 122 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 115 [Warning] Aborted connection 115 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:34 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:34 124 [Warning] Aborted connection 124 to db: 'sbtest' user: 'sbench' host: 'ip-10-4-20-121.us-east-2.compute.internal' (Got an error reading communication packets)
Feb 18 02:43:39 ip-10-4-20-211 mysqld[10268]: 2019-02-18 2:43:39 0 [Note] WSREP: cleaning up f3789450 (tcp://10.4.151.8:4567)
Other details:
- OS: Ubuntu 18.04.2 LTS
- MariaDB: 10.3.12-MariaDB-1:10.3.12+maria~bionic-log mariadb.org binary distribution
galera
galera
asked 50 secs ago
SlawomirSlawomir
20119
20119
add a comment |
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "182"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdba.stackexchange.com%2fquestions%2f230007%2fresource-deadlock-avoided-during-commit-when-a-node-leaves-the-group%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Database Administrators Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdba.stackexchange.com%2fquestions%2f230007%2fresource-deadlock-avoided-during-commit-when-a-node-leaves-the-group%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown