{182998236}: dispatch retry sql to pool#5873
{182998236}: dispatch retry sql to pool#5873emelialei88 wants to merge 1 commit intobloomberg:mainfrom
Conversation
0a230d5 to
221983a
Compare
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
selectv_rcode_serialretry_generated [failed with core dumped]
logfill [db unavailable at finish] **quarantined**
consumer_non_atomic_default_consumer_generated **quarantined**
api_tst
mem_tracker
ad3a581 to
f6fb6d5
Compare
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
comdb2sys_queueodh_generated [db unavailable at finish]
truncatesc_offline_generated **quarantined**
sc_resume_logicalsc_generated **quarantined**
consumer_non_atomic_default_consumer_generated **quarantined**
replay_trans
mem_tracker
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
truncatesc_offline_generated **quarantined**
consumer_non_atomic_default_consumer_generated **quarantined**
replay_trans
mem_tracker
sc_truncate_lockorder_generated [timeout] **quarantined**
reco-ddlk-sql [timeout] **quarantined**
3e7966d to
2412e0c
Compare
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Error ⚠.
Regression testing: Success ✓.
The first 10 failing tests are:
sc_truncate_multiddl_generated [db unavailable at finish] **quarantined**
consumer_non_atomic_default_consumer_generated **quarantined**
remsql_locks_rte_connect_generated **quarantined**
remsql_locks **quarantined**
mem_tracker
reco-ddlk-sql [timeout] **quarantined**
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
logfill [db unavailable at finish] **quarantined**
scindex
sc_resume_logicalsc_generated **quarantined**
consumer_non_atomic_default_consumer_generated **quarantined**
remsql_locks_rte_connect_generated **quarantined**
remsql_locks **quarantined**
mem_tracker
reco-ddlk-sql [timeout] **quarantined**
|
failing mem_tracker test |
| if (done_cb_evbuffer(clnt) != 0) { | ||
| return -1; | ||
| int rc = done_cb_evbuffer(clnt); | ||
| if (rc != 0) { |
There was a problem hiding this comment.
handle rc RC_INTERNAL_RETRY here?
There was a problem hiding this comment.
This is handled in newsql_done_cb by returning without doing anything. I still think the failure could be a timing issue but I'm looking into it more.
2412e0c to
e265b77
Compare
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
sc_resume_logicalsc_generated **quarantined**
sc_resume
cldeadlock
consumer_non_atomic_default_consumer_generated **quarantined**
remsql_locks_rte_connect_generated **quarantined**
remsql_locks **quarantined**
mem_tracker
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Error ⚠.
Regression testing: Success ✓.
The first 10 failing tests are:
scindex_logicalsc_generated
sc_resume_logicalsc_generated **quarantined**
noresetgen
consumer_non_atomic_default_consumer_generated **quarantined**
remsql_locks_rte_connect_generated **quarantined**
remsql_locks **quarantined**
manual_partition
skipscan **quarantined**
reco-ddlk-sql [timeout] **quarantined**
|
Please check robomark failures |
e31f69e to
fb958db
Compare
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
sc_redo [failed with core dumped]
logfill [db unavailable at finish] **quarantined**
consumer_non_atomic_default_consumer_generated **quarantined**
remsql_locks_rte_connect_generated **quarantined**
remsql_locks **quarantined**
longreq_stats
truncatesc_offline_generated [timeout] **quarantined**
reco-ddlk-sql [timeout] **quarantined**
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
sc_truncate [db unavailable at finish]
sc_resume_logicalsc_generated **quarantined**
reco-ddlk-sql **quarantined**
consumer_non_atomic_default_consumer_generated **quarantined**
remsql_locks_rte_connect_generated **quarantined**
remsql_locks **quarantined**
I think it was a github clone issue and the most recently robomark run is fine. |
It's still failing a few tests.. |
fb958db to
0583eff
Compare
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Error ⚠.
Regression testing: Success ✓.
The first 10 failing tests are:
sc_truncate [db unavailable at finish]
consumer_non_atomic_default_consumer_generated **quarantined**
reco-ddlk-sql [timeout] **quarantined**
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
sc_resume [db unavailable at finish]
consumer_non_atomic_default_consumer_generated **quarantined**
remtran_origin
reco-ddlk-sql [timeout] **quarantined**
Signed-off-by: Emelia Lei <wlei29@bloomberg.net>
0583eff to
4d66537
Compare
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Error ⚠.
Regression testing: Success ✓.
The first 10 failing tests are:
logfill [db unavailable at finish] **quarantined**
scindex_logicalsc_generated
sc_resume
consumer_non_atomic_default_consumer_generated **quarantined**
sc_downgrade [timeout] **quarantined**
truncatesc_offline_generated [timeout] **quarantined**
When a verified retry happens, what happens currently is that the all the statements within this transaction will rerun in the COMMIT SQL thread. This will cause the thread to block and unfair for the other normal transactions. In this PR, we schedule each statement into its own SQL thread.
When a retry happens, we call
srs_tran_replay_prepareto do some preparations like saving the originaldone_cband assign the newdone_cbto besrs_tran_replay_async. Then for each retry,srs_tran_replay_beginis called to set some states and dispatch the first statement. Ifsrs_tran_replay_preparereturns without failure, we will return error codeRC_INTERNAL_RETRYso that the callers will return the thread without any cleanup.At the end of execution,
srs_tran_replay_asyncwill be called. It will either dispatch the next statement, start another retry if this one fails, handle error or successfully return. For the last two cases we restore thedone_cbbefore returning.