Always enable check hb on blocked write for chunk transactions#5864
Always enable check hb on blocked write for chunk transactions#5864chands10 wants to merge 1 commit intobloomberg:mainfrom
Conversation
There was a problem hiding this comment.
this timing guess is not robust. can we get cdb2buf_flush to indicate that failure was due to timeout?
There was a problem hiding this comment.
when cdb2buf_flush returns -1 currently, it either means that sb is NULL, or that it was a timeout error. Can I just check that? Or can add new bool param to cdb2buf_flush *timeout_error
There was a problem hiding this comment.
Added commit with new param to explicitly set it
e2e3403 to
7e68096
Compare
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
sc_resume_logicalsc_generated **quarantined**
consumer_non_atomic_default_consumer_generated **quarantined**
sc_truncate_lockorder_generated [timeout] **quarantined**
incoh_remsql [timeout] **quarantined**
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
scindex_logicalsc_generated
sc_resume_logicalsc_generated **quarantined**
queuedb_rollover **quarantined**
cdb2api_chunk
consumer_non_atomic_default_consumer_generated **quarantined**
incoh_remsql [timeout] **quarantined**
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
cdb2api_chunk
consumer_non_atomic_default_consumer_generated **quarantined**
reco-ddlk-sql [timeout] **quarantined**
incoh_remsql [timeout] **quarantined**
| int CDB2BUF_FUNC(cdb2buf_flush)(COMDB2BUF *sb); | ||
| /* flush output. returns # of bytes written or <0 for error. If timeout_error is not NULL, it will be set to 1 if a | ||
| * timeout occurred */ | ||
| int CDB2BUF_FUNC(cdb2buf_flush)(COMDB2BUF *sb, int *timeout_error); |
There was a problem hiding this comment.
Introduce cdb2_flush_chk_timeout (or another suitable name), so all cdb2buf_flush calls don't have to be updated?
| ######################################## | ||
| ## RUN WITHOUT FIX EXPLICITLY ENABLED ## | ||
| ######################################## | ||
| echo 'comdb2_config:check_hb_on_blocked_write=0' >> ${cfg} |
There was a problem hiding this comment.
Note: this means it's impossible to disable this for chunks
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
sc_truncate [db unavailable at finish]
consumer_non_atomic_default_consumer_generated **quarantined**
api_tst
alias
cdb2api_chunk [timeout]
incoh_remsql [timeout] **quarantined**
| int64_t end_us = end.tv_sec * 1000000 + end.tv_usec; | ||
| int64_t start_us = start.tv_sec * 1000000 + start.tv_usec; | ||
| int64_t elapsed_us = end_us - start_us; | ||
| if (!is_begin && !hndl->is_read && hndl->in_trans && hndl->socket_timeout && |
There was a problem hiding this comment.
Not sure how we haven't gotten seg faults here before because hndl could be NULL
|
Maybe let's wait making more changes till after we get the first cdb2api merged rollout in |
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
truncatesc_offline_generated **quarantined**
sp_snapshot_generated
reco-ddlk-sql **quarantined**
consumer_non_atomic_default_consumer_generated **quarantined**
api_tst
dc
incoh_remsql [timeout] **quarantined**
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
truncatesc_offline_generated **quarantined**
sc_resume_logicalsc_generated **quarantined**
sp_snapshot_generated
consumer_non_atomic_default_consumer_generated **quarantined**
api_tst
reco-ddlk-sql [timeout] **quarantined**
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Error ⚠.
Regression testing: Success ✓.
Signed-off-by: Salil Chandra <schandra107@bloomberg.net>
roborivers
left a comment
There was a problem hiding this comment.
Cbuild submission: Success ✓.
Regression testing: Success ✓.
The first 10 failing tests are:
sc_timepart [failed with core dumped] **quarantined**
sc_timepart_multiddl_generated
analyze **quarantined**
consumer_non_atomic_default_consumer_generated **quarantined**
tunables
To help us review your pull request, please consider providing an overview of the following: