Jarrod Johnson
c3c0e1570a
Push quorum state to followers
...
The followers need to know quorum state.
2018-07-19 13:27:21 -04:00
Jarrod Johnson
10c82a72b5
Restore message on unreachable collective member
...
The parallel execution had broken how that message transmits.
Bonus, make it a per node error.
2018-07-18 16:49:54 -04:00
Jarrod Johnson
79cdf65a72
Fix SLES sockapi
...
Previous fix was applied to the incorrect section of code
2018-07-18 15:07:22 -04:00
Jarrod Johnson
497ca40492
Do not abort connecting process on bad cert
...
The target may be non-viable, but don't let that ruin the party
for everyone. Let it keep going as if the system were down.
2018-07-18 14:58:16 -04:00
Jarrod Johnson
fd33e6ae01
Fix non-collective confluent mode
...
list_collective returns an iterator, which will be True...
2018-07-18 14:53:23 -04:00
Jarrod Johnson
32f944e67c
Handle unclean loss of current proxy host
...
If transition is less than gentle, provide a path to restore automatic
if it gets moved.
2018-07-18 14:32:39 -04:00
Jarrod Johnson
dcad9f5a75
Add keepalive and acks to collective
...
Detect unplugged condition (eventually).
2018-07-18 13:45:03 -04:00
Jarrod Johnson
b7fe72673d
Add clear node/group attributes to collective
...
collective was not syncing clear directives.
2018-07-17 15:57:48 -04:00
Jarrod Johnson
0159bf1b1d
Fix typo in error message
2018-07-17 15:39:08 -04:00
Jarrod Johnson
cf9ad11290
Short out operations if in collective mode but no collective.manager
2018-07-17 15:25:12 -04:00
Jarrod Johnson
ddd7ef5eba
Fix proxyconsole break and reopen
2018-07-17 15:05:09 -04:00
Jarrod Johnson
73da8ec8b5
Fix ProxyConsole if self.remote is not yet set
2018-07-17 14:44:59 -04:00
Jarrod Johnson
eac4d97732
Disengage remote console on manager change
...
This results in a more direct treatment of manager change.
2018-07-17 13:10:01 -04:00
Jarrod Johnson
fc5472065a
Catch missing '@' in token as invalid token
2018-07-17 11:46:40 -04:00
Jarrod Johnson
0d936e0059
Ensure no more than one in-flight slave connection from a given follower
...
This will prevent a connection from deregistering itself after the
replacement registers itself.
2018-07-17 10:36:31 -04:00
Jarrod Johnson
a7b8f0ab0c
Parallelize cross-manager requests
...
Rather than doing it at one at a time, parallelize the requests
for improved performance.
2018-07-17 10:07:32 -04:00
Jarrod Johnson
3ab4203104
Explicitly set ECDHE curve
...
Some vintages of the SSL stack require we explicitly request a curve,
so here it is.
2018-07-16 16:23:33 -04:00
Jarrod Johnson
13aa2e9aae
Catch more broad errors
...
Operating on a closed socket is not a socket.error
2018-07-16 11:58:18 -04:00
Jarrod Johnson
7462bc28e8
Use the eventlet socket in configmanager
2018-07-16 10:06:53 -04:00
Jarrod Johnson
18f1c07d65
Change to setting an errstr rather than exception
...
If nodefirmware update has an issue, provide error message instead.
2018-07-16 09:03:02 -04:00
Jarrod Johnson
0016077bee
Ensure that wait_for_sync always does a new sync
...
If a sync is in progress, wait for that to complete.
Then issue the requested *new* sync.
Probably only needed if fullsync, as the one in progress may be a
'dirty' only sync and fullsync would be satisfied by the partial sync
without it, which is bad.
2018-07-13 22:15:38 -04:00
Jarrod Johnson
1dad69097b
Be consistent with sync during load of leader cfg
...
Pass through sync as appropriate.
Also changes meant for previous commit
2018-07-13 21:52:17 -04:00
Jarrod Johnson
fd7c428d1f
Cleanup leftover sockets and more reliably be following or leading
...
Before there was a chance to be in a half state, leading to an inability
to reach consensus on leader.
2018-07-13 21:20:42 -04:00
Jarrod Johnson
80a1bd72e7
Correct arguments for Thread constructor
2018-07-13 15:43:09 -04:00
Jarrod Johnson
042d7ab5cf
Modify clear_commit to use the same thread
...
Additionally, wrap a lock around the dbm operations, in case something
in the future makes a mistake.
2018-07-13 15:27:16 -04:00
Jarrod Johnson
c74fdf5924
More collective join errors
2018-07-13 11:07:39 -04:00
Jarrod Johnson
58bf226d23
Relay error from server about token issue
2018-07-13 10:50:17 -04:00
Jarrod Johnson
6f012b69a1
Provide cleaner message for collective manager being unreachable
2018-07-13 10:43:20 -04:00
Jarrod Johnson
1d16534c16
If replacing a follower stream, ensure the old one closes
2018-07-13 09:37:00 -04:00
Jarrod Johnson
c80ebb0e8d
Explicitly close connection before replacement
...
If an existing follower is stalled out, close the socket explicitly
to avoid leaving it open in lsof.
2018-07-13 09:14:36 -04:00
Jarrod Johnson
efaf1dae70
Make cfgleader modifications more robust
...
If cfgleader is about to forget a socket, explicitly try to close
it first.
2018-07-13 09:05:28 -04:00
Jarrod Johnson
1de82936ed
Add full sync mode
...
For implementing clear config, all data must be presumed dirty.
2018-07-12 17:06:37 -04:00
Jarrod Johnson
b0c384c9ca
Check quorum on attribute read
...
It's too bizarre for attribute read from api to work
without quorum, could be misleading.
2018-07-12 16:05:04 -04:00
Jarrod Johnson
61dd71778f
Never generate new key on crypt read
...
An autogenerated key on read can never be useful. Instead, let it fail
and assume a repair action is coming.
2018-07-12 15:55:05 -04:00
Jarrod Johnson
c925353f02
Fix rollback
...
First the data was not actually being staged in the rollback area.
Secondly, there wasn't an assurance that the disk wouldn't have rollback
committed...
2018-07-12 10:01:32 -04:00
Jarrod Johnson
9edb225bd3
Fix the reference to sync to file
2018-07-12 09:20:41 -04:00
Jarrod Johnson
7cdc3c1400
Implement clear config rollback
...
Should something go awry during config
load, rollback the clear and load.
2018-07-12 08:48:21 -04:00
Jarrod Johnson
bd2a3f14e6
Defer txcount increment until after potential failure
...
This avoids the txcount incrementing when no transaction actually
occurs.
2018-07-12 08:41:07 -04:00
Jarrod Johnson
beedfb0600
If a drone doesn't exist, treat it as if it's an invalid certificate
2018-07-11 16:29:45 -04:00
Jarrod Johnson
ce59a36351
Avoid excessive syncs on connect
...
This removes some redundancy and avoids writing and loading to disk
during the initialization process.
2018-07-11 16:07:56 -04:00
Jarrod Johnson
a1e612968e
Merge branch 'clustertime' of github.com:jjohnson42/confluent into clustertime
2018-07-11 13:44:30 -04:00
Jarrod Johnson
ce1a58bf58
Provide a more specific error when file doesn't exist
...
nodemedia does check locally, but server should also check it, in
case of remote *and* service nodes.
2018-07-11 13:17:37 -04:00
Jarrod Johnson
b386084f2d
Note future enhancement for flattening on collective manager change...
2018-07-11 11:07:59 -04:00
Jarrod Johnson
8e9bcbb44f
Clear txcount on enroll
...
The transaction count on 'join' was being honored as high, when
it never should be.
2018-07-11 09:40:22 -04:00
Jarrod Johnson
704aaeecf9
Tolerate newline in myname
...
vim is quite insistent on adding a newline, tolerate that.
2018-07-11 09:36:51 -04:00
Jarrod Johnson
11968faffc
Numerous fixes to collective
...
If client has higher transaction count, do not close the connection
before extracting peer address.
If our connect session is rudely terminated, abort rather than trying
to continue.
On assimilate failure, ignore a failed assimilate with no data.
Fix problem where a follower getting double deleted was causing an error.
2018-07-10 14:55:57 -04:00
Jarrod Johnson
8769c438c0
Relax cryptodomex requirement
...
We don't *need* the higher version, it just accelerates password auth
if possible. Fix conflict with RH provided package.
2018-07-10 10:10:04 -04:00
Jarrod Johnson
298e11f60f
Allow invite from non-leader role
...
A non-leader transaction is modified such that the enroll node
can be connected to the leader and have validation.
2018-07-09 16:40:43 -04:00
Jarrod Johnson
67d6e9a6c7
Add collective show
...
Provide a harmless way to look at collective state
2018-07-09 15:07:24 -04:00
Jarrod Johnson
a4edf9afb8
Rename confluentutil to collective
...
Also adjust output to be a bit more automation friendly.
2018-07-09 13:33:56 -04:00