Jarrod Johnson
582a4de62d
Add CLI and directory support for nodesupport
2018-08-23 16:36:41 -04:00
Jarrod Johnson
c9959d4082
More of the service data retrieval api
...
More progress is made toward the goal
2018-08-23 16:14:49 -04:00
Jarrod Johnson
fa11fb54cb
Add API support for getting service data
...
Service data retrieval is a common activity required
for interacting with support.
2018-08-23 15:39:25 -04:00
Jarrod Johnson
ee3b824870
Add /description to nodes api
2018-08-23 08:44:41 -04:00
Jarrod Johnson
55f5b30369
Merge branch '21'
2018-08-22 16:29:58 -04:00
Jarrod Johnson
784e4bed2f
Force cleanup if follow thread dies of exception
...
If something killed a follow thread, it was not always able to fire the
recovery off. Wrap the risky code in a try statement.
2018-08-20 15:02:34 -04:00
Jarrod Johnson
df7cba00fd
Amend the message on collective failure
2018-08-17 16:45:45 -04:00
Jarrod Johnson
dfb720d0ee
Have collective command warn if the libssl library is not viable
...
Main example is RedHat providing pyOpenSSL of relatively ancient
vintage.
2018-08-17 13:57:13 -04:00
Jarrod Johnson
f5d5cbd67b
Have collective command warn if the libssl library is not viable
...
Main example is RedHat providing pyOpenSSL of relatively ancient
vintage.
2018-08-17 13:56:38 -04:00
Jarrod Johnson
319fec2145
Add advanced to nodeconfig
2018-08-17 11:16:11 -04:00
Jarrod Johnson
8787d23b3a
Add advanced to API for system configuration.
...
pyghmi makes hiding optional advanced settings.
2018-08-17 10:59:50 -04:00
Jarrod Johnson
9b48110155
Do not proceed a logged, but broken session
...
It shouldn't be possible for this to be the case, but out of an
abundance of caution, check for this. So far only produced this by
forcing broken = True in a debug session. Intended to catch an alleged
scenario where console was managing to use a broken session (fixed in
pyghmi) and have confluent also recognize the situation for non-console
usage).
2018-08-16 14:43:16 -04:00
Jarrod Johnson
3064e7bef6
Ensure path is made prior to creating transactioncount
...
Fresh install will be missing /etc/confluent/cfg. Advance the
_mkpath call to fix this problem.
2018-08-08 18:05:45 -04:00
Jarrod Johnson
1d4df8af3a
Fix extraneous error in log on connectivity loss
2018-08-07 15:43:53 -04:00
Jarrod Johnson
2aba6e469c
Correct variable name in the 'connected' fix
2018-08-07 15:31:41 -04:00
Jarrod Johnson
de58593f14
Fix inability to notice underlying broken layers of the SOL
...
Through an unknown set of circumstances, an solconnection could be
stuck 'connecting'. In every case analyzed, the ipmi_session was
broken. Use that to detect a class of failure and react appropriately.
2018-08-07 15:12:53 -04:00
Jarrod Johnson
ecbe1a86b1
Revert "Have nodeconsole restore term on exit"
...
This reverts commit 2972374da8
.
2018-08-02 10:27:37 -04:00
Jarrod Johnson
2972374da8
Have nodeconsole restore term on exit
2018-08-02 10:07:41 -04:00
Jarrod Johnson
81dd6202d3
Fix when rpc has no 'exc' but has 'xid'
2018-07-30 11:26:09 -04:00
Jarrod Johnson
36a202842a
Fix collective on rpc exception
...
Exceptions on collective calls were not correctly handled, fix
the handling so that collective continues and also the calling function
is correctly given the exception.
2018-07-30 09:33:24 -04:00
Jarrod Johnson
6a8e24dd0e
Prioritize interactive feedback part of console handling.
2018-07-26 08:55:25 -04:00
Jarrod Johnson
d3afeb3414
Fix web shell if user hits enter too fast
2018-07-24 17:20:22 -04:00
Jarrod Johnson
1bf4c0ac0a
Have collective coalesce watched updates
...
Particularly chatty output can make collate be unreasonable in
low quality terminals and links. Throttle to about 4 times a second.
2018-07-24 16:50:46 -04:00
Jarrod Johnson
8e422ef822
Fix ssh access
...
Fixed handler (e.g. ssh) did not return console consistent with
the plugin defined handlers.
2018-07-24 16:48:46 -04:00
Jarrod Johnson
f0edbbad39
Have collective show present some info when not in quorum
2018-07-20 14:11:38 -04:00
Jarrod Johnson
5cf1671350
Make the takeover process more deterministic
...
Try to avoid submitting to be a follower while we are currently
becoming a leader
2018-07-20 13:50:42 -04:00
Jarrod Johnson
e5c4219ee9
Reorder certificate check
...
First order of business is to verify certificate before even thinking
about if the request is possible
2018-07-20 13:34:14 -04:00
Jarrod Johnson
3ff7e42074
Change behavior for fallback handling
...
Fallback would do nothing to fix a persistent problem with an IPMI
session. For lack of knowing how to avoid the situation, at least
make changes so it won't go wrong in the future.
2018-07-20 13:20:50 -04:00
Jarrod Johnson
fab177e077
Fix node[group][attrib|define] handling of =
...
Attributes with = in the value were not handled correctly,
fix by only doing one split.
2018-07-20 09:54:17 -04:00
Jarrod Johnson
a1ba5f59a8
Fix collective show on non-collective
2018-07-19 17:21:01 -04:00
Jarrod Johnson
9bcca6bfad
Provide collective show on all members
2018-07-19 17:08:20 -04:00
Jarrod Johnson
96671ace4e
Correct collective show behavior
2018-07-19 16:48:30 -04:00
Jarrod Johnson
bcff3fc962
Improve collective show readability
2018-07-19 16:39:13 -04:00
Jarrod Johnson
54d93571d1
Have leader provide more data in collective show
2018-07-19 16:26:05 -04:00
Jarrod Johnson
f2f902de7b
Have collective show report when collective inactive
...
Collective show was misleading if not in a collective.
2018-07-19 15:59:15 -04:00
Jarrod Johnson
a09792f969
Schedule periodic attempts to restart collective
...
If collective is lost due to connectivity, this will cause
occasional attempts to bring it back.
2018-07-19 15:49:05 -04:00
Jarrod Johnson
7d16c943a8
Handle updating address of collective member on connect
...
If a collective member changes its IP address, update at the next
possible opportunity.
2018-07-19 15:24:08 -04:00
Jarrod Johnson
b053d41cd8
Error on loss of manager in flight
2018-07-19 14:36:23 -04:00
Jarrod Johnson
200569e7af
Merge branch 'master' into clustertime
2018-07-19 13:32:00 -04:00
Jarrod Johnson
c3c0e1570a
Push quorum state to followers
...
The followers need to know quorum state.
2018-07-19 13:27:21 -04:00
Jarrod Johnson
10c82a72b5
Restore message on unreachable collective member
...
The parallel execution had broken how that message transmits.
Bonus, make it a per node error.
2018-07-18 16:49:54 -04:00
Jarrod Johnson
79cdf65a72
Fix SLES sockapi
...
Previous fix was applied to the incorrect section of code
2018-07-18 15:07:22 -04:00
Jarrod Johnson
497ca40492
Do not abort connecting process on bad cert
...
The target may be non-viable, but don't let that ruin the party
for everyone. Let it keep going as if the system were down.
2018-07-18 14:58:16 -04:00
Jarrod Johnson
fd33e6ae01
Fix non-collective confluent mode
...
list_collective returns an iterator, which will be True...
2018-07-18 14:53:23 -04:00
Jarrod Johnson
32f944e67c
Handle unclean loss of current proxy host
...
If transition is less than gentle, provide a path to restore automatic
if it gets moved.
2018-07-18 14:32:39 -04:00
Jarrod Johnson
dcad9f5a75
Add keepalive and acks to collective
...
Detect unplugged condition (eventually).
2018-07-18 13:45:03 -04:00
Jarrod Johnson
2a34388d09
Add -p to man page for nodepower
2018-07-18 11:02:12 -04:00
Jarrod Johnson
6993e0b496
Fix nodepower argument parsing
...
nodepower was assuming that the second parameter was always the
state regardless of option parsing. Use args instead to fix.
2018-07-18 11:00:01 -04:00
Jarrod Johnson
b7fe72673d
Add clear node/group attributes to collective
...
collective was not syncing clear directives.
2018-07-17 15:57:48 -04:00
Jarrod Johnson
0159bf1b1d
Fix typo in error message
2018-07-17 15:39:08 -04:00