Jarrod Johnson
e40c362743
Fix network configuration changes through collective
2021-03-16 14:54:26 -04:00
Jarrod Johnson
cafbc1d1c2
Have tty2 also run a shell
...
Some may find switching VTs to be more intuitive
than tmux session management. Provide one extra
on tty2 for such a scenario.
2021-03-13 12:50:44 -05:00
Jarrod Johnson
797e197bc7
Have tmux keep reattaching
...
If someone accidentally detaches, then intervene and
reattach.
2021-03-13 12:50:37 -05:00
Jarrod Johnson
19d4a3a83f
Have autocons attempt sizing of serial console
...
If a terminal is open during autocons, that terminal
will be the size of the console.
Otherwise, fallback to 100x31.
2021-03-13 12:50:21 -05:00
Jarrod Johnson
874947076d
Fix local console behavior in genesis
...
This will have the vga console be less concerning.
2021-03-12 13:55:33 -05:00
Jarrod Johnson
ffe0754dd9
Fix typo in confluent profile.yaml
...
The quiet argument was missed due to typo in parameter name.
2021-03-12 12:44:19 -05:00
Jarrod Johnson
bbb8e50c3e
Fix configbmc gateway check
...
It erroneously linked to prefix to check gateway or not.
2021-03-11 16:28:01 -05:00
Jarrod Johnson
4f246e6a41
Fix compatibility with some switch configurations
...
While some implementations mess up portid and need portdescr instead, others are
just the opposite.
Tolerate match either by description or name.
2021-03-10 13:41:59 -05:00
Jarrod Johnson
c53e758170
Workaround non-cisco switch crash
...
Querying Cisco MIB on certain
firmware levels of non-cisco switches
causes a crash. Tolerate and
wait a bit to give SNMP a chance to restart.
2021-03-10 13:41:52 -05:00
Jarrod Johnson
1a7c238b54
Handle malformed json data more gracefully
2021-03-10 13:41:45 -05:00
Jarrod Johnson
94a4a7e20b
Fix missing exit code in nomededia
...
nodemedia was not setting return code on exit properly.
2021-03-08 08:01:30 -05:00
Jarrod Johnson
c9157b90eb
Close other places that may be false negative
...
Have checks for neightable be preceeded by an attempt to refresh,
to mitigate false negatives.
2021-03-05 13:15:37 -05:00
Jarrod Johnson
5d0423c38b
Attempt refresh of neigh table on miss
...
When an address is new it may not be in the last
captured neighbor table. Induce refresh before deciding
that neighbor is unavailable.
2021-03-05 13:09:54 -05:00
Jarrod Johnson
e583d34555
Fix nic index map with bonding
...
The assumption that /sys/class/net is interfaces is incorrect,
when encountering entries that are not interfaces, do not
mess up the call.
2021-03-04 10:49:23 -05:00
Jarrod Johnson
95466392f9
Fix typo in confluentdbutil
...
The restore function would fail
to chown directories due to typo
2021-03-01 10:32:13 -05:00
Jarrod Johnson
b3857f8d33
Fix chained discovery of DWC SMMv2
...
SMMv2 for DWC has more ports. Make the code
not care about which port is which for checking
for matching smm fingerprints.
2021-03-01 10:32:05 -05:00
Jarrod Johnson
7eb06f2722
Improve concurrency of SLP
...
During a scan, unicast TCP
interrogation of candidates
was done serially. Do this
concurrently so that poorly
behaving targets do not prolong
a scan.
2021-02-26 13:31:07 -05:00
Jarrod Johnson
8cf264602d
Update status at end of esxi install
2021-02-05 14:02:28 -05:00
Jarrod Johnson
a2f5b11185
Add Usb to nodesetboot
...
Particularly for redfish, this is a more specific target
2021-02-01 08:43:47 -05:00
Jarrod Johnson
a9e39eab96
Fix path mistake in mofed.post script
...
mofed.post had incorrect path to mofed.tgz after
fetch. Correct the path mistake.
2021-01-28 16:50:50 -05:00
Jarrod Johnson
d29d2bf683
Implement workaround for install over infiniband
...
8.3 implemented networkmanager to manage the install time
networking, but didn't correctly generate infiniband
network manager configuration.
Workaround by checking for infiniband install, and
then checking if the configuration is wrong, and
adding an infiniband section if missing.
2021-01-28 13:49:10 -05:00
Jarrod Johnson
6458eac93b
Tolerate multiple forms of 'stringy'
...
The fromstring needs to accept either, so
amend it to do so.
2021-01-22 12:43:15 -05:00
Jarrod Johnson
8df15b3a54
Tolerate different SR635/SR655 response
...
Some SR635/SR655 behave differently. Adjust by adapting as possible,
but never making an assumption about data being present.
2021-01-22 10:30:19 -05:00
Jarrod Johnson
b4f9bb78ae
Address irrelavent call to yaml.load
2021-01-21 17:48:56 -05:00
Jarrod Johnson
c8e1efecdb
Mitigate XML parse risks
...
The intended xml never has !entity tags and
thus we can reject any such XML outright and
avoid billion laughs and similar abuses.
2021-01-21 17:46:21 -05:00
Jarrod Johnson
22dc852277
Remove now redundent errors
...
printerror now searches for deeper errors,
no longer need to call it as much.
2021-01-21 16:38:35 -05:00
Jarrod Johnson
784ac5ecba
Fix gitignore to track ronn files
2021-01-21 11:52:22 -05:00
Jarrod Johnson
66c9777b3c
Add man page for stats command
2021-01-21 11:36:08 -05:00
Jarrod Johnson
dba4c40f0e
Fix collective join with empty config
...
Collective join without a key set would fail on
first try.
2021-01-21 11:19:13 -05:00
Jarrod Johnson
6997508a0c
Fallback to forced utf8 on unicodeerror
...
If LC_ALL is set to C, then unicode can be a problem.
2021-01-21 08:41:35 -05:00
Jarrod Johnson
203253e05f
Fix mispelling of exception name in confetty
2021-01-20 11:19:24 -05:00
Jarrod Johnson
edc4804146
Discover larger SMMv2 based chassis
...
SMMv2 can support 12 servers, so increase the limit.
2021-01-20 09:08:56 -05:00
Jarrod Johnson
7cfdf11bf2
Fix collective name return
...
It inadvertently would return None when
rereading from file.
2021-01-19 17:34:28 -05:00
Jarrod Johnson
a3bd21d605
Cleanup confetty/nodeconsole exit
...
Unexpected exit from nodeconsole/confetty is now
handled better for feedback and terminal usefulness
2021-01-19 16:24:36 -05:00
Jarrod Johnson
6d8474a16a
Fix node errors being swalled by print_error
...
Node specific errors were not processed, correct
that oversight.
2021-01-19 12:16:22 -05:00
Jarrod Johnson
5736c41daa
Add more data to discovery
...
Some applications may
want to source more information
from systems to help
identify things, particularly with
partially preconfigured systems.
2021-01-15 10:34:43 -05:00
Jarrod Johnson
a5c4b64c60
Have Genesis better work with BMC install
...
In a BMC install, the certificate may fail and we may
have ability to port forward 3389 but not 22.
Support normal and enhanced certificate behavior
when possible, but degrade to cert-less ssh and
also port 3389
2021-01-15 08:25:12 -05:00
Jarrod Johnson
f7a940227d
Move sixel under opportunistic import
...
Do not require sixel
to run stats.
2021-01-14 15:49:30 -05:00
Jarrod Johnson
ebf50359f0
Fix file descriptor leak by web forwarder
...
It failed to close the two sockets when a socket was
done.
2021-01-13 16:50:17 -05:00
Jarrod Johnson
5160023cc4
Update nodedeploy error message
...
We can't tell which argument was omitted,
so warn that both are needed.
2021-01-13 16:43:41 -05:00
Jarrod Johnson
a738b761b4
Fix XCC discovery with Whitley changes
2021-01-12 11:47:02 -05:00
Jarrod Johnson
d27ef81e32
Fix PXE handling of candidate managers
2021-01-11 13:33:26 -05:00
Jarrod Johnson
f5344fabaa
Correct typo in new attribute text
2021-01-11 13:13:03 -05:00
Jarrod Johnson
fa1c2f5c1e
Only offer deployment if a candidate manager
...
If candidate managers are defined, and this node is not in
that set, ignore PXE and SSDP requests to opt out of
deployment.
2021-01-08 16:32:41 -05:00
Jarrod Johnson
25c3f40559
Cache manager name
...
Since the get_myname() may be called much much more
frequently now that it is in the deployment flow,
have it cache results to save a lot of disk I/O
2021-01-08 16:30:51 -05:00
Jarrod Johnson
5812a0eef6
Have a rebalance shortly after becoming leader
...
This will handle startup and takeover when the current leader dies.
2021-01-08 16:15:11 -05:00
Jarrod Johnson
086ce9823b
First phase of collective manager candidate implementation
...
This implements recovery on loss of collective member to
the least loaded candidates for the node.
2021-01-08 16:00:24 -05:00
Jarrod Johnson
2d6bdffebe
Finalize the ssh.trustnodes facility
...
This is the confluent approach to handling the same
problem that xCAT SSH Zones do.
2021-01-08 14:05:37 -05:00
Jarrod Johnson
efdbeeae0d
Fix SNMPv2 on non-cisco switches
...
The cisco change was causing problems elsewhere.
2021-01-08 09:16:40 -05:00
Jarrod Johnson
a2a1142f18
Draft implementation of ssh trust segmentation
...
Have equiv optionally be restricted to a subset of nodes
so that node to node ssh may be enabled within subsets
without enabling across the board.
This is akin to 'zones' in xCAT, albeit a bit more flexible
and covering both users and administrative access.
2021-01-06 11:52:43 -05:00