greg
2018-02-02 05:00
- tip has been update with fixes for the deadlock some have been seeing. ALso, DRP will point to the stable UX by default. This includes more unit tests for DHCP.

greg
2018-02-02 05:00
We are getting close to 3.7.0

wdennis
2018-02-02 17:44
OK, victory in preseeding appears within reach! :the_horns:

wdennis
2018-02-02 17:44
But MacGyver would be proud...

ctrees
2018-02-02 17:45
ONLY if it blew up your test lab as you escape with a paper clip... :wink:

wdennis
2018-02-02 17:45
A) Ubuntu has a subsystem called "kickseed" that can take a kickstart file from the PXE command line, and auto-magically translate it into preseed...

wdennis
2018-02-02 17:48
So, from https://code.launchpad.net/~ubuntu-installer/kickseed/master get the code, and go into `~ubuntu-installer/kickseed/master`

wdennis
2018-02-02 17:49
B) Copy in a "Ubuntu-compatible" kickstart file (Ubuntu only supports a subset of kickstart) such as:

wdennis
2018-02-02 17:54

wdennis
2018-02-02 17:55
C) Then, run `./test-kickseed <kickstart_file>`

wdennis
2018-02-02 17:56
D) Take what you need from the preseed file which is generated & output to the screen

wdennis
2018-02-02 17:57
E) Profit!

ctrees
2018-02-02 18:00
Oh... so you get detailed output of all the script-expansion running the https://bazaar.launchpad.net/~ubuntu-installer/kickseed/master/view/head:/test-kickseed

ctrees
2018-02-02 18:11
I THINK I follow... so you've MacGyver'd a kickseed lint for ubuntu so you can pass a 'clean' (no additional expansion) to drp (really sledgehammer) cause it is CentoOS based and does not expand as expected ? (Note: doing as a mental exercise as I need to figure out kickseed execution for an embedded system) ? ... Check my math... pretty COOL if I'm tracking correctly...

wdennis
2018-02-02 18:13
No. It's just a means to an end to generate preseed directives (basically, the `partman` disk-partitioning ones) from a kicstart file, which I understand better, and IMNSHO has a WAY saner config language (especially for disk partitioning!)

wdennis
2018-02-02 18:15
So from this kickstart config section:


wdennis
2018-02-02 18:16
I got this preseed section:


wdennis
2018-02-02 18:18
And, it works!


ctrees
2018-02-02 18:24
So the MacGyver was a 'preseed template creation' not a linter... or more of the 'collect underpants' stage

wdennis
2018-02-02 18:25
It was a "I know how to write the partitioning I want in kickstart, but can't figure out how to do the same in preseed" rosetta stone :slightly_smiling_face:

wdennis
2018-02-02 18:27
Now, the kewl thing with DRP templating is, @greg (or @vlowther maybe?) wrote support in for templates-in-templates (which Go templates don't have native support for, amirite?)

wdennis
2018-02-02 18:28
So now I can have a library of disk partition templates that I can "plug in" to my standard preseed template for Ubuntu

ctrees
2018-02-02 18:30
I know I have to do something like what you've pulled off as I've got a bunch of old HPE gear that I'll need to avoid the RAID stuff with... AND hope to get to use drp on some embedded f/w support stuff... have not done in the mud kickstart in decades... esp all the part stuff... glad your shoveled a path :wink:

shane
2018-02-02 18:36
@wdennis - yes @greg added the ability to have Nested templates, which are not native to Golang Templating ... and collecting a bunch of example partitioning schemes and allowing to select the right Nested Template based on a Param input would be a very very nice thing to have

wdennis
2018-02-02 18:40
Right now for testing, I am setting the `part-scheme` and `select-kickseed` params on the host itself, but in the future, I'll probably set them in the Profile that the hosts are set to

andreas.holmsten
2018-02-06 14:59
@wdennis I've also had some issues with the preseed and disk partitioning. Seems like some preseed options in the default template (btw part-scheme-default.tmpl got incorrect options) are in the wrong order which cause partitioning to not work correctly. Took a couple of hours for me to track it down but not had time to make a pull request yet

wdennis
2018-02-06 15:35
@andreas.holmsten Thx for info; want to share your findings in the meantime?

wdennis
2018-02-06 15:37
I'm slowly piecing together (trial/error due to poor docu) a partitioning sub-template that does /boot, /boot/efi, and then rest of disk for LVM PV that then gets split into multiple LV's (in one VG)

wdennis
2018-02-06 16:50
My as-of-now preseed partitioning is thus:


wdennis
2018-02-06 16:52
Funny thing is, is does the right thing on a HDD of 1TB or more; but on >1TB (tried on 500GB & 250GB) it just makes LV's for swap and root, no /home...)

wdennis
2018-02-06 16:57
I think the weight values are funky, but they are exactly what the "kickseed" tool produced

wdennis
2018-02-06 19:48
So, what's the correct syntax to spec a couple of "Params" values on a machine? Did this, failed: ```$ drpcli machines update 985a9585-1923-491d-b813-1070a3c11f51 '"Params": { "part-scheme": "separate_home-TEST", "select-kickseed": "necla-ubu-seed.tmpl" }' Error: Failed to generate changed machines:985a9585-1923-491d-b813-1070a3c11f51 object: invalid character ':' after top-level value```

greg
2018-02-06 19:52
use the get or set cli commands

greg
2018-02-06 19:53
`drpcli machines set <uuid> param <param-name> to <value>`

wdennis
2018-02-06 20:15

wdennis
2018-02-06 20:15
Thx @greg

wdennis
2018-02-06 20:18
No way to set multiple with the same command?

greg
2018-02-06 20:19
the update you were doing is the way.

greg
2018-02-06 20:19
```drpcli machines update 985a9585-1923-491d-b813-1070a3c11f51 '"{ Params": { "part-scheme": "separate_home-TEST", "select-kickseed": "necla-ubu-seed.tmpl" } }'```

greg
2018-02-06 20:20
Note the extra {}

greg
2018-02-06 20:20
To remove parameters, you have to use the remove subcommand.

wdennis
2018-02-06 20:29
Ah, OK

wdennis
2018-02-06 20:29
`remove` instead of `update`, or instead of `set`?

wdennis
2018-02-06 20:46
OK, it's like `$ drpcli machines remove 985a9585-1923-491d-b813-1070a3c11f51 param "part-scheme"`

wdennis
2018-02-06 20:47
Next issue: using the default DRP preseed partitioning, as so:


wdennis
2018-02-06 20:49
But getting this when the install hits the partitioning step:


wdennis
2018-02-06 21:42
^^^ anyone?

greg
2018-02-06 21:47
umm - start with checking the machine?s aggregate parameters to see what is going on to make sure all is unset.

wdennis
2018-02-06 22:00
@greg You mean this?

wdennis
2018-02-06 22:00

wdennis
2018-02-06 22:01
The `"Params:"` section?

greg
2018-02-06 22:02
if you add `--aggregate` it will include parameters.

greg
2018-02-06 22:03
It appears you are using your own preseed.

wdennis
2018-02-06 22:04
Yes, but pulls in DRP default partitioning (`part-scheme-default.tmpl`)

wdennis
2018-02-06 22:05
Doing it wrong, I guess: ```$ drpcli machines show 985a9585-1923-491d-b813-1070a3c11f51 --aggregate | jq 'del(.Params."gohai-inventory")' Error: unknown flag: --aggregate Usage: drpcli machines show [id] [flags]```

greg
2018-02-06 22:05
sorry

greg
2018-02-06 22:06
`drpcli machines params <uuid> --aggregate`

greg
2018-02-06 22:06
it restricts to just parameters.

greg
2018-02-06 22:06
One tests would be unset kickseed and see what happens.


wdennis
2018-02-06 22:24
OK, trying one machine with stock DRP preseed, we'll see what happens...

wdennis
2018-02-06 22:26
So, when I create/edit my own templates, where do they live in the filesystem? In `saas-content`?

wdennis
2018-02-06 22:27
Because from now on, I'm version-controlling the hell out of them...

greg
2018-02-06 22:28
They are in the writable store. That is why I in general don?t use clone actions, but create my own content bundle.

wdennis
2018-02-06 22:28
Where is that?

greg
2018-02-06 22:30
I create a directory, git init, throw in some files and then use `drpcli contents bundle`

greg
2018-02-06 22:30
to build a content bundle that I upload.

wdennis
2018-02-06 22:46
Sounds like the way to go...

wdennis
2018-02-06 22:47
Any docu on that process?

zehicle
2018-02-06 22:47
@wdennis we are creating a video for this. Our first attempt was pretty close, but needs to be updated for sound quality. https://youtu.be/yy7-2D4jXXg

wdennis
2018-02-06 22:47
Will check out... thx @zehicle

wdennis
2018-02-06 23:01
OK, booted a new machine, only changed the name & profile, NO custom preseed/partitioning set, still getting the "No root file system is defined" error...

wdennis
2018-02-06 23:03

wdennis
2018-02-06 23:04

wdennis
2018-02-06 23:05
Can anyone see any problems in the generated preseed? Should be DRP standard...

greg
2018-02-06 23:28
You could try to boot into sledgehammer and check to see if the disk is already partitioned, if so, wipe it (like in the erase-disk task), and try. See if the LVM pre-existing is getting in the way.

wdennis
2018-02-06 23:32
I actually did do that (the disk was used, did have pre-existing LVM; I did a `vgremove` then `pvremove` and thereafter `dd if=/dev/zero of=/dev/sda bs=512 count=1` to wipe MBR + part tbl

wdennis
2018-02-06 23:33
So to DRP should look like a blank disk.

greg
2018-02-06 23:34
it isn?t DRP - it is ubuntu.

wdennis
2018-02-06 23:34
You are right

wdennis
2018-02-06 23:35
But anyways.

greg
2018-02-06 23:38
You may need to add the erase-hard-disks-for-os-install to your flow.

greg
2018-02-06 23:39
it does this: ``` #!/bin/bash # Nuke it all. declare vg pv maj min blocks name # Make sure that the kernel knows about all the partitions for bd in /sys/block/sd*; do [[ -b /dev/${bd##*/} ]] || continue partprobe "/dev/${bd##*/}" || : done # Zap any volume groups that may be lying around. vgscan --ignorelockingfailure -P while read vg; do vgremove -f "$vg" || : done < <(vgs --noheadings -o vg_name) # Wipe out any LVM metadata that the kernel may have detected. pvscan --ignorelockingfailure while read pv; do pvremove -f -y "$pv" || : done < <(pvs --noheadings -o pv_name) # Now zap any partitions along with any RAID metadata that may exist. while read maj min blocks name; do [[ -b /dev/$name && -w /dev/$name && $name != name ]] || continue [[ $name = loop* ]] && continue [[ $name = dm* ]] && continue [[ $name = fd* ]] && continue mdadm --misc --zero-superblock --force /dev/$name || : if (( blocks >= 2048)); then dd "if=/dev/zero" "of=/dev/$name" "bs=512" "count=2048" dd "if=/dev/zero" "of=/dev/$name" "bs=512" "count=2048" "seek=$(($blocks - 2048))" else dd "if=/dev/zero" "of=/dev/$name" "bs=512" "count=$blocks" fi done < <(tac /proc/partitions) ```

greg
2018-02-06 23:39
We found you also have to blast the end of the disk.

wdennis
2018-02-06 23:39
Is that a new stage?

greg
2018-02-06 23:40
it is a task that can be added to stage.

shane
2018-02-06 23:40
LVM leaves nasty poo all over the place and is a nightmare to get rid of ...

wdennis
2018-02-06 23:40
Yup

wdennis
2018-02-06 23:40
But I thought vgremove/pvremove would get rid of it...

shane
2018-02-06 23:40
nope

greg
2018-02-06 23:40
nope.

wdennis
2018-02-06 23:41
And the "nope"s have it!

wdennis
2018-02-06 23:42
Why does it work tho when I re-install one of my DRP-installed hosts? B/c it has same LVM structure?

greg
2018-02-06 23:43
probably

greg
2018-02-06 23:43
I?ll probably need to post a new flow.

greg
2018-02-06 23:43
at some point that uses stage-chooser and a pre-stage to wipe the disk, but that is later.

wdennis
2018-02-06 23:56
Booted the non-installing node with sledgehammer, and doing a `dd if=/dev/zero of=/dev/sda bs=1M` to wipe the disk..

wdennis
2018-02-06 23:56
Of course, it's a 2TB disk, so that'll run for a while...

wdennis
2018-02-06 23:57
We'll see what I get with the normal install thereafter

andreas.holmsten
2018-02-07 10:00
@wdennis the no root file system is exactly what i had to troubleshoot as well. First off `d-i partman/auto expert_recipe string` in the partitioning scheme is incorrect syntax. It should be `d-i partman-auto/expert_recipe string`. Secondly I moved: ``` d-i partman/confirm_write_new_label boolean true d-i partman/choose_partition select finish d-i partman/confirm boolean true d-i partman/confirm_nooverwrite boolean true ``` to after partitioning scheme. Otherwise the installer will error on no root partition found (as there isnt one yet made)

andreas.holmsten
2018-02-07 10:03
The single reason the default preseed and partitioning work is because of the incorrect syntax. Since no manual scheme is selected the default partman atomic scheme is

andreas.holmsten
2018-02-07 10:21

andreas.holmsten
2018-02-07 10:24
Observe that I'm not really that good with preseed so my assumptions might be wrong but it's what resolved the issue for me

greg
2018-02-07 12:48
@andreas.holmsten seems expert to me :grinning:

greg
2018-02-07 12:48
I?ll review and pull it in

2018-02-07 13:46
hi all - currently kicking the tyres to see if Rebar fits my use-case - looking good so far! I did run into this whilst attempting to pxe my first victim: "[0:1]TFTP: lpxelinux.0: transfer error: sending block 0" and the issue error seems to come from here https://github.com/digitalrebar/provision/blob/master/midlayer/tftp.go#L68. unfortunately i'm not familiar with Go and i'm unsure as to why that error has cropped up - can it just not find lpxelinux.0?

wdennis
2018-02-07 13:55
@andreas.holmsten Thanks - together maybe we can get a good base partitioning template, and then (my hope) maybe collaborate on community partitioning templates

wdennis
2018-02-07 13:59
Also @greg / @vlowther - maybe a good idea to put all of the partitioning (`d-i partman*`) directives into the base partitioning template - right now they are spread out over the preseed template and the partitioning template

shane
2018-02-07 14:02
@analbeard we're in a meeting - but check that your DRP Endpoint doesn't have an asymmetric routing issue. You might try to add --static flag to DRP start up, with the IP address from the interface on the provisioning machine side

2018-02-07 14:03
thanks Shane, that's certainly a possibility - the environment might need a little more work first

wdennis
2018-02-07 14:09
Hey RackN folk - at an Ansible training today, they are still referencing Cobbler on their slide deck:


wdennis
2018-02-07 14:25
Interesting - they keep mentioning Cobbler in their deck - strange for a dying OSS project to get such RHAT mentions?

wdennis
2018-02-07 14:25
@wdennis uploaded a file: https://rackn.slack.com/files/U416T0AAX/F96BHSF9V/cobbler-commits.png and commented: No commits since 1st week of Oct?17?

spector
2018-02-07 14:26
The project is essentially ?dead?. They release about 1 or 2 a year but it isn?t active at all

wdennis
2018-02-07 14:27
May be an oppt?y to reach out to RHAT folks and pitch your product (I?m sure you?ve probably thought of that already :slightly_smiling_face: )

spector
2018-02-07 14:28
Yup, next time you see this go ahead and raise the hand and tell them all about Digital Rebar and to get with it!!!! We will of course reach out

vlowther
2018-02-07 15:17
@analbeard -- Those are expected at a rate of about 1 per boot, it is the nic firmware initiating a download to get the size and then aborting the transfer.

vlowther
2018-02-07 15:18
It then pulls it for real after that.

vlowther
2018-02-07 15:19
Downgrading that log message to info priority has been on my list of things to do.

vlowther
2018-02-07 15:21
As long as the systems proceed to PXE boot, you can disregard it for now.

wdennis
2018-02-07 16:01
Ok, I dutifully harangued the lead RHAT guy in the training about how Cobbler must die, and oh by the way, have you heard of RackN? :grin:

wdennis
2018-02-07 16:04
he said he doesn?t see a lot of Cobbler out on the consulting gigs; mostly RedHat Satellite (upstream = Foreman)

gbuehler
2018-02-07 16:54
has joined #community201802

shane
2018-02-07 17:15
@gbuehler $welcome

2018-02-07 17:15
Digital Rebar community welcome information is here > http://rebar.digital/community/welcome.html

skluss
2018-02-07 17:38
has joined #community201802

jschulthies
2018-02-07 17:39
has joined #community201802

wdennis
2018-02-07 19:52
@greg Any way to register DRP events when a node PXE's and pulls pxelinux.0, then the kernel & initrd, so we can see those in the Event Log?

vlowther
2018-02-07 20:19
Not at this time.

wdennis
2018-02-07 20:43
Roadmap?

wdennis
2018-02-07 20:49
Would be great to know when the actual node installer boot happens?

vlowther
2018-02-07 21:12
Open an issue, or else I will forget by the end of the day. :grinning:


wdennis
2018-02-08 03:33

2018-02-08 13:41
afternoon all! i'm still poking DRP - maybe a little further along the road now. I've done a fairly vanilla install, bound DRP to the interface I want to use (usign --static-ip), but I'm still unable to get any of the files in the tftpboot dir which obviously means the boot fails. I've tried connecting with a tftp client but that also fails with 'transfer timed out'

2018-02-08 13:41
any suggestion as to where i'm going wrong here?

greg
2018-02-08 13:58
Firewall rules?

2018-02-08 13:59
yup i just clocked that about two minutes ago

2018-02-08 14:00
palm has been vigorously applied to face

greg
2018-02-08 14:00
:grinning:

2018-02-08 14:01
someone should bust me back down to first line for that

2018-02-08 14:10
possibly a daft question, but would the serial console show anything when the victim has booted into the sledgehammer env? i can see it in the machine list so it's been successful, but there's nada on the console

greg
2018-02-08 14:11
It will depend upon your hardware / env. We don?t pass a serial console, so it is linux defaults.

greg
2018-02-08 14:11
For packet, we add profiles that set the `kernel-console` parameter

greg
2018-02-08 14:12
To handle this case.

greg
2018-02-08 14:12
You can set that globally or on a machine or a profile to a machine.

greg
2018-02-08 14:12
Checkout the parameter `kernel-console` in the UX.

2018-02-08 14:12
ok, that makes sense. it's not the end of the world, just nice to see some output to understand what's going on, especially whilst i'm just poking it. if we were to use it in production then it wouldn't matter

2018-02-08 14:12
ok, will do. thanks!

greg
2018-02-08 14:13
make sense. Also, if you setup the `access-keys` parameter (docs has some stuff on this), you can ssh in to the box as well.

greg
2018-02-08 14:13
@faq


greg
2018-02-08 14:14
in faq - 22.3


2018-02-08 14:15
ah yes, that looks handy. thanks Greg!

greg
2018-02-08 14:15
You can do the command in the doc to the `global` profile and it will be available to all machines all times.

2018-02-08 14:20
ah yes i see what you mean, i can see a key for 'galthaus@Gregs-MacBook-Pro.local' in the root-access-example profile ;)

greg
2018-02-08 14:21
well - you know. author privs and all

greg
2018-02-08 14:22
The main thing to remember is that the parameter has to be set on the machine (globally, or specifically, or by profile assignment) when the task `ssh-access` runs during the discover stage. That means for discovered things you have to have it globally set. Or set it after discovery and reboot.

2018-02-08 14:25
sure, that makes sense. I think i've probably got enough to get something going now

2018-02-08 14:25
thanks again!

2018-02-08 15:08
hmm, I've had a machine PXE booted for half an hour or so but i'm unable to progress to an install: `Can not change bootenv while in a stage unless forced. old: sledgehammer new ubuntu-16.04-install`

2018-02-08 15:08
the machine is currently sitting in the discover stage according to it's info

greg
2018-02-08 15:24
This means you have a task that didn?t complete I think.

greg
2018-02-08 15:25
Check the jobs area to see if there is a failed job. The machine has probably been marked not runnable as well.

lae
2018-02-09 07:48
```[lae@yuzu fireeye-content]$ drpcli profiles update global global.yaml Error: Failed to generate changed profiles:global object: invalid character '-' in numeric literal```

lae
2018-02-09 07:48
I'm getting this I guess after a recent update, did anything change regarding importing profiles from yaml?

lae
2018-02-09 07:49
kind of expected this to also work, but I guess it's not expecting stdin to be yaml anymore? ``` [lae@yuzu fireeye-content]$ drpcli profiles show global -F yaml > tmp.yml [lae@yuzu fireeye-content]$ drpcli profiles update global - < tmp.yml Error: Failed to generate changed profiles:global object: invalid character 'A' looking for beginning of value ```

2018-02-09 09:40
@lae I had that yesterday, I think it's mean to be JSON

lae
2018-02-09 09:42
yes, but yaml used to be allowed

lae
2018-02-09 09:43
I'm just reusing my existing workflow for updating the global profile, which seems to not support yaml anymore

lae
2018-02-09 09:44
(I just went ahead and exported/edited/reimported as json for my immediate need but that's more tedious than editing yaml and committing it to git)

2018-02-09 09:45
@greg (or anyone at RackN) - is there no way to view the UI than through the RackN website? The environment my DRP box will live in won't be internet accessible, and even if it were I can guarantee our security team would have a shit-fit if I suggested doing that

2018-02-09 09:46
@lae I have found some of the docs to be a little out of date and I had to fudge my way around it - editing profiles was oen of them

lae
2018-02-09 09:47
the rackn UI doesn't access your DRP instance over the internet, it uses JS in your browser to access it - so you just need to be able to access the DRP endpoint from your browser

2018-02-09 09:47
oh - i hadn't investigated it because I thought that was how it worked. that certainly improves things!

lae
2018-02-09 09:52
and uh, my point is that this seems to be an unexpected regression in one of the recent releases. Anyway, I just tried downgrading drpcli to 3.4.1 and 3.2.1 (from 3.6.0) and that works in 3.2.1 but not 3.4.1

zehicle
2018-02-09 14:12
@analbeard, yes. That's a commercial offering of the ux.

zehicle
2018-02-09 14:14
We call that "air gap" but @lae is right. The ux does not require firewall holes because it uses CORS multi site.

vlowther
2018-02-09 14:37
@lae broken yaml support is definitely a bug. Open an issue?

greg
2018-02-09 14:46
@lae I opened an issue on that with the workaround for now. I was in meetings and need to look at it

greg
2018-02-09 14:47
Well the profile failing on redirect is more than what I was seeing.

2018-02-10 19:28
Just reinstalled with force. When I go to setup subnets in UX I just get a spinning "Loading Interfaces". Any suggestions? I am a newbie at drp.

2018-02-10 19:29
Running v3.6.0

shane
2018-02-10 19:43
@MattyBoy4444 - are you sure your DRP Endpoint is accessible from your Laptop/Management machine - no Firewalls or IPTables rules blocking access? You can also check the process to make sure it's running on the Endpoint as well (`ps -ef | grep dr-provision`) ... also - any log output from the running instance if it is running ?

shane
2018-02-10 19:44
you need TCP Port 8092 access to the DRP Endpoint from the system you are running the Web Browser connection to the Portal

2018-02-10 20:15
@rackneng I can access the UX frontend from my management machine. I did check Chrome console and found a jquery warning

2018-02-10 20:15
@rackneng jquery.min.js:2 jQuery.Deferred exception: Cannot read property 'push' of undefined TypeError: Cannot read property 'push' of undefined at https://rackn.github.io/provision-ux/build.js:14099:48 at Array.map (<anonymous>) at https://rackn.github.io/provision-ux/build.js:14076:38 at l (https://rackn.github.io/provision-ux/vendor.js:93588:443) at uu (https://rackn.github.io/provision-ux/vendor.js:93649:252) at Function.On.flatMap (https://rackn.github.io/provision-ux/vendor.js:93681:101) at Object.<anonymous> (https://rackn.github.io/provision-ux/build.js:14073:36) at j (https://rackn.github.io/provision-ux/vendor.js:93579:29999) at k (https://rackn.github.io/provision-ux/vendor.js:93579:30313) undefined


greg
2018-02-10 20:25
You may want to try changing the base url to http://portal.rackn.io

2018-02-10 20:27
@rackneng the log inside ux is basically empty

graziee
2018-02-11 15:32
has joined #community201802

2018-02-11 17:09
@rackneng I am running tip BTW. Sure looks like a bug due to the following changeset: https://github.com/digitalrebar/provision/commit/00f5ac97b8fab08353f03d068eab96948b706581

allen.swackhamer
2018-02-11 18:29
has joined #community201802

greg
2018-02-11 20:30
@MattyBoy4444 - the image you sent us - shows us that you are running v3.6.0 and a test UX.

greg
2018-02-11 20:31
You could try to change: `http://rackn.github.io` to `http://portal.rackn.io` and see if it loads differently.

greg
2018-02-11 20:32
The commit reference you made wouldn?t make a ux hang, because it doesn?t change API output. It would change internal actions. It also makes less thing required.

2018-02-11 22:16
@rackneng Well I installed from this URL. https://github.com/digitalrebar/provision/releases/download/tip/dr-provision.zip. Also the error is in reference the "NextServer" missing, which is what that changeset removes. My 2 cents.

greg
2018-02-11 22:20
Hmm - okay The image you sent doesn?t align with that, but probably true.

2018-02-11 22:22
@rackneng If you dig into Build.js on that line, I think it references o.NextServer

greg
2018-02-11 22:22
Yeah - I?m looking at it. The UX is not handling the facct that NextServer can be unset.

greg
2018-02-11 22:28
Actually, it is just a UX bug.

2018-02-11 22:28
That is what I thought.

greg
2018-02-11 22:28
I still don;?t think you are using tip, but just a second.

greg
2018-02-11 22:34
@MattyBoy4444 - try the UX again,

greg
2018-02-11 22:35
Then we should try and figure out the version of drp you have.

2018-02-11 22:35
I was afraid you would say that. :) I just wiped and started installing stable. Hmmm... I could wipe again and start over. Not a big deal. I was just setting up a test platform.

2018-02-11 22:36
??

greg
2018-02-11 22:36
The bug I think that is happening and I tried to fix. Is only if you are using a stable DRP against the master UX.

greg
2018-02-11 22:36
That is what your image was showing.

greg
2018-02-11 22:37
It would be nice to see what was in your `Info and Preferences` page or `dr-provision --version`

greg
2018-02-11 22:37
If you used `install.sh` from tip, it will still grab stable.

2018-02-11 22:39
I am trying again. Probably take 15 min

2018-02-11 22:40
Basically, I rm /var/lib/dr-provision and then follow this again. http://provision.readthedocs.io/en/stable/doc/install.html

2018-02-11 22:42
I originally had stable on this box. I assumed following these instructions after the remove, would basically be like doing a fresh install.

greg
2018-02-11 22:42
I think so.

2018-02-11 22:43
I am following that same procedure now. I HOPE i didn't waste your time'

greg
2018-02-11 22:49
no

greg
2018-02-11 22:50
The big thing is that to get tip, you have to explicitly ask for it if you are using install.sh.

greg
2018-02-11 22:50
I needed to fix the UX bug.

2018-02-11 22:53
I may have downloaded the tip and then ran the following command to install: sudo ./install.sh --force install

2018-02-11 22:53
I didn't see the switch for the version. So now i just ran the following command: sudo ./install.sh --force=true --version=tip install

greg
2018-02-11 22:54
hmm - I think it should be --drp-version=tip

greg
2018-02-11 22:54
nvm - either works now.

greg
2018-02-11 22:54
well for tip install.sh

greg
2018-02-11 22:54
:slightly_smiling_face:

2018-02-11 23:09
@rackneng I can now setup a subnet!!

greg
2018-02-11 23:12
:slightly_smiling_face:

shane
2018-02-11 23:18
@greg - I added `--version` because I had documented it as ... `--version` ... but I left the `--drp-version` flag for backward compatibility ... :slightly_smiling_face:

shane
2018-02-11 23:18
@MattyBoy4444 - I'd also suggest switching the Doc version to `latest` - not the `stable` docs ...

2018-02-12 00:05
Thanks for all the help. It is up and running. Now, if I could get the damn Up Squared Intel UEFI board to net boot. I get the following error from tftp: TFTP: lpxelinux.0: transfer error: sending block 0: code=8, error: User aborted the transfer

2018-02-12 00:05
Any suggestions. ARG!!!!

greg
2018-02-12 01:48
firewall?

2018-02-12 03:09
Na. They are on same.subnet

2018-02-12 03:10
Connected to same switch

2018-02-12 03:11
I saw there was some uefi issues in January. Has these been resolved and are the changes in the tip?

greg
2018-02-12 03:12
They are in tip. Firewall on DRP endpoint

2018-02-12 03:22
It is a clean install of Ubuntu server 16.04 with No extras/lamp

greg
2018-02-12 03:32
Some in the community have had issues with iptables putting in tftp blocking rules.

greg
2018-02-12 03:33
While not your current issue, a future issue will be that you need to unset the bootfile in the subnet (not needed in tip), because lpxelinux.0 doesn?t support uefi.

lae
2018-02-12 13:17
https://github.com/digitalrebar/provision/pull/684 I'm guessing the "DRP freezes up issue" I'm seeing is related to this? lol

daniel.bernier
2018-02-12 13:32
hi anybody can explain why all the UX pages work perfectly EXCEPT for ?machines? which since yesterday stays at ?loading machines?

zehicle
2018-02-12 13:54
@daniel.bernier if you are on the github URL then you may have hit a bug w/ a new feature. Login to https://portal.rackn.io -> that version of the UX is more stable

zehicle
2018-02-12 13:55
if you know how to look at your browser's dev tools, it would be helpful to know which network call(s) are failing on your system

zehicle
2018-02-12 13:56
@daniel.bernier do your machine run's gohai to get inventory information? that could also be the issue. the new code shows machine inventory values on the machines page

greg
2018-02-12 14:50
Yes @lae

greg
2018-02-12 14:51
Likely. Working to cut 3.7 soon

daniel.bernier
2018-02-12 15:18
@zehicle thanks switched to http://portal.rackn.io and issue was cleared. As per Gohai, only runs as part of discovery yet ? haven?t played to much with it yet. Havin? a ball with workflows right now :smile:

lae
2018-02-12 15:29
is http://portal.rackn.io going to be set as the default for drp stable at least?

shane
2018-02-12 15:29
@lae yes

shane
2018-02-12 15:29
on v3.7.0 release that will be the default redirect and is the "production" UX endpoint

lae
2018-02-12 15:30
got it, that way it makes sense

lae
2018-02-12 15:30
and I guess tip would stay the same?

shane
2018-02-12 15:30
yep - a little bit of a chicken-and-egg issue getting the versioned UX Endpoints in place and ready to make the switch in the DRP side

greg
2018-02-12 15:31
tip will still point to portal. You can change portal to latest to get the better edge.

lae
2018-02-12 15:31
https://github.com/digitalrebar/provision/issues/617 also this is affecting me as well, downgraded drpcli to 3.2.1 and it successfully changes it on a DRP 3.6.0 server

shane
2018-02-12 15:32
going forward; the github UX Endpoint will always be the "most recent (master) ... and probably a lil bit unstable"

shane
2018-02-12 15:35
we'll control UX updates via the following flow: http://rackn.github.io - latest master; unstable http://latest.rackn.io - will be like DRP "tip" - generally stable latest features http://portal.rackn.io - production released version

zehicle
2018-02-12 21:05
has suspended the Gitter & IRC synchronization with this channel

mchill
2018-02-13 06:39
has joined #community201802

tsahiduek
2018-02-13 07:31
has joined #community201802

tsahiduek
2018-02-13 12:40
Hey, I?m new to digital rebar and I have a question regarding workflows. From what I understand different profiles can have different workflows for different Bare-metal installation types (please correct me if I?m wrong). The thing I don?t understand is how to determine which machine will ?pxe-boot? to the right profile? How do I make the connection between a machine that just booted-up snd the appropriate profile? Thanks

greg
2018-02-13 13:38
@tsahiduek - the profile containing the workflow you want needs to be added to the machine?s profile list.

greg
2018-02-13 13:39
That is the simple answer. The question is how and when to do that.

greg
2018-02-13 13:40
There are a lot methods for that. One is manually from the UX or CLI. Another is terraform for a ?grab and go? style of operations. Another is to write a stage/task set to classify the node as it goes through discovery. The workflow can be modified ?inflight? during the discovery workflow.

greg
2018-02-13 13:41
This where your use and goals for deployment and operation come in to help you make that decision.

tsahiduek
2018-02-13 14:08
I want to automate (not via the UI) the process of installing bare metal. I want to create profile for each ?type of installation? - for example: centos 7 for DB cassandra, ubuntu 16.04 for openstack etc? How do I identify the machine that just ?PXE booted? to appropriate profile? I hope I?m clear about what I?m trying to do?

shane
2018-02-13 14:15
@mchill $welcome

2018-02-13 14:15
Digital Rebar community welcome information is here > http://rebar.digital/community/welcome.html

shane
2018-02-13 14:17
@tsahiduek - at some point, you have to be able to classify your systems - if you can do that based on CPU/Memory/Disk - then you can use the Gohai Inventory components (except for disk) ... and you can write a Param back to Digital Rebar Provision (DRP) Endpoint that has that classification - then in later Stages in the Workflow - you can use that Param to "do something" specific with the classification

shane
2018-02-13 14:17
you can also write a Stage (that uses a Task and Template) to call out to a DCIM/Asset Management system of some sort and ask it what the system should be (maybe based on MAC address, Serial Number ... or something else)

shane
2018-02-13 14:18
again - you'd then tag your Machine with a Param with the classification info and adjust your workflow accordingly

shane
2018-02-13 14:19
if you use the Stage/Task/Template route - note that any queries made to your DCIM/Asset Management will be **from** the Machine being provisioned

shane
2018-02-13 14:20
in some environments, this will not work - as the provisioning networks do not have access based on security policy - in which case you might need to write a Plugin for the DRP Endpoint, to make the query on behalf of the Machine - so you can control the security aspects

tsahiduek
2018-02-13 14:21
Thanks for the detailed answer. I?ll try to go in with the ?DCIM way? Thanks :slightly_smiling_face:

greg
2018-02-13 15:42
@lae - hit me up when you can. I want to move all the partman directives into the part template. I think this will hit you the most.

greg
2018-02-13 15:42
Bug #53 in community content.

wdennis
2018-02-13 16:37
@greg ^^^ nice - would affect me as well (but I asked for it)

greg
2018-02-13 16:41
yeah - I figured. :slightly_smiling_face:

spector
2018-02-13 16:41
Community - I sent out a Monthly RackN Newsletter yesterday and if you did not receive and would like too, please ping me. I do this 1x a month and it is pretty short but highlights events we are attending and key themes; this month was Cobbler. I understand that Digital Rebar is an open source community and I am not going to market to you (I promise) but just wanted to make everyone aware of this newsletter in case you are interested. Return to your open source goodness?

lae
2018-02-13 17:27
@greg I was actually meaning to talk about that

lae
2018-02-13 17:27
(not the issue but about partman directives)


greg
2018-02-13 17:30
yikes ? oops

lae
2018-02-13 17:30
We're currently using our own debian-9 and ubuntu-16 stages with that change as well as a change specifying a repo mirror (since it doesn't look like the Repo stuff is in community-content - and I haven't had a chance to look into it)

lae
2018-02-13 17:30
Anyway, I remembered why I may not have submitted that PR

lae
2018-02-13 17:31
I wasn't sure if we should have, e.g., "part-seed-X" for d-i templates and "part-ks-X" for ks templates

lae
2018-02-13 17:32
I had named it "part-scheme-X" to reduce the number of files to maintain and because we can check for OS in the part-scheme - this is actually how I was doing part templates in cobbler

lae
2018-02-13 17:33
but, apart from a default template, in my experience it's not very common to use a single template for both centos/debian

greg
2018-02-13 17:33
I thought about it. You have to manage profiles either way. So I let it be a single variable. Part-scheme is a bug above

greg
2018-02-13 17:34
the `select-kickseed` parameter is used in the default bootenvs to override. If you have your own bootenvs, then it is less of an issue.

lae
2018-02-13 17:36
@lae uploaded a file: https://rackn.slack.com/files/U54E4SD4G/F98J9QR29/image.png and commented: actually never mind, turns out I didn't check for OS within the templates during our cobbler days, lol

greg
2018-02-13 17:36
:slightly_smiling_face:

lae
2018-02-13 17:37
want me to submit that PR?

greg
2018-02-13 17:38
already took it. :slightly_smiling_face:

greg
2018-02-13 17:38
tip content already updated

lae
2018-02-13 17:38
wew, kk

greg
2018-02-13 17:41
@lae - the net of this was meant for me to warn you that I?m going to move all the partman refs inside the scheme tmpl. This way people can deal with their own gpt or not. Lvm or not.

lae
2018-02-13 17:42
hm

lae
2018-02-13 17:42
moving *all* of `d-i partman` is going to introduce a lot of duplication

lae
2018-02-13 17:43
I can understand moving some of them that actually configure partitioning, but several of them are to avoid prompts

greg
2018-02-13 17:43
yeah the question is one of ordering.

lae
2018-02-13 17:43
ordering of the commands? it shouldn't matter I'm pretty sure

greg
2018-02-13 17:44
@greg uploaded a file: https://rackn.slack.com/files/U02DGQYK1/F98EC1UKW/-.yaml and commented: This is it right now.

greg
2018-02-13 17:44
It seems to have for others in the community.

wdennis
2018-02-13 17:44
@lae I'm emprically finding out that the partman directive odering does count... (it seems anyways)

greg
2018-02-13 17:45
The last four could live outside. They are common and always that value I think.

wdennis
2018-02-13 17:45
And for LVM vs. not, or md raid with LVM on top, directives differ

lae
2018-02-13 17:45
like, putting just the labels inside part-scheme causes issues?

wdennis
2018-02-13 17:45
No, the directives may need to change depending on what partitioning scheme is used

wdennis
2018-02-13 17:46
makes sense to have them all in one "container" (template)

wdennis
2018-02-13 17:46
instead of split between two

wdennis
2018-02-13 17:47
I use `select-kickseed` to template non-partitioning directives

wdennis
2018-02-13 17:48
that may differ among my builds

lae
2018-02-13 17:48
I'm just not a big fan of a lot of boilerplate, but I guess if people are actually having issues with the current layout, I can't really quite complain

lae
2018-02-13 17:48
(and I'm just slightly surprised I haven't run into any issues with the part-schemes I've written)

wdennis
2018-02-13 17:48
then `part-scheme` should have all of the partitioning in it, enabling "mix-n-match" between the two

greg
2018-02-13 17:49
I?m in favoring of localizing common stuff. I need to think about this some more.

greg
2018-02-13 17:49
well the problem becomes one of depth of nesting.

wdennis
2018-02-13 17:49
Does anyone know if you add d-i directives that aren't needed/call on, if it messes up the installer automation?

greg
2018-02-13 17:50
not sure

wdennis
2018-02-13 17:50
me either

lae
2018-02-13 17:50
I don't usually need several of the boilerplate in net-seed - specifically LVM, and stuff provisions fine

greg
2018-02-13 17:50
probably depends upon the directive and its use.

wdennis
2018-02-13 17:51
Also, was thinking of if all partitoning in a single template, then community could build up a library of partitoning templates that could be easily plugged into the default DRP-provided kickseed, or a customized one

wdennis
2018-02-13 17:52
(also need to think about/do the same for RedHat-family distros, using kickstart syntax)

greg
2018-02-13 17:53
need to think about this and let it cook a little. I?m not going to change anything at this instant. I think.

wdennis
2018-02-13 17:53
OK, fair enough

greg
2018-02-13 17:54
I think I can environ some ways to get everybody to where they want to go, but want it bake a little more.

wdennis
2018-02-13 17:55
I just had to fall back to Cobbler / Clonezilla imaging to do deploys, b/c DRP preseed templates were erroring out on platforms I need to install with specific partitioning requirements...

wdennis
2018-02-13 17:56
I was thinking it was the interaction between the `d-i partman*` directives in the net-seed.tmpl (or custom version thereof) and the directives I put in my custom `part-scheme`-called template

greg
2018-02-13 17:57
ok

wdennis
2018-02-13 17:58
It's basically not really a DRP-related problem (except of the `d-i partman*` directives split, if that's the issue) but more of a "how to do a specific partitioning recipe in preseed" problem

wdennis
2018-02-13 18:00
I just don't want to have to have pairs of `select-kickseed` and `part-scheme` templates that I need to keep track of

wdennis
2018-02-13 18:01
Hence the ask to combine all `d-i partman*` stuff into one template

gbuehler
2018-02-13 18:32
for a new deployment would the recommended path be to use the dockerized DRP?

shane
2018-02-13 18:34
@gbuehler - I hope you are not referring to the old Digital Rebar ver2 version? Digital Rebar Provision (DRP) ver3 is NOT containerized as distributed (but can easily be built in to a container)

shane
2018-02-13 18:38
We look forward to seeing you at the V011 meetup in 20 mins or so ... details: https://www.meetup.com/digitalrebar/events/247321385



lae
2018-02-13 20:03

lae
2018-02-13 20:05
@wdennis hold up - when you made custom `part-scheme` templates are you sure you weren't hitting this bug? https://github.com/digitalrebar/provision-content/pull/56/files

lae
2018-02-13 20:06
where the part-scheme wouldn't have been loaded if it wasn't named `part-seed-$scheme`

lae
2018-02-13 20:06
I had fixed it in a local template a long time ago

wdennis
2018-02-13 20:06
@lae Yeah, figured that one out...

shane
2018-02-13 20:08
- v3.7.0 is going to be cut in the next day or two ... if any of you have extra cycles to test `tip` - please do so - there are a LOT of changes, bug fixes, and enhancements ... we appreciate any additional testing and verification in different environments prior to cutting the v3.7.0 release - THANKS !!

gbuehler
2018-02-13 20:12
i think @greg already captured this, but pinning major versions in docker hub would be super cool

lae
2018-02-13 20:13
there are docker image releases?

greg
2018-02-13 20:13
strangely enough there kinda is.

gbuehler
2018-02-13 20:14
i mean, as long as you love living off `master` there are


greg
2018-02-13 20:14
but like @gbuehler mentions it is rebuilt when I move tip. So not quite master, but close.

lae
2018-02-13 20:14
oh

lae
2018-02-13 20:14
i've just been living through my ansible stuff lol

greg
2018-02-13 20:18
More thinking for me todo.

lae
2018-02-13 20:54
while I'm still awake

lae
2018-02-13 20:54
I have the following stage

lae
2018-02-13 20:54
``` [lae@yuzu fireeye-content]$ cat content/stages/labs-debian-9.yml --- Name: "labs-debian-9-install" Description: "Debian 9 install stage for FireEye Labs environment." BootEnv: "labs-debian-9-install" RunnerWait: true Tasks: - "ubuntu-drp-only-repos" - "enforce-public-key-authentication" - "default-user-access" Meta: icon: "download" color: "yellow" title: "FireEye Content" ```

lae
2018-02-13 20:55
I removed change-stage recently after parsing through some chat logs and community-content commit history, but I still have the issue where `drpcli processjobs` hangs after all tasks complete

lae
2018-02-13 20:56
i see all the completed jobs in the UI

lae
2018-02-13 20:57
but if I run `drpcli machines update $UUID '{ "Runnable": true }'` externally, it exits and finishes the install

greg
2018-02-13 20:58
Not sure why that frees it. Two things:

greg
2018-02-13 20:59
1 set runnerwait to false will cause the runner to exit when done with all tasks in the stage assuming no workflow changes stage on you

greg
2018-02-13 21:00
2 if this is part of a workflow, use the stop action instead of success for the last stage you want to run during the seed file

greg
2018-02-13 21:01
Stop in the workflow will cause the runner to exit on the stage change.

lae
2018-02-13 21:01
` labs-debian-9-install: "complete-nowait:Success"`

lae
2018-02-13 21:01
oh

lae
2018-02-13 21:01
I see

lae
2018-02-13 21:01
I had also tried setting `RunnerWait: false` previously but it hadn't helped (and noticed it was true in the community repo anyway)

greg
2018-02-13 21:02
Well that should have worked. I think.

greg
2018-02-13 21:03
Does complete-nowait have runnerwait true?

lae
2018-02-13 21:04
complete-nowait is in community repo and it has runnerwait false last i checked

lae
2018-02-13 21:04
(which is the point of nowait after all)

lae
2018-02-13 21:04
let me try again

lae
2018-02-13 21:06
while i'm at it guess I'll test some things in drpcli tip

lae
2018-02-13 21:48
yeah so setting RunnerWait: false in the stage itself had no effect, I needed to update change-stage/map to Stop

lae
2018-02-13 21:51
also my yaml issue and the change stage error both appear resolved for me on tip

shane
2018-02-13 21:51
woot woot !!

shane
2018-02-13 21:51
@lae ... go to sleep - you're making me tired just thinking about you being up still :slightly_smiling_face:

lae
2018-02-13 21:53
I'm like, not sleepy

shane
2018-02-13 21:53
Yay for energy drinks !!

lae
2018-02-13 21:53
:joy:

lae
2018-02-13 21:58
https://aur.archlinux.org/packages/drpcli-tip and for the arch-initiated users I guess I made a drpcli-tip PKGBUILD

markw
2018-02-14 16:06
has joined #community201802

shane
2018-02-14 19:17
@markw $welcome

2018-02-14 19:17
Digital Rebar community welcome information is here > http://rebar.digital/community/welcome.html

wdennis
2018-02-14 19:53
I updated the 'impi' plugin to latest (tip) on my v3.6.0 install, and it seems to have hung the server... I killed it and restarted (running isolated), and this is what I see:

wdennis
2018-02-14 19:53

shane
2018-02-14 19:53
yep - that will do it

shane
2018-02-14 19:54
`tip` plugins are `plugin-v2` style

shane
2018-02-14 19:54
v3.6.0 stable is `plugin-v1` style - radically different

wdennis
2018-02-14 19:54
Why did it let me update?

shane
2018-02-14 19:54
completely and utterly non-compatible

shane
2018-02-14 19:54
because

wdennis
2018-02-14 19:54
And I would know that how?

shane
2018-02-14 19:55
what UX endpoint were you using ?

wdennis
2018-02-14 19:55

wdennis
2018-02-14 19:55
Same as ever...

shane
2018-02-14 19:56
our new `stable` UX endpoint going forward, which you should use instead of that is: https://portal.rackn.io

wdennis
2018-02-14 19:57
I thought that's with the new v3.7 when released?

greg
2018-02-14 19:57
In this channel, I said NOT to do that.

shane
2018-02-14 19:57
the default redirect in v3.7.0 will switch to that

wdennis
2018-02-14 19:57
@greg must have missed that...

shane
2018-02-14 19:57
we're still working out all the kinks between Feature Flags, UX Endpoint Version, DRP Endpoint Version ... and applying appropriate guardrails on those things

greg
2018-02-14 19:57
anyway, it does. It will be resolved on v3.7.0 stable - ux, plugins, and all will have better trigger to prevent bad behaviour.

wdennis
2018-02-14 19:57
good to hear

wdennis
2018-02-14 19:59
No way to roll back? (Or if v3.7 in the next few days, I could just wait and upgrade...)

greg
2018-02-14 19:59
or move to tip drp

wdennis
2018-02-14 20:00
Can upgrade from current tip to future stable?

shane
2018-02-14 20:00
yep

wdennis
2018-02-14 20:01
I usually do `tools/install.sh --isolated --upgrade install` -- what do i need to add to get `tip`?

shane
2018-02-14 20:05
depends - do you have a copy of the `stable` or `tip` install.sh script ? (note: `tip` has `--version` flag - don't look at the usage, look at the `case` statement)

wdennis
2018-02-14 20:05
prolly `stable`

shane
2018-02-14 20:05
I fixed a bit of the version stuff in there - but it's only in `tip` - after v3.7.0 publishes, it'll be in `stable`

shane
2018-02-14 20:06
I'd suggest: ```curl -s get.rebar.digital/tip -o install.sh bash ./install.sh install --isolated --upgrade --force --version=tip ```

wdennis
2018-02-14 20:08
Cool, done

wdennis
2018-02-14 20:10
On `v3.6.0-tip-149-4d49d65825eaab25ce0e3bfde8871d3ee05337db`

wdennis
2018-02-14 20:11

wdennis
2018-02-14 20:27
In a machine, was `select-kickseed` a `string` before, but now an `object`?

greg
2018-02-14 20:27
it should always be a string.

wdennis
2018-02-14 22:10
@greg looks like it's an object now in the UX...

wdennis
2018-02-14 22:10

wdennis
2018-02-14 22:11
And on the last install, it did not seem to use my custom `necla-ubu-seed.tmpl`

greg
2018-02-14 22:14
Is the `select-kickseed` in the parameters list in the UX?

greg
2018-02-14 22:15
What version is the content? I think select-kickseed is in tip content as a parameter.

wdennis
2018-02-14 22:15
It is not.

wdennis
2018-02-14 22:16
drp-community-content is at v1.1.0 in "Content Packages"

greg
2018-02-14 22:16
That is really old.

wdennis
2018-02-14 22:16
Version inspections says: ```drp-community-content content Major Upgrade from v1.1.0 to v1.5.0```

wdennis
2018-02-14 22:17
However... I can't upgrade, the upgrade button is greyed in Content Packages

wdennis
2018-02-14 22:18
(a bug reported to @zehicle that I believe he said he has a fix for...)

greg
2018-02-14 22:18

wdennis
2018-02-14 22:19
aha

wdennis
2018-02-14 22:19
OK, fixed, thanks

wdennis
2018-02-14 22:20
I'll be glad to get back to stable when it's 3.7.x and use the stable UX

marcelo
2018-02-15 15:14
has joined #community201802

shane
2018-02-15 16:04
@marcelo $welcome

2018-02-15 16:04
Digital Rebar community welcome information is here > http://rebar.digital/community/welcome.html

marcelo
2018-02-16 01:46
Thank you @shane I can't wait to start automating baremetal builds.. looking to integrate it with Terraform and Ansible for configuration management

sevans
2018-02-16 21:22
has joined #community201802

zehicle
2018-02-19 05:43
@sevans $welcome !

2018-02-19 05:43
Digital Rebar community welcome information is here > http://rebar.digital/community/welcome.html

zehicle
2018-02-19 05:44
slackbot help

2018-02-19 05:44
Available Commands: FAQ, $FAQ, $faq, $KRIB, $krib, $meetup, $Meetup, $issue, $Issue, $issues, $Issues, $quickstart, $QuickStart, $welcome, $conduct, $code-of-conduct

marcelo
2018-02-19 06:54
Howdy all, question... If I want to use RackN DR Provision at a site with no internet connectivity how can I access the features which require a RackN login?

zehicle
2018-02-19 07:23
@marcelo Provision does NOT require connectivity to function - it does not connect the the internet. We (RackN) can help you adjust templates so the the O/S installs only use local resources too. The UX is a cross-origin application that uses your browser to connect between the end-point and our SaaS. In that way, all management is actually behind your firewall. DR Provision is not "going through" the public internet in any way. We also offer a license of the UX that also runs on-prem ("air gap") so connection is needed at all.

marcelo
2018-02-19 07:26
Ok Thanks for the clarification @zehicle .. let me run a few builds and simulations and see how we go..

zehicle
2018-02-19 16:12
we're going to do a short video explaining this - it's a common question and our approach is unique since it's based with an on-prem support mentality with SaaS to support management.

michael.harp
2018-02-19 17:05
has joined #community201802

abrinded
2018-02-19 17:46
has joined #community201802

chermack
2018-02-19 18:56
Michael, Andrew welcome aboard

detiber
2018-02-20 03:55
Just wanted to say great work on the UEFI support in the latest tip release, I am now able to boot two problematic machines using ipxe.efi that do not boot with the stable release!

shane
2018-02-20 03:56
Awesome!

vlowther
2018-02-20 17:07
@detiber Which machines are they? Got any more problematic ones?

detiber
2018-02-20 17:09
@vlowther A liva x mini-pc and an asus sabertooth x79 system where the ones that were giving me problems before. I have others that may be problematic, but they are also arm64, so I haven't started tackling that yet :slightly_smiling_face:

vlowther
2018-02-20 17:39
Well, arm64 will be fun in more ways than one. :slightly_smiling_face:

dave.parker
2018-02-20 19:31
has joined #community201802

dave.parker
2018-02-20 19:34
Hi folks. I have a few questions. I'm following the quick start, and using virtualbox guests for both the server and first client. I can boot sledgehammer and discover just fine, but when I try to switch bootenvs to install I get this error: ```Error: ValidationError: machines/0d31f21f-ab03-4fbb-9b19-ba3f445edadb: Can not change bootenv while in a stage unless forced. old: sledgehammer new ubuntu-16-04-install```

dave.parker
2018-02-20 19:34
No amount of forcing from the command line fixes this. If I go into the gui and edit the machine and select the Force checkbox, I can then change the bootenv from the command line, though.

dave.parker
2018-02-20 19:35
However, on the next boot, the machine boots sledgehammer and repeats discovery instead of doing the install as expected. It registers with a new UUID, which I assume is why it comes up as a brand new machine.

dave.parker
2018-02-20 19:35
How is the UUID determined?

shane
2018-02-20 19:39
@dave.parker what DRP Endpoint version are you using? (`drpcli info get | grep version`)

dave.parker
2018-02-20 19:41
"version": "v3.6.0-0-0e5ccf678a3e5b5fdb10f86261247cd28c858ac0"

shane
2018-02-20 19:41
also - make sure you set the Stage with the Install bootenv you'd like to install - do not switch the BootEnv itself directly ... if you are using Stages, you can not change the bootenv - as the Stage references an existing BootEnv

shane
2018-02-20 19:42
we have a LOT of fixes and enhancements in our current `tip` release - which is about to be released in the next 1 or 2 days at v3.7.0 - if this is a non-production scenario - I highly recommend upgrading to the `tip` release

dave.parker
2018-02-20 19:42
So I shouldn't be trying to change the bootenv directly?

shane
2018-02-20 19:42
not if you are using Stages, no

dave.parker
2018-02-20 19:42
Is the quickstart out of date then?

dave.parker
2018-02-20 19:42
Ok, I can upgrade to tip.

shane
2018-02-20 19:43
which version of QuickStart are you using? (`latest`, `stable`, etc) ?


dave.parker
2018-02-20 19:43
So, stable.

shane
2018-02-20 19:44
(upgrade to tip: `curl -s get.rebar.digital/tip | bash -s -- install --isolated --version=tip --upgrade --force` <-- assumes you are using "isolated" install mode, not production)

shane
2018-02-20 19:44
yes - please switch Doc to the `latest` version

shane
2018-02-20 19:44
you can use lower right floating selector for that

dave.parker
2018-02-20 19:44
Gotcha

dave.parker
2018-02-20 19:46
latest still has the same bootenvs command though. Hrm.

dave.parker
2018-02-20 19:46
Anyway, upgrading.

shane
2018-02-20 19:47
I'll run through the quickstart and validate - it probably needs to be updated to say Stages

dave.parker
2018-02-20 19:50
:thumbsup:

shane
2018-02-20 20:11
@dave.parker - yes, change the `drpcli machines bootenv ...` command to `drpcli machines stage ... ` - it is otherwise identical ... long story short ... with the `defaultStage` preference set, this means a machine will use the Stage system. Therefor, you have to use Stages to change Machine between bootenvs ... If you leave the `defaultStage` set to `none`, then the Stages won't be enabled (effectively) - and the use of `bootenv` is correct. I'll clean up the QuickStart documents around that this afternoon.

dave.parker
2018-02-20 20:13
Ahh ok

dave.parker
2018-02-20 20:13
Thank you!

dave.parker
2018-02-20 20:13
I will give that a try once I'm done reloading install ISOs

dave.parker
2018-02-20 20:27
Ok, so now I have a machine discovered, and I was able to set the stage to the ubuntu install, and everything looks good. But when I reboot the system and it tries to PXE boot, dr-provision crashes:

dave.parker
2018-02-20 20:27
```Tried to access unlocked resource tasks panic: Tried to access unlocked resource tasks goroutine 103 [running]: log.Panicf(0xe121f9, 0x24, 0xc423ffccf8, 0x1, 0x1) /home/travis/.gimme/versions/go1.9.linux.amd64/src/log/log.go:337 +0xda http://github.com/digitalrebar/provision/backend.(*DataTracker).lockEnts.func1(0xdf96b1, 0x5, 0xc423ee3b30) /home/travis/gopath/src/github.com/digitalrebar/provision/backend/dataTracker.go:574 +0x143 http://github.com/digitalrebar/provision/backend.(*RequestTracker).stores(0xc4240b43c0, 0xdf96b1, 0x5, 0xc423f4aa58) /home/travis/gopath/src/github.com/digitalrebar/provision/backend/requestTracker.go:128 +0x3e http://github.com/digitalrebar/provision/backend.(*RequestTracker).(github.com/digitalrebar/provision/backend.stores)-fm(0xdf96b1, 0x5, 0x0) /home/travis/gopath/src/github.com/digitalrebar/provision/backend/machines.go:421 +0x3e http://github.com/digitalrebar/provision/backend.(*Machine).Validate(0xc4240707e0) /home/travis/gopath/src/github.com/digitalrebar/provision/backend/machines.go:422 +0x321 http://github.com/digitalrebar/provision/backend.(*Machine).BeforeSave(0xc4240707e0, 0x12e11c0, 0xc4240707e0) /home/travis/gopath/src/github.com/digitalrebar/provision/backend/machines.go:534 +0xb4 http://github.com/digitalrebar/provision/vendor/github.com/digitalrebar/store.save(0x12e78e0, 0xc423e46d80, 0x12e11c0, 0xc4240707e0, 0x823d4b, 0xc420248b68, 0x1ad71e8) /home/travis/gopath/src/github.com/digitalrebar/provision/vendor/github.com/digitalrebar/store/keySaver.go:166 +0x1a4 http://github.com/digitalrebar/provision/vendor/github.com/digitalrebar/store.Save(0x12e78e0, 0xc423e46d80, 0x12e11c0, 0xc4240707e0, 0xc4240707e0, 0x1, 0xc424006510) /home/travis/gopath/src/github.com/digitalrebar/provision/vendor/github.com/digitalrebar/store/keySaver.go:188 +0x49 http://github.com/digitalrebar/provision/backend.(*RequestTracker).Save(0xc4240b43c0, 0x12df2c0, 0xc4240707e0, 0xc424047b60, 0x3, 0x3) /home/travis/gopath/src/github.com/digitalrebar/provision/backend/requestTracker.go:324 +0x1c4 http://github.com/digitalrebar/provision/midlayer.(*DhcpRequest).coalesceOptions.func1(0xc4240387a0) /home/travis/gopath/src/github.com/digitalrebar/provision/midlayer/dhcp.go:330 +0xb98 http://github.com/digitalrebar/provision/backend.(*RequestTracker).Do(0xc4240b43c0, 0xc423ffd6c8) /home/travis/gopath/src/github.com/digitalrebar/provision/backend/requestTracker.go:111 +0xd9 http://github.com/digitalrebar/provision/midlayer.(*DhcpRequest).coalesceOptions(0xc4238140c0, 0xc42405e300, 0xc42409fd10, 0x0) /home/travis/gopath/src/github.com/digitalrebar/provision/midlayer/dhcp.go:274 +0xd6d http://github.com/digitalrebar/provision/midlayer.(*DhcpRequest).buildDhcpOptions(0xc4238140c0, 0xc42405e300, 0xc42409fd10, 0x0, 0xc4240627fc, 0x4, 0x4) /home/travis/gopath/src/github.com/digitalrebar/provision/midlayer/dhcp.go:441 +0x98 http://github.com/digitalrebar/provision/midlayer.(*DhcpRequest).ServeDHCP(0xc4238140c0, 0xc42409e701, 0x4, 0xc42407e148, 0x1) /home/travis/gopath/src/github.com/digitalrebar/provision/midlayer/dhcp.go:779 +0xcb7 http://github.com/digitalrebar/provision/midlayer.(*DhcpRequest).Process(0xc4238140c0, 0x0, 0x0, 0xffffffffffffffff) /home/travis/gopath/src/github.com/digitalrebar/provision/midlayer/dhcp.go:836 +0x85f http://github.com/digitalrebar/provision/midlayer.(*DhcpRequest).Run(0xc4238140c0) /home/travis/gopath/src/github.com/digitalrebar/provision/midlayer/dhcp.go:864 +0x2b created by http://github.com/digitalrebar/provision/midlayer.(*DhcpHandler).Serve /home/travis/gopath/src/github.com/digitalrebar/provision/midlayer/dhcp.go:920 +0x2e5```

dave.parker
2018-02-20 20:28
I must have missed something.

shane
2018-02-20 20:29
@greg and or @vlowther will have to take a look... I'm in a mtg right now....

vlowther
2018-02-20 20:41
looking at it.

vlowther
2018-02-20 20:41
@dave.parker What version of dr-provision is this?

dave.parker
2018-02-20 20:42
"version": "v3.6.0-tip-186-ed32fbb1e324a3e033a55d131d9b067f7442f4d5"

dave.parker
2018-02-20 20:42
I deleted the machine, rediscovered, and then reset the stage to ubuntu-install, and now it's installing the OS as expected. Not sure what the problem was before...

vlowther
2018-02-20 20:43
hm.

vlowther
2018-02-20 20:44
that is in the codepath where the DHCP system wants to update the machine IP address

vlowther
2018-02-20 20:44
Does this machine have multiple nics it can boot off of connected to the same physical network?

dave.parker
2018-02-20 20:45
Nope.

vlowther
2018-02-20 20:47
ok

vlowther
2018-02-20 20:54
I will have a fix out for that issue shortly.

dave.parker
2018-02-20 20:58
Excellent

vlowther
2018-02-20 20:59

dave.parker
2018-02-20 21:11
I have another weird problem. When I do get the install to work, I can't log in. The params `provisioner-default-user` and `provisioner-default-password-hash` default to rocketskates/r0cketsk8ts correct?

dave.parker
2018-02-20 21:12
I tried overriding those in the global profile with my own values (with a hash I generated via mkpasswd) but that doesn't work either.

dave.parker
2018-02-20 21:19
I can boot the system off a regular ubuntu iso and go into rescue mode, get a shell, and see the username I trired to add is there, along with the hash in /etc/shadow. If I reset the password there I can then log in after rebooting it.

dave.parker
2018-02-20 21:19
I guess I'll try just grabbing this hash and putting it in the param

dave.parker
2018-02-20 21:37
Grr, stuck on that DHCP bug again. When that pull request gets approved and merged will I be able to grab the fix by reinstalling tip?

greg
2018-02-20 21:42
@dave.parker yes

greg
2018-02-20 21:42
Victor?s fix should be in tip in about 30 minutes

dave.parker
2018-02-20 21:42
Awesome, thank you.

dave.parker
2018-02-20 21:47
Oh hey, manually changing the IP of the machine to the one dhcp is trying to give it fixes the issue and it boots into the install now. Sweet.

vlowther
2018-02-20 21:48
hmmm... manually changing it whwere?

dave.parker
2018-02-20 21:49
In the GUI I went to machines, and then clicked on the machine name, edited it, and put in the IP that was under the "leases" tab for this machine. Which was different than the one already there.

dave.parker
2018-02-20 21:50
That seemed to have fixed it? I think I also cleared out all the leases first though.

vlowther
2018-02-20 21:51
ok

dave.parker
2018-02-20 21:52
Yeah, I cleared out all the leases, tried to boot the machine again, and it failed again, but a new lease had popped up in the GUI. So that's the one I grabbed.

vlowther
2018-02-20 21:52
ok

dave.parker
2018-02-20 21:52
I'm old school, I basically kept trying bigger hammers until one of them "fixed" it...

vlowther
2018-02-20 21:53
I generally don't recommend clearing out all the leases

vlowther
2018-02-20 21:53
not unless you are going to whack all the machine records as well.

dave.parker
2018-02-20 21:53
Ah ok.

dave.parker
2018-02-20 21:53
Good to know.

dave.parker
2018-02-20 21:54
I will not use that particular hammer again if I can help it. :smile:

vlowther
2018-02-20 21:54
ya, it is just like dropping the lease database from any other DHCP server.

vlowther
2018-02-20 21:57
We keep the option around for troubleshooting purposes

vlowther
2018-02-20 21:59
basically, the Address field on a machine is the one we expect it to PXE from

dave.parker
2018-02-20 21:59
Ok

dave.parker
2018-02-20 22:02
FYI, the hash I pulled from the shadow file after manually changing the password (then threw back in the `provisioner-default-password-hash` param) worked fine. Not sure what was wrong with the hash provided as a default or the one I generated with mkpasswd.

shane
2018-02-20 22:02
@dave.parker you can convert a Lease to a Reservation if you want the machine to retain a given IP addr over time

dave.parker
2018-02-20 22:03
Ok

shane
2018-02-20 22:04
we currently don't have a `drpcli` command to do this, but I wrote a dirty bash script which does - see: https://rackn.slack.com/files/U6QFVRJNB/F9AR6H56G/lease2res_sh_-_Lease_to_Reservation_conversion_script.sh

dave.parker
2018-02-20 22:21
Cool

shane
2018-02-20 22:22
You can change a Reservation ... well ... not really ... but you can convert the Lease to a new IP address Reservation from the DHCP assigned

shane
2018-02-20 22:24
however, that script doesn't add that ability - and it's not exactly a "supported" feature at the moment - there are some side effects of the Machine Object not getting updated correctly - I've filed a bug fo this internally (https://github.com/digitalrebar/provision/issues/737)

shane
2018-02-20 22:25
right now, you'd have to delete the Reservation, and re-create the reservation with a new IP address ... since the IP Address field is the index for the Reservation object

vlowther
2018-02-20 22:59
If the HardwareAddrs field on the Machine is populated with the MAC addresses of the nics on the machine, then things shoudl work as you expect them to.

shane
2018-02-20 23:00
deleting the Reservation and creating a new one w/ the MAC addr and new IP addr works

shane
2018-02-20 23:00
but the issue is the Machine Object "Address" field doesn't get updated - it reflects the original Lease address

dave.parker
2018-02-20 23:00
I just pulled the latest version of tip since I saw that pr got merged.

vlowther
2018-02-20 23:02
@shane that is becaise machine.Address is initially populated by Sledgehammer when the machine is created, and the codepath in the DHCP subsystem that would update it if HardwareAddrs on the machine was populated was broken until 30 mins ago.

vlowther
2018-02-20 23:03
or so. :slightly_smiling_face:

dave.parker
2018-02-20 23:27
The fix works for me.

dave.parker
2018-02-20 23:27
```dr-provision2018/02/20 23:26:06.870664 [34:14]dhcp [ warn]: /home/travis/gopath/src/github.com/digitalrebar/provision/midlayer/dhcp.go:328 [34:14]bcc675dc-e333-49cd-8c86-eb17419a783b: Updating machine 10.10.10.28 address from 10.10.10.21 to %!s(MISSING)```

dave.parker
2018-02-20 23:28
Although that message looks wrong. It updated the machine from .28 to .21, not from .21 to `%!s(MISSING)`

dave.parker
2018-02-20 23:28
:smile:

dave.parker
2018-02-20 23:28
Anyway, it booted and is reinstalling which is what I expect to see.

dave.parker
2018-02-20 23:28
Thanks for your help today. I appreciate it.

vlowther
2018-02-20 23:35
bah, format string typos. :slightly_smiling_face:

amit.handa
2018-02-21 14:49
has joined #community201802

spector
2018-02-21 14:57
Welcome Amit

dave.parker
2018-02-21 18:30
Hi folks. Another question. The docs talk about setting up the DHCP server on a subnet to just be a relay, but there's no examples of how to do that that I can find.

dave.parker
2018-02-21 18:32
I'm not even sure that's what I want though. So I guess let me state my use case. I have a network that already has a DHCP server running on it, and I can't start another one that will compete with it, and I can't create another network or vlan, so I have to just play nicely with what's there. I *can* have that server forward to another server, or if it's possible I can just do an iPXE boot directly to the provisioning server. But I'm not sure which would be better or how to set up the dr-provision DHCP server to not stomp on the other one.

vlowther
2018-02-21 18:44
hm

vlowther
2018-02-21 18:44
We operate normally as a target of a DHCP relay.

vlowther
2018-02-21 18:45
just create a subnet that covers the IP address the relay lives in.

vlowther
2018-02-21 18:45
and hten have the relay point to us.

vlowther
2018-02-21 18:49
The other options for coexisting with other DHCP infrastructure are to just point next-server in your current DHCP infrastructure to us (in which case you wouldn't create any subnets in Digital Rebar, and lety your current DHCP infrastructure do all the heavy lifting), or you can use us as a ProxyDHCP server.

dave.parker
2018-02-21 18:49
If I'm on the same subnet I keep getting messages about how there "might be another DHCP server on this network" (there is!) and I just get conflicts. I guess my question is how do I configure the subnet so it doesn't try to be an active DHCP server and answer all requests?

dave.parker
2018-02-21 18:49
Ahhhh

dave.parker
2018-02-21 18:49
Ok, I think that's the setup I want.

dave.parker
2018-02-21 18:50
So I don't configure a subnet at all and just have the existing DHCP server pass to me via next-server? I think that's what I want.

vlowther
2018-02-21 18:50
Yep.

dave.parker
2018-02-21 18:50
Ok let me give that a whirl.

dave.parker
2018-02-21 18:50
Thanks.

vlowther
2018-02-21 18:51
Going that rout you will need to configure next-server and bootfile appropriately for the nodes you will be booting

dave.parker
2018-02-21 18:52
Ok

vlowther
2018-02-21 18:55
for legacy BIOS nodes, you can just use lpxelinux.0

vlowther
2018-02-21 18:56
for UEFI systems, we use ipxe, which gets a little more complicated.

vlowther
2018-02-21 18:57
http://ipxe.org/howto/chainloading has the docs on setting up ipxe support.

vlowther
2018-02-21 18:58
anf http://ipxe.org/howto/dhcpd#pxe_chainloading has ISC DHCPD specific instructions

dave.parker
2018-02-21 18:59
Excellent

vlowther
2018-02-21 19:02
The 4 files we use for bootloading are: lpxelinux.0 <-- legacy BIOS support by default. No special config needed ipxe.pxe <-- legacy BIOS support using ipxe. You need to "break the loop" as described in the first link in the IPXE docs.

vlowther
2018-02-21 19:03
ipxe.efi <-- UEFI booting using ipxe. You will alkso need to break the loop.

vlowther
2018-02-21 19:04
default.ipxe <-- the filename dhcpd should send when ipxe is loaded

vlowther
2018-02-21 19:05
Our DHCP server handles figuring out which file to serve behind the scenes, but for others you need to configure them upfront.

dave.parker
2018-02-21 19:11
:thumbsup:

dave.parker
2018-02-21 19:12
So I don't need to do anything special to get UEFI machines to boot if I use the integrated DHCP server?

dave.parker
2018-02-21 19:12
That's sweet.

dave.parker
2018-02-21 19:12
I can use that config in our core sites, which will be good.

vlowther
2018-02-21 19:13
You can also run us in ProxyDHCP mode -- create subnets for the address ranges you want to use and set Proxy to true.

vlowther
2018-02-21 19:14
The tradeoff is that I have not tested to see if it works through a DHCP relay.

vlowther
2018-02-21 19:15
and you have to relay your DHCP traffic to us as well as your usual DHCP servers.

vlowther
2018-02-21 19:15
Oh, and we don't handle UEFI arm boxes (32 or 64 bit) yet. :slightly_smiling_face:

dave.parker
2018-02-21 19:16
That's not a problem for me thankfully.

amit.handa
2018-02-22 04:12
I am using 3.2.6 drp, trying to learn it by pxe-booting virtualbox vm on the laptop. I am receiving pxe-e32: tftp open timeout on server logs, I am getting [0:153]TFTP: lpxelinux.0: transfer error: read udp [::]:33657: i/o timeout I am able to do tftp <ip> $get lpxelinux.0 successfully. How to debug it further ? Thanks,

zehicle
2018-02-22 05:51
I'm assuming you mean v3.6. Did you set a --static-ip address? if so, what is it.

zehicle
2018-02-22 05:52
Also, did you set your vboxnet0 subnet?

zehicle
2018-02-22 05:53
any chance you put it in a container or have a firewall blocking traffic?

amit.handa
2018-02-22 07:30
hi

amit.handa
2018-02-22 07:30
managed to get it work

amit.handa
2018-02-22 07:31
I had specified incorrect subnet settings (next-server, specifically)

amit.handa
2018-02-22 07:31
I used wireshark to debug it

amit.handa
2018-02-22 07:31
sorry, I am learning the ropes of setting up kubernetes cluster via digital rebar

amit.handa
2018-02-22 07:32
yes, I mean v3.6

amit.handa
2018-02-22 09:13
Thanks for the information, Ideally, it should be there in the docs. currently, it needs improvement IMO :slightly_smiling_face:

shane
2018-02-22 13:44
@amit.handa - no worries ... in our next release (due out today, in fact), v3.7.0 - we have a new feature that should mean you do not need to specify the `next-server` in your subnet - it automatically inserts the value for you

amit.handa
2018-02-22 14:15
cool

amit.handa
2018-02-22 14:15
I have added drp as next-server for an existing dhcp deployment in our company.

amit.handa
2018-02-22 14:15
existing one is WDS

amit.handa
2018-02-22 14:16
I need to migrate the existing windows install images to drp

amit.handa
2018-02-22 14:16
hope it should be straight forward ?

amit.handa
2018-02-22 14:16
ll update

shane
2018-02-22 14:16
windows image provisioning is entirely possible w/ DRP, however, it's not available in the Open Community pieces - that is an advanced RackN functionality piece

shane
2018-02-22 14:17
we have done windows images for other customers, so we can do it, but it's not as straight forward ... because ... Windows ...

amit.handa
2018-02-22 14:17
yup

amit.handa
2018-02-22 14:17
but then the issue becomes how do I support both

amit.handa
2018-02-22 14:18
if it is feasible at all with community version

dave.parker
2018-02-22 15:44
Could someone give me a brief explanation of what it means to make a machine "runnable" or what the runner is/does? I'm kind of confused on that.

dave.parker
2018-02-22 15:45
If the "runner" is waiting can you assign a stage to a machine and have it immediately start doing its thing?

shane
2018-02-22 15:45
Sure! The runner is simply an agent used during the install process to enable the job queue and tasks to be executed. The Runner is ... actually ... just the `drpcli` binary put in to a special mode to listen for jobs to execute during the Stage transitions.

dave.parker
2018-02-22 15:45
Ahh

dave.parker
2018-02-22 15:46
So when the host is sitting at the OS login prompt after the sledgehammer boot, the runner is waiting for further instructions basically?

shane
2018-02-22 15:47
The Runner (drpcli) runs in the Sledgehammer (discovery) stages, and executed work. By default and by design, it "dissolves" and does not remain resident after install. However, for larger and more complex full lifecycle management solutions - we can leave the Runner in place as a long-lived service (ala `systemd`, etc.) that enables deeper integration for lifecycle management and other enforcement activities, if you should choose.

shane
2018-02-22 15:47
Yes - generally speaking, as long as you have not run a Stage/Task that disables the runner ... :slightly_smiling_face:

shane
2018-02-22 15:49
you can check for it by simply doing `ps -ef | grep drpcli | grep -v grep` on a target Machine

dave.parker
2018-02-22 15:49
Ok cool.

dave.parker
2018-02-22 15:50
Thank you.

shane
2018-02-22 15:52
an example stage that disable the Runner is `complete-nowait`

shane
2018-02-22 15:52
inversely, the stage `complete` marks the Machine as done, but leaves the Runner ... ahem ... running

dave.parker
2018-02-22 15:52
Hehe

dave.parker
2018-02-22 15:54
It's starting to make sense now. I'm starting to wrap my head around everything.

zehicle
2018-02-22 16:12
@amit.handa Good work getting your first machines booted!! Just to clarify: There is a single code base for Digital Rebar Provision (DRP), which includes the provisioning service and the client. DRP is APLv2 licensed. RackN offers commercial support of that code and pushes patches and features back into the open (community) repos. We also offer a significant amount of RackN advanced content, plugins, and functionality for the DRP ecosystem that are sold commercially and, in many cases, offered without charge to the community.

amit.handa
2018-02-22 16:24
Thanks. I am pretty happy myself. I was unsure about DRP since I couldnt get in touch with its community. Now I have :slightly_smiling_face: ll definitely go through the DRP as well as RackN components.

mohd.mehdim
2018-02-22 20:18
has joined #community201802

spector
2018-02-22 21:57
Welcome Mohammed Mehdi

mohd.mehdim
2018-02-22 22:59
Thanks Spector

spector
2018-02-22 23:00
There is a FAQ, I think this will bring it up for you $Welcome

2018-02-22 23:00
Digital Rebar welcome information is here > http://rebar.digital/community/welcome.html

mohd.mehdim
2018-02-22 23:03
Yeah I have been doing some reading about digital rebar for the last one week. Trying to get this up and running in virtualbox.

shane
2018-02-22 23:04
VirtualBox is a bit of a pain - because it tries to interfere w/ your DHCP services, and you may have some other issues if you're on a Mac

mohd.mehdim
2018-02-22 23:04
yeah I am currently running into dhcp issues on my mac


mohd.mehdim
2018-02-22 23:08
Yeah trying to use hostonly network on both dr and client but for some reason its getting dhcp from somewhere else. Disabled virtualbox dhcp along with docker daemon.

shane
2018-02-22 23:09
for my MAC - I did this: 1. setup a single VM as my DRP Endpoint, w/ 2 NIC a) 1st Bridged to my WiFi (or your local LAN) b) 2nd as Host-Only (using vboxnet0) - this way, my Mac can participate in connecting to the VMs (Machines) directly too 2. set up my VMs with only 1 NIC, connected to Host-Only (vboxnet0) 3. on the DRP Endpoint, turn on packet forwarding (routing), and add IPTables NAT rules 4. disable DHCP on vboxnet0 - and then KILL the DHCP server that doesn't stop when you disable

shane
2018-02-22 23:09
when you disable it - it does NOT stop the DHCP service - you have to kill it, after disabling it

mohd.mehdim
2018-02-22 23:09
aha..need to kill the dhcp service...probably thats it, otherwise my setup is exactly the same as yours

shane
2018-02-22 23:12
I also added an IP addr to my Mac vboxnet0 network in the VirtualBox management tool - static IP assignment

mohd.mehdim
2018-02-22 23:16
ok so it gets an ip from dr but getting error ```Exec format error```

mohd.mehdim
2018-02-22 23:17
looks like some configuration issue with sledgehammer

mohd.mehdim
2018-02-22 23:18

shane
2018-02-22 23:22
hmm - not sure off the top of my head - I just upgraded my DRP version to the latest v3.7.0 (incidentally - you were the FIRST to download it - mere seconds after it released ... )

shane
2018-02-22 23:22
I was able to boot a VM smoothly

shane
2018-02-22 23:23
did you use the `--static-ip` flag when you started DRP ?

mohd.mehdim
2018-02-22 23:23
yeah

mohd.mehdim
2018-02-22 23:23
oh this is brand new :sunglasses:

shane
2018-02-22 23:28
can you please copy-n-paste the process listing/options for your running DRP Endpoint ? (`ps -ef | grep dr-provision | grep -v grep`)

mohd.mehdim
2018-02-22 23:32
```root 1510 1442 0 17:58 pts/0 00:00:00 sudo ./dr-provision --static-ip=192.168.99.201 --base-root=/root/drp-data --local-content= --default-content= root 1512 1510 0 17:58 pts/0 00:00:01 ./dr-provision --static-ip=192.168.99.201 --base-root=/root/drp-data --local-content= --default-content=```

shane
2018-02-22 23:33
did you create a `subnet` for 192.168.99.0/24 ?

mohd.mehdim
2018-02-22 23:34
yes

shane
2018-02-22 23:34
can you please provide the output of `drpcli subnets show <NAME _OF_SUBNET>` ?

shane
2018-02-22 23:34
we have some new changes that make adding a subnet not necessary - but I'd like to inspect what you set there

shane
2018-02-22 23:35
@shane uploaded a file: https://rackn.slack.com/files/U6QFVRJNB/F9CSHS532/screen_shot_2018-02-22_at_15.35.03.png and commented: this is my vbox VM booting up against a brand new v3.7.0 DRP endpoint

shane
2018-02-22 23:40
Ok - I replicated the issue - but only be deleting my Subnet - and I get the same errors you see when I do that

zehicle
2018-02-22 23:41
@mohd.mehdim which UX URL are you using?


shane
2018-02-22 23:45
:slightly_smiling_face: I can't access your DRP Endpoint via the Portal - I have to have direct access to the 192.168.99.201 IP address for that to work

mohd.mehdim
2018-02-22 23:45
```{ "ActiveEnd": "192.168.99.254", "ActiveLeaseTime": 60, "ActiveStart": "192.168.99.250", "Available": true, "Description": "", "Enabled": true, "Errors": [], "Meta": {}, "Name": "local_subnet", "NextServer": "", "OnlyReservations": false, "Options": [ { "Code": 1, "Value": "255.255.255.0" }, { "Code": 3, "Value": "192.168.99.1" }, { "Code": 6, "Value": "8.8.8.8" }, { "Code": 15, "Value": "http://example.com" }, { "Code": 28, "Value": "192.168.99.255" } ], "Pickers": [ "hint", "nextFree", "mostExpired" ], "Proxy": false, "ReadOnly": false, "ReservedLeaseTime": 7200, "Strategy": "MAC", "Subnet": "192.168.99.0/24", "Unmanaged": false, "Validated": true }```

mohd.mehdim
2018-02-22 23:46
yeah that ip is on hostonly network

shane
2018-02-22 23:46
that's a security mechanism - DRP Endpoint NEVER talks to the Portal directly - it only talks to the management workstation (aka your laptop) ... and provides a "passthrough" connection between the Portal and the Endpoint via the single-page React application that's running in your browser

mohd.mehdim
2018-02-22 23:48
So, the above is indeed the UX URL right?

shane
2018-02-22 23:48
yes - that's the Production version of the UX (our Stable Portal)

shane
2018-02-22 23:50
Ok - you need to edit the Subnet you created, add add the Param for Option Code 67, and set it to the value "lpxelinux.0"

shane
2018-02-22 23:50
`drpcli subnets set local_subnet option 67 to "lpxelinux.0"`

shane
2018-02-22 23:51
the DHCP changes in v3.7.0 release have made it necessary to now add that - I'll update the quickstart doc right now

mohd.mehdim
2018-02-22 23:53
cool..let me try that

mohd.mehdim
2018-02-22 23:55
Works :+1:

shane
2018-02-22 23:55
woot! woot!

mohd.mehdim
2018-02-22 23:58
So, can we use static names for machine names and use it to assign bootenv?

shane
2018-02-23 00:01
yep - just Edit the machine and change the name

shane
2018-02-23 00:01
you can do that in the UX, or via `drpcli` - see $FAQ for a FAQ note on that


mohd.mehdim
2018-02-23 00:02
I am more of a cli guy and will automate this provisioning using ansible

shane
2018-02-23 00:03
the FAQ has the CLI option there


shane
2018-02-23 00:04
We have some limited support for Ansible playbooks - we demonstrated it using the Kubernetes Kubespray Ansible playbook


shane
2018-02-23 00:04
note: it's been a while since we exercised that code - so it may be a bit crusty

mohd.mehdim
2018-02-23 00:07
cool?let me look into it?thanks

mohd.mehdim
2018-02-23 00:08
Another question, I see that DR does has the capability of doing hardware raid. Does it support any vendor or specific vendors?

shane
2018-02-23 00:09
we do - but that is a RackN commercial piece - if you're interested in that we should discuss that in context of the Trial you're running

greg
2018-02-23 01:46
- The release is out!

greg
2018-02-23 01:47
DRP stable is now v3.7.0. Content packages are v1.6.0. Plugins are now at v2.0.0. UX is now at v1.0.0.

greg
2018-02-23 01:47
If you update to stable, please immediately update your plugins to their v2.0.0 counterparts.

greg
2018-02-23 01:47
Updating default content will require a sledgehammer update.

greg
2018-02-23 01:48

greg
2018-02-23 01:49
UX release notes:


zehicle
2018-02-23 02:33
:nerd_face: great news. This is a big release w plugins, lots of bug fixes, dhcp and other updates. Well done!!

dave.parker
2018-02-23 18:32
Huh.

dave.parker
2018-02-23 18:32
I just installed the new stable, and when I try to do `drpcli bootenvs uploadiso sledgehammer` I get `Error: GET: bootenvs/sledgehammer: Not Found`

dave.parker
2018-02-23 18:34
Oh, this is probably why:

dave.parker
2018-02-23 18:34
```Installing Version stable of Digital Rebar Provision Community Content Failed to dowload content. Failed to download sha of content. sha256sum: drp-community-content.yaml: No such file or directory drp-community-content.yaml: FAILED open or read sha256sum: WARNING: 1 listed file could not be read```

dave.parker
2018-02-23 18:37
Going into the GUI under "Info & Preferences" and clicking the "Content" link under System Wizard and manually transferring community-content and community-contrib fixed it.

greg
2018-02-23 18:39
how did you install?

greg
2018-02-23 18:39
I found this and I?m fixing it shortly, I think.

greg
2018-02-23 18:39
Actually, it is in tip/tools/install.sh

greg
2018-02-23 18:40
I?ll crank a 3.7.1 with a couple of fixes here shortly.

dave.parker
2018-02-23 18:40
`curl -fsSL get.rebar.digital/stable | bash -s -- --isolated install`

greg
2018-02-23 18:41
hmm - when did you do the install?

greg
2018-02-23 18:42
just probably means now. :slightly_smiling_face:

dave.parker
2018-02-23 18:42
About ten minutes ago?

greg
2018-02-23 18:42
okay - thinking . It worked for me.

greg
2018-02-23 18:42
oh - I wonder if the tree was gyrating because of what I was doing. maybe.

greg
2018-02-23 18:43
Your recovery method was and is sound though.

dave.parker
2018-02-23 18:43
Cool

dave.parker
2018-02-23 20:14
Reinstalled tip just a bit ago and had the problem again. Don't know why it doesn't work for me.

greg
2018-02-23 20:15
hmmm - okay

greg
2018-02-23 20:16
@dave.parker - can do it two steps for me.

greg
2018-02-23 20:16
`curl -fsSL get.rebar.digital/stable > install.sh`

greg
2018-02-23 20:16
`chmod +x install.sh`

greg
2018-02-23 20:16
edit to add `set -x` near the top.

greg
2018-02-23 20:16
`./install.sh --isolated install`

dave.parker
2018-02-23 20:17
Sure I can try that.

greg
2018-02-23 20:17
Send me the output

greg
2018-02-23 20:17
please

dave.parker
2018-02-23 20:20

greg
2018-02-23 20:20
it seemed to work for you that time.

dave.parker
2018-02-23 20:20
Huh

dave.parker
2018-02-23 20:21
Yeah, sure did. I can grab sledgehammer just fine.

dave.parker
2018-02-23 20:21
Maybe it's some weird vagrant thing. Although I've got it to work in vagrant before.

greg
2018-02-23 20:21
It could be a networking path issue at times. :neutral_face:

greg
2018-02-23 20:21
hmmm

dave.parker
2018-02-23 20:22
I did manually try to reinstall once (not through vagrant up or vagrant provision, just going on the box and running the curl/install command) and that failed at that time.

dave.parker
2018-02-23 20:22
Well, let me try again.

greg
2018-02-23 20:23
I?m trying to help, but not sure what is going on. The thing I fixed this morning was if you specified `--version=v3.7.0`. It would fail for certain. You aren?t doing that so it is some other problem.

dave.parker
2018-02-23 20:23
Ok.

greg
2018-02-23 20:24
Stable shouldn?t be moving so the files should be there.

greg
2018-02-23 20:24
Sometimes tip can fail this way if you catch it during an update. The files are changing, but this isn?t that either.

dave.parker
2018-02-23 20:27
It certainly doesn't help that my laptop keeps crashing.

dave.parker
2018-02-23 20:27
Ok I'm trying again with stable through vagrant.

dave.parker
2018-02-23 20:27
See if that works now.

dave.parker
2018-02-23 20:33
Huh, nope. Still doesn't work.

dave.parker
2018-02-23 20:33
That's really weird.

greg
2018-02-23 20:33
yeah - does the direct curl work? `curl -sfL -o drp-community-content.sha256 https://github.com/digitalrebar/provision-content/releases/download/stable/drp-community-content.sha256`

dave.parker
2018-02-23 20:36
It does

dave.parker
2018-02-23 20:43
It works when I run from the command line though.

dave.parker
2018-02-23 20:43
Something is screwy with the way it installs during vagrant provisioning.

dave.parker
2018-02-23 20:51
Ok I don't get this. Because I see this in the install: ```master: Installing Version stable of Digital Rebar Provision Community Content master: drp-community-content.yaml: OK```

dave.parker
2018-02-23 20:51
But it still fails to find sledgehammer?

greg
2018-02-23 21:01
how are you running the dr-provision?

dave.parker
2018-02-23 21:07
A shell script that the vagrant provisioner runs.

dave.parker
2018-02-23 21:07
```#! /bin/bash mkdir dr-prov chown vagrant:vagrant dr-prov cd dr-prov curl -fsSL get.rebar.digital/tip | bash -s -- --isolated install nohup sudo ./dr-provision --static-ip=192.168.50.10 --base-root=/home/ubuntu/dr-prov/drp-data --local-content="" --default-content="" & &> /dev/null sleep 10 chown -R vagrant:vagrant . ./drpcli bootenvs uploadiso sledgehammer ./drpcli bootenvs uploadiso ubuntu-16.04-install ./drpcli subnets create - < /tmp/subnet.json ./drpcli bootenvs create - < /tmp/ubuntu1404.json ./drpcli prefs set unknownBootEnv discovery defaultBootEnv sledgehammer defaultStage discover```

dave.parker
2018-02-23 21:08
I just added the chown stuff. The first one seemed to fix the problem of the files not downloading at all, but it still doesn't seem to load properly.

dave.parker
2018-02-23 21:09
```master: Installing Version stable of Digital Rebar Provision Community Content master: drp-community-content.yaml: OK master: # Run the following commands to start up dr-provision in a local isolated way. master: # The server will store information and serve files from the drp-data directory. master: sudo ./dr-provision --static-ip=10.0.2.15 --base-root=/home/vagrant/dr-prov/drp-data --local-content="" --default-content="" & master: master: # Once dr-provision is started, these commands will install the isos for the community defaults master: ./drpcli bootenvs uploadiso ubuntu-16.04-install master: ./drpcli bootenvs uploadiso centos-7-install master: ./drpcli bootenvs uploadiso sledgehammer master: dr-provision2018/02/23 20:57:55.488386 Version: v3.7.0-0-246bbac639d47f8302fdfd4642646aeb498f9d0c master: dr-provision2018/02/23 20:57:55.492666 Extracting Default Assets master: dr-provision2018/02/23 20:57:56.081949 Starting TFTP server master: dr-provision2018/02/23 20:57:56.082273 Starting static file server master: dr-provision2018/02/23 20:57:56.082411 Starting DHCP server master: dr-provision2018/02/23 20:57:56.084521 Starting PXE/BINL server master: dr-provision2018/02/23 20:57:56.084848 Starting API server master: dr-provision2018/02/23 20:58:05.548566 [2:1]frontend [audit]: /home/travis/gopath/src/github.com/digitalrebar/provision/frontend/frontend.go:642 master: [2:1]Authenticated rocketskates - users token rocketskates - 127.0.0.1 master: Error: GET: bootenvs/sledgehammer: Not Found master: dr-provision2018/02/23 20:58:05.639968 [6:2]frontend [audit]: /home/travis/gopath/src/github.com/digitalrebar/provision/frontend/frontend.go:642 master: [6:2]Authenticated rocketskates - users token rocketskates - 127.0.0.1 master: Error: GET: bootenvs/ubuntu-16.04-install: Not Found```

dave.parker
2018-02-23 21:10
I'm going to try running the script manually from the command line in the vagrant box.

dave.parker
2018-02-23 21:12
It fails that way too. But the manual curl install worked. So I guess I'll go step by step until I find out what's not working...

greg
2018-02-23 21:16
this seems like you aren?t finding the content in the drp-data directory.

greg
2018-02-23 21:17
check - `/home/vagrant/dr-prov/drp-data/saas-content` for files

greg
2018-02-23 21:17
@dave.parker - more info ^

dave.parker
2018-02-23 21:17
Hrm ok

dave.parker
2018-02-23 21:20
```vagrant@dr-prov:~/dr-prov/drp-data/saas-content$ ls -la total 88 drwxrwxr-x 2 vagrant vagrant 4096 Feb 23 21:16 . drwxrwxr-x 9 vagrant vagrant 4096 Feb 23 21:16 .. -rw-rw-r-- 1 vagrant vagrant 79008 Feb 23 21:16 default.yaml```

dave.parker
2018-02-23 21:28
Huh, now it's not working with the curl command manually either.

dave.parker
2018-02-23 21:28
I'm stumped then.

dave.parker
2018-02-23 21:28
I'm going to try a non-vagrant machine.

zehicle
2018-02-23 21:29
@dave.parker if you are trying to build a system w/ DRP content staged, you can do it by populating the directory structure directly instead of using the APIs

dave.parker
2018-02-23 21:59
It seems to be something with vagrant. It's working fine installed on a hand-built host.

greg
2018-02-23 22:00
whew! I guess. :disappointed:

amit.handa
2018-02-24 07:16
install.sh is taking ages to download dr-provision.zip :disappointed:

amit.handa
2018-02-24 07:16
40KBPS max speed

amit.handa
2018-02-24 07:16
unable to upgrade

amit.handa
2018-02-24 07:17
any ideas on speeding up

greg
2018-02-24 14:37
Hmm. Not sure. It is in an s3 bucket.

zehicle
2018-02-24 18:18
FWIW - I've tried get Vagrant working on multiple iterations of Rebar with limited success, but I'd been doing it on Linux desktops.

zehicle
2018-02-24 19:50
Greg and I recorded a video set about creating and bundling content. The example content is on http://github.com/digitalrebar/colordemo and the videos posted: * Creating Content: https://youtu.be/79Y-3IOguZk * Bundling: https://youtu.be/JUyzFNkLyZU

michael.harp
2018-02-26 16:52
http://provision.readthedocs.io/en/latest/doc/quickstart.html#install-your-first-machine Step-4 has typo, stages->stage `drpcli machines stage <UUID> ubuntu-16.04-install`

greg
2018-02-26 16:59
Thanks - fixing now.

florent.wagener
2018-02-26 18:31
is there any release notes for the 3.7.0 version ? I have trouble finding them...


florent.wagener
2018-02-26 18:40
ill give it a try this afternoon.

dave.parker
2018-02-26 19:04
3.7 has been pretty great to me so far.

greg
2018-02-26 19:56
I pushed a new release v3.7.1 - https://github.com/digitalrebar/provision/releases/tag/v3.7.1 Most this is doc updates, a bug fix, and some Mac OSX and virtualbox ease of use stuff. Content updates as well, but small too. Some new icon and color update. A couple of helper templates.

greg
2018-02-26 19:57
Also, docker containers are updated and versioned now as well. stable, latest, and v3.7.1 are out there.

rakeshrhcss
2018-02-27 11:08
has joined #community201802

wdennis
2018-02-27 16:12
@shane or anyone else - what is correct cmd line to upgrade from `v3.6.0-tip` to 3.7 stable?


shane
2018-02-27 16:16
(sorry I pasted `tip` doc - use the `latest`)

shane
2018-02-27 16:16
we put version to version upgrade notes there

shane
2018-02-27 16:16
if there is nothing, then you should assume there are no special requirements for your upgrade path

shane
2018-02-27 16:17
for your case ... "it depends" ... on what version of `3.6.0-tip` you were at

shane
2018-02-27 16:17
but the 3.6.0 to 3.7.0 notes will give you good results - eg update your plugins

wdennis
2018-02-27 16:19
Seeing this right after upgrade and start - but before upgrading plugins: ```dr-provision2018/02/27 11:56:01.273119 Version: v3.7.1-0-b441dd1450c98be5317025c89668f85985eb65d8 dr-provision2018/02/27 11:56:01.273263 Extracting Default Assets dr-provision2018/02/27 11:56:02.246097 [0:1]frontend [ info]: /home/travis/gopath/src/github.com/digitalrebar/provision/frontend/frontend.go:512 [0:1]Running Local UI from /home/dradmin/drp/drp-data/ux dr-provision2018/02/27 11:56:02.430453 [0:2]plugin [error]: /home/travis/gopath/src/github.com/digitalrebar/provision/midlayer/controller.go:523 [0:2]Unpack for ipmi failed: exit status 1 dr-provision2018/02/27 11:56:02.430578 [0:3]plugin [error]: /home/travis/gopath/src/github.com/digitalrebar/provision/midlayer/controller.go:524 [0:3]Error: unknown command "unpack" for "ipmi" Run 'ipmi --help' for usage. dr-provision2018/02/27 11:56:03.448196 Starting TFTP server dr-provision2018/02/27 11:56:03.449430 Starting static file server dr-provision2018/02/27 11:56:03.449786 Starting API server```

wdennis
2018-02-27 16:19
The `ipmi` failure is due to old plugin?

shane
2018-02-27 16:19
yep - you had plugins v1

shane
2018-02-27 16:20
you'll need to follow the 3.6.0 to 3.7.0 notes on updating your plugins to the new v2 goodness

wdennis
2018-02-27 16:20
No, just came from v2 plugins w/ 2.6.0-tip actually

wdennis
2018-02-27 16:20
2->3

shane
2018-02-27 16:20
(yep)

wdennis
2018-02-27 16:21
OK, did ipmi plugin upgrade, let's restest start...

wdennis
2018-02-27 16:21
Yup, clean start now

wdennis
2018-02-27 16:22
Need new sledgehammer, correct?

shane
2018-02-27 16:22
yes

shane
2018-02-27 16:24
not sure if you noticed - but you can use the version inspector with colorized (pretty-print) and Diff capability in the UX

shane
2018-02-27 16:24
you can see exactly what's going to be changed before performing an upgrade of Content

wdennis
2018-02-27 16:25
Is there an example in docs for downloading boot isos?

shane
2018-02-27 16:26
$quickstart has it

2018-02-27 16:26

wdennis
2018-02-27 16:27
cool, thx

shane
2018-02-27 16:27
no prob

shane
2018-02-27 16:28
I believe ... but may be lying to you ... that you can now do it through the "Boot ISOs" menu in the UX - I haven't tried that path yet

wdennis
2018-02-27 16:28
It downloads ISO to host running UX

shane
2018-02-27 16:28
ah

shane
2018-02-27 16:29
actually - I think the Upload action pushes ISO from your UX hosted management workstation

wdennis
2018-02-27 16:29
The `drpcli bootenvs uploadiso` does the explode etc right?

shane
2018-02-27 16:29
so if you have the ISO local - then you can push it to your endpoint from workstation

shane
2018-02-27 16:29
yep

wdennis
2018-02-27 16:29
OK

shane
2018-02-27 16:30
From the Boot Environments menu - you can select a BootEnv, which will have the source of the ISO/tarball - and download to your workstation - then use the Boot ISOs Upload command

shane
2018-02-27 16:30
or - as you are doing - use the CLI uploadiso helper

shane
2018-02-27 16:32
yep - the 2-step UX procedure works too

wdennis
2018-02-27 16:32
OK, everything looks good...

shane
2018-02-27 16:32
excellent !

wdennis
2018-02-27 16:33
Trying a 5-node reinstall, let's see what happens

shane
2018-02-27 16:33
it'll magically work !

wdennis
2018-02-27 16:33
(bye-bye old KRIB cluster :cry:)

shane
2018-02-27 16:34
there's some new feedback stuff in the new KRIB update ... which gives you visual change cues

shane
2018-02-27 16:34
@zehicle is excited about it - I'm going to give it a try and see how it looks today

wdennis
2018-02-27 16:34
Yeah, have to install a test Rancher cluster today, but then want to go back to stock k8s

shane
2018-02-27 16:35
did you get rancher working via DRP ?

wdennis
2018-02-27 16:35
No, not RancherOS - just Rancher-controlled infra nodes

shane
2018-02-27 16:35
ah

wdennis
2018-02-27 16:35
Kicking tires on their k8s installer

shane
2018-02-27 17:36
we hope to see you all in a short bit (11 am PST) for our v012 meetup. Meetup link: https://www.meetup.com/digitalrebar/events/247773442/

amontalban
2018-02-27 18:08
Hey guys, anyone had to use full disk encryption on Ubuntu setting the encryption key in preseed? Anyway to automate random key generation with DigitalRebar or interact with Vault for it?

shane
2018-02-27 18:10
hi @amontalban - we haven't specifically done any FDE w/ DRP - nor specific Vault integration ... however, it should be pretty easy to author/change content to use the Vault command line to interact with your Vault store

amontalban
2018-02-27 18:10
Yeah, I think I will go that route

amontalban
2018-02-27 18:10
Thanks!

shane
2018-02-27 18:12
also `openssl` is installed by default in the Sledgehammer image - so you can use `openssl rand ...` to generate a number, which can be used also in a Stage

shane
2018-02-27 18:13
(eg `openssl rand -hex 100` to generate a 100 character random string)

amontalban
2018-02-27 18:14
Great thank you!

shane
2018-02-27 18:15
you can also use `drpcli` to store a Param on the machine w/ the randomly generated number, which can subsequently be used in your Stage(s) for the seed value

dave.parker
2018-02-27 18:59
Did anybody ever figure out what bizarre thing Vagrant is doing that doesn't play nice with dr-provision? I know @zehicle said something about having tried it with limited success.

dave.parker
2018-02-27 19:00
I gave up on it but would still love to get it working. It'd be ideal for me to be able to pass a kind of playground around for people to get familiar with.

greg
2018-02-27 19:04
virtualbox had issues with lpxelinux.

greg
2018-02-27 19:04
that has been fixed in 3.7.1

greg
2018-02-27 19:04
I don?t remember the other issues in vagrant.

shane
2018-02-27 19:05
@dave.parker we were going to discuss that in the meetup that is starting RIGHT NOW

dave.parker
2018-02-27 19:11
Oh, I thought that was hours ago. Time zones are hard. Either way I can't jump on right now. :disappointed:

spector
2018-02-27 19:21
We record the meetups, will be online in an a few hours from now

dave.parker
2018-02-27 20:01
Oh cool.

spector
2018-02-27 20:48
@dave.parker http://bit.ly/2BSVRsq -> almost done processing on YouTubve

dave.parker
2018-02-27 21:07
:thumbsup:

florent.wagener
2018-02-27 21:41
does sledgehammer support python3.x ? If not is this something on the roadmap ?

wdennis
2018-02-27 21:42
Hi team - what would a stage-map look like that has `prep-install` as an step before an OS install?

wdennis
2018-02-27 21:43
(hoping that `prep-install` fixes install prob's with pre-existing disks that were used for LVM)

vlowther
2018-02-27 21:43
@florent.wagener Not out of the box -- 2.7 is present, IIRC -- it is the default for centos7 still.

florent.wagener
2018-02-27 21:43
thanks @vlowther

vlowther
2018-02-27 21:44
@wdennis discover -> prep-install -> foo-install -> local

wdennis
2018-02-27 21:46
@vlowther There will be a reboot after `prep-install` I reckon?

vlowther
2018-02-27 21:47
nope.

vlowther
2018-02-27 21:47
that is, the task won't reboot the system.

vlowther
2018-02-27 21:48
and the runner will automatically reboot into the right bootenv for the next stage.

wdennis
2018-02-27 21:49
So, this should work? ``` "change-stage/map": { "discover": "prep-install:Success", "prep-install": "ubuntu-16.04-install:Success", "ssh-access": "complete-nowait:Reboot", "ubuntu-16.04-install": "ssh-access:Success" }, ```

greg
2018-02-27 21:51
``` "change-stage/map": { "discover": "prep-install:Success", "prep-install": "ubuntu-16.04-install:Reboot", "ssh-access": "complete-nowait:Stop", "ubuntu-16.04-install": "ssh-access:Success" }, ```

greg
2018-02-27 21:51
Reboot after changing to the install stage.

wdennis
2018-02-27 21:51
ah

greg
2018-02-27 21:51
Stop the task process when the post-install stages are done.

wdennis
2018-02-27 21:54
OK, got it... ``` "change-stage/map": { "discover": "prep-install:Success", "prep-install": "ubuntu-16.04-install:Reboot", "ssh-access": "complete-nowait:Stop", "ubuntu-16.04-install": "ssh-access:Success" } ```

greg
2018-02-27 21:56
also you don?t need the ssh-access steps. It is a built-in task to the ubuntu-16.04-install

clint
2018-02-27 22:01
has joined #community201802

wdennis
2018-02-27 22:15
Hmmm, prep-install not blanking the install disk - still getting this when trying to reuse prior-installed disks:

wdennis
2018-02-27 22:16
I can see from the jobs log that the tasks ran before the Ubuntu install task...


wdennis
2018-02-27 22:18
This is sample job output... ```Log for Job: 36f0b70e-055a-40c8-a288-0a224eb94343 Starting task erase-hard-disks-for-os-install on bc00245d-48e8-487a-9b5c-e59eb2b62f8d Starting command ./erase-hard-disks-for-os-install-erase-disks Command running PARTIAL MODE. Incomplete logical volumes will be processed. Reading all physical volumes. This may take a while... Found volume group "testnode01" using metadata type lvm2 Logical volume testnode01/root contains a filesystem in use. PV /dev/sda3 VG testnode01 lvm2 [1.82 TiB / 0 free] Total: 1 [1.82 TiB] / in use: 1 [1.82 TiB] / in no VG: 0 [0 ] PV /dev/sda3 belongs to Volume Group testnode01 so please use vgreduce first. (If you are certain you need pvremove, then confirm by using --force twice.) mdadm: Unrecognised md component device - /dev/sda3 2048+0 records in 2048+0 records out 1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.0144598 s, 72.5 MB/s 2048+0 records in 2048+0 records out 1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.0566154 s, 18.5 MB/s mdadm: Unrecognised md component device - /dev/sda2 2048+0 records in 2048+0 records out 1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.0237597 s, 44.1 MB/s 2048+0 records in 2048+0 records out 1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.0618786 s, 16.9 MB/s mdadm: Unrecognised md component device - /dev/sda1 1024+0 records in 1024+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0163394 s, 32.1 MB/s mdadm: Unrecognised md component device - /dev/sda 2048+0 records in 2048+0 records out 1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.0104949 s, 99.9 MB/s 2048+0 records in 2048+0 records out 1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.0346949 s, 30.2 MB/s mdadm: Couldn't open /dev/sr0 for write - not zeroing dd: failed to open '/dev/sr0': No medium found dd: failed to open '/dev/sr0': No medium found Command exited with status 0 Action erase-disks finished Task erase-hard-disks-for-os-install finished Updated job 36f0b70e-055a-40c8-a288-0a224eb94343 to finished ```

vlowther
2018-02-27 22:28
Guess we need the --force --force --really-i-mean-it flag. :confused:

dave.parker
2018-02-27 22:43
--run --go --get-to-the-choppa

zehicle
2018-02-27 23:32
hello @clint $welcome

2018-02-27 23:32
Digital Rebar welcome information is here > http://rebar.digital/community/welcome.html

clint
2018-02-27 23:32
Thanks!

amit.handa
2018-02-28 11:29
quick ques: I have a local sledgehammer tarball (from s3) which I want to deploy to another drp-instance (due to version upgrade).

amit.handa
2018-02-28 11:29
so I copy the tarball to dest instance.

amit.handa
2018-02-28 11:33
get the bootenv for 'sledgehammer'

amit.handa
2018-02-28 11:33
update the isoUrl param to point to local disk path for the tarball

amit.handa
2018-02-28 11:33
and do bootenvs update sledgehammer - < updatedbootenv.list

amit.handa
2018-02-28 11:34
I get "Error: PATCH: discovery"

amit.handa
2018-02-28 11:34
no logs on the server side as well

amit.handa
2018-02-28 11:34
had run the drp server with trace log-level

amit.handa
2018-02-28 11:35
am I doing it correctly ?

amit.handa
2018-02-28 11:35
thanks

greg
2018-02-28 13:54
Login to the portal and update the content package for the community.

greg
2018-02-28 13:55
Then from the cli run the uploadiso command from the QuickStart. The community video has Shane talking about this some yesterday @amit.handa

wdennis
2018-02-28 16:19
Using the default DRP-provided preseed partitioning map, getting a failure as so:

wdennis
2018-02-28 16:23
(Had wiped the target disk beforehand with `dd if=/dev/zero of=/dev/sda bs=1024M` )


wdennis
2018-02-28 16:25
Here is the partitioning recipe that is being used: ``` #Partitioning Scheme d-i partman-auto/disk string /dev/sda d-i grub-installer/choose_bootdev select /dev/sda d-i grub-installer/bootdev string /dev/sda d-i partman-auto/method string lvm d-i partman-auto-lvm/guided_size string max d-i partman-auto-lvm/new_vg_name string testnode02 d-i partman-auto/choose_recipe select custom_lvm d-i partman-auto/expert_recipe string \ custom_lvm:: \ 500 50 1024 free $iflabel{ gpt } $reusemethod{ } method{ efi } format{ } . \ 128 50 256 ext2 $defaultignore{ } method{ format } format{ } use_filesystem{ } filesystem{ ext2 } mountpoint{ /boot } . \ 10240 20 10240 ext4 $lvmok{ } mountpoint{ / } lv_name{ root } in_vg{ testnode02 } method{ format } format{ } use_filesystem{ } filesystem{ ext4 } . \ 50% 20 100% linux-swap $lvmok{ } lv_name{ swap } in_vg{ testnode02 } method{ swap } format{ } . d-i grub-installer/only_debian boolean true d-i partman/confirm_write_new_label boolean true d-i partman/choose_partition select finish d-i partman/confirm boolean true d-i partman/confirm_nooverwrite boolean true ```

greg
2018-02-28 16:26
Is that the default scheme file as well?

wdennis
2018-02-28 16:27
I believe so, but let me dbl-check...

wdennis
2018-02-28 16:30
part-scheme-default.tmpl:``` {{if .ParamExists "operating-system-disk" -}} d-i partman-auto/disk string /dev/{{.Param "operating-system-disk"}} d-i grub-installer/choose_bootdev select /dev/{{.Param "operating-system-disk"}} d-i grub-installer/bootdev string /dev/{{.Param "operating-system-disk"}} {{else -}} d-i partman-auto/disk string /dev/sda d-i grub-installer/choose_bootdev select /dev/sda d-i grub-installer/bootdev string /dev/sda {{end -}} d-i partman-auto/method string lvm d-i partman-auto-lvm/guided_size string max d-i partman-auto-lvm/new_vg_name string {{.Machine.ShortName}} d-i partman-auto/choose_recipe select custom_lvm d-i partman-auto/expert_recipe string \ custom_lvm:: \ 500 50 1024 free $iflabel{ gpt } $reusemethod{ } method{ efi } format{ } . \ 128 50 256 ext2 $defaultignore{ } method{ format } format{ } use_filesystem{ } filesystem{ ext2 } mountpoint{ /boot } . \ 10240 20 10240 ext4 $lvmok{ } mountpoint{ / } lv_name{ root } in_vg{ {{.Machine.ShortName}} } method{ format } format{ } use_filesystem{ } filesystem{ ext4 } . \ 50% 20 100% linux-swap $lvmok{ } lv_name{ swap } in_vg{ {{.Machine.ShortName}} } method{ swap } format{ } . d-i grub-installer/only_debian boolean true ```

wdennis
2018-02-28 16:30
So, yes

greg
2018-02-28 16:30
okay - I?ll try it here in a might.

wdennis
2018-02-28 16:31
These are reinstalls on previously-used disks; that's why I did the wipe, as they had LVM prior

wdennis
2018-02-28 16:31
So not they should be "blank" disks

wdennis
2018-02-28 16:31
not --> now

ghabian
2018-02-28 18:20
has joined #community201802

wdennis
2018-02-28 22:51
@greg You try an Ubuntu install yet?

greg
2018-02-28 22:59
Fighting other fires.

greg
2018-02-28 22:59
Will get to it tonight after church and soccer practice.

spector
2018-02-28 23:00
hello @ghabian $welcome

2018-02-28 23:00
Digital Rebar welcome information is here > http://rebar.digital/community/welcome.html

wdennis
2018-02-28 23:00
@greg ACK

ghabian
2018-03-01 01:27
Thanks for the welcome!