This is great progress towards making unikernels non-runtime specific. However, ...

jo909 · on Nov 17, 2018

You would debug an unikernel just like you can debug the OS kernel. Either it brings its own internal "remote debugging API", or the hypervisor helps you with that and provides interactive tracing/debugging features and dumps with CPU and memory state that you load into a local debugger.

The kernel debugging technology exists for a long time now, and I don't think its far fetched to see public clouds exposing those abilities in a secure manner, and unikernels including their own tools for remote debugging. Any kernel can be debugged, and unikernels are no exception.

0xbadcafebee · on Nov 17, 2018

Then the question becomes, should all application developers become kernel developers/debuggers? Wouldn't that impose a significant barrier to adoption?

andrewflnr · on Nov 17, 2018

Probably (hopefully) no more than having a standard library forces you to be a "standard library debugger". That is to say, a little bit, but you can easily ignore the parts you don't care about.

eyberg · on Nov 17, 2018

htop and such don't make sense inside a single process system; logging usually gets sent to remote syslog/elk/etc. many apm solutions work out of the box

one of the biggest misconceptions about unikernels is that they are somehow of a slimmed down linux - it's easier to think about them as individual programs provisioned as vms - would you ssh into a process? why?

all 'debugging' tools work perfectly fine w/unikernels although I'd draw a fine line between real application level debugging which should never happen on production and ops tooling

rjzzleep · on Nov 17, 2018

> I'd draw a fine line between real application level debugging which should never happen on production and ops tooling

Sorry, but that's just plain wrong. One very common reason why people do that is, because performance debugging may give very different results on dev, staging and production systems that's why people want tooling that gives them as accurate as possible application level debugging information without impeding the behavior of the system. It's a very common problem in large scale systems.

int0x80 · on Nov 17, 2018

I think that can be solved by good tracing and logging facilities. Both of which are already available in Linux (eBPF, ftrace, tracepoints, perf events...). You may have to customize them or polish them further though and provide more user friendly interfaces/frontends for them.

paulddraper · on Nov 17, 2018

Though the amount of tracing/logging to equate to debugging is so much that you will want to turn it on selectively.

And you will want a mechanism to iterate quickly to see effects of tweaks.

Eventually you have built an interactive debugger.

int0x80 · on Nov 18, 2018

Maybe. But KGDB and friends do exist too in Linux. And (AFAIK) not everyone likes using those, some just like throwing a few printks here and there. That is not universal obviously :) My point is that yes, unikernels may not be as easy to debug etc at this point as regular usermode code, but it is something that can be worked around and I think if people begin to use them in-mass frameworks and solutions will emerge.

paulddraper · on Nov 20, 2018

Yes. It is a deficiency/roadblock to adoption.

One that could improve.

kjeetgill · on Nov 18, 2018

> would you ssh into a process? why?

Oh god, I wish. It's more common that you probably think.

You have a lot of options approximating this with a lot of thicker runtimes like Java/JVMs and Erlang/BeamVM or interpreted REPL'd languages like Python or Lisps. And they're incredibly useful.

Pulling up a REPL into a running process like a JS console in the browser is basically the same thing.

gigatexal · on Nov 17, 2018

Yes I agree. We have to refactor the way we debug and realize it’s devugging a single process and that process is the application.

tybit · on Nov 17, 2018

Sounds like adding the concept of debug builds and opt in instrumentation of release builds for unikernels would mostly address your concerns. I think it’s fair to say they’re not ready yet, but never seems unlikely to me.

AtlasBarfed · on Nov 17, 2018

Yes, how do you know EXACTLY what code/service/diagnosis/communication library you'll need in the future?

Answer: you don't. You don't know what corner you're painting yourself into.

The linux kernel, outside of its excessive amounts of drivers, is full of code that is useful for computers, vms, and containers in lots of different situations: normal operation, compromised, at load, in distress, etc.

I doubt you can even make a list of the utilities you'd need ahead of time for getting stats on the container's state. network? disk? processes? memory?

Linux obviously used to scale to very constrained computers (386/486s with a couple megs of ram was the PC state of the art when Linux was initially developed) up to the current supercomputers and large vms on AWS and other vendors.

A lot of the size bloat in linux for container images is the drivers. For containers and VMs, one really doesn't need all the driver variants because a VM/container should just be presented a limited virtual hardware interface. Then you could greatly reduce the driver portion of the monokernel.

Once you get rid of that, linux should probably concern itself with a couple "power of 1000" kernels.

kiloherz/kilobytes of ram (maybe not even bother with this) aka the 80s computer. Since this is an 8 or 16 bit computer, linux may not be practically back-scalable to this mode of computing, but I don't know linux history enough

megahertz/megabytes of ram aka the 90s computer aka the 32 bit era.

gigahertz/gigabytes+ (2000s to modern) aka the 64 bit era.

terascale is basically served by 64 bit kernels afaik.

Your typical container will basically fit into one of these profiles I would guess. But the basic linux/unix model should work for every one of them... because it has, since the 70s, on machines from the PDP-7 on up.

So IMO, container focused derivatives of Linux should concentrate work on tailoring to these levels.

Container applications should basically be targetted at one of the levels.

Granted, maybe the 1000 factor jump is a bit big. Economically there is a big difference between 1, 10, 100 MB and gigabyte memory spaces in particular, and what you can cram into a machine or pod. But the KB image should be able to overlap with the low end MB (although that takes ugly segmentation pointers and other memory extension hacks). The MB image can DEFINITELY encroach on the low end GB image.

intheclouddan · on Nov 17, 2018

Recent paper on being able to debug unikernels: https://dl.acm.org/citation.cfm?id=3267845

And a blog writeup on it: https://blog.acolyer.org/2018/11/14/unikernels-as-processes/

disclaimer: I'm interested in nabla containers and have an open PR on the runtime.