Hacker News new | ask | show | jobs
by gavinray 2023 days ago
I'm a big fan conceptually of Nomad, and I love the streams that the Nomad team does to talk about development and answer questions.

My only comments would be:

I wish there was more content available (maybe on HashiCorp Learn Nomad) for working with single-instance Nomad clusters.

And a demo of realworld dev-to-prod workflow. Something like "Okay here's a local Docker Compose setup with Postgres, a backend API, and a frontend web app, and here's the workflow for getting it into production with Nomad."

1 comments

Thanks that's great feedback and you're not alone in that desire.

We do "small" users a pretty big disservice by effectively dismissing single-server deployments and jumping straight to "real" production deployments (3+ servers distinct from the nodes that run workloads).

We have people who use Nomad locally as a systemd alternative. We have people who use single-scheduler-multiple-nodes clusters at home or at work, and it works quite well! If the scheduler crashes, work continues to run uninterrupted, so there's little reason not to start small and scale up.

The problem is largely that it's tricky to separate out and accurately address these various personas. People looking for a systemd alternative are obviously highly technical and will likely figure everything out through experimentation. However, "small" cluster users need to be carefully educated on the differences between an HA cluster (3+ schedulers) and a single scheduler cluster.

Not only that, but we would need to automate testing for each of these recommendations to ensure changes don't break an official recommendation.

For single node or small cluster deployments, one issue with k8s is the significant CPU and RAM usage even on a fully idle system with no payload. How is Nomad in that regard? Are the controllers also implemented as a control loop that is constantly polling and diffing reported and desired state?
Granted this is anecdotal, but I found Nomad to have much more reasonable resource usage compared to K3s on my pi cluster
Thanks. Seems like it’s worth looking into it then. Still curious about the reasons why it’s more efficient.
Nomad was designed for efficiency (linear scalability) and performance from the very beginning. One of major aspects of this is that it is monolithic. The single nomad binary contains:

- The CLI for interacting with Nomad clusters via their HTTP API

- The HTTP API

- The "server" agent which contains the scheduler, state store, and Raft implementation

- The "client" agent which runs workloads and has builtin task drivers for docker, plain processes, and more.

Not only is there no dependency on an external state store like etcd, but there's no pluggable abstraction for state storage. The tradeoff is that all cluster state must fit in memory. Pi clusters should use far less than 100mb of memory while something like C2M (6,100 nodes, 2 million containers) used just under 100gb of memory. Memory use scales linearly with cluster size (nodes+containers).

Since Raft and the scheduler are colocated within a single server agent process, many operations can be performed in the replicated FSM as a simple function call instead of if Raft was an external process requiring networking overhead to interact with.

I'm not very familiar with k8s internals, so I'm afraid I can't offer a very detailed direct comparison.

Thanks for that detailed explanation. The fact that all main functions live in a single process probably helps a lot with CPU and RAM usage, compared to k8s where etcd, API server, scheduler and controller manager live in independent processes, which means serialization/deserialization and less optimization opportunities.
Yeah, I mean also from a financial/business standpoint I totally get it.

The incentive is to target content efforts towards enterprise and large-scale orgs. Because who cares if the indie dev or small startup is running your stack, let's be real.

I’d be happy to know more about the single node use case. I considered Nomad in the past for this, but it seemed optimized for a cluster, and I ended up using Docker Swarm instead.
Same for me, i did stick to docker-compose. A production-grade single server deployment would be a real interesting case.
The big problem for me with Docker Compose is the lack of zero downtime / seamless deployments. First it stops the containers running the old version, then it starts new containers running new versions, instead of doing the reverse.