fermyon/installer

Traefik did not start running local

itowlson opened this issue · 2 comments

I installed all the bits and ran the local installer.

It stalled on deploying Traefik.

I stopped it and tried again in a fresh shell and got rm: cannot remove './data': Permission denied.

When I ran it under sudo, the Traefik deployment was successful, but the installer then printed "shutting down services", and did not print the expected URLs.

Environment:

  • WSL
  • Ubuntu 18.04 WHICH IS NOT A REALLY EARLY VERSION STOP CALLING IT THAT YOU FRIGHTFUL CADS
  • Nomad and Consul installed using apt
  • Traefik and Bindle downloaded under home directory and put on path via export (not in bash profile)
  • Spin built locally and put on path via export

Output running as normal user (note Ctrl+C on antepenultimate line - this was after waiting several minutes):

16:19 $ ./start.sh
Starting consul...
Starting nomad...
Waiting for nomad...
Starting traefik job...
==> 2022-06-14T16:19:23+12:00: Monitoring evaluation "2e71b961"
    2022-06-14T16:19:23+12:00: Evaluation triggered by job "traefik"
==> 2022-06-14T16:19:24+12:00: Monitoring evaluation "2e71b961"
    2022-06-14T16:19:24+12:00: Evaluation within deployment: "8eed3446"
    2022-06-14T16:19:24+12:00: Evaluation status changed: "pending" -> "complete"
==> 2022-06-14T16:19:24+12:00: Evaluation "2e71b961" finished with status "complete" but failed to place all allocations:
    2022-06-14T16:19:24+12:00: Task Group "traefik" (failed to place 1 allocation):
      * Constraint "${attr.consul.version} semver >= 1.7.0": 1 nodes excluded by filter
    2022-06-14T16:19:24+12:00: Evaluation "b4c7f5fc" waiting for additional capacity to place remainder
==> 2022-06-14T16:19:24+12:00: Monitoring deployment "8eed3446"
  ⠹ Deployment "8eed3446" in progress...

    2022-06-14T16:31:12+12:00
    ID          = 8eed3446
    Job ID      = traefik
    Job Version = 0
    Status      = running
    Description = Deployment is running

    Deployed
    Task Group  Desired  Placed  Healthy  Unhealthy  Progress Deadline
    traefik     1        0       0        0          N/A^C
Shutting down services
./start.sh: line 19: kill: (8638) - No such process

Output under sudo (did not hit Ctrl+C until after shutdown message):

16:49 # ./start.sh
Starting consul...
Starting nomad...
Waiting for nomad...
Starting traefik job...
==> 2022-06-14T16:49:50+12:00: Monitoring evaluation "42a6ad40"
    2022-06-14T16:49:50+12:00: Evaluation triggered by job "traefik"
==> 2022-06-14T16:49:51+12:00: Monitoring evaluation "42a6ad40"
    2022-06-14T16:49:51+12:00: Evaluation within deployment: "7b367706"
    2022-06-14T16:49:51+12:00: Evaluation status changed: "pending" -> "complete"
==> 2022-06-14T16:49:51+12:00: Evaluation "42a6ad40" finished with status "complete" but failed to place all allocations:
    2022-06-14T16:49:51+12:00: Task Group "traefik" (failed to place 1 allocation):
      * Constraint "${attr.consul.version} semver >= 1.7.0": 1 nodes excluded by filter
    2022-06-14T16:49:51+12:00: Evaluation "19459823" waiting for additional capacity to place remainder
==> 2022-06-14T16:49:51+12:00: Monitoring deployment "7b367706"
  ✓ Deployment "7b367706" successful

    2022-06-14T16:50:10+12:00
    ID          = 7b367706
    Job ID      = traefik
    Job Version = 0
    Status      = successful
    Description = Deployment completed successfully

    Deployed
    Task Group  Desired  Placed  Healthy  Unhealthy  Progress Deadline
    traefik     1        1       1        0          2022-06-14T17:00:09+12:00

Shutting down services

@itowlson Can you check if this is resolved on main?

main now prompts me for a sudo password, and it places traefik but that goes to unhealthy. It keeps placing it but it keeps going unhealthy (lost the trace but the first time it was up to 3 placed/3 unhealthy before I killed it).

✔ ~/github/fermyon-installer/local [main|✔]
08:41 $ ./start.sh
[sudo] password for ivan:
Starting consul...
Waiting for consul...
Starting nomad...
Waiting for nomad...
Starting traefik job...
==> 2022-06-15T08:41:56+12:00: Monitoring evaluation "9e8f02d9"
    2022-06-15T08:41:56+12:00: Evaluation triggered by job "traefik"
    2022-06-15T08:41:56+12:00: Allocation "93756bd5" created: node "34ccff0c", group "traefik"
==> 2022-06-15T08:41:57+12:00: Monitoring evaluation "9e8f02d9"
    2022-06-15T08:41:57+12:00: Evaluation within deployment: "f8cb4836"
    2022-06-15T08:41:57+12:00: Evaluation status changed: "pending" -> "complete"
==> 2022-06-15T08:41:57+12:00: Evaluation "9e8f02d9" finished with status "complete"
==> 2022-06-15T08:41:57+12:00: Monitoring deployment "f8cb4836"
  ⠧ Deployment "f8cb4836" in progress...

    2022-06-15T08:42:32+12:00
    ID          = f8cb4836
    Job ID      = traefik
    Job Version = 0
    Status      = running
    Description = Deployment is running

    Deployed
    Task Group  Desired  Placed  Healthy  Unhealthy  Progress Deadline
    traefik     1        1       0        1          2022-06-15T08:51:55+12:00^C
Shutting down services
./start.sh: line 17: kill: (30552) - Operation not permitted