kamal

Author	SHA1	Message	Date
dhh	62cdf31ae2	Fix tests	2023-09-16 11:01:16 -07:00
dhh	3ae855ef28	Explain method better	2023-09-16 09:53:03 -07:00
Donal McBreen	3c12d1799c	Copy all files into asset volume Adding -T to the copy command ensures that the files are copied at the same level into the target directory whether it exists or not. That allows us to drop the `/*` which was not picking up hidden files. Fixes: https://github.com/basecamp/kamal/issues/465	2023-09-15 08:07:48 +01:00
Donal McBreen	60835d13a8	Merge pull request #444 from rience/custom-healthcheck-log-lines-count Configurable Number of Lines in Healthcheck Log Output	2023-09-13 08:57:00 +01:00
Krzysztof Adamski	892cf0e66b	Configurable Log Lines Number in Healthcheck Log Output	2023-09-12 21:06:36 +02:00
Krzysztof Adamski	8ddc484ce6	Configurable Lines Number in Healthcheck Log Output	2023-09-12 21:04:18 +02:00
Donal McBreen	fb0aeec27e	Escape the newline in the inspect query	2023-09-12 19:10:39 +01:00
Donal McBreen	7b42daa9fb	Merge pull request #457 from basecamp/remove-dangling-image-filter Remove the `dangling=true` filter	2023-09-12 11:21:50 +01:00
Donal McBreen	2c5ab054db	Remove the `dangling=true` filter This has been removed from Docker Engine 24 and `docker image prune` only deletes dangling images anyway. Fixes https://github.com/basecamp/kamal/issues/410	2023-09-12 11:09:26 +01:00
Donal McBreen	66291a2aea	Validate the build image Kamal needs images to have the service label so it can track them for pruning. Images built by Kamal will have the label, but externally built ones may not. Without it images will build up over time. The worst case is an outage if all the hosts disks fill up at the same time. We'll add a check for the label and halt if it is not there.	2023-09-12 10:45:01 +01:00
Donal McBreen	00cb7d99d8	Merge pull request #449 from basecamp/asset-path Asset paths	2023-09-12 08:26:07 +01:00
Donal McBreen	718776eb72	Prune healthcheck containers If a deployment is interrupted it could leave stale healthcheck containers around that prevent dependent images from being pruned.	2023-09-11 14:36:25 +01:00
Donal McBreen	0b439362da	Asset paths During deployments both the old and new containers will be active for a small period of time. There also may be lagging requests for older CSS and JS after the deployment. This can lead to 404s if a request for old assets hits a new container or visa-versa. This PR makes sure that both sets of assets are available throughout the deployment from before the new version of the app is booted. This can be configured by setting the asset path: ```yaml asset_path: "/rails/public/assets" ``` The process is: 1. We extract the assets out of the container, with docker run, docker cp, docker stop. Docker run sets the container command to "sleep" so this needs to be available in the container. 2. We create an asset volume directory on the host for the new version of the app on the host and copy the assets in there. 3. If there is a previous deployment we also copy the new assets into its asset volume and copy the older assets into the new asset volume. 4. We start the new container mapping the asset volume over the top of the container's asset path. This means the both the old and new versions have replaced the asset path with a volume containing both sets of assets and should be able to serve any request during the deployment. The older assets will continue to be available until the next deployment.	2023-09-11 12:18:18 +01:00
Donal McBreen	cd02510d0f	Output one mount per line The go template was concatenating all the mounts into one line. It happened to work because the mount we are interested was always first. Fix it to output one mount per line instead.	2023-09-07 15:20:50 +01:00
Donal McBreen	8a41d15b69	Zero downtime deployment with cord file When replacing a container currently we: 1. Boot the new container 2. Wait for it to become healthy 3. Stop the old container Traefik will send requests to the old container until it notices that it is unhealthy. But it may have stopped serving requests before that point which can result in errors. To get round that the new boot process is: 1. Create a directory with a single file on the host 2. Boot the new container, mounting the cord file into /tmp and including a check for the file in the docker healthcheck 3. Wait for it to become healthy 4. Delete the healthcheck file ("cut the cord") for the old container 5. Wait for it to become unhealthy and give Traefik a couple of seconds to notice 6. Stop the old container The extra steps ensure that Traefik stops sending requests before the old container is shutdown.	2023-09-06 14:35:30 +01:00
Donal McBreen	94bf090657	Copy env files to remote hosts Setting env variables in the docker arguments requires having them on the deploy host. Instead we'll add two new commands `kamal env push` and `kamal env delete` which will manage copying the environment as .env files to the remote host. Docker will pick up the file with `--env-file <path-to-file>`. Env files will be stored under `<kamal run directory>/env`. Running `kamal env push` will create env files for each role and accessory, and traefik if required. `kamal envify` has been updated to also push the env files. By avoiding using `kamal envify` and creating the local and remote secrets manually, you can now avoid accessing secrets needed for the docker runtime environment locally. You will still need build secrets. One thing to note - the Docker doesn't parse the environment variables in the env file, one result of this is that you can't specify multi-line values - see https://github.com/moby/moby/issues/12997. We maybe need to look docker config or docker secrets longer term to get around this. Hattip to @kevinmcconnell - this was all his idea.	2023-09-06 14:33:13 +01:00
Donal McBreen	adc7173cf2	Merge pull request #437 from basecamp/kamal-run-directory Configurable Kamal directory	2023-09-06 14:31:07 +01:00
Krzysztof Adamski	bbcc90e4d1	Configurable Healthcheck Expose Port	2023-09-05 10:53:32 +02:00
Donal McBreen	787688ea08	kamal -> .kamal	2023-08-28 17:13:52 +01:00
Donal McBreen	bcfa1d83e8	Configurable Kamal directory To avoid polluting the default SSH directory with lots of Kamal config, we'll default to putting them in a `kamal` sub directory. But also make the directory configurable with the `run_directory` key, so for example you can set it as `/var/run/kamal/` The directory is created during bootstrap or before any command that will need to access a file.	2023-08-28 16:32:18 +01:00
Trevor Vallender	c2ec04f8c1	Allow Traefik to run without publishing port Adds the `publish` option which, if set to false, does not pass `--publish` to `docker run` when starting Traefik. This is useful when running Traefik behind a reverse proxy, for example.	2023-08-24 10:52:10 +01:00
David Heinemeier Hansson	c4a203e648	Rename to Kamal	2023-08-22 08:24:31 -07:00
Donal McBreen	4dd8208290	Extract versions that contains dashes The version extraction assumed that the version is everything after the last `-` in the container name. This doesn't work if you deploy a non-MRSK generated version that contains a `-`. To fix we'll generate the non version prefix and strip it off. In some places for this to work we need to make sure to pass the role through. Fixes: https://github.com/mrsked/mrsk/issues/402	2023-08-08 14:16:32 +01:00
Bruno Prieto	cbd180205d	Include role options when executing commands	2023-07-24 17:45:24 +02:00
Igor Alexandrov	e6ca270537	Include service name to lock details	2023-07-15 21:50:39 +04:00
David Heinemeier Hansson	08d8790851	Merge pull request #337 from igor-alexandrov/feature/cache Support for Docker multistage build cache	2023-06-20 11:38:46 +02:00
Igor Alexandrov	aa28ee0f3e	Inroduce Native::Cached builder	2023-06-18 22:45:04 +04:00
Matt Robinson	21b13bf8d3	Add support for proxy_command to run_over_ssh	2023-06-16 08:22:10 -03:00
Igor Alexandrov	d9b3fac17a	Added ability to override default Traefik command line arguments	2023-06-15 15:41:20 +04:00
David Heinemeier Hansson	5a25f073f7	Merge pull request #320 from jsoref/spelling Spelling	2023-05-31 17:59:18 +02:00
David Heinemeier Hansson	c8f521c0e8	Merge pull request #323 from basecamp/prefix-docker-host-with-real-host Prefix container hostname with the underlying one	2023-05-31 17:58:55 +02:00
Donal McBreen	28d6a131a9	Prefix container hostname with the underlying one To make it easier to identity where a docker container is running, prefix its hostname with the underlying one from the host. Docker chooses a 12 character random hex string by default, so we'll keep that as the suffix.	2023-05-31 16:22:25 +01:00
Donal McBreen	079d9538bb	Improve image pruning robustness If you different images with the same git SHA, on the second deploy the tag is moved and the first image becomes untagged. It may however still be attached to an existing container. To handle this: 1. Initially prune dangling images - this will remove any untagged images that are not attached to an existing image 2. Then filter out the untagged images when deleting tagged images - any that remain will be attached to a container. The second issue is that `docker container ls -a --format '{{.Image}}` will sometimes return the image id rather than a tag. This means that the image doesn't get filtered out when we grep to remove the active images. To fix that we'll grep against both the image id and repo:tag.	2023-05-31 10:17:52 +01:00
Josh Soref	8e94c21729	spelling: with Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com>	2023-05-29 20:46:34 -04:00
Donal McBreen	ff7a1e6726	Prune unused images correctly dangling=true doesn't prune any images, as we are not creating dangling images. Using --all should remove unused images, but it considers the Git SHA tag on the latest image to be unused (presumably because there are two tags, the SHA and latest and the running container is only considered to be using "latest"). As a result it deletes the tag, which means that we can't rollback to that SHA later. Its a bit more complicated to only remove images that are not referenced by any containers. First we find the tags we want to keep from the containers (running and stopped). Then we append the latest tag to that list. Then we get a full list of image tags and remove those tags from that list (using `grep -v -w`). Finally we pass the tags to `docker rmi`. That either deletes the tag if there are other references to the image or both the tag and the image if it is the only one.	2023-05-25 17:16:46 +01:00
Donal McBreen	cedb8d900f	Stop containers with restarting status When stopping the old container we need to also look for ones with a restarting status.	2023-05-25 12:10:26 +01:00
Donal McBreen	3b695ae127	Add service_version and add running hook message	2023-05-23 13:56:19 +01:00
Donal McBreen	9fd184dc32	Add post-deploy and post-rollback hooks These replace the custom audit_broadcast_cmd code. An additional env variable MRSK_RUNTIME is passed to them. The audit broadcast after booting an accessory has been removed.	2023-05-23 13:56:16 +01:00
Donal McBreen	910f14e9c0	Add configuration for hooks_path	2023-05-23 13:55:04 +01:00
Donal McBreen	58c1096a90	MRSK hooks Adds hooks to MRSK. Currently just two hooks, pre-build and post-push. We could break the build and push into two separate commands if we found the need for post-build and/or pre-push hooks. Hooks are stored in `.mrsk/hooks`. Running `mrsk init` will now create that folder and add sample hook scripts. Hooks returning non-zero exit codes will abort the current command. Further potential work here: - We could replace the audit broadcast command with a post-deploy/post-rollback hook or similar - Maybe provide pre-command/post-command hooks that run after every mrsk invocation - Also look for hooks in `~/.mrsk/hooks`	2023-05-23 13:55:04 +01:00
River He	44b83151e3	Allow to inject environment variables to traefik	2023-05-10 03:18:26 +00:00
David Heinemeier Hansson	aafaee7ac8	Merge pull request #223 from basecamp/customizable-audit-broadcast Allow customizing audit broadcast with env	2023-05-05 14:30:04 +02:00
Donal McBreen	326711a3e0	Fix aggressive prune breaking rollback In the image prune command --all overrides --dangling=true. This removes the image git sha image tag for the latest image which prevented us from rolling back to it. I've updated the integration test to now test deploy, redeploy and rollback.	2023-05-05 12:13:14 +01:00
Kevin McConnell	82be521e66	Merge branch 'main' into customizable-audit-broadcast * main: Fix staging label bug Fix typo Capture container health log when unhealthy Bump version for 0.12.0	2023-05-05 11:40:29 +01:00
Jberczel	0e19ead37c	Capture container health log when unhealthy	2023-05-03 15:03:05 -04:00
Jeremy Daer	048aecf352	Audit details (#1 ) Audit details * Audit logs and broadcasts accept `details` whose values are included as log tags and MRSK_* env vars passed to the broadcast command * Commands may return execution options to the CLI in their args list * Introduce `mrsk broadcast` helper for sending audit broadcasts * Report UTC time, not local time, in audit logs. Standardize on ISO 8601 format	2023-05-02 11:42:05 -07:00
David Heinemeier Hansson	88a7413b3e	Merge branch 'main' into pr/223 * main: Don't run actions twice on PRs Further distinguish dependency verification Naming Reveal configured dockerfile path Style Distinguish from server dependencies Distinguish from local dependency verification Improve clarity and intent Style Style Style Add local dependencies check Bootstrap: use multi-platform installer	2023-05-02 14:44:16 +02:00
David Heinemeier Hansson	9cc73fed9a	Merge branch 'main' into pr/223 * main: Simplify domain language to just "boot" and unscoped config keys Retain a fixed number of containers when pruning Don't assume rolling back in message Check all hosts before rolling back Ensure Traefik service name is consistent Extend traefik delay by 1 second Include traefik access logs Check if we are still getting a 404 Also dump load balancer logs Dump traefik logs when app not booted Fix missing for apt-get Report on container health after failure Fix the integration test healthcheck Allow percentage-based rolling deployments Move `group_limit` & `group_wait` under `boot` Limit rolling deployment to boot operation Allow performing boot & start operations in groups	2023-05-02 14:43:17 +02:00
David Heinemeier Hansson	b7877c59b4	Merge branch 'main' into docker-readiness	2023-05-02 14:30:35 +02:00
David Heinemeier Hansson	35b5b317af	Merge branch 'main' into pr/205 * main: Simplify domain language to just "boot" and unscoped config keys Retain a fixed number of containers when pruning Don't assume rolling back in message Check all hosts before rolling back Ensure Traefik service name is consistent Extend traefik delay by 1 second Include traefik access logs Check if we are still getting a 404 Also dump load balancer logs Dump traefik logs when app not booted Fix missing for apt-get Report on container health after failure Fix the integration test healthcheck Allow percentage-based rolling deployments Move `group_limit` & `group_wait` under `boot` Limit rolling deployment to boot operation Allow performing boot & start operations in groups	2023-05-02 14:29:06 +02:00

1 2 3 4

188 Commits