Writing a GitLab Pipeline with Claude Code

Written by Alex Podobnik | Apr 28, 2026 3:45:03 PM

I've been using Claude Code for our internal GitLab pipelines for a while now and wanted to write up what the workflow actually looks like in practice. The service below is one of our APIs, and the pipeline is a fairly standard build-scan-push setup. Nothing exotic, but enough surface area to show where the tool earns its keep and where it doesn't.

Pipeline rewrites on our older services tend to start from roughly the same place: a Dockerfile that works on someone's laptop, a Jenkinsfile or half-broken .gitlab-ci.yml that nobody really trusts, and the perennial want to stop babysitting the thing through every run. For a container pipeline of this shape our tooling choices don't move around much anymore. Rootless BuildKit for the build (no daemon, no privileged runner, plays well with our fleet), Trivy for scanning, Skopeo when we want to retag without pulling the image back through a runner, and GitLab's rules: and needs: to keep the DAG honest.

The Starting Point

The API has been around for a while. Its current build is a Jenkinsfile with a single stage: docker build then docker push, and that's the whole file. Nothing scans the image. Nothing caches layers. Nothing runs in parallel. We're moving the repo onto our self-managed GitLab instance and this is the moment to finally add the things we've been meaning to add for a year: scanning with a hard fail on HIGH and CRITICAL, proper build caching, and a retag step so :latest only ever points at something that has actually passed scan.

After cloning, I start Claude Code in the repo root:

cd my-api
claude

The trick at this point is to not immediately ask it to generate anything. You want it to read the repo first. Pipelines written blind hit something awkward on the first real run, usually one of the small things that live in a repo for a reason nobody remembers. A .dockerignore that excludes vendor/ so the build context ends up without dependencies. A Makefile injecting flags that the CI build needs to match. A cache mount in the Dockerfile that the runner's default setup doesn't know what to do with. When you write the pipeline yourself, those details are invisible because they're already in your head. Claude Code isn't in your head, so it needs a minute to go look.

Speed: The First Scaffold

My first prompt is pretty loose:

Look at the Dockerfile and the Makefile. Write a .gitlab-ci.yml that builds the container,
scans it with Trivy, and pushes to the GitLab container registry. The scan should fail the
pipeline on HIGH or CRITICAL. Use rootless BuildKit for the build, not docker-in-docker.

It reads both files and comes back with something like this:

stages:
  - build
  - scan
  - push

variables:
  IMAGE_TAG: "$CI_REGISTRY_IMAGE:$CI_COMMIT_SHORT_SHA"

build:
  stage: build
  image:
    name: moby/buildkit:v0.29.0-rootless
    entrypoint: [""]
  variables:
    BUILDKITD_FLAGS: --oci-worker-no-process-sandbox
  script:
    - mkdir -p ~/.docker
    - echo "{\"auths\":{\"$CI_REGISTRY\":{\"auth\":\"$(echo -n $CI_REGISTRY_USER:$CI_REGISTRY_PASSWORD | base64 | tr -d '\n')\"}}}" > ~/.docker/config.json
    - buildctl-daemonless.sh build
      --frontend dockerfile.v0
      --local context=$CI_PROJECT_DIR
      --local dockerfile=$CI_PROJECT_DIR
      --output type=image,name=$IMAGE_TAG,push=true

scan:
  stage: scan
  image:
    name: aquasec/trivy:0.70.0
    entrypoint: [""]
  variables:
    TRIVY_USERNAME: $CI_REGISTRY_USER
    TRIVY_PASSWORD: $CI_REGISTRY_PASSWORD
  script:
    - trivy image --exit-code 1 --severity HIGH,CRITICAL "$IMAGE_TAG"

push:
  stage: push
  image: quay.io/skopeo/stable:v1.22
  script:
    - echo "$CI_REGISTRY_PASSWORD" | skopeo login -u "$CI_REGISTRY_USER" --password-stdin "$CI_REGISTRY"
    - skopeo copy "docker://$IMAGE_TAG" "docker://$CI_REGISTRY_IMAGE:latest"
  rules:
    - if: $CI_COMMIT_BRANCH == $CI_DEFAULT_BRANCH

This runs and uses the right tools. There are a couple of things wrong with it though, neither of them disasters.

The first is that the push happens before the scan, so the image is already sitting in the registry under its real tag by the time Trivy gets around to looking at it. If the scan fails, you've got a vulnerable image reachable in the registry for the minute or so it takes for the pipeline to fail out and somebody to go clean up. The fix is a staging tag and a separate promote stage, which Claude Code is perfectly happy to write if you ask for it but didn't volunteer.

The second is no Trivy DB cache. Every pipeline pulls the full vulnerability database from scratch, which adds a minute or two to every run and sometimes runs into rate limits on the DB mirror at the worst possible time. Also easy to fix.

Neither of these is surprising. The first pass Claude Code gives me on a pipeline almost always looks like this: usable, but with a couple of obvious things to tighten up. The fixes are about shape, not design, which is honestly the situation you want to be in.

Quality: The Second Pass

My follow-up:

Two changes. Push the build to an unscanned- prefixed tag first, then promote it to the real
tag only after the scan passes. Use Skopeo for the retag so we don't have to pull the image
back through a runner. And cache the Trivy DB across runs.

What comes back:

stages:
  - build
  - scan
  - promote

variables:
  IMAGE_TAG: "$CI_REGISTRY_IMAGE:$CI_COMMIT_SHORT_SHA"
  IMAGE_TAG_UNSCANNED: "$CI_REGISTRY_IMAGE:unscanned-$CI_COMMIT_SHORT_SHA"

build:
  stage: build
  image:
    name: moby/buildkit:v0.29.0-rootless
    entrypoint: [""]
  variables:
    BUILDKITD_FLAGS: --oci-worker-no-process-sandbox
  script:
    - mkdir -p ~/.docker
    - echo "{\"auths\":{\"$CI_REGISTRY\":{\"auth\":\"$(echo -n $CI_REGISTRY_USER:$CI_REGISTRY_PASSWORD | base64 | tr -d '\n')\"}}}" > ~/.docker/config.json
    - buildctl-daemonless.sh build
      --frontend dockerfile.v0
      --local context=$CI_PROJECT_DIR
      --local dockerfile=$CI_PROJECT_DIR
      --output type=image,name=$IMAGE_TAG_UNSCANNED,push=true
      --import-cache type=registry,ref=$CI_REGISTRY_IMAGE/buildkit-cache
      --export-cache type=registry,ref=$CI_REGISTRY_IMAGE/buildkit-cache,mode=max
  rules:
    - if: $CI_PIPELINE_SOURCE == "merge_request_event"
    - if: $CI_COMMIT_BRANCH == $CI_DEFAULT_BRANCH

scan:
  stage: scan
  image:
    name: aquasec/trivy:0.70.0
    entrypoint: [""]
  needs:
    - job: build
      artifacts: false
  variables:
    TRIVY_CACHE_DIR: ".trivycache/"
    TRIVY_USERNAME: $CI_REGISTRY_USER
    TRIVY_PASSWORD: $CI_REGISTRY_PASSWORD
  cache:
    key: trivy-db-v1
    paths:
      - .trivycache/
  script:
    - trivy image --download-db-only
    - trivy image --severity UNKNOWN,LOW,MEDIUM "$IMAGE_TAG_UNSCANNED"
    - trivy image --exit-code 1 --severity HIGH,CRITICAL "$IMAGE_TAG_UNSCANNED"

promote:
  stage: promote
  image: quay.io/skopeo/stable:v1.22
  needs:
    - job: scan
      artifacts: false
  script:
    - echo "$CI_REGISTRY_PASSWORD" | skopeo login -u "$CI_REGISTRY_USER" --password-stdin "$CI_REGISTRY"
    - skopeo copy "docker://$IMAGE_TAG_UNSCANNED" "docker://$IMAGE_TAG"
    - |
      if [ "$CI_COMMIT_BRANCH" = "$CI_DEFAULT_BRANCH" ]; then
        skopeo copy "docker://$IMAGE_TAG_UNSCANNED" "docker://$CI_REGISTRY_IMAGE:latest"
      fi
  rules:
    - if: $CI_COMMIT_BRANCH == $CI_DEFAULT_BRANCH
    - if: $CI_PIPELINE_SOURCE == "merge_request_event"
      when: manual

A couple of things about this revision are worth noting.

The --import-cache and --export-cache flags weren't in my prompt. They tell BuildKit to push its intermediate cache layers up to a dedicated cache ref under the project namespace, and on this service it saves roughly ninety seconds on a warm build. Scoping the cache under the project is the convention I'd have picked anyway, because it gets cleaned up when the project does.

Claude Code also added needs: between the stages, turning what would have been a strict sequence into a proper DAG. I didn't ask for that either, but I'm glad it's there, because the minute you start playing with rules: on individual stages the DAG behavior starts to matter.

The scan stage got split into two passes, one that reports MEDIUM and below without failing the job and one that hard-fails on HIGH and CRITICAL. This is a pattern I like a lot, because it lets developers see what's coming at them without blocking their work today, and Claude Code reached for it on its own.

There is one thing I'd push back on. The promote job is set to when: manual on MR pipelines, which means any developer can manually promote an unscanned image to the real tag on their branch. On some of our services we actually want that as an integration-testing escape hatch, so that a colleague can pull a known-good image for cross-service testing. On others it's just a policy hole. It's a one-line change either way.

Migration: Translating the Jenkinsfile

Here is the Jenkinsfile we're replacing:

pipeline {
  agent any
  environment {
    IMAGE = "registry.example.com/my-api:${BUILD_NUMBER}"
  }
  stages {
    stage('Build') {
      steps {
        sh "docker build -t ${IMAGE} ."
      }
    }
    stage('Push') {
      steps {
        withCredentials([usernamePassword(credentialsId: 'registry',
          usernameVariable: 'REG_USER', passwordVariable: 'REG_PASS')]) {
          sh "echo \$REG_PASS | docker login registry.example.com -u \$REG_USER --password-stdin"
          sh "docker push ${IMAGE}"
        }
      }
    }
  }
}

My prompt:

Translate this Jenkinsfile into a stage that fits into the GitLab pipeline we just wrote.
Push to the Nexus registry rather than the GitLab one. Preserve the BUILD_NUMBER-style
identifier scheme.

The shape of what comes back is right. BUILD_NUMBER becomes CI_PIPELINE_IID, which is GitLab's closest equivalent. The Jenkins withCredentials block becomes a pair of masked CI/CD variables. The Nexus destination gets hoisted into a REGISTRY_HOST variable so tag construction stays readable.

Where I usually have to step in on these translations is at the spots where Jenkins and GitLab don't share assumptions. Jenkins agents are long-lived and almost always have a Docker daemon available. Our self-managed GitLab runners don't, which is the whole reason rootless BuildKit is in the pipeline to begin with. If you don't tell Claude Code this, you'll sometimes see it translate docker build into a docker:dind service block. That works on GitLab.com shared runners. It falls over on our fleet. Easiest fix is to put the runner constraint in a CLAUDE.md so you aren't repeating it every session.

Jenkins post blocks are the other one that catches people. post { failure { ... } } runs at pipeline scope and can read the outcome of any earlier stage. GitLab CI doesn't have anything that does exactly that. The right way to translate it is usually a dedicated notification job at the end of the DAG using when: on_failure. Claude Code will sometimes go a different route and put the notification in an after_script:, which only sees the status of its own job and misses everything upstream. Worth a second look on any Jenkins migration.

Lowering the Barrier

One of the engineers on the service had never written a GitLab pipeline before this one. Strong developer, fine with Docker, but CI config had always been somebody else's problem. The usual pattern on a team like ours is that one person writes the pipeline, everyone else copies bits of it into their own services, and nobody's totally sure later which parts actually mattered and which ones were just along for the ride.

With Claude Code she wrote the pipeline herself, pausing to ask what needs: actually does, why BuildKit wants both --import-cache and --export-cache when it feels like one should be enough, what happens if you don't pin the Trivy image. The file she shipped was basically what one of the platform engineers would have written, with the difference that she understood every line of it. Three weeks later, when the Trivy tag moved and the cache key needed a version bump, she just handled it, without paging the platform team.

Review hasn't gone away, but it's doing a different job now. Catching syntax mistakes and spotting obviously-wrong shapes happens faster because there aren't as many of them. What takes longer is the judgment stuff: cache key scope, how permissive the rules: should be, whether retag-on-promote actually matches what this particular service needs. That's the part Claude Code is worst at, because it turns on context that isn't in the repo.

Things It Still Gets Wrong

Old syntax drift. GitLab has been steering people away from only/except and toward rules: for years. Claude Code knows both but will sometimes fall back to only: anyway, usually in longer outputs where it seems to drift toward older patterns. If only: shows up, just ask it to redo that block with rules:.

Inconsistent image version pinning. BuildKit images usually come back pinned to a specific tag. Trivy is less consistent. Skopeo is where you can actually get burned — its stable image only publishes minor-version tags like v1.22, not patch-version ones, so if Claude Code hands back a :v1.22.1, the pull fails at runtime with a manifest-unknown error. Every generated pipeline gets a pass where I walk every image reference and decide what to pin.

Cache key scope in monorepos. In a single-project pipeline a simple static key is fine. In a monorepo, a shared key means you're using one Trivy DB cache across services that have nothing to do with each other, and the first time that cache gets corrupted, every scan in the repo starts failing. Claude Code won't scope the key for you by default. Worth scoping it to the service directory.

BuildKit registry cache blob accumulation. Setting --export-cache type=registry,...,mode=max exports every intermediate layer as a cache blob. Without registry garbage collection those blobs keep accumulating. A few months in, the cache repo has tens of gigabytes of orphaned blobs, the registry gets slower, and on some setups you'll see real storage costs. The fix is either a periodic registry GC or a retention policy on the cache repo.

Registry auth variants. Claude Code will sometimes return a docker login call before the buildctl invocation, which fails because the moby/buildkit:rootless image doesn't ship with a docker CLI. The error looks like a generic "command not found" buried in the build output. The base64-encoded config.json approach used in the examples above is the one that works reliably. If the login version comes back, ask for the config.json one.

How the Loop Actually Runs

The workflow I've settled into for a pipeline this size is: open Claude Code in the repo, let it read what's there, describe what I want loosely, then review the first pass for structural stuff (push-before-scan, missing caches, old syntax) and iterate from there rather than starting over. Pin image versions before I commit. Run the thing in a real branch rather than in my head. Once it works, move any runner constraints or team conventions into a CLAUDE.md so the next person on the service doesn't have to rediscover them from scratch.

For a service of this size, going from an empty .gitlab-ci.yml to something I'd merge takes me under an hour. A platform engineer writing it from scratch takes about the same. The difference isn't really speed. It's who ends up writing and owning the file. The service team does, instead of the platform team. And the small things that usually take a second or third pass to catch, like cache key scope or DAG structure or how the scan is split, tend to be there in the first draft.

View full post