-
Notifications
You must be signed in to change notification settings - Fork 14.7k
KIP-1028: AK 3.8.0 Docker Official Image Assets #16768
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -18,58 +18,54 @@ | |
|
||
FROM eclipse-temurin:21-jre-alpine AS build-jsa | ||
|
||
USER root | ||
|
||
# Get Kafka from https://archive.apache.org/dist/kafka, url passed as env var, for version 3.7.0 | ||
ENV kafka_url https://archive.apache.org/dist/kafka/3.7.0/kafka_2.13-3.7.0.tgz | ||
# Get Kafka from https://archive.apache.org/dist/kafka, url passed as env var, for version 3.8.0 | ||
ENV kafka_url https://archive.apache.org/dist/kafka/3.8.0/kafka_2.13-3.8.0.tgz | ||
|
||
COPY jsa_launch /etc/kafka/docker/jsa_launch | ||
|
||
RUN set -eux ; \ | ||
apk update ; \ | ||
apk upgrade ; \ | ||
apk add --no-cache wget gcompat gpg gpg-agent procps bash; \ | ||
mkdir opt/kafka; \ | ||
wget -nv -O kafka.tgz "$kafka_url"; \ | ||
wget -nv -O kafka.tgz.asc "$kafka_url.asc"; \ | ||
export GNUPGHOME="$(mktemp -d)"; \ | ||
gpg --batch --keyserver hkp://keys.openpgp.org --recv-keys CF9500821E9557AEB04E026C05EEA67F87749E61 || \ | ||
gpg --batch --keyserver keyserver.ubuntu.com --recv-keys CF9500821E9557AEB04E026C05EEA67F87749E61 ; \ | ||
gpg --batch --verify kafka.tgz.asc kafka.tgz; \ | ||
gpgconf --kill all; \ | ||
rm -rf "$GNUPGHOME" kafka.tgz.asc; \ | ||
mkdir opt/kafka; \ | ||
tar xfz kafka.tgz -C /opt/kafka --strip-components 1; \ | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It's odd to mix "traditional" flags and Unix/GNU-style flags, and I'd recommend avoiding mixing them (https://manpages.debian.org/bookworm/tar/tar.1.en.html) |
||
wget -nv -O KEYS https://downloads.apache.org/kafka/KEYS; \ | ||
gpg --import KEYS; \ | ||
gpg --batch --verify kafka.tgz.asc kafka.tgz | ||
|
||
# Generate jsa files using dynamic CDS for kafka server start command and kafka storage format command | ||
RUN /etc/kafka/docker/jsa_launch | ||
# Generate jsa files using dynamic CDS for kafka server start command and kafka storage format command | ||
/etc/kafka/docker/jsa_launch | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm concerned about this script; it appears to start a server in the background, does not track the PID of that started server, waits for a file to exist, and then exits, and hopes that the file existing means the server has shut down successfully and cleanly, so there could absolutely be a race here where the file gets created, gets filled up partway and thus exists, the script exits, and the server is killed before it finishes writing the file. Is this actually necessary? Does it dramatically improve something like performance, startup time, etc? Is it an artifact that could be shipped with the Kafka releases instead? (This script seems to be the primary justification for the multi-stage build, and I'm sorry but I'm not seeing it. 🙈) |
||
|
||
|
||
FROM eclipse-temurin:21-jre-alpine | ||
|
||
# exposed ports | ||
EXPOSE 9092 | ||
|
||
USER root | ||
|
||
# Get Kafka from https://archive.apache.org/dist/kafka, url passed as env var, for version 3.7.0 | ||
ENV kafka_url https://archive.apache.org/dist/kafka/3.7.0/kafka_2.13-3.7.0.tgz | ||
ENV build_date 2024-06-11 | ||
# Get Kafka from https://archive.apache.org/dist/kafka, url passed as env var, for version 3.8.0 | ||
ENV kafka_url https://archive.apache.org/dist/kafka/3.8.0/kafka_2.13-3.8.0.tgz | ||
ENV build_date 2024-08-13 | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This is not going to be accurate -- this image will be rebuilt any time the base image updates, so this and the associated |
||
|
||
|
||
LABEL org.label-schema.name="kafka" \ | ||
org.label-schema.description="Apache Kafka" \ | ||
org.label-schema.build-date="${build_date}" \ | ||
org.label-schema.vcs-url="https://github.com/apache/kafka" \ | ||
LABEL org.opencontainers.image.title="kafka" \ | ||
org.opencontainers.image.description="Apache Kafka" \ | ||
org.opencontainers.image.created="${build_date}" \ | ||
org.opencontainers.image.source="https://github.com/apache/kafka" \ | ||
maintainer="Apache Kafka" | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. See docker-library/official-images#3540, especially docker-library/official-images#3540 (comment):
(ie, |
||
|
||
RUN set -eux ; \ | ||
apk update ; \ | ||
apk upgrade ; \ | ||
apk add --no-cache wget gcompat gpg gpg-agent procps bash; \ | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Again, why |
||
mkdir opt/kafka; \ | ||
wget -nv -O kafka.tgz "$kafka_url"; \ | ||
wget -nv -O kafka.tgz.asc "$kafka_url.asc"; \ | ||
tar xfz kafka.tgz -C /opt/kafka --strip-components 1; \ | ||
wget -nv -O KEYS https://downloads.apache.org/kafka/KEYS; \ | ||
gpg --import KEYS; \ | ||
export GNUPGHOME="$(mktemp -d)"; \ | ||
gpg --batch --keyserver hkp://keys.openpgp.org --recv-keys CF9500821E9557AEB04E026C05EEA67F87749E61 || \ | ||
gpg --batch --keyserver keyserver.ubuntu.com --recv-keys CF9500821E9557AEB04E026C05EEA67F87749E61 ; \ | ||
gpg --batch --verify kafka.tgz.asc kafka.tgz; \ | ||
gpgconf --kill all; \ | ||
rm -rf "$GNUPGHOME" kafka.tgz.asc; \ | ||
mkdir opt/kafka; \ | ||
tar xfz kafka.tgz -C /opt/kafka --strip-components 1; \ | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Using a single stage would pretty dramatically decrease the amount of duplication here. |
||
mkdir -p /var/lib/kafka/data /etc/kafka/secrets; \ | ||
mkdir -p /etc/kafka/docker /usr/logs /mnt/shared/config; \ | ||
adduser -h /home/appuser -D --shell /bin/bash appuser; \ | ||
|
@@ -79,9 +75,8 @@ RUN set -eux ; \ | |
cp /opt/kafka/config/log4j.properties /etc/kafka/docker/log4j.properties; \ | ||
cp /opt/kafka/config/tools-log4j.properties /etc/kafka/docker/tools-log4j.properties; \ | ||
cp /opt/kafka/config/kraft/server.properties /etc/kafka/docker/server.properties; \ | ||
rm kafka.tgz kafka.tgz.asc KEYS; \ | ||
apk del wget gpg gpg-agent; \ | ||
apk cache clean; | ||
rm kafka.tgz; \ | ||
apk del wget gpg gpg-agent; | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. As a non-blocking suggestion, is there some simple command that could be run here to verify the installation? One we often use is |
||
|
||
COPY --from=build-jsa kafka.jsa /opt/kafka/kafka.jsa | ||
COPY --from=build-jsa storage.jsa /opt/kafka/storage.jsa | ||
|
@@ -92,4 +87,4 @@ USER appuser | |
|
||
VOLUME ["/etc/kafka/secrets", "/var/lib/kafka/data", "/mnt/shared/config"] | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Are all of these paths intended to be user-specified / persistent storage? These are all spearately paths that Kafka will write to at runtime in the default configuration and that users will have a bad time if they don't save in some persistent place/way? |
||
|
||
CMD ["/etc/kafka/docker/run"] | ||
CMD ["/etc/kafka/docker/run"] | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm definitely still struggling to understand the necessity of so much Docker-image-custom behavior -- what problem is being solved here that's unique to running Kafka inside a container? I can't think of very many things that would be useful for containers that wouldn't also make the life of users in things like high-constrained systemd units better, such that any logic here should probably live in proper upstream scripts / code instead. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
gcompat
is a strange inclusion here -- what in this process needsglibc
compatibility?