Upgrades to Spark 3.4/JRE 17 and fixes all high/critical CVEs #226

codefromthecrypt · 2024-04-16T02:03:10Z

I manually tested this on all three storage types in docker as well (using zipkin's docker/examples instructions)

Signed-off-by: Adrian Cole <[email protected]>

codefromthecrypt · 2024-04-16T02:04:06Z

trivy is clean now, so even if we don't release until 8.14 final... at least we can apply the security settings same as other repos after merge:

$ trivy repo .
2024-04-15T15:56:48.696-1000	INFO	Need to update DB
2024-04-15T15:56:48.696-1000	INFO	DB Repository: ghcr.io/aquasecurity/trivy-db:2
2024-04-15T15:56:48.696-1000	INFO	Downloading DB...
45.03 MiB / 45.03 MiB [------------------------------------------------------------------------------------------------------------------] 100.00% 2.58 MiB p/s 18s
2024-04-15T15:57:07.461-1000	INFO	Vulnerability scanning is enabled
2024-04-15T15:57:07.461-1000	INFO	Secret scanning is enabled
2024-04-15T15:57:07.461-1000	INFO	If your scanning is slow, please try '--scanners vuln' to disable secret scanning
2024-04-15T15:57:07.462-1000	INFO	Please see also https://aquasecurity.github.io/trivy/v0.50/docs/scanner/secret/#recommendation for faster secret detection
2024-04-15T15:57:07.731-1000	INFO	Number of language-specific files: 5
2024-04-15T15:57:07.731-1000	INFO	Detecting pom vulnerabilities...

Signed-off-by: Adrian Cole <[email protected]>

codefromthecrypt

elastic/elasticsearch-hadoop#2187 unlocked this (releasing in elastic-hadoop 8.14, but not sure when)

other notes below

codefromthecrypt · 2024-04-16T02:04:36Z

cassandra3/src/main/java/zipkin2/dependencies/cassandra3/CassandraDependenciesJob.java

@@ -150,6 +150,9 @@ public CassandraDependenciesJob build() {
    df.setTimeZone(TimeZone.getTimeZone("UTC"));
    this.dateStamp = df.format(new Date(builder.day));
    this.conf = new SparkConf(true).setMaster(builder.sparkMaster).setAppName(getClass().getName());
+    if (builder.sparkMaster.startsWith("local[")) {
+      conf.set("spark.driver.bindAddress", "127.0.0.1");


this is a spark 3.4 thing

basically it tries to detect with the hostname, which isn't needed for local mode anyway.

Not sure it is worth looking into, but InetAddress.getLocalHost().getHostAddress() may be more reliable option (fe if the host uses IPv6 only).

codefromthecrypt · 2024-04-16T02:20:52Z

...ticsearch/src/main/java/zipkin2/dependencies/elasticsearch/ElasticsearchDependenciesJob.java

-          .values()
-          .map(DEPENDENCY_LINK_JSON);
+    JavaRDD<Map<String, Object>> links;
+    try (JavaSparkContext sc = new JavaSparkContext(conf)) {


this is just polish as we can use try/resources with some of the drivers

codefromthecrypt · 2024-04-16T02:21:16Z

main/src/main/resources/log4j2.properties

@@ -0,0 +1,14 @@
+# Set everything to be logged to the console


this is also spark 3.4 thing (log4j 2 not 1.2 config)

codefromthecrypt · 2024-04-16T02:21:50Z

pom.xml

@@ -187,6 +224,53 @@
        <type>pom</type>
        <scope>import</scope>
      </dependency>
+
+      <!-- CVE fix versions -->


to keep the build without CVE will be difficult, but anyway at least once it is clean ;)

codefromthecrypt · 2024-04-16T02:28:16Z

some big wins from merging this include:

we can switch the bytecode floor of zipkin-storage to 17 (with the other server and collector code)
- this gets us to a more supportable version of jooq for example.
we can use the same security settings as other prod repos since the dep tree isn't impossible anymore

codefromthecrypt · 2024-04-16T02:33:15Z

oh yeah spent so much time doing this I forgot why.. I was trying to polish this up prior to adding dependencies to helm. There was a point where I though maybe we need to rewrite the entire thing (like in beam) to solve the revlock. I'm glad it didn't get that far.

openzipkin/zipkin-helm#11

codefromthecrypt · 2024-04-16T05:51:36Z

openzipkin/zipkin#3763 for zipkin changes we can now do

Signed-off-by: Adrian Cole <[email protected]>

reta · 2024-04-16T13:35:26Z

docker/bin/start-zipkin-dependencies

-exec java ${JAVA_OPTS} -Djava.io.tmpdir=/tmp -cp classes zipkin2.dependencies.ZipkinDependenciesJob $@
+# Spark 3.4 module config from:
+# https://github.com/apache/spark/blob/branch-3.4/launcher/src/main/java/org/apache/spark/launcher/JavaModuleOptions.java#L29
+exec java ${JAVA_OPTS} -Djava.io.tmpdir=/tmp \


codefromthecrypt · 2024-04-16T16:33:12Z

thanks for the look folks!

Upgrades to Spark 3.4/JRE 17 and fixes all high/critical CVEs

5f3db60

Signed-off-by: Adrian Cole <[email protected]>

codefromthecrypt requested review from anuraaga and reta April 16, 2024 02:03

fix logging and polish pom

4fdc71e

Signed-off-by: Adrian Cole <[email protected]>

codefromthecrypt commented Apr 16, 2024

View reviewed changes

codefromthecrypt mentioned this pull request Apr 16, 2024

Adds SECURITY.md and scanning workflow openzipkin/zipkin-reporter-java#267

Merged

shakuzen approved these changes Apr 16, 2024

View reviewed changes

anuraaga approved these changes Apr 16, 2024

View reviewed changes

Adrian Cole added 2 commits April 15, 2024 21:31

zipkin 3.3

298ba08

Signed-off-by: Adrian Cole <[email protected]>

openrewrite

767b0b3

Signed-off-by: Adrian Cole <[email protected]>

reta reviewed Apr 16, 2024

View reviewed changes

reta approved these changes Apr 16, 2024

View reviewed changes

codefromthecrypt merged commit 6be9f80 into master Apr 16, 2024
6 checks passed

codefromthecrypt deleted the spark-3.4 branch April 16, 2024 16:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrades to Spark 3.4/JRE 17 and fixes all high/critical CVEs #226

Upgrades to Spark 3.4/JRE 17 and fixes all high/critical CVEs #226

codefromthecrypt commented Apr 16, 2024

codefromthecrypt commented Apr 16, 2024

codefromthecrypt left a comment

codefromthecrypt Apr 16, 2024

codefromthecrypt Apr 16, 2024

reta Apr 16, 2024

codefromthecrypt Apr 16, 2024

codefromthecrypt Apr 16, 2024

codefromthecrypt Apr 16, 2024

codefromthecrypt commented Apr 16, 2024

codefromthecrypt commented Apr 16, 2024

codefromthecrypt commented Apr 16, 2024

reta Apr 16, 2024

codefromthecrypt commented Apr 16, 2024

		@@ -0,0 +1,14 @@
		# Set everything to be logged to the console

Upgrades to Spark 3.4/JRE 17 and fixes all high/critical CVEs #226

Upgrades to Spark 3.4/JRE 17 and fixes all high/critical CVEs #226

Conversation

codefromthecrypt commented Apr 16, 2024

codefromthecrypt commented Apr 16, 2024

codefromthecrypt left a comment

Choose a reason for hiding this comment

codefromthecrypt Apr 16, 2024

Choose a reason for hiding this comment

codefromthecrypt Apr 16, 2024

Choose a reason for hiding this comment

reta Apr 16, 2024

Choose a reason for hiding this comment

codefromthecrypt Apr 16, 2024

Choose a reason for hiding this comment

codefromthecrypt Apr 16, 2024

Choose a reason for hiding this comment

codefromthecrypt Apr 16, 2024

Choose a reason for hiding this comment

codefromthecrypt commented Apr 16, 2024

codefromthecrypt commented Apr 16, 2024

codefromthecrypt commented Apr 16, 2024

reta Apr 16, 2024

Choose a reason for hiding this comment

codefromthecrypt commented Apr 16, 2024