Virtual cluster wide topic id cache #11

SamBarker · 2023-07-19T23:43:05Z

Fixes: #2

robobario · 2023-07-20T04:10:55Z

...ylicious-filter/src/main/java/io/strimzi/kafka/topicenc/kroxylicious/FetchDecryptFilter.java

+        //TODO revisit error handling
+        try {
+            final CompletableFuture<String> topicNameFuture = topicUuidToNameCache.getTopicName(originalUuid);
+            return topicNameFuture != null ? topicNameFuture.get(5, TimeUnit.SECONDS) : null;


This blocking in the netty thread seems risky. If the future isn't completed then connections on this eventloop will slow to a crawl.

Maybe it's better to redundantly request the metadata while handling the Request, or respond with an error message if the future isnt complete.

Yeah thats a fair call re-blocking.

Should maybe switch to getNow and just error if it returns null.

One property I think we'll want is that metadata fetch is re-attempted pretty quickly in case the initial one failed. I guess that's part of //TODO revisit error handling, maybe the futures should be removed from the cache. Or the cache entry in the cache could be an object with a timestamp on it as well, so callers could decide to retry.

robobario · 2023-07-20T04:14:05Z

kroxylicious-filter/src/main/java/io/strimzi/kafka/topicenc/kroxylicious/TopicIdCache.java

+    private final AsyncCache<Uuid, String> topicNamesById;
+
+    public TopicIdCache() {
+        this(Caffeine.newBuilder().expireAfterAccess(Duration.ofMinutes(10)).buildAsync());


Any reason for expiring? Is this to keep only relevant/used mappings cached?

Yeah. Can't say I gave it too much thought. I wanted to avoid the data going stale/leaking in case the proxy missed a metadata update which deleted a topic.

I suppose if a kafka use-case used short lived topics, then this would be a concern

I think it should probably be a bounded cache as well, given we are going to have 1 per VirtualCluster.

…m a cache miss directly.

k-wall · 2023-07-20T17:06:15Z

...cious-filter/src/main/java/io/strimzi/kafka/topicenc/kroxylicious/TopicEncryptionConfig.java

@@ -22,4 +29,9 @@ public TopicEncryptionConfig(@JsonProperty(value = IN_MEMORY_POLICY_REPOSITORY_P
    public PolicyRepository getPolicyRepository() {
        return inMemoryPolicyRepository.getPolicyRepository();
    }
+
+    public TopicIdCache getTopicUuidToNameCache() {
+        return virtualClusterToTopicUUIDToTopicNameCache.computeIfAbsent("VIRTUAL_CLUSTER_ID", (key) -> new TopicIdCache());


Is the intent here that the config will be given access to the VirtualCluster name, or its UID?

haha you spotted my deliberate fudge. I'm currently working on https://github.com/sambarker/kroxylicious/tree/name_that_cluster I my current suspicion is it will need to be name based as we are leaning towards relaxed restrictions on clusterID's.

k-wall · 2023-07-20T17:12:36Z

kroxylicious-filter/src/main/java/io/strimzi/kafka/topicenc/kroxylicious/TopicIdCache.java

+        final MetadataRequest metadataRequest = builder.build(builder.latestAllowedVersion());
+        topicIdsToResolve.forEach(uuid -> topicNamesById.put(uuid, new CompletableFuture<>()));
+        context.<MetadataResponseData> sendRequest(metadataRequest.version(), metadataRequest.data())
+                .whenComplete((metadataResponseData, throwable) -> {


What path is followed when topicId is not found?

None yet 😁 As I haven't spun up a real cluster to work out what that would look like (this is the sort of reason its still a draft PR).

I suspect it will need to fail the future or even just complete it with null and let it get re-queried.

sonarcloud · 2023-07-26T09:27:24Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
3 Code Smells

No Coverage information
0.0% Duplication

SamBarker added 2 commits July 6, 2023 16:44

Switch to a Caffeine cache.

7363428

Initial skeleton for separating out a TopicIdCache.

0b54d75

robobario reviewed Jul 20, 2023

View reviewed changes

SamBarker added 3 commits July 20, 2023 17:03

Implement cache loading.

a7ad3fc

Context cacheLoader can never help us as we can't trigger loading fro…

224f288

…m a cache miss directly.

Don't block the event loop

e130c3a

SamBarker force-pushed the virtual_cluster_wide_toppicID_cache branch from 3bab12f to e130c3a Compare July 20, 2023 05:03

k-wall reviewed Jul 20, 2023

View reviewed changes

SamBarker closed this Jan 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Virtual cluster wide topic id cache #11

Virtual cluster wide topic id cache #11

SamBarker commented Jul 19, 2023 •

edited

Loading

robobario Jul 20, 2023

SamBarker Jul 20, 2023

robobario Jul 20, 2023 •

edited

Loading

robobario Jul 20, 2023

SamBarker Jul 20, 2023

k-wall Jul 20, 2023

SamBarker Jul 20, 2023

k-wall Jul 20, 2023

SamBarker Jul 20, 2023

k-wall Jul 20, 2023

SamBarker Jul 20, 2023 •

edited

Loading

sonarcloud bot commented Jul 26, 2023

Virtual cluster wide topic id cache #11

Virtual cluster wide topic id cache #11

Conversation

SamBarker commented Jul 19, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

robobario Jul 20, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SamBarker Jul 20, 2023 • edited Loading

Choose a reason for hiding this comment

sonarcloud bot commented Jul 26, 2023

SamBarker commented Jul 19, 2023 •

edited

Loading

robobario Jul 20, 2023 •

edited

Loading

SamBarker Jul 20, 2023 •

edited

Loading