Feature/instrument process node #1

boxsterman · 2019-08-08T16:27:54Z

No description provided.

…ection timeout, make test assumptions about follow-strategy explicit

kamon-kafka-clients/src/main/java/kamon/kafka/client/instrumentation/advisor/Advisors.java

dpsoft

@boxsterman LGTM I've left a simple comment. @ivantopo WDYT

… details of accessing nested Scala objects from Java code.

boxsterman · 2019-09-28T09:16:20Z

@dpsoft Finally got around to address your feedback ... too much other work :(

ivantopo

Hey @boxsterman, I finally started looking into this PR. Solid work so far 🎉

I just looked at the producer/consumer instrumentation right now since I'm still wrapping my head around the streams part, hopefully we can start polishing that side and I'll get back as soon as possible regarding the streams side of this 😄

Besides the minor comments on some methods, there are two other bigger things I would like to address:

Regular vs Delayed Spans

Besides the follow strategy we should also have a setting to determine whether the generated Span should be a "regular" span or a "delayed" Span. The idea behind the delayed Spans is that they can capture additional information about the processing in asynchronous scenarios, particularly, how long was an operation waiting before actually being processed. I think it would go like this:

When using delayed Spans is enabled we could count the "wait time" as the difference between the time a message was produced and the time when the "poll" method started and the actual processing time for that Span will be the time it takes for "poll" to complete.
When not using delayed Spans the only time reflected will be processing time, which should be equal to the time take for the "poll" function to run (btw, this is not the case as the moment, please see the relevant comments)

Continuing Tracing on the Consumer Side

Once a record has been consumed, the user will want to continue doing some processing of that record and that processing should continue the same trace started/joined by the consumer. Currently, I don't see any clear way to do that and I think there should be one. One idea that comes to mind is to inject the generated Spans on the ConsumerRecord objects via instrumentation and provide an utility class for users to extract that Span and use it as parent of their own Spans.

What do you think about the above?

ivantopo · 2019-09-30T07:41:42Z

build.sbt

-val scalaExtension      = "io.kamon"            %% "kanela-scala-extension"           % "0.0.10"
+val kamonCore           = "io.kamon"            %% "kamon-core"                       % "2.0.0"
+val kamonTestkit        = "io.kamon"            %% "kamon-testkit"                    % "2.0.0"
+val scalaExtension      = "io.kamon"            %% "kamon-instrumentation-common"     % "2.0.0"

 val kafkaClient         = "org.apache.kafka"    % "kafka-clients"	                    % "0.11.0.0"


Is there any particular reason for not using the latest kafka clients/streams versions?

Migrated to Kafka 2.3.0.

The signature of the instrumented method in the Kafka client has changed, how shall we address compatibility with older Kafka version (pre 2.3.0)? Shall I create a dedicated module depending on Kafka 2.0.0?

@boxsterman you can use the kanela ClassRefiner in order to activate an instrumentation depending of some properties of the target class. Take a look:
https://github.com/kamon-io/kamon-play/blob/master/kamon-play/src/main/scala/kamon/instrumentation/play/PlayServerInstrumentation.scala#L34-L58
and https://github.com/kamon-io/kanela/blob/master/agent/src/test/scala/kanela/agent/classloader/ClassloaderNameMatcherSpec.scala#L25

If we would keep it all in the kamon-kafka project and use such property-based instrumentation, wouldn't that cause issues with eviction of (older) Kafka libraries on the side of the users of such a kamon-kafka lib?

build.sbt

kamon-kafka-clients/src/main/java/kamon/kafka/client/instrumentation/advisor/Advisors.java

kamon-kafka-clients/src/main/scala/kamon/kafka/instrumentation/ConsumerInstrumentation.scala

kamon-kafka-clients/src/main/java/kamon/kafka/client/instrumentation/advisor/Advisors.java

kamon-kafka-clients/src/main/scala/kamon/kafka/instrumentation/ConsumerInstrumentation.scala

…onsumer) as span tags

mladens · 2019-10-10T08:53:47Z

kamon-kafka-clients/src/main/scala/kamon/kafka/client/instrumentation/RecordProcessor.scala

+            ContextSerializationHelper.fromByteArray(h.value())
+          }.getOrElse(Context.Empty)
+
+          val span = consumerSpansForTopic.getOrElseUpdate(topic, {


Hey, i was wondering about poll-spans cardinality here.
This way it will produce one span per consumed topic within a poll, is this intentional and whats the reasoning behind it?

Seams it could go two ways:

One span per poll, in which case record specific tags dont makes sense (partition, key, offset). Any further streaming spans should link to poll span due to potentially large number of child spans (default max.poll.records is 500?)

One span per polled record, can be used as a root for all streaming stages spans. This is aligned with what is done on producing side instrumentation which tracks individual record send()s and ingores batching.

Hi @mladens
thanks a lot for your feedback and for pointing out this span inconsistency!

I've refactored that part and now the following spans are created:

one span for the actual poll operation: using its "own" (existing) traceId context this allows tracing of individual poll operations and how many records they actually polled.

one span for each record polled: using the record's send record as parent span or linked span if follow-strategy = false this enable tracing of the business-related flow of the application.

This way it becomes also quite visible how Kafka consumer settings like autoCommit can influence the consumer.

Here are two examples on how the instrumentation now looks like:

Simple publish/subscribe with two messages (autoCommit=true)

Simple publish/subscribe with two messages (autoCommit=false)

solid edges: parent/child spans

dashed edges: linked spans

In the second example was the second record not commit fast enough so that it popped up in the next poll operation. It will not be forwarded to the application since it is still "in flight" on the client side.

… span creation. Improve tests and DOT file renderer.

…tion :(

…tion :( ...

kamon-kafka-clients/src/main/scala/kamon/kafka/client/instrumentation/RecordProcessor.scala

…pan, fix "missing" zeros in milli-to-nano conversion

…records

boxsterman · 2019-10-15T17:33:43Z

Hi @ivantopo

I've added support for accessing the (completed) span of a consumer record in order to create further child spans with it. Therefore I've created a new mixin to only carry the span and not the whole context ... to avoid dealing with changes/updates to the context itself.

This is how the simple example looks now:

What do you think? I'll continue to add now a "real" context mixin to the stream to properly keep that context.

ivantopo · 2019-10-17T05:55:41Z

...afka-clients/src/main/scala/kamon/kafka/client/instrumentation/ConsumerInstrumentation.scala

+    * to make the span available as parent for down stream operations
+    */
+  onSubTypesOf("org.apache.kafka.clients.consumer.ConsumerRecord")
+    .mixin(classOf[HasSpan.Mixin])


This should be HasContext instead of HasSpan, please see the comments on the HasSpan class.

ivantopo · 2019-10-17T05:59:02Z

kamon-kafka-clients/src/main/scala/kamon/kafka/client/instrumentation/HasSpan.scala

+
+import scala.util.Try
+
+trait HasSpan {


Since the Context might have additional information we cannot hide it from the users and instead of mixin the Span we should mixin the Context that has the Span (should be the deserialized Context with the new Span inside).

ivantopo · 2019-10-17T06:11:08Z

kamon-kafka-clients/src/main/scala/kamon/kafka/client/instrumentation/HasSpan.scala

+
+object SpanExtractionSupport {
+
+  implicit class SpanOps[K, V](cr: ConsumerRecord[K, V]) {


Few comments on this one:

Is there a situation in which users themselves will want to write a Context into the ConsumerRecord? Maybe I'm missing something here.

We need to add a method to get the Context, but we can also have one to get the Span, just because it is convenient. In that method please return just the Span, not Option[Span].. if there is no span in the context Kamon will return a Span.Empty instead, same goes for the Context. We use these empty objects instead of returning Options of things because it provides a much more consistent usage across Java/Scala/Kotlin.

I would recommend moving this to the Client companion object and breaking it down into two parts: methods on the companion object itself like extractContext(consumerRecord)/extractSpan(consumerRecord) and a Syntax implicit class that does what this class does. This is to allow Java users to access these functions but still keeping the cool stuff for the Scala folks 😄

ivantopo · 2019-10-17T06:13:14Z

kamon-kafka-clients/src/main/resources/reference.conf

@@ -4,7 +4,10 @@

 kamon {
  kafka {


For consistency with the rest of the modules this should be under kamon.instrumentation.kafka. BTW, same goes with the code itself! It should be under the kamon.instrumentation.kafka package.

ivantopo · 2019-10-17T06:13:53Z

@boxsterman the record should get the Context mixed in, not just the Span because there might be more interesting data in the Context that also needs to be used by the client, like Context Tags or some other Context Entries besides the Span. I just made some comments on the relevant files.

boxsterman · 2019-10-17T11:21:48Z

@ivantopo Yes, I've noticed that the HasSpan idea was not my best and somehow part of my learning curve :)
Therefore I'm only using the HasContext in the newer parts ... and will replace HasSpan accordingly.

…erRecord

…ationId, add test using multiple threads per stream

add reset for configuration

… cycle (init/process/close)

…t topic and key

…-changelog updates (send operation)

mladens · 2019-12-04T14:05:09Z

Hi @boxsterman, what are your plans regarding this PR, do you have any more ideas/features you wanted to see in there or? If there's anything i can do to help and get this closer to release, let me know.

Carsten Seibert added 24 commits July 21, 2019 09:36

integrate with Kamon 2.0.0-RC

77e3d4f

Set version to 2.0.0-SNAPSHOT

5207d59

Add current kafka record's key as tag to the span

2941659

Add kafka's Scala sugar

499ea65

Add optional record.key() as tag to the span

50d7403

Fix module name in stream's reference.conf

779a031

Add streaming test for context propagation

a77ff44

WIP: add node tracing

c1ebb00

Add consistent stream and node instrumentation

cc1505a

Use default propagation, inprove test stability by increasing zk conn…

99b0bfb

…ection timeout, make test assumptions about follow-strategy explicit

Cleanup code, remove println output

5b76e66

Adapt to 2.0 final

4bd2bfe

update travis to use openjdk8 instead of oraclejdk8

76cd874

Update readme

f38e917

Avoid javac errors on unknown characters

75243a5

Update kamon-sbt-umbrella plugin reference

8c1562e

Try different workaround for package.type undefined problem ...

493f4d0

Try again another workaround for package.type undefined problem ...

71a4a29

Try again another workaround ... properly integrated

c0eda0f

Try again another workaround ...

0c3685a

Try again another workaround ...

eef7611

Move to sbt 1.3.0

8078100

Increase mem for sbt tests

b9d59d5

Fix cross scala version build, bump scala version to 2.12.9

19c18b8

dpsoft reviewed Sep 14, 2019

View reviewed changes

kamon-kafka-clients/src/main/java/kamon/kafka/client/instrumentation/advisor/Advisors.java Outdated Show resolved Hide resolved

dpsoft requested changes Sep 14, 2019

View reviewed changes

Carsten Seibert added 2 commits September 28, 2019 10:26

Add description to the module configuration

9e166d6

Extract context serialization into a Scala class in order to hide the…

b8c3d95

… details of accessing nested Scala objects from Java code.

ivantopo requested changes Sep 30, 2019

View reviewed changes

Carsten Seibert added 2 commits October 9, 2019 18:23

Address PR feedback: add clientId (consumer/publisher) and groupId (c…

91ac200

…onsumer) as span tags

Refactor configuration and introduce support for delayed spans

e637a87

mladens reviewed Oct 10, 2019

View reviewed changes

Carsten Seibert added 4 commits October 12, 2019 13:47

Refactor RecordProcessor (client poll instrumentation) for consistent…

ef802a9

… span creation. Improve tests and DOT file renderer.

Fix single abstract message issue

ab27353

Fix single abstract message issue - again, forgot about cross compila…

e0b7281

…tion :(

Fix single abstract message issue - again, forgot about cross compila…

63cd34b

…tion :( ...

mladens reviewed Oct 14, 2019

View reviewed changes

kamon-kafka-clients/src/main/scala/kamon/kafka/client/instrumentation/RecordProcessor.scala Outdated Show resolved Hide resolved

mladens reviewed Oct 14, 2019

View reviewed changes

kamon-kafka-clients/src/main/scala/kamon/kafka/client/instrumentation/RecordProcessor.scala Outdated Show resolved Hide resolved

Carsten Seibert added 5 commits October 14, 2019 13:38

Add kafka.timestamp and kafka.timestampType as tags to kafka client s…

c6ea37b

…pan, fix "missing" zeros in milli-to-nano conversion

Add test for new tags

b10a825

Trigger build

1035c9c

Add proper test for new span tags timestamp and timestampType

1e88b87

Add support for storing and accessing spans associated with consumer …

94aa3e5

…records

ivantopo requested changes Oct 17, 2019

View reviewed changes

Carsten Seibert added 11 commits October 18, 2019 10:23

Add proper context propagation for streams

3a50957

Refactor package names to kamon.instrumentation.kafka.*

751808e

Clean up convenience functions for extracting the context from Consum…

fc97b9a

…erRecord

Trigger build

50c94c6

Add configuration for include/exclude streams tracing based on applic…

07ae1dd

…ationId, add test using multiple threads per stream

Improve tests with better expression of which spans are expected,

c0d63a7

add reset for configuration

Provide proper current context when executing ProcessNode.process()

cd656f2

Refactor node instrumentation of streams to properly honor their life…

bc02e60

… cycle (init/process/close)

Add proper exception logging for stream and node spans

93590b7

Add specific istrumentation for source and sink node to capture in/ou…

3c87f2a

…t topic and key

Start tracing for JOIN operations - fix context propagation for store…

27ae452

…-changelog updates (send operation)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/instrument process node #1

Feature/instrument process node #1

boxsterman commented Aug 8, 2019

dpsoft left a comment

boxsterman commented Sep 28, 2019

ivantopo left a comment

ivantopo Sep 30, 2019

boxsterman Oct 6, 2019

dpsoft Oct 6, 2019

boxsterman Oct 8, 2019

mladens Oct 10, 2019

boxsterman Oct 12, 2019

boxsterman commented Oct 15, 2019 •

edited

Loading

ivantopo Oct 17, 2019

ivantopo Oct 17, 2019

ivantopo Oct 17, 2019

ivantopo Oct 17, 2019

ivantopo commented Oct 17, 2019

boxsterman commented Oct 17, 2019

mladens commented Dec 4, 2019


		object SpanExtractionSupport {

		implicit class SpanOps[K, V](cr: ConsumerRecord[K, V]) {


		import scala.util.Try

		trait HasSpan {

Feature/instrument process node #1

Are you sure you want to change the base?

Feature/instrument process node #1

Conversation

boxsterman commented Aug 8, 2019

dpsoft left a comment

Choose a reason for hiding this comment

boxsterman commented Sep 28, 2019

ivantopo left a comment

Choose a reason for hiding this comment

Regular vs Delayed Spans

Continuing Tracing on the Consumer Side

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

boxsterman commented Oct 15, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ivantopo commented Oct 17, 2019

boxsterman commented Oct 17, 2019

mladens commented Dec 4, 2019

boxsterman commented Oct 15, 2019 •

edited

Loading