Improve ByteBufPrimitiveCodec readBytes #1617

chibenwa · 2022-11-08T10:26:29Z

-> Remove a needless call to ByteBuff::hasArray

-> Removes needless slicing

This method is always used on sliced ByteBuf (as the first 4 bytes old the overall size) so the payload will never match the underlying buffer's array thus defeats the purpose of the optimization and end up costing CPU cycles. On a typical application this represent ~0.4% of the CPU

Directly read the source ByteBuf while checking for content length. Reader index are similarily moved. On a typical application this takes ~0.88% of CPU

absurdfarce · 2022-11-15T22:30:20Z

core/src/main/java/com/datastax/oss/driver/internal/core/protocol/ByteBufPrimitiveCodec.java

-      // Move the readerIndex just so we consistently consume the input
-      buffer.readerIndex(buffer.writerIndex());
-      return buffer.array();
-    }


@chibenwa I'm not sure I understand. Is there a reason why we don't want to use the underlying byte array directly if we can get access to it?

Because the conditions for reusing the array are never fullfilled.

The buffer layout is 4 bytes size, byyyyyyyyyyyyytes so we would expect the underlying array to match (best case) this layout.

However reading the size means that the leading 4 bytes are read first, and then the payload never matches the underlying array.

Checking if this optimization is doable is impactful enough to be removing it.

Clearer?

Apologies @chibenwa, I've been out for the holiday here... just getting back to this.

Yeah, your explanation makes sense, thanks for the additional context. I'm reluctant to give up the re-use of the backing array, though, so I put together an alternate implementation which preserves that feature. What do you think of something in that vein?

The alternate proposal never went anywhere so we're moving forward with the original proposal from @chibenwa

chibenwa · 2022-12-02T02:55:47Z

I added some tests to better specify the expected behaviour of readBytes and removed the needless test.

chibenwa added 3 commits November 8, 2022 17:08

ByteBufPrimitiveCodec: avoid slicing in readBytes

489eae1

Directly read the source ByteBuf while checking for content length. Reader index are similarily moved. On a typical application this takes ~0.88% of CPU

Merge branch '4.x' into read-bytes

3d3fe81

absurdfarce reviewed Nov 15, 2022

View reviewed changes

Merge branch '4.x' into read-bytes

fdd54b5

absurdfarce mentioned this pull request Dec 1, 2022

Alternate idea to PR 1617 #1621

Closed

chibenwa added 2 commits December 2, 2022 09:54

Test IndexOutOfBound and remove uneeded check

4e96a87

ByteBufPrimitiveCodecTest: more tests

bd7cfc5

absurdfarce merged commit 4d6e2e7 into apache:4.x Aug 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve ByteBufPrimitiveCodec readBytes #1617

Improve ByteBufPrimitiveCodec readBytes #1617

chibenwa commented Nov 8, 2022

absurdfarce Nov 15, 2022

chibenwa Nov 16, 2022

absurdfarce Dec 1, 2022

absurdfarce Aug 21, 2023

chibenwa commented Dec 2, 2022

Improve ByteBufPrimitiveCodec readBytes #1617

Improve ByteBufPrimitiveCodec readBytes #1617

Conversation

chibenwa commented Nov 8, 2022

absurdfarce Nov 15, 2022

Choose a reason for hiding this comment

chibenwa Nov 16, 2022

Choose a reason for hiding this comment

absurdfarce Dec 1, 2022

Choose a reason for hiding this comment

absurdfarce Aug 21, 2023

Choose a reason for hiding this comment

chibenwa commented Dec 2, 2022