[#2173] feat(remote merge): support netty for remote merge. #2202

zhengchenyu · 2024-10-18T13:01:50Z

What changes were proposed in this pull request?

Support netty for remote merge. Use direct ByteBuf to replace with byte[] when netty is enable. And optimized code structure to avoid memory leaks

Why are the changes needed?

Fix: #2173

Does this PR introduce any user-facing change?

No.

How was this patch tested?

unit test, integration test, real job in cluster.

github-actions · 2024-10-18T13:28:08Z

Test Results

2 926 files ± 0 2 926 suites ±0 6h 13m 57s ⏱️ + 3m 21s
1 088 tests + 39 1 086 ✅ + 39 2 💤 ±0 0 ❌ ±0
13 630 runs +585 13 600 ✅ +585 30 💤 ±0 0 ❌ ±0

Results for commit 3a68303. ± Comparison against base commit 43323bb.

This pull request removes 20 and adds 59 tests. Note that renamed tests count towards both.

org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile1{String, File}[1]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile1{String, File}[2]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile2{String, File}[1]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile2{String, File}[2]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile3{String, File}[1]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile3{String, File}[2]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile4{String, File}[1]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile4{String, File}[2]
org.apache.uniffle.common.serializer.PartialInputStreamTest ‑ testReadFileInputStream
org.apache.uniffle.common.serializer.PartialInputStreamTest ‑ testReadMemroyInputStream
…

org.apache.uniffle.common.merger.MergerTest ‑ testMergeSegmentToFile{String, File}[2]
org.apache.uniffle.common.merger.MergerTest ‑ testMergeSegmentToFile{String, File}[3]
org.apache.uniffle.common.merger.MergerTest ‑ testMergeSegmentToFile{String, File}[4]
org.apache.uniffle.common.netty.protocol.NettyProtocolTest ‑ testGetSortedShuffleDataRequest
org.apache.uniffle.common.netty.protocol.NettyProtocolTest ‑ testGetSortedShuffleDataResponse
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFileUseDirect{String, File}[1]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFileUseDirect{String, File}[2]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile{String, File}[1]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile{String, File}[2]
org.apache.uniffle.common.records.RecordsReaderWriterTest ‑ testWriteAndReadRecordFile{String, File}[3]
…

♻️ This comment has been updated with latest results.

zhengchenyu · 2024-10-22T02:33:46Z

@jerqi Can you please review this PR?

jerqi · 2024-10-22T02:42:05Z

.../src/main/java/org/apache/uniffle/common/serializer/writable/WritableSerializerInstance.java

 if (raw) {
- return new RawWritableSerializationStream(this, output);
+ if (shared) {


Why do we use a shared serialization stream?

In client side, the parsed record will be used by reduce, so we need a deep copy instance, every record have their own buffer. if we use shared buffer, error will occur.

But in server side, we write the record to the mergedblock immediately after parsing the record, so there is no need for each record to have a separate memory copy. For a segment/block, we can use only two shared buffer, this saves more memory.

BTW, although this PR is about Netty, a lot of work has actually been done on saving memory

Name makes me confused. Because SharedSerializationStream needs to be operated by multiple threads. It will need the lock.

A SharedSerializationStream corresponds to a block. SharedSerializationStream will not be accessed by multiple threads, can only be executed under the merge thread corresponding to one partition. Here, 'Shared' means use shared buffer to merge.

Could we give a better name for it?

Could we give a better name for it?

In fact, I used to use shallow as name before. But I change to 'shared'.
Compare to RawWritableDeserializationStream, SharedRawWritableDeserializationStream use shared buffer to store record, but RawWritableDeserializationStream allocates a new buffer for each record.
If we need changed, RawWritableDeserializationStream rename to DeepRawWritableDeserializationStream, and SharedRawWritableDeserializationStream rename to ShallowRawWritableDeserializationStream. How about it?

BufferDeserializationStream and PartitionDeserializationStream may be better.

We have three stream:

WritableSerializationStream

RawWritableDeserializationStream

SharedRawWritableDeserializationStream

WritableSerializationStream is used to parse bytes into Java objects, mainly used on the reduce side of tez and spark.

RawWritableSerializationStream directly copies bytes without doing any actual deserialization. It is mainly used on the reduce side of mr, because mr requires raw interface.

SharedRawWritableDeserializationStream is similar to RawWritableSerializationStream, but uses some memory optimization methods. Mainly used for server-side merge. So deserialization is no needed.

The Raw prefix means that the bytes are copied directly without unnecessary serialization. I think it should not be deleted.

Now that, I think WritableSerializationStream and RawWritableDeserializationStream names are not changed, rename SharedRawWritableDeserializationStream to BufferRawDeserializationStream. How about this?

zhengchenyu marked this pull request as draft October 18, 2024 13:01

zhengchenyu closed this Oct 21, 2024

zhengchenyu deleted the issue-2173 branch October 21, 2024 06:52

[apache#2173] feat(remote merge): support netty for remote merge.

3a68303

zhengchenyu reopened this Oct 21, 2024

zhengchenyu force-pushed the issue-2173 branch from 8b6c234 to 3a68303 Compare October 21, 2024 07:05

zhengchenyu marked this pull request as ready for review October 21, 2024 07:07

zhengchenyu requested a review from jerqi October 22, 2024 02:33

jerqi reviewed Oct 22, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[#2173] feat(remote merge): support netty for remote merge. #2202

[#2173] feat(remote merge): support netty for remote merge. #2202

zhengchenyu commented Oct 18, 2024

github-actions bot commented Oct 18, 2024 •

edited

Loading

zhengchenyu commented Oct 22, 2024

jerqi Oct 22, 2024

zhengchenyu Oct 22, 2024

jerqi Oct 22, 2024

zhengchenyu Oct 22, 2024 •

edited

Loading

jerqi Oct 22, 2024

zhengchenyu Oct 22, 2024

jerqi Oct 22, 2024

zhengchenyu Oct 22, 2024

[#2173] feat(remote merge): support netty for remote merge. #2202

Are you sure you want to change the base?

[#2173] feat(remote merge): support netty for remote merge. #2202

Conversation

zhengchenyu commented Oct 18, 2024

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

github-actions bot commented Oct 18, 2024 • edited Loading

Test Results

zhengchenyu commented Oct 22, 2024

jerqi Oct 22, 2024

Choose a reason for hiding this comment

zhengchenyu Oct 22, 2024

Choose a reason for hiding this comment

jerqi Oct 22, 2024

Choose a reason for hiding this comment

zhengchenyu Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

jerqi Oct 22, 2024

Choose a reason for hiding this comment

zhengchenyu Oct 22, 2024

Choose a reason for hiding this comment

jerqi Oct 22, 2024

Choose a reason for hiding this comment

zhengchenyu Oct 22, 2024

Choose a reason for hiding this comment

github-actions bot commented Oct 18, 2024 •

edited

Loading

zhengchenyu Oct 22, 2024 •

edited

Loading