KAFKA-16738: Returns BaseRecords instead of MemoryRecords #15935

luozhenyu · 2024-05-13T15:16:39Z

Related to https://issues.apache.org/jira/browse/KAFKA-16738

We can write a record which is a subtype of BaseRecords, but we can not read a record which is a subtype of BaseRecords. If we change the return type of Readable#readRecords from MemoryRecords to BaseRecords, we can override the implementation of readRecords and returns a subtype of BaseRecords easily.

We known that the MemoryRecords is based on JDK's ByteBuffer. We are developing a netty project(kroxylicious) and we want to create a subtype of BaseRecords like MemoryRecords based on netty's ByteBuf.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

chia7712 · 2024-05-13T16:30:48Z

clients/src/main/java/org/apache/kafka/common/protocol/Readable.java

@@ -53,7 +54,7 @@ default List<RawTaggedField> readUnknownTaggedField(List<RawTaggedField> unknown
        return unknowns;
    }

-    default MemoryRecords readRecords(int length) {
+    default BaseRecords readRecords(int length) {


in kafak serialization, we assume the impl of BaseRecords is MemoryRecords. For example:

@Override public PartitionProduceData duplicate() { PartitionProduceData _duplicate = new PartitionProduceData(); _duplicate.index = index; if (records == null) { _duplicate.records = null; } else { _duplicate.records = MemoryRecords.readableRecords(((MemoryRecords) records).buffer().duplicate()); } return _duplicate; }

Hence, I'm not sure how this change works if you introduce a non-MemoryRecords impl.

Hence, I'm not sure how this change works if you introduce a non-MemoryRecords impl.

Hard-coded duplicating of MemoryRecords is not elegant. So I add a duplicate method to BaseRecords. How do you think?

@Override public PartitionProduceData duplicate() { PartitionProduceData _duplicate = new PartitionProduceData(); _duplicate.index = index; if (records == null) { _duplicate.records = null; } else { _duplicate.records = records.duplicate(); } return _duplicate; }

Hard-coded duplicating of MemoryRecords is not elegant. So I add a duplicate method to BaseRecords. How do you think?

Could you file a KIP to let us know the whole picture? MemoryRecords is used in code base, so we need to be careful of changing that.

chia7712 reviewed May 13, 2024

View reviewed changes

luozhenyu force-pushed the read-records branch from 4f67367 to 2880cca Compare May 14, 2024 07:08

KAFKA-16738: Returns BaseRecords instead of MemoryRecords

f526723

luozhenyu force-pushed the read-records branch from 2880cca to f526723 Compare May 14, 2024 07:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAFKA-16738: Returns BaseRecords instead of MemoryRecords #15935

KAFKA-16738: Returns BaseRecords instead of MemoryRecords #15935

luozhenyu commented May 13, 2024 •

edited

chia7712 May 13, 2024

luozhenyu May 14, 2024 •

edited

chia7712 May 14, 2024

KAFKA-16738: Returns BaseRecords instead of MemoryRecords #15935

Are you sure you want to change the base?

KAFKA-16738: Returns BaseRecords instead of MemoryRecords #15935

Conversation

luozhenyu commented May 13, 2024 • edited

Committer Checklist (excluded from commit message)

chia7712 May 13, 2024

Choose a reason for hiding this comment

luozhenyu May 14, 2024 • edited

Choose a reason for hiding this comment

chia7712 May 14, 2024

Choose a reason for hiding this comment

luozhenyu commented May 13, 2024 •

edited

luozhenyu May 14, 2024 •

edited