Add ByteDecoder for opensearch_api source to support Kafka buffer by divakarsingh · Pull Request #6879 · opensearch-project/data-prepper

divakarsingh · 2026-05-21T06:59:27Z

Description
The opensearch_api source did not work with Kafka buffer because no ByteDecoder was registered. When
buffer.isByteBuffer()=true, raw bytes are written but the consumer side had no way to reconstruct events from the NDJSON bulk
format.

This adds OpenSearchBulkByteDecoder which parses NDJSON bulk format back into Data Prepper events with correct metadata
attributes.

Issues Resolved
Resolves #6876

Check List

New functionality includes testing
Commits are signed with DCO (Signed-off-by)

github-actions · 2026-05-21T22:58:55Z

✅ License Header Check Passed

All newly added files have proper license headers. Great work! 🎉

dlvenable

Thank you @divakarsingh for this contribution! I have a few comments on the code and the approach.

dlvenable · 2026-06-05T20:15:59Z

+    public void parse(InputStream inputStream, Instant timeReceived, Consumer<Record<Event>> eventConsumer) throws IOException {
+        final BufferedReader reader = new BufferedReader(new InputStreamReader(inputStream, StandardCharsets.UTF_8));
+        String line;
+        while ((line = reader.readLine()) != null) {


This code should be the same as the existing code in OpenSearchAPIService for handling the response for non-byte buffers. The current structure of this class looks reusable. I think we might be able to refactor OpenSearchAPIService to use this in processBulkRequest.

@dlvenable Refactored OpenSearchAPIService.processBulkRequest to delegate to OpenSearchBulkByteDecoder for the non-byte-buffer path, so there's a single copy of the parsing logic. Let me know if you had a different structure in mind.

The opensearch_api source did not work with Kafka buffer because no ByteDecoder was registered. When buffer.isByteBuffer()=true, raw bytes are written but the consumer side had no way to reconstruct events from the NDJSON bulk format. This adds OpenSearchBulkByteDecoder which parses NDJSON bulk format back into Data Prepper events with correct metadata attributes. OpenSearchAPIService now delegates to the decoder for the non-byte-buffer path as well, eliminating duplicate parsing logic. Resolves opensearch-project#6876 Signed-off-by: Divakar Pratap Singh <divakar.p.singh@gmail.com>

divakarsingh requested review from KarstenSchnitter, Zhangxunmt, dinujoh, divbok, dlvenable, graytaylor0, kkondaka, oeyh, san81, sb2k16, srikanthjg and srikanthpadakanti as code owners May 21, 2026 06:59

divakarsingh force-pushed the fix/opensearch-api-kafka-buffer branch from 6fbd388 to 220f714 Compare May 21, 2026 07:56

divakarsingh force-pushed the fix/opensearch-api-kafka-buffer branch from 220f714 to 2a982d8 Compare May 22, 2026 02:49

dlvenable requested changes Jun 5, 2026

View reviewed changes

divakarsingh force-pushed the fix/opensearch-api-kafka-buffer branch from 2a982d8 to 504f703 Compare June 9, 2026 11:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ByteDecoder for opensearch_api source to support Kafka buffer#6879

Add ByteDecoder for opensearch_api source to support Kafka buffer#6879
divakarsingh wants to merge 1 commit into
opensearch-project:mainfrom
divakarsingh:fix/opensearch-api-kafka-buffer

divakarsingh commented May 21, 2026

Uh oh!

github-actions Bot commented May 21, 2026 •

edited

Loading

Uh oh!

dlvenable left a comment

Uh oh!

dlvenable Jun 5, 2026

Uh oh!

divakarsingh Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

divakarsingh commented May 21, 2026

Uh oh!

github-actions Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ License Header Check Passed

Uh oh!

dlvenable left a comment

Choose a reason for hiding this comment

Uh oh!

dlvenable Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

divakarsingh Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions Bot commented May 21, 2026 •

edited

Loading