Turn BytesContext into FromTensorContext #721

scotts · 2025-06-11T03:34:54Z

Fixes #720.

Reading from a raw void* is problematic because we have no way of claiming ownership of it. This PR makes sure reading from bytes only happens through a tensor. The tensor is effectively a smart pointer, so it ensures that we actually keep the data around. We now have both a tensor-reading context and a tensor-writing context. This PR also refactors both to share code - which we thought we may eventually need to do.

The test added by this PR fails without these changes in the C++ level.

src/torchcodec/_core/AVIOBytesContext.h

src/torchcodec/_core/AVIOTensorContext.h

NicolasHug

Thanks for the fix!!

NicolasHug · 2025-06-11T16:13:43Z

test/test_decoders.py

+    def test_create_bytes_ownership(self):
+        # Note that the bytes object we use to instantiate the decoder does not


Suggested change

def test_create_bytes_ownership(self):

# Note that the bytes object we use to instantiate the decoder does not

def test_create_bytes_ownership(self):

# Non-regression test for https://github.com/pytorch/torchcodec/issues/720

# Note that the bytes object we use to instantiate the decoder does not

test/test_decoders.py

NicolasHug · 2025-06-11T16:19:45Z

src/torchcodec/_core/custom_ops.cpp


  SingleStreamDecoder::SeekMode realSeek = SingleStreamDecoder::SeekMode::exact;
  if (seek_mode.has_value()) {
    realSeek = seekModeFromString(seek_mode.value());
  }

-  auto contextHolder = std::make_unique<AVIOBytesContext>(data, length);
+  auto contextHolder = std::make_unique<AVIOFromTensorContext>(video_tensor);


Can you confirm my understanding that the crux of the fix is this line, where we now pass-in the tensor which is ref-counted, instead of passing the raw underlying data which wasn't ref-counted and thus freed?

Yes, that is correct!

src/torchcodec/_core/AVIOTensorContext.h

src/torchcodec/_core/AVIOTensorContext.cpp

NicolasHug

Thanks for the fix!!

scotts added 2 commits June 10, 2025 13:10

Add test for bytes constructor and ownership

fe61d91

Turn BytesContext into FromTensorContext

709686e

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jun 11, 2025

scotts commented Jun 11, 2025

View reviewed changes

src/torchcodec/_core/AVIOBytesContext.h Outdated Show resolved Hide resolved

Refactor file names

8217a8b

scotts commented Jun 11, 2025

View reviewed changes

src/torchcodec/_core/AVIOTensorContext.h Show resolved Hide resolved

scotts added 5 commits June 10, 2025 20:50

Fix C++ test

087b3be

Lint

2635230

Actually fix C++ test

10a5ddc

Note about other test failures

9581ae5

Remove stray comment

e8c7439

scotts marked this pull request as ready for review June 11, 2025 14:28

NicolasHug approved these changes Jun 11, 2025

View reviewed changes

scotts added 2 commits June 11, 2025 12:17

Apply review changes

dba46e5

Merge branch 'main' of github.com:pytorch/torchcodec into own_bytes

e1516b5

scotts merged commit f4a351c into pytorch:main Jun 11, 2025
46 of 51 checks passed

scotts deleted the own_bytes branch June 11, 2025 20:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Turn BytesContext into FromTensorContext #721

Turn BytesContext into FromTensorContext #721

Uh oh!

scotts commented Jun 11, 2025

Uh oh!

Uh oh!

Uh oh!

NicolasHug left a comment

Uh oh!

NicolasHug Jun 11, 2025

Uh oh!

Uh oh!

NicolasHug Jun 11, 2025

Uh oh!

scotts Jun 11, 2025

Uh oh!

Uh oh!

Uh oh!

NicolasHug left a comment

Uh oh!

Uh oh!

Uh oh!

		def test_create_bytes_ownership(self):
		# Note that the bytes object we use to instantiate the decoder does not

Turn BytesContext into FromTensorContext #721

Turn BytesContext into FromTensorContext #721

Uh oh!

Conversation

scotts commented Jun 11, 2025

Uh oh!

Uh oh!

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

NicolasHug Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NicolasHug Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

scotts Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!