9.4 memoryview: Zero-Copy Buffer Protocol

The memoryview object provides a zero-copy, non-owning view into the internal buffer of objects that support the Python Buffer Protocol. This is a critical tool for high-performance computing, I/O operations, and interfacing with C extensions, as it allows you to access and manipulate the raw bytes of an object without creating an intermediate copy. This protocol is supported by fundamental types like bytes, bytearray, array.array, and many third-party library objects (e.g., numpy arrays).

The Core Concept: Zero-Copy Views

The primary advantage of a memoryview is its avoidance of data duplication. Consider a scenario where you have a large bytearray and need to process a slice of it. Using standard slicing large_bytearray[1000:2000] creates a new bytes object, copying 1000 bytes into a new memory allocation. For large data sets, this is inefficient. A memoryview solves this by creating a “window” or a view onto the original data. The view object itself is small and lightweight, containing metadata like a pointer to the original memory, the length of the view, and the stride (the number of bytes between successive elements). Any modification made through a view of a mutable object (like a bytearray) is reflected instantly in the original underlying buffer.

# Creating a mutable bytearray
data = bytearray(b'ABCDEFGH')
print(f"Original: {data}")  # Original: bytearray(b'ABCDEFGH')

# Creating a memoryview
mv = memoryview(data)

# Modifying through the view. This does NOT create a copy.
mv[2:4] = b'XY'
print(f"After view modification: {data}")  # After view modification: bytearray(b'ABXYEFGH')

# Slicing the view creates another view, not a copy.
sub_view = mv[1:5]
sub_view[0] = 90  # ASCII 'Z'
print(f"After sub-view modification: {data}")  # After sub-view modification: bytearray(b'AZXYEFGH')

Understanding the Buffer Protocol and memoryview Creation

The Buffer Protocol is a C-level API that objects can implement to expose their raw memory. When you call memoryview(obj), Python checks if obj supports this protocol. If it does, the memoryview constructor simply records the necessary information to access obj’s memory. If it does not, a TypeError is raised. The created view can be in one of two modes, dictated by the original object’s mutability: read-only (for bytes) or writable (for bytearray).

# Read-only view from bytes
immutable_data = b'Hello'
read_only_view = memoryview(immutable_data)
try:
    read_only_view[0] = 74  # Try to change 'H' to 'J'
except TypeError as e:
    print(f"Error: {e}")  # Error: cannot modify read-only memory

# Writable view from bytearray
mutable_data = bytearray(b'World')
writable_view = memoryview(mutable_data)
writable_view[0] = 87  # Change 'W' to 'w' (ASCII 87)
print(mutable_data)  # bytearray(b'world')

Multi-Dimensional Interpretation with ’ndim’ and ‘shape’

A powerful feature of memoryview is its ability to interpret a one-dimensional buffer as a multi-dimensional array. This is controlled by the format property. For example, an int format like 'i' (4-byte integer) allows the view to reinterpret a flat memory buffer as an array of integers. The ndim attribute reveals the number of dimensions, and shape reveals the size of each dimension. This is immensely useful for efficiently working with data from files or networks that have a known, structured layout.

# Create a buffer of 12 bytes (3 integers * 4 bytes each)
buffer = bytearray([0x01, 0x00, 0x00, 0x00,  # int 1
                    0x02, 0x00, 0x00, 0x00,  # int 2
                    0x03, 0x00, 0x00, 0x00]) # int 3

# Create a memoryview interpreting the data as integers
mv = memoryview(buffer).cast('i')  # 'i' is the format code for signed int

print(f"Number of dimensions: {mv.ndim}")  # 1
print(f"Shape: {mv.shape}")                # (3,)
print(f"Stride: {mv.strides}")             # (4,) - bytes between elements
print(f"First element: {mv[0]}")           # 1
print(f"Tolist: {mv.tolist()}")            # [1, 2, 3]

# We can cast again to a 2D array (3 rows, 1 column)
mv_2d = mv.cast('i', (3, 1))
print(f"2D Shape: {mv_2d.shape}")          # (3, 1)
print(f"Element at [2][0]: {mv_2d[2][0]}") # 3

Casting and Reinterpreting Memory

The .cast() method is used to reinterpret the underlying memory into a new format or shape without copying the data. It effectively changes the “lens” through which the data is viewed. This operation is only valid if the product of the itemsizes and shapes results in the same total buffer length. It’s a low-level operation, so the programmer must be cautious about platform-specific details like byte order (endianness). The format property follows the same syntax as the struct module.

Common Pitfalls and Best Practices

Lifetime Management: A memoryview does not own its data; it only references it. If the original object is garbage collected while a view still exists, accessing the view will cause undefined behavior (often a program crash). Always ensure the base object’s lifetime exceeds that of any views created from it.
Read-Only Buffers: Attempting to write to a view created from a read-only object (like bytes) will raise a TypeError. Check the .readonly attribute (True or False) if your code needs to handle both cases.
Format and Platform Dependence: The interpretation of data through a memoryview is dependent on the platform’s byte order and data type sizes. For portable code, explicitly handle endianness using format characters like '>i' (big-endian) or '<i' (little-endian) rather than relying on the native 'i'.
Tobytes() Copies: The .tobytes() method and tolist() method do create a copy of the data. Use them when you need a true, independent bytes object or Python list, but be aware of the performance cost for large buffers.
Use with array.array for Homogeneous Data: Combining memoryview with array.array is a highly efficient way to work with large sequences of homogeneous numeric types, offering performance much closer to C or NumPy without the external dependency.