Dunder | mikePietsch.com

28.12 Hashing: hash and Its Relationship to eq

In Python, the __hash__ method is a fundamental part of the language’s data model, enabling an object to be used as a key in a dictionary or as a member of a set. These data structures rely on a hash table implementation, which requires a fast, efficient way to compute a unique integer representation—a hash value—for each object. This hash value acts as a rough guide to where the object’s data is stored, allowing for near-constant time (O(1)) average complexity for lookups, insertions, and deletions.

28.11 Object Creation: new and init

In Python, object construction is a two-step process handled by the __new__ and __init__ methods. Understanding their distinct roles and interplay is fundamental to mastering the Python data model. The __new__ method is responsible for creating a new instance, while the __init__ method is responsible for initializing it. This separation provides powerful control over object instantiation, enabling patterns like immutable objects, singletons, and subclassing built-in types. The Role of new The __new__ method is a static method (though it doesn’t require the @staticmethod decorator) that takes the class of which an instance is requested as its first argument (cls). Its job is to allocate memory for the object and return a new instance. This instance is then passed as the self argument to __init__. If __new__ does not return an instance of cls (or a subclass), then the __init__ method of the new object will not be invoked. This is the mechanism that allows __new__ to control whether initialization occurs.

28.10 Attribute Access: getattr, getattribute, setattr, delattr

Attribute access in Python is a fundamental concept controlled by a set of special methods known as the attribute access dunder methods. These methods—__getattr__, __getattribute__, __setattr__, and __delattr__—form the core of the Python Data Model’s mechanism for customizing how objects interact with the dot (.) operator. Understanding their interplay, invocation order, and potential pitfalls is crucial for advanced Python programming, enabling the creation of dynamic, flexible, and robust classes. The Default Behavior and setattr By default, when you assign an attribute like obj.x = 10, Python stores the value 10 in the object’s instance dictionary, obj.__dict__['x']. This default behavior is implemented by the object.__setattr__ method. When you override this method, you intercept all attribute assignment attempts. Therefore, it is absolutely critical to avoid using the dot notation for assignment within your custom __setattr__ method, as it would cause infinite recursion. Instead, you must use the super() method or directly manipulate the object’s __dict__.

28.9 Context Manager Protocol: enter and exit

The context manager protocol, defined by the __enter__ and __exit__ methods, is a cornerstone of Python’s resource management paradigm. It provides a clean, predictable mechanism for allocating and releasing resources precisely when needed, famously used with the with statement. This protocol embodies the RAII (Resource Acquisition Is Initialization) idiom, ensuring that even if an error occurs within the block, the cleanup code in __exit__ is always executed, preventing resource leaks.

28.8 Callable Objects: call

The __call__ method is a powerful and distinctive feature of Python’s data model, transforming class instances from static data containers into dynamic, function-like objects. When a class implements __call__, its instances become callable, meaning they can be invoked using parentheses () and, optionally, arguments. This blurs the line between functions and objects, enabling a programming paradigm known as “function objects” or “functors.” At its core, this mechanism leverages the fact that everything in Python is an object, including functions. A regular function object has a __call__ method defined in its class (built-in, but conceptually true). When you define your own __call__, you are essentially giving your custom objects the same core behavior as function objects. The interpreter, upon encountering my_instance(), translates it to my_instance.__call__(). This allows instances to maintain state between calls, a capability that traditional functions, without using closures or nonlocal variables, lack.

28.7 Container Protocol: len, getitem, setitem, delitem, contains

The container protocol in Python allows objects to emulate the behavior of built-in container types like lists, dictionaries, and sets. By implementing a specific set of dunder methods, you can create custom classes that support operations such as indexing, slicing, membership testing, and determining length. This protocol is fundamental to making your objects “Pythonic”—they work intuitively with the language’s built-in functions and idioms. Implementing len for Size Reporting The __len__ method should return a non-negative integer representing the number of items in the container. This method is called by the built-in len() function. A key requirement is that the length must be a non-negative integer; returning a negative number will raise a ValueError.

28.6 Unary Operators: neg, pos, abs

The Purpose of Unary Operators in the Data Model Unary operators are operations that act upon a single operand to produce a new value. In Python, the three primary unary operators are implemented through the __neg__, __pos__, and __abs__ special methods. These methods are integral to the Python data model, which defines the interfaces that allow your custom objects to interact seamlessly with the language’s core syntax. By implementing these methods, you empower your objects to respond to the - (negation), + (unary plus), and abs() built-in function calls, respectively. This integration is a cornerstone of writing Pythonic code; objects that behave like built-in types make APIs more intuitive and code more maintainable.

28.5 Comparison: eq, ne, lt, le, gt, ge

In Python, comparison operators (==, !=, <, <=, >, >=) are not hardcoded behaviors of the language itself. Instead, they are syntactic sugar that trigger the invocation of specific special methods, often called “rich comparison methods,” which form a core part of the Python Data Model. This design allows developers to define how objects of their custom classes should be compared, imbuing them with intuitive, expected behavior. The six methods are __eq__ (equal, ==), __ne__ (not equal, !=), __lt__ (less than, <), __le__ (less than or equal, <=), __gt__ (greater than, >), and __ge__ (greater than or equal, >=).

28.4 In-Place Operators: iadd, isub

In-place operators, often called “augmented assignment” operators, provide a concise syntax for modifying an object and reassigning the result to the same variable (e.g., x += 1). Under the hood, these operations are powered by special dunder methods like __iadd__ and __isub__. Their implementation is crucial for creating efficient, intuitive, and predictable mutable objects. The Purpose and Advantage of In-Place Methods The primary advantage of implementing in-place methods is performance, especially for large mutable objects. Consider a += operation on a list. If __iadd__ is implemented, it can modify the list in-place, avoiding the creation of an entirely new list object. If __iadd__ is not implemented, Python falls back to __add__, which does create a new object. The subsequent assignment is then just a reference change.

28.3 Reflected Operators: radd, rsub

When Python encounters an expression like x + y, it first attempts to call x.__add__(y). If x’s class does not implement __add__, or if the method returns the special NotImplemented value, Python does not immediately give up. Instead, it checks if y’s class implements the corresponding “reflected” or “right-hand” method, in this case, __radd__ (right-add). This fallback mechanism is a cornerstone of the Python data model, designed to allow for flexible and intuitive operations between objects of different types, even when one of them wasn’t originally designed to work with the other.

28.2 Arithmetic Operators: add, sub, mul, truediv

The Role of Arithmetic Dunder Methods Arithmetic dunder methods are the mechanism by which Python’s data model allows objects to respond to standard operators like +, -, *, and /. When you write a + b, the Python interpreter effectively translates this into a method call: a.__add__(b). This design provides a consistent interface for operator overloading, enabling your custom objects to behave like built-in types. The primary purpose of these methods is not merely to add functionality but to integrate your objects seamlessly into the Python language itself, allowing them to participate in expressions and follow established idioms.

28.1 The Data Model: How Python Calls Special Methods

At the heart of Python’s elegance and consistency lies the Data Model, a formal interface that defines how objects interact with the language’s core constructs. This model is not enforced by a compiler but is instead a set of conventions—a promise your objects make to the interpreter. When you write len(my_obj), Python doesn’t inherently know how to get the length. Instead, it translates this operation into a method call on your object: my_obj.__len__(). This translation is the fundamental mechanism of the Data Model. It’s how Python achieves polymorphism; any object, regardless of its type, can participate in common operations like length calculation, iteration, or arithmetic, simply by implementing the corresponding “dunder” (double underscore) methods. This approach allows built-in functions and operators to work seamlessly with both built-in types and user-defined classes, creating a unified and expressive programming environment.