Distinguishing between Pydantic Models with same fields

Question

I'm using Pydantic to define hierarchical data in which there are models with identical attributes. However, when I save and load these models, Pydantic can no longer distinguish which model was used and picks the first one in the field type annotation. I understand that this is expected behavior based on the documentation. However, the class type information is important

Accepted Answer

As correctly noted in the comments, without storing additional information models cannot be distinguished when parsing.As of today (pydantic v1.8.2), the most canonical way to distinguish models when parsing in a Union (in case of ambiguity) is to explicitly add a type specifier Literal. It will look like this:from abc import ABCfrom pydantic import BaseModelfrom typing import Union, Literalclass Data(BaseModel, ABC):    """ base class for a Member """    number: floatclass DataA(Data):    """ A type of Data"""    tag: Literal['A'] = 'A'class DataB(Data):    """ Another type of Data """    tag: Literal['B'] = 'B'class Container(BaseModel):    """ container holds a subclass of Data """    data: Union[DataA, DataB]# initialize container with DataBdata = DataB(number=1.0)container = Container(data=data)# export container to string and load new container from stringstring = container.json()new_container = Container.parse_raw(string)# look at type of container.dataprint(type(new_container.data).__name__)# >>> DataBThis method can be automated, but you can use it at your own responsibility, since it breaks static typing and uses objects that may change in future versions:from pydantic.fields import ModelFieldclass Data(BaseModel, ABC):    """ base class for a Member """    number: float    def __init_subclass__(cls, **kwargs):        name = 'tag'        value = cls.__name__        annotation = Literal[value]        tag_field = ModelField.infer(name=name, value=value, annotation=annotation, class_validators=None, config=cls.__config__)        cls.__fields__[name] = tag_field        cls.__annotations__[name] = annotationclass DataA(Data):    """ A type of Data"""    passclass DataB(Data):    """ Another type of Data """    pass

Advertisement

Answer