Changing schema of avro file when writing to it in append mode

Question

Changing schema of avro file when writing to it in append mode

390 Views Asked by egriffiths At 27 July 2025 at 13:21

I'm looking for a way to modify the schema of an avro file in python. Taking the following example, using the fastavro package, first write out some initial records, with corresponding schema:

from fastavro import writer, parse_schema

schema = {
    'name': 'test',
    'type': 'record',
    'fields': [
        {'name': 'id', 'type': 'int'},
        {'name': 'val', 'type': 'long'},
    ],
}
records = [
    {u'id': 1, u'val': 0.2},
    {u'id': 2, u'val': 3.1},
]
with open('test.avro', 'wb') as f:
    writer(f, parse_schema(schema), records)

Uhoh, I've got some more records, but they contain None values. I'd like to append these records to the avro file, and modify my schema accordingly:

more_records = [
    {u'id': 3, u'val': 1.5},
    {u'id': 2, u'val': None},
]
schema['fields'][1]['type'] = ['long', 'null']

with open('test.avro', 'a+b') as f:
    writer(f, parse_schema(schema), more_records)

Instead of overwriting the schema, this results in an error:

ValueError: Provided schema {'type': 'record', 'name': 'test', 'fields': [{'name': 'id', 'type': 'int'}, {'name': 'val', 'type': ['long', 'null']}], '__fastavro_parsed': True, '__named_schemas': {'test': {'type': 'record', 'name': 'test', 'fields': [{'name': 'id', 'type': 'int'}, {'name': 'val', 'type': ['long', 'null']}]}}} does not match file writer_schema {'type': 'record', 'name': 'test', 'fields': [{'name': 'id', 'type': 'int'}, {'name': 'val', 'type': 'long'}], '__fastavro_parsed': True, '__named_schemas': {'test': {'type': 'record', 'name': 'test', 'fields': [{'name': 'id', 'type': 'int'}, {'name': 'val', 'type': 'long'}]}}}

Is there a workaround for this? The fastavro docs for this suggest it's not possible, but I'm hoping someone knows of a way!

Cheers

Original Q&A

There are 1 best solutions below

**Scott** · Answer 1

Scott On 11 November 2022 at 15:47

The append API in fastavro does not currently support this. You could open an issue in that repository and discuss if something like this makes sense.

Changing schema of avro file when writing to it in append mode

There are 1 best solutions below

Related Questions in PYTHON

Related Questions in AVRO

Related Questions in FASTAVRO

Trending Questions

Popular # Hahtags

Popular Questions