I have two Avro schemas, one contains several union fields, and the union type is ["null", "string"]
. The other schema does not have any union fields.
And I have POJO classes representing the mentioned two schemas. POJOs were generated by the avro-tools-1.11.0.jar
.
I followed the below approach to transform the JSON into an Avro object(the one that does not contain any union fields)
Decoder decoder = DecoderFactory.get().jsonDecoder(EtmKey.getClassSchema(), "The JSON Input!");
SpecificDatumReader<EtmKey> reader = new SpecificDatumReader<>(EtmKey.getClassSchema());
EtmKey etmKeyDatum = reader.read(null, decoder);
System.out.println("EtmKey topic: " + etmKeyDatum.toString());
EtmKey is the Avro schema representation class.
Using the above code, I was able to successfully generate the Avro object without having any issues. The library I used is org.apache.avro
.
But the same library cannot be used to generate the Avro object when it has union fields. It throws Exception in thread "main" org.apache.avro.AvroTypeException: Expected start-union. Got VALUE_STRING
. Tried manually converting the field values like this "field_5": {"string": "0MI8C..."}
according to the answer in here but no luck. Manually converting is not an option anyway.
Example JSON payload,
{
"field_1": "Apple",
"field_2": "123",
"field_3": "001-123",
"field_4": "TR501",
"field_5": "0MI8...",
"field_6": "0010y...",
"field_7": "2022-12-02T22:21:19.000+0000",
"field_8": "john.doe",
"field_9": "005E00."
}
Please note that JSON payload field names are the same as the ones in the Avro class.
Here, I would like to get some insight on how to generate an Avro Object that has Union Fields using JSON input. The solution should be solid and it is required to use an official plugin/dependency or reputed library like Jackson, Gson. Appreciate any resources/code examples and suggestions.
Solved the issue by implementing the following code,
Call the method and assign the output to the required class(entity).
Etm
is an Avro POJO class generated using avro-tools