We have a Mapreduce job created to inject data into BigQuery. There is not much of filtering function in our job so we'd like to make it map-only job to make it faster and more efficient.
However, the java class "com.google.gson.JsonObject" accepted by BigQuery doesn't implement the Writable interface needed for hadoop Mapper interface. JsonObject is also final and we can not extend it ...
Any suggestions on how we get around the issue?
Thanks,
You should be able to use the BigQuery connector for Hadoop (see https://cloud.google.com/hadoop/bigquery-connector) which provides an implementation of the Hadoop OutputFormat class.