Is it possible to create Kafka topics through StreamSets Data Collector (SDC)?

489 Views Asked by At

I am using StreamSets Data Collector (SDC) web tool to create a pipeline that transfers data from my local system to Kafka through a Kafka producer. However, I have to first manually create the topic in which I want to store my data. Is it possible to create the topic via StreamSets?

1

There are 1 best solutions below

0
On BEST ANSWER

By default, Kafka is configured to auto-create topics. You can set num.partitions in broker config to change the number of partitions per topic from its default value of 1.

If you wanted to create the topic from the pipeline, you could trigger a shell executor from the pipeline start event to create a topic.

Note that it is not possible to create the topic from a stage event, since these fire after data is sent to destinations, and you would want the topic to exist before the pipeline starts sending data.