I would like to generate something like a data flow diagram for an ETL process.
What I want to achieve is to have (the minimal example) two tables - source and destination - both tables with their attributes and between them a process, that transforms data from the first to the second table. Within this process I want to display the relationship to input fields, output fields and description of the formula/expression.
Of course, then for real world scenario I need multiple source tables for one deatination and the whole chain, where the data flows from input table to the final destination one through multiple tables.
I tried to generate something in graphviz, but if it is only a bit more complicated than completely simple, I can't force it to keep the expected layout and it's just a mess.
Here is some mspaint mockup for what layout I am after. But there is more tables (~10), around the same amount of processes, but hundreds of attributes and derivation functions.
Very fiddly, but also very scriptable. See the Graphviz documentation as needed, esp. https://graphviz.org/doc/info/shapes.html
Giving:
