Using Smooks (1.4
) to read CSVs and transform them into a Person
POJO.
The CSV consists of comma-separated records on each line, and each record has pipe-delimited fields:
Smith|John|45|male|Johnny|JSmith|JSmizzle,
Smith|Jane|43|female|Janey
etc. Thus each line represents a different person to create. First, a POJO:
public class Person
{
private String lastName;
private String firstName;
private int age;
private boolean isMale;
private List<String> aliases;
}
My problem is with the List
of aliases. Here are the vital parts from my XML configuration:
<reader class="org.milyn.csv.CSVReader">
<param name="fields">lastName,fristName,age,gender,aliases</param>
<param name="separator">|</param>
<param name="strict">false</param>
</reader>
<core:filterSettings type="SAX"/>
<jb:bean beanId="person" class="net.me.myproject.app.Person" createOnElement="csv-set/csv-record/">
<jb:value property="lastName" data="csv-set/csv-record/lastName"/>
<jb:value property="firstName" data="csv-set/csv-record/firstName"/>
<jb:value property="isMale" data="csv-set/csv-record/gender"/>
<jb:value property="age" data="csv-set/csv-record/age"/>
<jb:wiring property="aliases" beanRefId="aliases"/>
</jb:bean>
<jb:bean beanId="aliases" class="java.util.ArrayList" createOnElement="???">
<jb:wiring beanRefId="alias"/>
</jb:bean>
<jb:bean beanId="alias" class="java.util.String" createOnElement="???">
???
</jb:bean>
So where I'm choking is in getting the createOnElement
configured correctly for the aliases
ArrayList as well as each alias
String. Thanks in advance to anybody who can nudge me in the right direction!
First of all, your
CSVReader
's "fields" attribute will be a hodge-podge of all fields contained inside the CSV file, irregardless of which POJO, list or type they map back to. Thus, some fields will bePerson
properties, and some fields will be aliases that actually belong to the separatealiases
bean, which is of typejava.util.ArrayList<String>
.Your job is to tell Smooks how to map each field to the respective bean/list/component/type/etc, which means telling it what to do when it encounters each field.
Smooks does not support this sort of "dynamic" field binding, where you can have 0+ CSV fields map back to an
ArrayList
, that itself will either be empty or populated. You must enumerate each field in theCSVReader
, which means having anArrayList
of aliases that has a fixed size.Ergo, you must decide on a maximum number of aliases that can be associated with each
Person
, and account for them in the fields list:This means that each CSV record must have a credible value for your 3 aliases. I would recommend having an "ignore" value, such as "
%%%IGNORE%%%
" so that your app logic can no to remove an list items that contain that value (after Smooks has finished performing the transformation that is).You might also want to check out Smooks's built-in
$ignore$
token, which might already do this or something like it.The last piece before we can tie everything together in a complete code example is to simply accept the sad fact that Smooks either does not (or does not publicly document) any ability to use
List<String>
in this kind of example. You must convert your POJO to use either to aList<StringBuffer>
or aList<StringBuilder>
for aliases, so that we may take advntage of a Smooks-JavaBeanvalue
element attribute calledsetterMethod
.Altogether now:
So every time we start parsing a new
csv-record
, we create both aperson
bean (as your initial code example shows perfectly) as well as analiases
bean. Then, over the course of parsing this record, we will findPerson
properties as well asalias1
throughalias3
. ThealiasN
fields get stored into thealiases
bean, meanwhile the otherPerson
field gets stored into theperson
bean. Finally, Smooks knows to "wire" theperson
andaliases
beans together to create a JavaPerson
object.