I have a list created in Python embedded into a cell of a csv. I am trying to coerce the elements into a datatable in R, but I am stuck on one particular vector which contains text. The reason is that while strsplit() works fine with the numeric values by splitting on ",", any embedded comma in the text causes one vector to be longer than the others. Below I've enclosed a reproducible example. Thank you for any help you can provide!
x <- c("['SPOSORSHIP FOR CONVENTION']", "['GENERAL CONTRIBUTION', 'GENERAL CONTRIBUTION']",
"['WOMEN & POPULATION']", "['PROGRAM SUPPORT', 'PROGRAM SUPPORT']",
"['MULTIPLE GRANTS FOR MULTIPLE PURPOSES']", "['IMPROVING NATIONAL PARKS']",
"['general operating support']", "['Civic Engagement', 'Animal Welfare', 'Religion']",
"['RESEARCH SUBAWARD']", "['OPERATIONAL SUPPORT', 'OPERATIONAL SUPPORT']",
"['PROMOTE FILM INDUSTRY']", "['TO SUPPORT PUBLIC AFFAIRS PROGRAMS', 'TO SUPPORT PUBLIC AFFAIRS PROGRAMS', 'TO SUPPORT PUBLIC AFFAIRS PROGRAMS', 'TO SUPPORT PUBLIC AFFAIRS PROGRAMS', 'TO SUPPORT PUBLIC AFFAIRS PROGRAMS', 'TO SUPPORT PUBLIC AFFAIRS PROGRAMS']",
"['10TH ANNUAL GREAT LAKES RESTORATION CONFERENCE AND PETER WEGE TRIBUTE LUNCHEON']",
"['Conservation', 'Conservation']", "['FOR GENERAL OPERATING SUPPORT']"
)
Perhaps this will help. I first remove the '[ and '] and then split on ', '