Need to extract data from the text ( this is just a sample)
text <- c(" 9 A 1427107 -",
" 99 (B) 3997915 -",
" 999 (SOCIO) 7161315 -",
" 9999 @M 4035115 -",
" 99999 01 Z 2136481035115 8,621"
)
so far I tried but could not create pattern for all columns
as.numeric(gsub("([0-9]+).*$", "\\1",text))
I want my data frame out put looks like
row_names Text ID Amount
9 A 1427107 -
99 (B) 3997915 -
999 (SOCIO) 7161315 -
9999 @M 4035115 -
99999 01 Z 2136481035115 8,621
Row_names are all the numbers, "Text" contains numbers and text ID column contains numbers from 7 to 13 digits, Amount is either a "-" or numbers with thousands (,)
We can use
read.table
to read the data into adata.frame
Or using
extract