I have a yearly survey that I do some validations on. I only want to do the validations for the current year (max year in the column surveyYear). The survey comes every year so I want to have it automatised instead of writing the current year. So when the survey for 2019 comes I do not need to change the code.
Is it possible to filter surveyYear in the set statement to only take the largest number in the column? I have tried different combinations but do not sucseed. And if it is not possible, how could I solve it otherwise
I would also like to do it in a datastep and not in proc as the validation code comes in the want datastep after the set statement.
data have;
input surveyYear id score varChar$10. ;
datalines;
2016 1 10 Yes
2016 2 6 Yes
2016 3 8 Yes
2016 4 . No
2017 5 6 No
2017 6 5 No
2017 7 12 IU
2017 8 3 IU
2017 9 2 IU
2017 10 15 99999
2018 11 0 .
2018 12 . No
2018 13 10 Yes
2018 14 8 No
2018 15 11 No
2018 16 3 IU
;
run;
data want;
set have (where=(surveyYear > 2017));
/* I would like to have something that states the max surveyYear instead */
/* (where=(surveyYear = max(surveyYear))) */
run;
To filter for the max you can first get hte max and then apply a filter.
Then use it in your filter.