I have to translate someones SAS code into R as my company is moving away from SAS and was wondering if there were a comparable R function to proc means? For reference, proc means produces (or at least what I need it to produce) is the number of observations, the mean, min, max, and standard deviation. It is also important that this be able to be done by groups as well as weighted. The summary function in R produces this output but I don't think you can do it weighted. I would prefer an answer that uses base R but if that doesn't exist, a package would be okay.
Example:
df
Temp V1 Weight
1 Hi 1 8
2 Low 2 3
3 Hi 3 9
4 Low 4 9
I need to produce various weighted summary statistics (median,min,max,sum,mean etc) of V1 by group as follows:
Group Min Max Mean Sum
Hi 1 3 2.06 35
Low 2 4 3.5 42
Generally,
aggregate
in base R can serve as counterpart to SAS'sproc means
. Typically, this method runs a single simple function (that takes one input) on a single numeric column. Below is the formula version of the method:But
aggregate
can be extended for multiple columns and multiple functions:For weighted statistics,
aggregate
does not have a built-in facility, so a bit more nuance is needed where you compute the statistics before aggregation. R'save
allows calculation across groups:Once you have such columns, you can aggregate on
min
andmax
to return grouping needs:Online Demo
See Also: