i have a matrix in r with 128 rows (scenarios) and 6 columns (results for each scenario). i want to determine the pareto optimal scenarios among the 128 possibilities. the results of column 1,5 and 6 should be low. the results of column 2,3 and 4 should be high. i normalized the results for common scales. result_matrix is the matrix of simulated reuslts. i tried this code:
# Example matrix with values between 0 and 1 (not the simulated results I am analysing)
num_rows <- 128
num_cols <- 6
results_matrix <- matrix(runif(num_rows * num_cols), nrow = num_rows, ncol = num_cols)
# Initialize pareto as all points
pareto <- 1:nrow(results_matrix)
# Loop through the points and check for dominance
for (i in 1:nrow(results_matrix)) {
for (n in 1:nrow(results_matrix)) {
if (i != n) {
# Check for dominance
if (all(results_matrix[i, c(2,3,4)] >= results_matrix[n, c(2,3,4)]) &&
all(results_matrix[i, c(1,5,6)] <= results_matrix[n, c(1,5,6)])) {
pareto[i] <- NA
break # Break the inner loop when dominance is found
}
}
}
}
# Print the indices of points on the Pareto front
print(pareto)
the output would be a vector with the pareto otimum indices. however,the results are not reasonable. e.g., one pareto option has the highest value in column one, although this should be low.
If I apply the rPref
package, I get the pareto solution with the maximum or minimum values for each column, but I want also the solutions on the pareto front:
p <- low(column1, results_df)*high(results_df$column2)* high(results_df$column3)*high(results_df$column4)* low(results_df$column5)*low(results_df$column6)
peval(p)
The inner if is checking that row
i
dominates rown
. It should be the other way around.We can also simplify things by flipping the signs of columns 1, 5, and 6. This will allow us to compare the entire row all at once.
Or a fully vectorized version:
Timing with
microbenchmark
(and check that the results are equivalent withcheck = "equal"
):