I have a RDD which is similar to,
CELL-ID | COUNT
--------------
abcd 10
DEF 20
ghi 15
I need to get an RDD with
CELL-ID-1 | CELL-ID-2 | PRODUCT
--------------
abcd DEF 200
abcd ghi 150
DEF abcd 200
DEF ghi 300
...
....
How can this be done ? I've tied to use cartesian product but couldn't get the output
val result = orginalRDD.cartesian(orginalRDD).collect {
case ((t1: _,Int), (t2: _,Int)) if t1 != t2 => t1 * t2
}
You can either make
t1
andt2
represent the tuples (entire "records"):Or, you can do the same but use the pattern-matching to break them up further:
Your solution looks like an attempt to do both at once...