Top-N using Python, MapReduce

25 Views Asked by At

I'm using Python to write a MapReduce algorithm to find the top-10 viewed movies. I used two mappers and two reducers. The first mapper emit (movid, 1) and the first reducer outputs movid, frequency of occurrence (call it counter). I want to find the top-10 movie while keeping the code distributed. The attached image is my output (movid , count).

enter image description here

0

There are 0 best solutions below