For a field that is functioning as counter, ie, value will be changed over the time, and will be used to return ordered entities (will sort against this field for the filtered entities), should we build index for this field?
For a field that is functioning as counter, ie, value will be changed over the time, and will be used to return ordered entities (will sort against this field for the filtered entities), should we build index for this field?
It's not entirely clear, but I think the question is about whether the cons of creating an index on a frequently updated field will outweigh the benefits of fast querying and sorting on that field. You also imply that your query will filter on a different field, and then you want to sort on this field. Feel free to elaborate on your exact use case.
What I think you want is something like this:
And then an index like so:
And then a query like so:
Which uses the btree:
Now, it really might depend on how many results you need to get back. If it's only a few results, you could probably sort on the field without an index, and that would improve the update performance. I think I'll actually do a few tests since I'm curious. I'll update in a bit.
update I wrote this benchmark to show the difference between sorting on an index and not, and updating a count field on an index, and not. Full code here: https://gist.github.com/1696041
It inserts 700K and 7M docs (to get some variety), separated into 7 "filters". Then it randomly picks a doc to increment the count of 1M times. The 1M docs per filter are too big to sort without a limit, so the only way to show how that piece works is to put a limit in.
The conclusion is as expected. It takes longer (almost twice as long in this test--but twice as long is still pretty fast) to update the count field when there's an index on it. But it's much faster to query against. You have to decide which is more important to you.
The output is here (running on my macbook pro w/ SSD):