Caching HTML Pages

209 Views Asked by At

I wanted a mechanism to cache/index the html pages for a website as it is. So, that if a requests comes for that page again, I can simply fetch that page from cache or index and send it. Is it possible with Solr? Because Solr always seems to store display (text) part of the html and not the entire HTML Dom. Is there any other open-source technology which can help me do that with fast performance?

1

There are 1 best solutions below

0
On

There's a lot of missing detail in this question, but the answer to your question about Solr in particular is yes, Solr can store raw HTML just fine. Normally you use a field with stored="true" to be able to retrieve the original HTML, and indexed="true" along with a HTML filter to keep only the text searchable.