I have looked at how to get a list of all tokens from Solr/Lucene index? but Lucene 8.6.1 doesn't seem to offer IndexReader.terms(). Has it been moved or replaced? Is there an easier way than this answer?
How to get a list of all tokens from Lucene 8.6.1 index?
669 Views Asked by PSK At
1
There are 1 best solutions below
Related Questions in JAVA
- I need the BIRT.war that is compatible with Java 17 and Tomcat 10
- Creating global Class holder
- No method found for class java.lang.String in Kafka
- Issue edit a jtable with a pictures
- getting error when trying to launch kotlin jar file that use supabase "java.lang.NoClassDefFoundError"
- Does the && (logical AND) operator have a higher precedence than || (logical OR) operator in Java?
- Mixed color rendering in a JTable
- HTTPS configuration in Spring Boot, server returning timeout
- How to use Layout to create textfields which dont increase in size?
- Function for making the code wait in javafx
- How to create beans of the same class for multiple template parameters in Spring
- How could you print a specific String from an array with the values of an array from a double array on the same line, using iteration to print all?
- org.telegram.telegrambots.meta.exceptions.TelegramApiException: Bot token and username can't be empty
- Accessing Secret Variables in Classic Pipelines through Java app in Azure DevOps
- Postgres && statement Error in Mybatis Mapper?
Related Questions in PYTHON
- How to store a date/time in sqlite (or something similar to a date)
- Instagrapi recently showing HTTPError and UnknownError
- How to Retrieve Data from an MySQL Database and Display it in a GUI?
- How to create a regular expression to partition a string that terminates in either ": 45" or ",", without the ": "
- Python Geopandas unable to convert latitude longitude to points
- Influence of Unused FFN on Model Accuracy in PyTorch
- Seeking Python Libraries for Removing Extraneous Characters and Spaces in Text
- Writes to child subprocess.Popen.stdin don't work from within process group?
- Conda has two different python binarys (python and python3) with the same version for a single environment. Why?
- Problem with add new attribute in table with BOTO3 on python
- Can't install packages in python conda environment
- Setting diagonal of a matrix to zero
- List of numbers converted to list of strings to iterate over it. But receiving TypeError messages
- Basic Python Question: Shortening If Statements
- Python and regex, can't understand why some words are left out of the match
Related Questions in SEARCH
- How to create a regular expression to partition a string that terminates in either ": 45" or ",", without the ": "
- Hospital route finding ai project
- tryin to write a function that searches for SSN in a dict, and if that SSN is found, to retrieve all the data associated with that SSN
- How the search filter from search bar works in mern?
- Angular application loading weirdly when I add "/" at the end of URL
- Elastic python to extract last 1hr tracing
- How to detect if two sentences are simmilar, not in meaning, but in syllables/words?
- I need to have a look at all my private pine scripts and filter the scripts for certain words in TRADINGVIEW
- What is correct URL? {'quandl_error': {'code': 'QECx01', 'could not recognize URL: /api/v3/databases/WIKI/search. Please check URL and try again.'}
- Solr 9 punctuation issue
- Autocomplete search filter not working for dynamically added input fields in angular
- How to correct call API search request with debounce?
- Search in GDrive only the first 5 topics
- How do I use sp/pnp sp.search to find all Associated sites when querying a hub site Id
- How to apply custom analyzers on a field in Vespa schema
Related Questions in LUCENE
- How to update Cassandra Lucene index with a new column? rebuild or update index?
- How to glue (merge) files Lucene?
- Apache Lucene performance estimation
- Lucene DocValues.Source deprecated
- Solr score diff in doc list and Explain score
- How do I reload the index before searching in Hibernate Lucene
- Using Lucene 9.10.0 MemoryIndex in Java to ingest and search IntField and use rangequery
- How can i use a builtin analyzer in my entity with Hibernate Search
- Atlas Search Index Build Fail
- how to use hiberanate search 7.1.0 analyzer settin in spring boot 3
- Suggester template Search issue ElasticSearch
- I'm using hibernate text based search and indexing. I want to search common rows between indexed tables using Lucene query
- Merging Solr index stored in HDFS not working
- Can't find document at lucene index with no delimeter in phrase
- How do I get the list of the full indexed terms in an ElasticSearch index?
Related Questions in PYLUCENE
- how do I resolve pylucene installation error for java1.8 when using jcc. I am getting fata error
- solr string field getting no results
- query parser failed when AND is used in query
- Retrieving terms for a document in pylucene
- what is a Factory in Lucene
- failed importing ICUFoldingFilter while using pylucene
- Problem chaining tokenizer with filters with PythonAnalyzer in PyLucene
- pylucene fuzzy search not return anything even with the same search term
- Lucene Search based on edit-distance on entire text rather than individual tokens
- Efficiently match texts contained in a query text
- pylucence cannot find a word that was presented in the text which indexed earlier
- Lucene query: TermQuery doesn't work but QueryParser works
- pylucene - ModuleNotFoundError: No module named 'org'
- PyLucene install: "make" not working and "jvm.dll could not be found"
- Problem in Ping or SSH connect to docker container
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Some History
You asked: I'm just wondering if
IndexReader.terms()has moved or been replaced by an alternative.The Lucene v3 method
IndexReader.terms()was moved toAtomicReaderin Lucene v4. This was documented in the v4 alpha release notes.(Bear in mind that Lucene v4 was released way back in 2012.)
The method in
AtomicReaderin v4 takes a field name.As the v4 release notes state:
The key part there is "per term within a single field". So from that point onward there was no longer a single API call to retrieve all terms from an index.
This approach has carried through to later releases - except that the
AtomicReaderandAtomicReaderContextclasses were renamed toLeafReaderandLeafReaderContextin Lucene v 5.0.0. See Lucene-5569.Recent Releases
That leaves us with the ability to access lists of terms - but only on a per-field basis:
The following code is based on the latest release of Lucene (8.7.0), but should also hold true for the version you mention (8.6.1) - with the example using Java:
The above example assumes an index as follows:
If you need to enumerate field names, the code in this question may provide a starting point.
Final Note
I guess you can also access terms on a per document basis, instead of a per field basis, as mentioned in the comments. I have not tried this.