I was wondering if it is at all possible to use OpenGrok to index PPT, XLS, DOC etc formats. Would this have to be programmed by myself or is there already a plugin/method of doing this?
Using OpenGrok to index Microsoft Office documents etc
368 Views Asked by CoD At
2
There are 2 best solutions below
0
Richard Ludwig
On
There is an issue on OpenGROK available here https://github.com/oracle/opengrok/issues/492 Though it is waiting since 2013 on a plugin interface.
Related Questions in SVN
- Why does the Jenkins SVN plugin give error E170001 when connecting to my VisualSVN server?
- How to find the Git Revision Hash in a synced SVN repo using SubGit?
- In SVN, what's the different between "merge from a to b" an "merge from b to a"?
- Revert back to older version in TortoiseSVN and Commit
- SVN - folder checkout and merges
- Automatically Compile .Net UserControls
- SVN update not working
- maven-scm-plugin 1.9.4 - too old to work with working copy
- git-svn problems creating tags
- Sonarqube SVN Plugin fails with code E155007 'is not a working copy'
- Install Subversion 1.7 on Debian jessie
- How to customize svn date format
- Reverse Merge and svn:mergeinfo
- Automatically deploy latest revision to website from SVN server
- What is a generic name for SVN's "trunk", "tags" and "branches" directories?
Related Questions in SEARCH
- SQL weight rows by formula
- If Input is focused trigger X else trigger Y
- laravel full-text search with multiple keywords together
- Login form by using a new database, made in VB
- How to search for overloaded methods in a class
- Modifying Tries code in Java
- Doing a multi-column search for an item in a listView control using c#
- T SQL wildcard searching for a zip code
- django rest framework search filter all fields
- how to filter search result with dropdown list in php
- PHP/MySQL search... show all data by default, or show matched data
- Oracle multiple REPLACE options in REGEXP_REPLACE
- Is there a way to get all complete sentences that a search engine (e.g. Google) has indexed that contain two search terms?
- How to search a unknown composite key for dictionary in O(1) in c#
- android java search listview clickedItem
Related Questions in INDEXING
- Why does mysql stop using indexes when date ranges are added to the query?
- MySQL: Using natural primary index or adding surrogate when tables are given
- How does MongoDB process unsupported languages?
- Error in indicies while unsetting Sessions
- How to index a field with mongodb-erlang
- How to force use of indices in MongoDB?
- Hint indexes to mysql on Join
- Lucene get all non deleted document from index file
- Querydsl generated sql query wrong sql type (nvarchar instead of varchar)
- Numpy Indexing: Get every second coloumn for each even row
- Simpler, safer string manipulation Python
- Understanding "ValueError: need more than 1 value to unpack" w/without enumerate()
- Poor performance with mongo array index
- Is it possible to skip IndexRebuilder in the startup process of mongodb 2.6?
- Does PostgreSQL self join ignore indexes?
Related Questions in REPOSITORY
- How to push a Git server repository issues to Github repository?
- escaping values in Spring Data Repository
- Duplicate entry '[X]' for key '[Y]' on JPA repository 'save' operation. Saved entity has its key defined already
- Mock service that takes unitOfWork in constructor
- How to turn local source code directory into remote git repo?
- Migrating Nexus repository manager
- How to configure authentication for access of repository in pom.xml?
- Get Record ID in Entity Framework 5 after insert
- Android Studio Best way import module from other repository
- Repository Pattern with Repository Factory
- Octokit.net Creating new repository
- No Author in SVN Repo Logs
- Attaching an entity of type '' failed because another entity of the same type already has the same primary key value
- mercurial - several projects and repositories
- Symfony2: How to Call functions in Repository class from Type
Related Questions in OPENGROK
- Opengrok indexing failure
- Using OpenGrok to index Microsoft Office documents etc
- List folders and sub folders of a git branch without checking out from Jenkins Pipeline
- Filtering the Opengrok indexes based on an ELF file
- How to run opengrok in docker?
- How to ignore files and directories in opengrok used from docker?
- How to quickly refresh several workdirs related to one git-repo
- Multiple indexes in Opengrok
- Multiple projects with OpenGrok
- How do I handle git branches in OpenGrok?
- Is there an OpenGrok API?
- What will be Grok Pattern for following IIS logs?
- How to track usage of OpenGrok service
- Is it possible to search for a phrase in opengrok containing curly brackets?
- symbols comes twice in opengrok with Project configuration
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
There is currently no dedicated analyzer to extract data from these types of documents, however it should be possible to implement one based on the Java libraries listed in Read Microsoft Word Documents into Plain Text (DOC, DOCX) in Java (e.g. Apache POI or Apache Tika)
Feel free to file a new issue on https://github.com/oracle/opengrok/issues