Beside using Cloudera Navigator, how can I find the less frequently accessed files, in HDFS.
How to find less frequenlty accessed files in HDFS
518 Views Asked by Dipperman At
1
There are 1 best solutions below
Related Questions in HIVE
- How do I set the Hive user to something different than the Spark user from within a Spark program?
- schedule and automate sqoop import/export tasks
- PIG merge two lines in the log
- Elephant bird with hive to query protobuf file
- How can we decide the total no. of buckets for a hive table
- How to create a table in Hive with a column of data type array<map<string, string>>
- How to find number of unique connection using hive/pig
- sqoop-export is failing when I have \N as data
- How can we test expressions in hive
- Run Hive Query in R with Config
- Rhive: The messages shows: Not Connected to Hiveserver2 (But can connect HDFS)
- HIVE Query Deleting source data blob
- Hive JOIN of query with subquery takes forever
- What is Metadata DB Derby?
- How could I set the number or size of output files in an "insert" script?
Related Questions in HDFS
- Using HDFS with Apache Spark on Amazon EC2
- How to read a CSV file from HDFS via Hadoopy?
- How can I migrate data from one HDFS cluster to another over the network?
- Spark/Spark Streaming in production without HDFS
- Jcascalog to query thrift data on HDFS
- What is Metadata DB Derby?
- Can Solr or ElasticSearch be configured to use HDFS as their persistence layer in a way that also supports MapReduce?
- How to import only new data by using Sqoop?
- How to access hdfs by URI consisting of H/A namenodes in Spark which is outer hadoop cluster?
- Force HDFS globStatus to skip directories it doesn't have permissions to
- Trying to use WinInet to upload a file to HDFS
- Apache Spark architecture
- Is possible to set hadoop blocksize 24 MB?
- Unable to create file using Pail DFS
- Hadoop Distributed File Systmes
Related Questions in CLOUDERA-NAVIGATOR
- Where to check LDAP query from Cloudera Navigator?
- Lineage feature in Cloudera Navigator
- Does Cloudera Navigator Key Trustee Server ,Cloudera Navigator Key HSM , Cloudera Navigator Key Trustee KMS necessary for Impala installation?
- Issue searching file names with spaces in Cloudera Navigator API
- Cloudera Navigator Performance evaluation
- Cloudera Navigator API fail to fetch nested data
- Tuning cloudera navigator
- cloudera navigator insert overwrite directory
- How to find less frequenlty accessed files in HDFS
- cloudera navigator multi-tenancy capability
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
I assume that you are looking for the time a file was last accessed (open, read, etc.), because as longer in the past the file would be less accessed.
Whereby you can do this in Linux quite simple via ls -l -someMoreOptions, in HDFS more work is necessary.
Maybe you could monitor the
/hdfs-audit.logforcmd=openof the mentioned file. Or you could implement a small function to read out theFileStatus.getAccessTime()and as mentioned under Is there anyway to get last access time of HDFS files? or How to get last access time of any files in HDFS? in Cloudera Community.In other words, it will be necessary to create a small program which scans all the files, read out the properties
and order it. It that you will be able find files which were not accessed for long times.