There is a hadoop table having thousands of records. I have a micro-service which has multiple instances and each instance will be connected to this hadoop table and should be able to read the data concurrently. I want to process the data from hadoop parallely so that no two service instances are processing the same data from the table. But, if there is a failure while processing the service should be able to convey the failure back to the table so that that particular data set can be given to some other service instance for processing (basically like a object lock system, 1 instance fetches few records from the table by applying a lock). What options of storage layer will support this requirement? Impala + Hadoop? or something else?
How to read hadoop tables concurrently from multiple instances of a service?
13 Views Asked by Amit Sharma At
0
There are 0 best solutions below
Related Questions in IMPALA
- Issue with SQLAlchemy accessing Impala database via cloudera ODBC DSN
- string to timestamp with sql
- Need with removing matching data from two different table
- Openshift - Impala Connection issue
- Explode array of maps with values in HIVE or Impala
- Filling previous zeros or null values with first previous non zeros in Hive Impala
- Understanding of result memory for impala jdbc driver
- How to read hadoop tables concurrently from multiple instances of a service?
- Read Dataset from Impala to SAS
- LAG Function Impala
- Cloudera configuration between Master and Slave Nodes
- I'm trying to use Sum() over(partition by ) to find the difference between two groups of numbers and I'm getting an unexpected result
- Impala query lead to error message that parquet has an invalid file length: 0
- how to increase list of completed queries on impala query api
- Calculate a weighted average of a score, with increasing weights with respect to updateness of the month considered [SQL]
Related Questions in HADOOP-STREAMING
- What is meant by Streams w.r.t Java IO
- How to package python script with dependencies into zip/tar?
- Number of parallel mapper tasks in Hadoop Streaming job
- Caching option for files
- Check existence of a field in HDFS avro format using Pig/Python
- hadoop cluster: hadoop streaming map task run only on one master machine not in slaves
- my datanode is not starting in hadoop 2.7.3 multi nodes
- Chaining jobs using user defined class
- pass multiple files based on date in same directory as Input to Mapreduce
- How to gzip compress a directory in hdfs without changing the name of the files
- How can i send data from node-red to Hadoop?
- Read Snappy Compressed data on HDFS from Hadoop Streaming
- Ozzie workflow example
- How do I use hadoop streaming cmdenv with Oozie?
- How do I create and set variables in oozie workflows?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?