While Apache tez reuses containers,what is the process takes place. Can anyone explain me clearly?
How container reuse works in Apache Tez? While reusing what is the data stored in shared location?
673 Views Asked by Ramprakash At
1
There are 1 best solutions below
Related Questions in APACHE
- .htaccess redirect 403 error files to 404 error document
- RestApi server code is not workinng
- Convert Apache VirtualHost to nginx Server Block for Dynamic Subdomains
- Looking the Method that MANUALLY INSTALL PHP on OSX Yosemite
- Premature end of script on VPS
- Rasterization with Javascript looks different on Apache server
- Vagrant - Ansible error installing Apache
- Can't use subdomain in Chrome using Apache (XAMPP)
- Django webapp (on an Apache2 server) hangs indefintely when importing nltk in views.py
- Redirect keystone app to sub directory using htaccess
- How can I integrate Solr5.1.0 with Nutch1.10
- Disconnect Client connected to cgi application
- Solr ping taking time during full import
- How to redirect an incoming request to specific serverName to different server in apache2?
- What is the correct way to link Django Flatpages?
Related Questions in CONTAINERS
- Positioning child at bottom of parent with scroll
- Where are docker images and containers stored when we use it with Windows?
- c++ container very efficient at adding elements to the end
- Set-like alternative for yaml files
- Julia: Enforce constraints on objects in a container?
- Control snapping on resize in C# scrollable container
- Creating a vertically draggable container in Adobe Flex 3.5
- How to set the read ACL on container in open stack swift such that allow Read for all users and deny for one user
- Which STL container(s)/algorithm(s) could I use to solve this?
- Persistent error coming from libusb0
- Make responsive text relative to responsive image?
- Initializing database in the base image of a running container
- Is there unique-value / set container in java-script
- Can we change width of container in css using Bootstrap3
- Use of container docker as a proxy for CF app to get public IP
Related Questions in TEZ
- When does HIVE (not) use WHERE clause on partition as predicate filter
- How can I add additional libraries to Tez to support a different HDFS backend?
- Add Filesystem to Hive with Tez execution engine
- Using Hive to distribute over Reducers?
- Hadoop cluster. Large number of TCP retransmissions
- Hive / Tez job won't start
- Why is hive join taking too long?
- Tez vertex error shown while execution of query in hive
- Executing query with Hive, Tez and offset
- Hive execute "insert into ... values ..." very slow
- Failed Vertex re-running
- Hive complex data type querying
- Hive query does not start the map phase
- Record too large for in-memory buffer. Error when working with Hive's ORC tables via TEZ
- Are Apache Phoenix JARs causing error in Tez?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Please read Hortonworks (most significant Tez contributor) docs here: https://hortonworks.com/blog/re-using-containers-in-apache-tez/
Each vertex in Tez specifies parameters, which are used when launching containers. These include the requested resources (memory, CPU etc), YARN LocalResources, the environment, and the command line options for tasks belonging to this Vertex. When a container is first launched, it is launched for a specific task and uses the parameters specified for the task (or vertex) – this then becomes the container’s signature. An already running container is considered to be compatible for another task when the running container’s signature is a superset of what the task requires.
The Tez scheduler works with several parameters to take decisions on task assignments – task-locality requirements, compatibility of containers as described above, total available resources on the cluster, and the priority of pending task requests.
When a task completes, and the container running the task becomes available for re-use – a task may not be assigned to it immediately – as tasks may not exist, for which the data is local to the container’s node. The Tez scheduler first makes an attempt to find a task for which the data would be local for the container. If no such task exists, the scheduler holds on to the container for a specific time, before actually allocating any pending tasks to this container.
Each Tez JVM (or container) contains an object cache, which can be used to share data between different tasks running within the same container. This is a simple Key-Object store, with different levels of visibility/retention. Objects can be cached for use within tasks belonging to the same Vertex, for all tasks within a DAG, and for tasks running across a Tez Session.