I am new to Pentaho kettle and have a requirement where we want to unzip a set of files and the paths to those files are there in a table. I am wondering how to go about it.
Unzip a list of files whose path has to be read from a table in Pentaho kettle
1.9k Views Asked by Sushant At
1
There are 1 best solutions below
Related Questions in PENTAHO
- Git Not In Sync with Local Branch
- How to write a blob file to local filesystem using PDI?
- Where I download the Pentaho Community Edition, link breadk on sourceforge
- Pentaho PDI || Windows Current User
- Slowly Changing Dimension Type 2
- Json is sending Überlegung character convert in pentaho different format
- Amazon S3 multi part upload error using Pentaho 9.3
- Multiline parsing in Fluent-Bit for pentaho jobs on kubernetes
- Set Variable In a Pentaho Loop
- Get row number from stream to perform concat
- pass parameters to database connection in pentaho using jenkins freestyle job
- Troubleshooting 'No such file or directory' Error for Network Paths in Pentaho Server Job Execution
- Pentoho How to check ,If at least one row is LDAP is 0 with same user mutiple times
- Problems returning csv data, pentaho reading failure
- Pentaho Unable to access Remote Excel file
Related Questions in ETL
- dbt Incremental Model Issue with Snowflake Autoincrement Column
- Ibis vs. Spark for big data processing against an analytics datawarehouse with a DataFrame API?
- How to copy XML files in a folder F1 based on whether its content is present on folder F2 (disregarding file names)
- Can we orchestrate Matillion Data Loader in Matillion Designer?
- Reading Unstructured Text from the entire file in Azure Data Factory
- Write rows on destination even when an error occurs?
- What is the difference between Data Ingestion and ETL?
- SSIS remove $ format from csv
- Generate data flow graph for ETL process
- Meta Data driven ADF pipeline to ingestion data from multiple sources
- How to push data from multiple sources/integrations for a single destination in stitch ETL Tool
- Pentaho PDI || Windows Current User
- MATILLION API Query Profile
- Joining Data Frame & SQL Server table directly and update table
- Extract composite unique key from GoHighLevel API with Python {{ contact.utm_source }}
Related Questions in KETTLE
- Transform XML with meta columns to csv in PDI
- Get row number from stream to perform concat
- How to add condition in Pentaho Spoon
- Pentaho: Multiple excel in folder to a excel
- Issue with Channel Log Tables in Pentaho Data Integration V7: Works in Manual Execution but Not in Nightly Batch
- How to loop through date variables in PDI?
- How to delete rows which were deleted in source table (Pentaho data integration)
- Unable to read variable value from kettle.properties file
- Pentaho Kettle PDI - Use ETL Metadata injection with data validator
- Bad accents and "ñ" in my postgres database
- Pentaho kettle embedded transformation error handling in JAVA application
- Pentaho metastore location when connecting to repository
- Kettle's spoon.bat can't open
- How to add new column to stream in Pentaho DI, without blocking the stream, new column value is not constant
- Run Pentaho Job or transformation on Carte Server via command line
Related Questions in PENTAHO-DESIGN-STUDIO
- Json is sending Überlegung character convert in pentaho different format
- Pentoho How to check ,If at least one row is LDAP is 0 with same user mutiple times
- How to define the loop xpath in Pentaho
- Getting data from a database and outputting it in another DB dynamically with Pentaho
- How to grab value from DB table and use that value to replace another field in Pentaho
- Pentaho job failing only when running through control-m
- Pentaho Report Designer Database connection with variable on preview
- Pentaho transformation swithc/sace doesnt work as suposed to be
- Looking for updates in Pentaho7.x for log 4j vulnerability and it's procedure
- Pentaho- Can we import a sub report/ report into a report in Pentaho?
- Pentaho- How do I view the backend sql of reports?
- Pentaho PDI on Ubuntu 18
- How to get data from excel and pass excel values as parameters in SQL query on pentaho etl jobs
- Pentaho-IVYDC and IVYBC Plugin
- Pentaho spoon error: "Connection refused. Check that the hostname and port are correct and that the postmaster is accepting TCP/IP connections."
Related Questions in PENTAHO-CDE
- How to define the loop xpath in Pentaho
- Execute statement in pentaho cde sql over sqlJndi
- unable to write file to database
- i have a problem in pentaho bi server dynamic row
- Does Pentaho allow the "Filter rows" step to load a list of values?
- How to change default path in pentaho-server 9.0.0.0.423?
- Pentaho cde & Report-designer source
- Pentaho mondrian cube main sql query
- Using Pentaho CDE check component data as file name for export
- Looking for updates in Pentaho7.x for log 4j vulnerability and it's procedure
- Pentaho CDE Dashbord: how to export to excel multiple tables into one sheet with button click
- How input data in line chart in Pentaho?
- Checking multiple table existence in Pentaho
- How I can read the file on the everyday basis in pentaho in this format? "tcms_oneed_YYYYMMDD.txt"
- Pentaho - Spoon Decimal from Text File Input
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
This should be your main Job:
First transformation connects to your database and extract the paths, after this another Job is called (Unzip) which extracts those files. I'll be more clear, the next is the transformation called "Table input":
Use the "Table input" step to connect to your database. When you open it you have to create a new connecion and then put your query in the canvas. (make a query which extract values just from the interested column, not every columns). The step "Copy rows to result" gives the values form the database to the next job.
The following is the job "Unzip":
This job receives the values from the previous transformation and pass tose to the "Unzip file" job entry.
Things to know:
1) In the main job double click on the Unzip job icon, go to "advanced" and specify "Copy previous result to parameters" and "Execute for every input row". Of course in the Job specification you have to specify the path of this job.
2) Also double click on the Unzip job icon, go to parameters and put a parameter named as the value which you extract from the database:
3) Enter in the sub-Job (Unzip in my case) and right click, then go to "Job settings" and then to "parameters". Now put the same parameter name as before:
4) Remember to set the destination folder of the files and the receving parameters in the "Unzip files" job entry: