I am facing an internship and they asked me to learn how to use talend ETL. I did it, not so difficult. One of the extra-tasks that have been assigned to me is to verify how much of the operations I set on the design workspace is executed in java and what is done through the use of queries. I've set up a simple Join using the TMap component and I monitored the SQLdatabase through the use of SQL Profiler. the result is that only the essential create/drop and the select/insert of the table is done via sql while every other thing like the actual join is made "Java" side. As long as it is an simple operation like join, wouldn't it be convenient to execute it through a query without having to bother java to perform it? For those who also know SAP, in terms of performance is there so much difference between Talend and SAP?
How much of Talend functionality is translated in SQL-Query and how much in Java?
287 Views Asked by Signori Andrea At
1
There are 1 best solutions below
Related Questions in ETL
- dbt Incremental Model Issue with Snowflake Autoincrement Column
- Ibis vs. Spark for big data processing against an analytics datawarehouse with a DataFrame API?
- How to copy XML files in a folder F1 based on whether its content is present on folder F2 (disregarding file names)
- Can we orchestrate Matillion Data Loader in Matillion Designer?
- Reading Unstructured Text from the entire file in Azure Data Factory
- Write rows on destination even when an error occurs?
- What is the difference between Data Ingestion and ETL?
- SSIS remove $ format from csv
- Generate data flow graph for ETL process
- Meta Data driven ADF pipeline to ingestion data from multiple sources
- How to push data from multiple sources/integrations for a single destination in stitch ETL Tool
- Pentaho PDI || Windows Current User
- MATILLION API Query Profile
- Joining Data Frame & SQL Server table directly and update table
- Extract composite unique key from GoHighLevel API with Python {{ contact.utm_source }}
Related Questions in TALEND
- what are .item, .properties and .screenshot? How to read them?
- How to push data from multiple sources/integrations for a single destination in stitch ETL Tool
- why talend data transfer slow intermittently?
- Talend Lookup: Retrieving Client IDs Based on Client Names
- Talend execution order in Subjob with tLoop
- Run databricks notebook from talend
- How to check snowflake jar version in talend studio
- Talend Connectivity with Share Point
- Error Import Jobs Talend Data Integration 8
- Talend OS - activating Log4j in components prints to STDOUT
- Talend open studio : tFileInoutXml
- NoSQL Connections node in TALEND not found
- How to solve Talend Job Failing Due to Missing Files
- I have a problem with my talent unable to solve it
- I couldn't deploy my web-service in Talend Runtime Container
Related Questions in DATA-INTEGRATION
- How do I correct this error in Oracle shown here?
- Building dimensional model from multiple sources
- Unable to Retrieve Data from Bullhorn to Power BI
- Suggestions for a tools or any other method that allows extracting data from mongoDB and then move it to another environment of same product?
- How can I import a EU INSPIRE conform WFS in QGIS?
- Talend Api : How to extract the Engine Logs from talend cloud
- About pricing Azure Synapse
- How to ignore records that are matched already in lookup table and able to fetch the next matched records in Talend
- How to batch delete records from a airtable base using google apps script?
- Talend application scenarios: is it correct to have logical operators in the first term of GAV mapping?
- How to apply data governance to API integration
- transform json to table with get multiple parent in every child element with pentaho
- Handling Unstructured Data from Excel in SSIS
- How to Unescape a character in snowflake during data ingestion from CSV to Snowflake table
- SAS SQL Pass-Through Facility does not work as expected for Postgres database
Related Questions in TALEND-MDM
- Compression Discrepancy in BMEcat XML Files Generated via Talend vs. eprocat
- How to download file from Oracle ucm using web service in Talend job
- Convert data from single CSV file to multiple JSON files using Talend
- Convert CSV to JSON using Talend Open Studio for Data Integration
- Sorting on similar values using talend
- How we can get nvarchar column value from SQL Server in talend
- Talend: How to remove a specific XML tag from output when the source file is a .txt file instead of xml
- How to get metadata from Talend Data Management Platform?
- talend format yyyy-MM-dd'T'HH:mm:ss.SSSz to yyyy-mm-dd HH:mm:ss
- Load the data from MongoDB (tMongoDbInput) of recent rows which are not loaded yet
- Dynamic Schema/Type -table column name in camelcase
- Is there any way to import executable JAR file in Talend?
- Any solutions on splitting output file?
- Generate specific key for specific code in Talend
- How to upload the range of files into database in talend
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Only operations in tDB components (create,select,insert, etc) are actually done through SQL. All operations done in other talend components (tMap, tFilter, aggregate, etc) are done through java. Indeed you'll have better performances doing operations SQL-side. You then have to find the right balance between an "all-in-sql" type of job and an "all-java" one. (it could be harder for a talend developer to debug operations if all the sql part is done through a unique query inside a single component...).
You could definitely have your joins inside a tDBInput component, and output the result in a single output flow. You can also check ELT* components : they let you use SQL-engine instead of java-engine to perform all operations (join,aggregate,filter) while using a talend interface.