I am facing an internship and they asked me to learn how to use talend ETL. I did it, not so difficult. One of the extra-tasks that have been assigned to me is to verify how much of the operations I set on the design workspace is executed in java and what is done through the use of queries. I've set up a simple Join using the TMap component and I monitored the SQLdatabase through the use of SQL Profiler. the result is that only the essential create/drop and the select/insert of the table is done via sql while every other thing like the actual join is made "Java" side. As long as it is an simple operation like join, wouldn't it be convenient to execute it through a query without having to bother java to perform it? For those who also know SAP, in terms of performance is there so much difference between Talend and SAP?
How much of Talend functionality is translated in SQL-Query and how much in Java?
284 Views Asked by Signori Andrea At
1
There are 1 best solutions below
Related Questions in ETL
- Monolithic ETL to distributed/scalable solution and OLAP cube to Elasticsearch/Solr
- How to use component javascript in the Pentahoo Data Integration
- SSIS ETL parallel extraction from a AS400 file
- ETL Hangs - SQL Server in EC2 Machine + SSIS + AWS RDS SQL Server
- Pull Text file to SQL server 2008 table
- SqlAlchemy get all strings (don't cast to boolean or datetime)
- Best / simplest way to transfer data from one Oracle database to another
- Using blank-line delimited records and colon-separated fields in awk
- SSIS dynamic columns validation
- Is it possible to pass parameter inside With Clause in SQL Server SSIS Job?
- Easiest way to import a simple csv file to a graph with OrientDB ETL
- forwarding data from one source to another in real time
- SSIS Variable Scope Issues
- OrientDB ETL with self joined mysql table
- loop row by row from an excel file map to variable
Related Questions in TALEND
- Talend Open Studio for Big Data
- How to extract data from web api with Talend Open Studio
- Send Json Request Using tRestClient With Nested Object in talend
- Get column name ( Meta Data ) Talend
- How to concatenate two column in talend tMap
- Condition on lookup Talend
- Put File Mask on tFileList Component dynamically
- import Data from Excel to MongoDB in Talend
- Talend Open Studio: maven to add librairies
- Unauthorized error in Talend REST Client
- Build option is not working in Talend Job
- Handle tS3Connection failure in Talend
- Talend > NullPointerException on tFileUnarchive
- Talend TRestClient: geocoding and combination of both flows (rows) afterwards
- Connection failure. You must change the jdbc7.jar at org.talende
Related Questions in DATA-INTEGRATION
- Workday: Submit_Customer_Invoice returning an error - The task submitted is not authorized
- Installing Silk Workbench on windows 10?
- How to integrate tabular data into GraphDB automatically?
- Format was not found error when running an imported EG on DI
- SAS DI Error 22-232 in ROW_NUMBER () OVER (PARTITION BY construction
- Data retrieval and search accross multiple services
- Unable to load the source file details into mysql database using TALEND tool
- Getting result from SQL Server Stored Procedure
- How can I integrate data on regular bases between 2 different MySQL Servers?
- How to configure Apache Flume to fetch data from Twitter for specific period?
- Extraction of data from Siebel Data base to Dat file and staging table
- Data loading is slow while using "Insert/Update" step in pentaho
- Can not override Talend job context parameters when launching from the command-line
- What is the best object mapper for merging DTOs from many sources?
- Unable to complete Oracle example for ODI Flat File to Flat File Export
Related Questions in TALEND-MDM
- Modelling Magento in Talend MDM
- import Data from Excel to MongoDB in Talend
- Unauthorized error in Talend REST Client
- Build option is not working in Talend Job
- Unable to call Talend Open Studio tContextLoad in query
- How do I set the order of execution of my already existing subjobs in Talend?
- How to read data values from different rows from an Excel file in Talend?
- How to read and write to the same excel file in Talend in the same subjob?
- What component can be used to duplicate every row of an excel file using Talend?
- How to append columns to other columns in Talend?
- How to download file from Oracle ucm using web service in Talend job
- Compression Discrepancy in BMEcat XML Files Generated via Talend vs. eprocat
- Issue Installing Talend 7.3, No response
- Migrate excel data to postgres database
- What caused Collibra to transition away from Collibra Connect?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Only operations in tDB components (create,select,insert, etc) are actually done through SQL. All operations done in other talend components (tMap, tFilter, aggregate, etc) are done through java. Indeed you'll have better performances doing operations SQL-side. You then have to find the right balance between an "all-in-sql" type of job and an "all-java" one. (it could be harder for a talend developer to debug operations if all the sql part is done through a unique query inside a single component...).
You could definitely have your joins inside a tDBInput component, and output the result in a single output flow. You can also check ELT* components : they let you use SQL-engine instead of java-engine to perform all operations (join,aggregate,filter) while using a talend interface.