As I stated in previous posts I've inherited a warehouse based on SQL Server framework.
In my continuous review of existing components and the practices adopted by the predecessors I found something that caught my attention and surprised me: there are several places where physical tables are being used as temporal tables to manipulate large amounts of data. First reaction was to think this practice is very expensive for the DBMS but would like to have more feedback on that.
Some annotations on this topic:
- Physical tables being created/droped within SPs ( tables called "TMP_TableName")
- Tables mostly used to manipulate large amount of data
- Mentioned SPs calls several times during daily night processing
Questions:
- Does this practice brings up any advantage to the processing routines that I'm not aware of?
- Are there best practices about this?
- My plan is to update the code to use #temp tables to improve the performance. Comments on that?.
- Should I consider to use variable tables? I read the performance is bad when treating with big data.
I appreciate any feedback based on your knowlege/experience you may want to share with all the stackoverflowers.
Thanks in advance,
You can get performance improvements by dumping specific data into tables and creating specific indexes etc. i.e. if the same large but limited dataset is used multiple times, or if you have a calculated column which requires an index. Often this happens behind the scenes by SQL anyway though, whether through caching/spooling etc.
Personally I would leave the table intact so I could debug afterwards. If you were to do that you would need code to clear the table beforehand at each run
There is a serious downside with the current approach: If two process run the stored proc at the same time the data will 'collide'
It's unlikely there will be any change in performance unless your DB is on a slow disk and your tempdb is on a fast disk, and you have insufficient RAM so it has to use the disk all the time.
This will fix the problem with the SP being run at the same time with multiple processes though.
Basically no.....fully covered in the prior post.
In summary convert to temp (#) tables unless you have some kind of need to debug the process