Query Optimisation left outer join 7 tables - SQL Server - sql-server

Need help for faster execution and optimisation for the below.
Find the execution plan in the 2 images attached..
Data in all tables are as follows:
company - 25 rows
warranty_period - 878 rows
company_claimtype - 216 rows
claim_type - 25 rows
customer - 15980 rows
auto_handle - 25 rows
auto_handle_condition - 31 rows
PART_REQUIRED_RETURN - 4521 rows
The following query took almost 12 mins to execute and it contains only left outer join of all the above mentioned tables:
select
company0_.ID as ID1_9_0_,
warrantype1_.ID as ID1_36_1_,
partsrequi2_.ID as ID1_20_2_,
claimtype4_.ID as ID1_8_3_,
autohandle5_.ID as ID1_4_4_,
autohandle6_.ID as ID1_5_5_,
customer7_.ID as ID1_14_6_,
company0_.CREATED as CREATED2_9_0_,
company0_.UPDATED as UPDATED3_9_0_,
company0_.AUTO_REJECT_USER as AUTO_REJ4_9_0_,
company0_.CLAIM_CREDIT_ORDER_HISTORY as CLAIM_CR5_9_0_,
company0_.CLAIM_ORDER_HISTORY as CLAIM_OR6_9_0_,
company0_.COMPANY_CODE as COMPANY_7_9_0_,
company0_.COMPANY_NAME as COMPANY_8_9_0_,
company0_.CONNECTED_TO_ERP as CONNECTE9_9_0_,
company0_.DEFAULT_CUSTOMER_ID as DEFAULT16_9_0_,
company0_.DATE_FORMAT as DATE_FO10_9_0_,
company0_.DISPLAY_WORK_LOCATION as DISPLAY11_9_0_,
company0_.DISTANCE_UNIT as DISTANC12_9_0_,
company0_.ENABLE_SHIPPING_LABEL as ENABLE_13_9_0_,
company0_.HANDLE_CREDIT_INVOICES as HANDLE_14_9_0_,
company0_.RETURNS_PARTS_DAYS as RETURNS15_9_0_,
warrantype1_.CREATED as CREATED2_36_1_,
warrantype1_.UPDATED as UPDATED3_36_1_,
warrantype1_.COMPANY_ID as COMPANY_8_36_1_,
warrantype1_.PERIOD_CONSUMER as PERIOD_C4_36_1_,
warrantype1_.PERIOD_PROFESSIONAL as PERIOD_P5_36_1_,
warrantype1_.PRODUCT_GROUP as PRODUCT_6_36_1_,
warrantype1_.PRODUCT_NO as PRODUCT_7_36_1_,
warrantype1_.COMPANY_ID as COMPANY_8_36_0__,
warrantype1_.ID as ID1_36_0__,
partsrequi2_.CREATED as CREATED2_20_2_,
partsrequi2_.UPDATED as UPDATED3_20_2_,
partsrequi2_.COMPANY_ID as COMPANY_5_20_2_,
partsrequi2_.PRODUCT_NO as PRODUCT_4_20_2_,
partsrequi2_.COMPANY_ID as COMPANY_5_20_1__,
partsrequi2_.ID as ID1_20_1__,
claimtype4_.CREATED as CREATED2_8_3_,
claimtype4_.UPDATED as UPDATED3_8_3_,
claimtype4_.CATEGORY as CATEGORY4_8_3_,
claimtype4_.CLAIM_TYPE as CLAIM_TY5_8_3_,
claimtype4_.SORT_ORDER as SORT_ORD6_8_3_,
claimtype4_.TEXT_KEY as TEXT_KEY7_8_3_,
claimtypes3_.COMPANY_ID as COMPANY_2_10_2__,
claimtypes3_.CLAIMTYPE_ID as CLAIMTYP1_10_2__,
autohandle5_.CREATED as CREATED2_4_4_,
autohandle5_.UPDATED as UPDATED3_4_4_,
autohandle5_.[ACTION] as ACTION4_4_4_,
autohandle5_.[ACTIVE] as ACTIVE5_4_4_,
autohandle5_.[ASSIGN] as ASSIGN6_4_4_,
autohandle5_.[COMPANY_ID] as COMPANY_8_4_4_,
autohandle5_.[NAME] as NAME7_4_4_,
autohandle5_.[COMPANY_ID] as COMPANY_8_4_3__,
autohandle5_.ID as ID1_4_3__,
autohandle6_.CREATED as CREATED2_5_5_,
autohandle6_.UPDATED as UPDATED3_5_5_,
autohandle6_.AUTO_HANDLE_ID as AUTO_HAN7_5_5_,
autohandle6_.[OPERATOR] as OPERATOR4_5_5_,
autohandle6_.[TYPE] as TYPE5_5_5_,
autohandle6_.[VALUE] as VALUE6_5_5_,
autohandle6_.AUTO_HANDLE_ID as AUTO_HAN7_5_4__,
autohandle6_.ID as ID1_5_4__,
customer7_.CREATED as CREATED2_14_6_,
customer7_.UPDATED as UPDATED3_14_6_,
customer7_.COMPANY_ADDRESS as COMPANY_4_14_6_,
customer7_.STREET as STREET5_14_6_,
customer7_.ZIPCODE as ZIPCODE6_14_6_,
customer7_.COMPANY_ID as COMPANY_9_14_6_,
customer7_.CUSTOMER_NO as CUSTOMER7_14_6_,
customer7_.NAME as NAME8_14_6_
from
COMPANY company0_
left outer join
WARRANTY_PERIOD warrantype1_
on company0_.ID=warrantype1_.COMPANY_ID
left outer join
PART_REQUIRED_RETURN partsrequi2_
on company0_.ID=partsrequi2_.COMPANY_ID
left outer join
COMPANY_CLAIMTYPE claimtypes3_
on company0_.ID=claimtypes3_.COMPANY_ID
left outer join
CLAIM_TYPE claimtype4_
on claimtypes3_.CLAIMTYPE_ID=claimtype4_.ID
left outer join
AUTO_HANDLE autohandle5_
on company0_.ID=autohandle5_.[COMPANY_ID]
left outer join
AUTO_HANDLE_CONDITION autohandle6_
on autohandle5_.ID=autohandle6_.AUTO_HANDLE_ID
left outer join
CUSTOMER customer7_
on company0_.DEFAULT_CUSTOMER_ID=customer7_.ID

What I find particularly strange is that you get about 585k records from PART_REQUIRED_RETURN and then tag those along but when the server adds the AUTO_HANDLE data things suddenly multiply to 2.1M records. Are you sure your join takes into account the entire PK? As your query looks right now I would assume that ID is the PK of AUTO_HANDLE, however the numbers seem to contradict this. Something similar happens when adding the AUTO_HANDLE_CONDITION data.
In the end you are reading 3.3M rows, or about 4.5Gb of data. I don't think you can expect to bring this down to seconds.

Related

Slow query in SQL Server with LEFT OUTER JOIN

I have the query below. When I use LEFT OUTER JOIN my query is very slow. It took more than 1 minute for the query to complete. Is there a way I can improve query speed while still using LEFT OUTER JOIN?
SELECT RISK.RA_HAZ_ID,
RISK.RA_CTRL_ID,
RISK.RA_HAZ_HAZARD_NAME,
RISK.RA_CTRL_CONTROL_NAME,
RISK.RA_CTRL_CONTROL_ID,
RISK.RA_IS_ESCALATED,
RISK.RA_IS_CONSEQUENCE,
RISK.RA_CTRL_OUTCOME,
RISK.RA_HAZ_IS_ALARP,
RISK.RA_HAZ_CONSEQUENCE,
RISK.RA_STEP_ORDER,
RISK.RA_STEP_NAME,
RISK.RA_HAZ_TYPE_NAME,
RISK.RA_CTRL_IS_REQUIRED,
RISK.RA_CTRL_IS_PRE_SELECTED,
RISK.RA_CTRL_IS_SELECTED,
RISK.RA_HAZ_SORT_ORDER,
RISK.RA_HAZ_COMMENT,
RISK.RA_CONTROL_COMMENT,
RISK.INIT_RISK,
RISK.RES_RISK,
RISK.RA_MATRIX_ID,
RISK.RES_RISK_LEVEL,
RISK.INIT_RISK_LEVEL,
MAX(RISK.RA_CTRL_ASSIGNED_FULL_NAME) AS RA_CTRL_ASSIGNED_FULL_NAME
FROM PRINT_RISK_ASSESSMENT_DETAILS RISK
LEFT OUTER JOIN PRINT_JSA_REFERENCE JSA ON JSA.OWNING_ACTIVITY_ID = RISK.OWNING_ACTIVITY_ID
WHERE JSA.ACTIVITY_ID = 10977 OR RISK.OWNING_ACTIVITY_ID = 10977
GROUP BY RISK.RA_HAZ_ID,
RISK.RA_CTRL_ID,
RISK.RA_HAZ_HAZARD_NAME,
RISK.RA_CTRL_CONTROL_NAME,
RISK.RA_CTRL_CONTROL_ID,
RISK.RA_IS_ESCALATED,
RISK.RA_IS_CONSEQUENCE,
RISK.RA_CTRL_OUTCOME,
RISK.RA_HAZ_IS_ALARP,
RISK.RA_HAZ_CONSEQUENCE,
RISK.RA_STEP_ORDER,
RISK.RA_STEP_NAME,
RISK.RA_HAZ_TYPE_NAME,
RISK.RA_CTRL_IS_REQUIRED,
RISK.RA_CTRL_IS_PRE_SELECTED,
RISK.RA_CTRL_IS_SELECTED,
RISK.RA_HAZ_SORT_ORDER,
RISK.RA_HAZ_COMMENT,
RISK.RA_CONTROL_COMMENT,
RISK.INIT_RISK,
RISK.RES_RISK,
RISK.RA_MATRIX_ID,
RISK.RES_RISK_LEVEL,
RISK.INIT_RISK_LEVEL
ORDER BY RISK.RA_STEP_ORDER,
RISK.RA_HAZ_SORT_ORDER,
RISK.RA_HAZ_HAZARD_NAME,
RISK.RA_HAZ_TYPE_NAME DESC,
RISK.RA_CTRL_IS_REQUIRED DESC,
RISK.RA_CTRL_IS_PRE_SELECTED DESC,
RISK.RA_CTRL_CONTROL_NAME
I want to improve query speed while still using LEFT OUTER JOIN
You can first see if you have indexes over important fields JSA.ACTIVITY_ID, JSA.OWNING_ACTIVITY_ID and RISK.OWNING_ACTIVITY_ID. If not, consider creating it.
Since you are grouping over the entiry table, if RISK has an important amount of row, it will be slow.

Does SQL Server CASE expression not do short circuit?

Appropriate if somebody could help us to optimize below query
As per execution plane it seems the else part sub query is always executing, irrespective of conditions.
Won't CASE be short circuited? Why is it executing, even though it is not necessary?
IF OBJECT_ID('#Calculation') IS NOT NULL
DROP TABLE #Calculation;
SELECT
Result.IdDeckungsbeitrag,
Result.wert AS Wert,
REPLACE(#Formula, '<#PackagingCosts> ', Result.wert) AS Kalkulation
INTO
#Calculation
FROM
(SELECT
deck.IdDeckungsbeitrag,
CASE
WHEN lp.ID_VERPACKUNG_2 IS NULL
AND lp.ID_VERPACKUNG_3 IS NULL
THEN vg.VERPACKUNGSKOSTEN_PRO_EINHEIT * deck.Menge
ELSE
(
SELECT SUM(temp.me) * vg.VERPACKUNGSKOSTEN_PRO_EINHEIT
FROM
(
SELECT SUM(gv.MENGE) AS me
FROM dbo.KUNDENRECHNUNG_POSITION krp
LEFT JOIN dbo.LIEFERSCHEIN_POSITION lp
ON lp.ID_LIEFERSCHEIN_POSITION = krp.ID_LIEFERSCHEIN_POSITION
LEFT JOIN dbo.GEBINDE_VERLADEN gv
ON gv.ID_LIEFERSCHEIN_POSITION = lp.ID_LIEFERSCHEIN_POSITION
LEFT JOIN dbo.MATERIAL_BESTAND mb
ON mb.ID_MATERIAL_BESTAND = gv.ID_MATERIAL_BESTAND
LEFT JOIN dbo.MATERIAL_GEBINDE mg
ON mg.ID_MATERIAL_BESTAND = mg.ID_MATERIAL_BESTAND
WHERE mg.CHARGE_NUMMER = deck.Charge
GROUP BY mg.ID_VERPACKUNG
) temp
)
END AS wert
FROM #DeckungsbeitragCalculationPositions_TVP deck
LEFT JOIN dbo.KUNDENRECHNUNG_POSITION krp
ON krp.ID_KUNDENRECHNUNG_POSITION = deck.IdDeckungsbeitrag
LEFT JOIN dbo.LIEFERSCHEIN_POSITION lp
ON lp.ID_LIEFERSCHEIN_POSITION = krp.ID_LIEFERSCHEIN_POSITION
LEFT JOIN dbo.VERPACKUNG vg
ON vg.ID_VERPACKUNG = lp.ID_VERPACKUNG_1
WHERE deck.IdMandant = #Id_Mandant
) Result;
Don't think of it as short-circuit or not. That's a procedural code mindset rather than a set-based mindset.
In SQL, the actual "execution" happens in the realm of table or index scans and seeks, hash matches, sorts, and the like. Pull data from different tables into a working set that will eventually produce the desired relational result.
Not seeing any real or projected execution plan, in this case (no pun intended), I suspect the query optimizer decided it was most efficient to first produce this result set in memory:
SELECT mg.ID_VERPACKUNG, mg.CHARGE_NUMMER, SUM(gv.MENGE) AS me
FROM dbo.KUNDENRECHNUNG_POSITION krp
LEFT JOIN dbo.LIEFERSCHEIN_POSITION lp
ON lp.ID_LIEFERSCHEIN_POSITION = krp.ID_LIEFERSCHEIN_POSITION
LEFT JOIN dbo.GEBINDE_VERLADEN gv
ON gv.ID_LIEFERSCHEIN_POSITION = lp.ID_LIEFERSCHEIN_POSITION
LEFT JOIN dbo.MATERIAL_BESTAND mb
ON mb.ID_MATERIAL_BESTAND = gv.ID_MATERIAL_BESTAND
LEFT JOIN dbo.MATERIAL_GEBINDE mg
ON mg.ID_MATERIAL_BESTAND = mg.ID_MATERIAL_BESTAND
GROUP BY mg.ID_VERPACKUNG, mg.CHARGE_NUMMER
This is the subquery from the ELSE clause, minus the WHERE clause conditions and with additional info added to the SELECT to make the match more effective. If the query optimizer can't be confident of meeting the WHEN clause a high percentage of the time, it might believe producing this larger ELSE set once to match against as needed is more efficient. Put another way, if it thinks it will have to run that subquery a lot anyway, it may try to pre-load it for all possible data.
We don't know enough about your database to suggest real solutions, but indexing around the ID_VERPACKUNG_2, ID_VERPACKUNG_3, and CHARGE_NUMMER fields might help. You might also be able to use a CTE, temp table, or table variable to help Sql Server to a better job of caching this data just once.
If the conditions of where are not met then the else will become active. If this is always the case then you would expect the following to return no rows:
select *
FROM #DeckungsbeitragCalculationPositions_TVP deck
LEFT JOIN dbo.KUNDENRECHNUNG_POSITION krp ON krp.ID_KUNDENRECHNUNG_POSITION = deck.IdDeckungsbeitrag
LEFT JOIN dbo.LIEFERSCHEIN_POSITION lp ON lp.ID_LIEFERSCHEIN_POSITION = krp.ID_LIEFERSCHEIN_POSITION
WHERE lp.ID_VERPACKUNG_2 IS NULL
AND lp.ID_VERPACKUNG_3 IS NULL

ibm 1 sql creating - need to add 2 tables

I have following view which is working but not sure how to add 2 tables to join.
This table is adres1 and it will join on the IDENT# and IDSFX# to table
prodta.adres1 called adent# and adsfx#, there I need a col. ads15.
then i also need to get the ship to, row in this adres1. this we get first from the order table, prodta. oeord1 in col. odgrc#. This grc# is 11 pos and is combined 8 and 3 of the ent and suf. these 2 represent the ship to record and looking in same table adres1 (we do have many logical views on them if it's easier, like adres15) we can get col. ADSTTC for the ship to state.
Not sure if can included these 2 new parts to the current view created code below. Please ask if something not clear, it's an old system and somewhat developed convoluted.
CREATE VIEW Prolib.SHPWEIGHTP AS SELECT
T01.IDORD#,
T01.IDDOCD,
T01.IDPRT#,
t01.idsfx#,
T01.IDSHP#,
T01.IDNTU$,
T01.IDENT#,
(T01.IDNTU$ * T01.IDSHP#) AS LINTOT,
T02.IAPTWT,
T02.IARCC3,
T02.IAPRLC,
T03.PHVIAC,
T03.PHORD#,
PHSFX#,
T01.IDORDT,
T01.IDHCD3
FROM PRODTA.OEINDLID T01
INNER JOIN PRODTA.ICPRTMIA T02 ON T01.IDPRT# = T02.IAPRT#
INNER JOIN
(SELECT DISTINCT
PHORD#,
PHSFX#,
PHVIAC,
PHWGHT
FROM proccdta.pshippf) AS T03 ON t01.idord# = T03.phord#
WHERE T01.IDHCD3 IN ('MDL','TRP')
I'm not exactly clear on what you're asking, and it looks like some of the column-names are missing from your description, but this should get you pretty close:
CREATE VIEW Prolib.SHPWEIGHTP AS
SELECT T01.IDORD#,
T01.IDDOCD,
T01.IDPRT#,
t01.idsfx#,
T01.IDSHP#,
T01.IDNTU$,
T01.IDENT#,
( T01.IDNTU$ * T01.IDSHP# ) AS LINTOT ,
T02.IAPTWT,
T02.IARCC3,
T02.IAPRLC,
T03.PHVIAC,
T03.PHORD#,PHSFX#,
T01.IDORDT,
T01.IDHCD3,
t04.ads15
FROM PRODTA.OEINDLID T01
INNER JOIN PRODTA.ICPRTMIA T02
ON T01.IDPRT# = T02.IAPRT#
INNER JOIN (SELECT DISTINCT
PHORD#,
PHSFX#,
PHVIAC,
PHWGHT
FROM proccdta.pshippf) AS T03
ON t01.idord# = T03.phord#
JOIN prodta.adres1 as t04
on t04.adent# = t01.adent#
and t04.adsfx# = t01.adsfx#
JOIN prodta.oeord1 t05
on t05.odgrc# = T01.IDENT# || T01.SUFFIX
WHERE T01.IDHCD3 IN ('MDL','TRP')
Let me know if you need more details.
HTH !

SQL Server LEFT JOIN

This query has been keeping me busy for the last couple of days. I tried to rewrite it with different ideas but I keep having the same problem. To simplify the problem I put part of my query in a view, this view returns 23 records. Using a left join I would like to add fields coming from the table tblDatPositionsCalc to these 23 records. As you can see I have an additional condition on the tblDatPositionsCalc in order to only consider the most recent records. With this condition it would return 21 records. The join should be on two fields together colAccount and colId.
I simply want the query to return the 23 records from the view and where possible have the information from tblDatPositionsCalc. There is actually only 2 records in the view without corresponding id and account in tblDatPositionsCalc, that means out of the 23 records only 2 will have missing values in the fields coming from the table tblDatPositionsCalc.
The problem with my query is that it only returns the 21 records from tblDatPositionsCalc. I don't understand why. I tried to move the condition on date in just after the JOIN condition but that did not help.
SELECT TOP (100) PERCENT
dbo.vwCurrPos.Account,
dbo.vwCurrPos.Id,
dbo.vwCurrPos.TickerBB,
dbo.vwCurrPos.colEquityCode,
dbo.vwCurrPos.colType,
dbo.vwCurrPos.colCcy,
dbo.vwCurrPos.colRegion,
dbo.vwCurrPos.colExchange,
dbo.vwCurrPos.[Instr Type],
dbo.vwCurrPos.colMinLastDay,
dbo.vwCurrPos.colTimeShift,
dbo.vwCurrPos.Strike,
dbo.vwCurrPos.colMultiplier,
dbo.vwCurrPos.colBetaVol,
dbo.vwCurrPos.colBetaEq,
dbo.vwCurrPos.colBetaFloor,
dbo.vwCurrPos.colBetaCurv,
dbo.vwCurrPos.colUndlVol,
dbo.vwCurrPos.colUndlEq,
dbo.vwCurrPos.colUndlFut,
tblDatPositionsCalc_1.colLots,
dbo.vwCurrPos.[Open Positions],
dbo.vwCurrPos.colListMatShift,
dbo.vwCurrPos.colStartTime,
tblDatPositionsCalc_1.colPrice,
tblDatPositionsCalc_1.colMktPrice,
dbo.vwCurrPos.colProduct,
dbo.vwCurrPos.colCalendar,
CAST(dbo.vwCurrPos.colExpiry AS DATETIME) AS colExpiry,
dbo.vwCurrPos.colEndTime,
CAST(tblDatPositionsCalc_1.colDate AS datetime) AS colDate,
dbo.vwCurrPos.colFund,
dbo.vwCurrPos.colExchangeTT,
dbo.vwCurrPos.colUserTag
FROM dbo.vwCurrPos
LEFT OUTER JOIN dbo.tblDatPositionsCalc AS tblDatPositionsCalc_1
ON tblDatPositionsCalc_1.colId = dbo.vwCurrPos.Id
AND tblDatPositionsCalc_1.colAccount = dbo.vwCurrPos.Account
WHERE (tblDatPositionsCalc_1.colDate =
(SELECT MAX(colDate) AS Expr1 FROM dbo.tblDatPositionsCalc))
ORDER BY
dbo.vwCurrPos.Account,
dbo.vwCurrPos.Id,
dbo.vwCurrPos.colEquityCode,
dbo.vwCurrPos.colRegion
Any idea what might cause the problem?
(Option 1) DrCopyPaste is right so your from clause would look like:
...
FROM dbo.vwCurrPos
LEFT OUTER JOIN dbo.tblDatPositionsCalc AS tblDatPositionsCalc_1
ON tblDatPositionsCalc_1.colId = dbo.vwCurrPos.Id
AND tblDatPositionsCalc_1.colAccount = dbo.vwCurrPos.Account
and (tblDatPositionsCalc_1.colDate =
(SELECT MAX(colDate) AS Expr1 FROM dbo.tblDatPositionsCalc))
...
reason: the where clause restriction of left joined to column = some expression with fail to return for "null = something" so the row will be removed.
(Option 2) As oppose to pushing code in to additional views where it is harder to maintain you can nest sql select statements;
select
X.x1,X.x2,
Y.*
from X
left join
(select Z.z1 as y1, Z.z2 as y2, Z.z3 as y3
from Z
where Z.z1 = (select max(Z.z1) from Z)
) as Y
on x.x1 = Y.y1 and X.x2 = Y.y2
The advantage here is you check each nested sub query a move out quickly. Although if you still building up more logic check out common table expressions (CTE's) http://msdn.microsoft.com/en-us/library/ms175972.aspx

How to separate content from my query into 3 distinct columns

I have got the following query, which puts the data that I need (text in the 245, 260 and 300 tags of my table called "bib") into a single column (called "Title"). Instead I would like to have this data separated into three distinct columns, one for each tag. Any suggestions would be very welcome
SELECT DISTINCT isbnEX_inverted.isbn, ISNULL(bib.text, CONVERT(varchar(255), bib_longtext.longtext)) AS Title, top_circ_summary.ranking, bib.tag, bib.bib#
FROM bib INNER JOIN
item ON bib.bib# = item.bib# INNER JOIN
isbnEX_inverted ON bib.bib# = isbnEX_inverted.bib# INNER JOIN
top_circ_summary ON item.bib# = top_circ_summary.bib# LEFT OUTER JOIN
bib_longtext ON bib.bib# = bib_longtext.bib# AND bib.tag = bib_longtext.tag
WHERE (isbnEX_inverted.isbn LIKE '%978__________%') AND (top_circ_summary.collection_group = 'jfic') AND (bib.tag in ('245', '520', '300'))
order by top_circ_summary.ranking
Here's a sample of the (tab separated) output
isbn Title ranking tag bib# 9780143307334 a217 p. :bill. ;c21
cm. 1 300 962366 9780143307334 aDiary of a wimpy kid :bthe third
wheel /cby Jeff Kinney. 1 245 962366 9780143307334 aTrying to find
a partner for the Valentine's Day dance, Greg finds solace in the fact
that his best friend Rowley also doesn't have a date, but an
unexpected twist might turn his night around. 1 520 962366

Resources