Learn performance optimization skills from TPCH tests - Q8
I Query Requirement
Q8 queries the change in market share of a given part type in a certain area of a country over the past two years.
Q8 is characterized by query operations with grouping, sorting, aggregation and sub-query operations. The main query of the sub-query has no other query objects. The sub-query is relatively simple in structure and a multi-table join query itself.
II Oracle Execution
The query SQL written in Oracle is as follows:
select /*+ parallel(n) */
o_year,
sum(case
when nation = 'CHINA' then volume
else 0
end)/ sum(volume) as mkt_share
from
(
select
extract(year from o_orderdate) as o_year,
l_extendedprice * (1 - l_discount) as volume,
n2.n_name as nation
from
part,
supplier,
lineitem,
orders,
customer,
nation n1,
nation n2,
region
where
p_partkey = l_partkey
and s_suppkey = l_suppkey
and l_orderkey = o_orderkey
and o_custkey = c_custkey
and c_nationkey = n1.n_nationkey
and n1.n_regionkey = r_regionkey
and r_name = 'ASIA'
and s_nationkey = n2.n_nationkey
and o_orderdate between date '1995-01-01' and date '1996-12-31'
and p_type = 'STANDARD POLISHED BRASS'
) all_nations
group by
o_year
order by
o_year;
Where /*+ parallel(n) */ is the parallel query syntax of Oracle, and n is the parallel number.
Script execution time, Unit: seconds
Number of parallel |
1 |
2 |
4 |
8 |
12 |
Oracle |
472 |
362 |
277 |
216 |
192 |
III SPL Optimization
The optimization principle of JOIN between orders and lineitem primary-sub tables here is similar to that in Q3.
The SPL script is as follows:
A |
|
1 |
=now() |
2 |
1995-01-01 |
3 |
1996-12-31 |
4 |
>nation="CHINA" |
5 |
>name="ASIA" |
6 |
>type="STANDARD POLISHED BRASS" |
7 |
=file("region.btx").import@b().select(R_NAME==name).derive@o().keys@i(R_REGIONKEY) |
8 |
=file("nation.btx").import@b().select(N_NAME==nation).switch@i(N_REGIONKEY,A7).derive@o().keys@i(N_NATIONKEY) |
9 |
=file("nation.btx").import@b().switch@i(N_REGIONKEY,A7).derive@o().keys@i(N_NATIONKEY) |
10 |
=file("supplier.ctx").open().cursor@m(S_SUPPKEY;A8.find(S_NATIONKEY)).fetch().keys@im(S_SUPPKEY) |
11 |
=file("part.ctx").open().cursor@m(P_PARTKEY;P_TYPE==type).fetch().keys@im(P_PARTKEY) |
12 |
=file("customer.ctx").open().cursor@m(C_CUSTKEY;A9.find(C_NATIONKEY)).fetch().keys@im(C_CUSTKEY) |
13 |
=file("orders.ctx").open().cursor@m(O_ORDERKEY,O_ORDERDATE;O_ORDERDATE>=A2 && O_ORDERDATE <=A3 && A12.find(O_CUSTKEY)) |
14 |
=file("lineitem.ctx").open().news(A13,L_SUPPKEY,L_EXTENDEDPRICE,L_DISCOUNT,O_ORDERDATE;A11.find(L_PARTKEY)) |
15 |
=A14.switch(L_SUPPKEY,A10) |
16 |
=A15.run(L_EXTENDEDPRICE*=(1-L_DISCOUNT)) |
17 |
=A16.groups( year(O_ORDERDATE):o_year;sum(if(L_SUPPKEY,L_EXTENDEDPRICE,0)):s1,sum(L_EXTENDEDPRICE):s2) |
18 |
=A17.new(o_year,s1/s2:mkt_share) |
19 |
return interval@ms(A1,now()) |
The nation table has been used twice as a foreign key table, being generated in A8 and A9 respectively.
Script execution time, Unit: seconds
Number of parallel |
1 |
2 |
4 |
8 |
12 |
Oracle |
472 |
362 |
277 |
216 |
192 |
SPL composite table |
315 |
162 |
92 |
46 |
37 |
SPL Official Website 👉 https://www.scudata.com
SPL Feedback and Help 👉 https://www.reddit.com/r/esProcSPL
SPL Learning Material 👉 https://c.scudata.com
SPL Source Code and Package 👉 https://github.com/SPLWare/esProc
Discord 👉 https://discord.gg/2bkGwqTj
Youtube 👉 https://www.youtube.com/@esProc_SPL
Chinese Version