为什么CTE（递归）未并行化（MAXDOP = 8）？

桑博远

2023-03-14

问题内容：

我们有相当大的计算机100GB +内存和8+内核。服务器范围的MAXDOP = 8。

T_SEQ_FF rowcount = 61692209, size = 2991152 KB

UPD 1： 表T_SEQ_FF具有两个索引：

1) create index idx_1 on T_SEQ_FF (first_num)
2) create index idx_2 on T_SEQ_FF (second_num)

表格T_SEQ_FF中的first_num，second_num pairs有num个，应在cte之后提供一个序列：

;with first_entity as ( 
    select first_num from  T_SEQ_FF a  where not exists (select 1 from  T_SEQ_FF b  where a.first_num = b.second_num) 
) ,
cte as ( 
select a.first_num, a.second_num, a.first_num as first_key, 1 as sequence_count 
from  T_SEQ_FF a  inner join first_entity b on a.first_num = b.first_num 
union all 
select a.first_num, a.second_num, cte.first_key, cte.sequence_count + 1 
from  T_SEQ_FF a  
inner join cte on a.first_num = cte.second_num 
) 
select * 
from cte 
option (maxrecursion 0);

但是，当我运行此查询时，我只会看到没有并行的串行查询计划。如果我从上述查询中删除 CTE的第二部分：

union all 
    select a.first_num, a.second_num, cte.first_key, cte.sequence_count + 1 
    from  T_SEQ_FF a  
    inner join cte on a.first_num = cte.second_num

然后我可以看到使用Repartition和Gather Streams使查询计划成为 并行化 。

因此，我可以总结一下，这是因为 recurisve CTE的SQL Server处理此查询时，不使用并行。

我相信，在拥有大量免费资源的大型计算机上，并行性应有助于更快地完成查询。

现在，它运行约40-50分钟。

您能否建议如何使用尽可能多的资源来更快地完成查询？

CTE是唯一的选择，因为我们需要从first_num - second_num成对中填充序列，并且这些序列可以是任何长度。

问题答案：

我会尝试重写CTE以删除以下步骤之一，即

;cte as ( 
select a.first_num, a.second_num, a.first_num as first_key, 1 as sequence_count 
from  T_SEQ_FF a  where not exists (select 1 from  T_SEQ_FF b  where a.first_num = b.second_num) 
union all 
select a.first_num, a.second_num, cte.first_key, cte.sequence_count + 1 
from  T_SEQ_FF a  
inner join cte on a.first_num = cte.second_num 
) 
select * 
from cte 
option (maxrecursion 0);

如果只有一个根元素，最好将其作为变量传递到查询中，以便查询优化器可以使用该值。

另一尝试是更改查询以获取没有子查询的根元素，即，second_num为null或first_num = second_num。

为什么CTE（递归）未并行化（MAXDOP = 8）？

相关阅读

相关文章

相关问答

相关工具

相关文档