MySQL的排序和分組

原創

2021-05-15 03:33

order by和group by這兩個要十分注意，因爲一不小心就會產生文件內排序，即file sort，這個性能是十分差的。下面來看具體的案例分析。

首先建表：

create table `tblA`(
  `id` int not null primary key auto_increment comment '主鍵',
  `age` int not null comment '年齡',
  `birth` timestamp not null comment '生日'
) ;
insert into tblA(age, birth) values(22, now());
insert into tblA(age, birth) values(23, now());
insert into tblA(age, birth) values(24, now());

create index idx_age_birth on tblA(age, birth);

1. order by：

看看下面語句的執行計劃：

explain select * from tblA where age > 20 order by age;
explain select * from tblA where age > 20 order by age,birth;

這兩個個毫無疑問，可以用到索引。

再來看看這個：

explain select * from tblA where age > 20 order by birth;

顯然我們可以看到這裏產生了filesort，爲什麼呢？因爲age是範圍，且order by的直接是二樓，帶頭大哥沒了，所以索引失效了。

那這樣呢？

explain select * from tblA where age > 20 order by birth, age;
explain select * from tblA where age > 20 order by age, birth;

第一個還是不行，因爲範圍後失效，且order by是從birth二樓開始的。第二個可以用到索引，不會產生filesort，是因爲，雖然前面的age是範圍，但是order by的又是從age開始，帶頭大哥在。

上面這些都好理解，看看這個：

explain select * from tblA  order by age desc, birth asc;

奇了怪了，帶頭大哥在，也沒有範圍，爲啥就出現了filesort了呢？

這是因爲age是降序，birth又是升序，一升一降，就會導致索引用不上，就會產生filesort了。如果把兩個都改成desc或者asc，那就沒問題了。

注意：

MySQL的filesort有兩種策略，
MySQL4.1之前，叫雙路排序。
就是會進行兩次磁盤I/O操作。讀取行指針和order by的列，
對它們排序，然後掃描排好序的表，再從磁盤中取出數據來。

4.1之後的版本，叫單路排序，只進行一次I/O。
先將數據從磁盤讀到內存中，然後在內存中排序。
但是，如果內存，即sort_buffer_size不夠大，性能反而不如雙路排序。

order by優化小總結：

儘量避免select *;
嘗試增大sort_buffer_size，不管用哪種算法，增大這個都可以提高效率；
嘗試增大max_length_for_sort_data，增大這個，會增加用改進算法的概率。

2. group by：

group by 其實和order by一樣，也是先排序，不過多了一個分組，也遵從最佳左前綴原則。要注意的一點是，where優於having，能用where時就不要用having。

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

MySQL的排序和分組

SQL優化-20231016

MySQL窗口函數的妙用

緩存一致性問題一、什麼是一致性問題二、使用緩存存在的問題三、緩存一致性問題的解決方案

手把手教你搭建ELK

java面試題 --- MySQL②

java面試題 --- Spring③

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結