目录
一、复合查询
1.1 多表查询
实际开发中往往数据来自不同的表,所以需要多表查询,但是可以将多张表做笛卡尔积后的表当做是一张表,也就是单表查询。
我们用一个简单的公司管理系统,有三张表EMP、DEPT、SALGRADE来演示如何进行多表查询。
语法:from 表1, 表2 ...
这样其实就是对多张表进行笛卡尔积,假设是两张表:
- 第一张表的第一行和第二张表的所有行组合。
- 第一张表的第二行和第二张表的所有行组合。…
这样所有的组合,并不都是有意义的,所有一般都是根据条件筛选的。
在多表查询中,有可能多张表有相同的字段,为了便于区分,可以使用表名.字段名
表示某一张表的某一个字段。
mysql> select dept.dname, emp.ename, emp.sal from emp,dept where dept.deptno = emp.deptno; +------------+--------+---------+ | dname | ename | sal | +------------+--------+---------+ | RESEARCH | SMITH | 800.00 | | SALES | ALLEN | 1600.00 | | SALES | WARD | 1250.00 | | RESEARCH | JONES | 2975.00 | | SALES | MARTIN | 1250.00 | | SALES | BLAKE | 2850.00 | | ACCOUNTING | CLARK | 2450.00 | | RESEARCH | SCOTT | 3000.00 | | ACCOUNTING | KING | 5000.00 | | SALES | TURNER | 1500.00 | | RESEARCH | ADAMS | 1100.00 | | SALES | JAMES | 950.00 | | RESEARCH | FORD | 3000.00 | | ACCOUNTING | MILLER | 1300.00 | +------------+--------+---------+ 14 rows in set (0.00 sec)
1.2 自连接
自连接是对一张表的查询,但是使用多张表的查询方式,自己做笛卡尔积。
例如:
mysql> show create table emp \G *************************** 1. row *************************** Table: emp Create Table: CREATE TABLE `emp` ( `empno` int(6) unsigned zerofill NOT NULL COMMENT '雇员编号', `ename` varchar(10) DEFAULT NULL COMMENT '雇员姓名', `job` varchar(9) DEFAULT NULL COMMENT '雇员职位', `mgr` int(4) unsigned zerofill DEFAULT NULL COMMENT '雇员领导编号', `hiredate` datetime DEFAULT NULL COMMENT '雇佣时间', `sal` decimal(7,2) DEFAULT NULL COMMENT '工资月薪', `comm` decimal(7,2) DEFAULT NULL COMMENT '奖金', `deptno` int(2) unsigned zerofill DEFAULT NULL COMMENT '部门编号' ) ENGINE=InnoDB DEFAULT CHARSET=utf8 1 row in set (0.00 sec)
- 显示员工FORD的上级领导的编号和姓名(mgr是员工领导的编号)。
- 这是员工表,领导也是属于员工,所以无法只用一条简单查找语句对一张表查找两次,这就需要用到复合查询。
- 子查询 — 即嵌套查询,使用两条查询语句
mysql> select empno, ename from emp where empno = (select mgr from emp where ename = 'FORD'); +--------+-------+ | empno | ename | +--------+-------+ | 007566 | JONES | +--------+-------+ 1 row in set (0.00 sec
使用自连接的两张表查询,这里需要对表使用别名
mysql> select leader.empno, leader.ename from emp as worker, emp leader where worker.mgr = leader.empno and worker.ename='FORD'; +--------+-------+ | empno | ename | +--------+-------+ | 007566 | JONES | +--------+-------+ 1 row in set (0.00 sec)
1.3 子查询
子查询是指嵌入在其他sql语句中的select语句,也叫嵌套查询。
1.3.1 单行子查询
返回一行记录的子查询
- 显示SMITH同一部门的员工
mysql> select deptno, ename from emp where deptno = (select deptno from emp where ename = 'SMITH'); +--------+-------+ | deptno | ename | +--------+-------+ | 20 | SMITH | | 20 | JONES | | 20 | SCOTT | | 20 | ADAMS | | 20 | FORD | +--------+-------+ 5 rows in set (0.00 sec)
1.3.2 多行子查询
返回多行记录的子查询,仍然是一列(一个字段)。
in关键字(属于查询出来的多行中的一行,无法用于比较):查询和10号部门的工作岗位相同的雇员的名字,岗位,工资,部门号,但是不包含10自己的
先拆分,找到10号部门的岗位:
mysql> select distinct job from emp where deptno = 10; +-----------+ | job | +-----------+ | MANAGER | | PRESIDENT | | CLERK | +-----------+ 3 rows in set (0.00 sec) -- 然后再复合 mysql> select ename, job, sal, deptno from emp where job in (select distinct job from emp where deptno = 10) and deptno <=> 10; +--------+-----------+---------+--------+ | ename | job | sal | deptno | +--------+-----------+---------+--------+ | CLARK | MANAGER | 2450.00 | 10 | | KING | PRESIDENT | 5000.00 | 10 | | MILLER | CLERK | 1300.00 | 10 | +--------+-----------+---------+--------+ 3 rows in set (0.00 sec)
all关键字(与查询结果的所有行比较为真的):显示工资比部门30的所有员工的工资高的员工的姓名、工资和部门号
同样可以拆分
-- 先找到部门号为30的所有工资 mysql> select sal from emp where deptno = 30; +---------+ | sal | +---------+ | 1600.00 | | 1250.00 | | 1250.00 | | 2850.00 | | 1500.00 | | 950.00 | +---------+ 6 rows in set (0.00 sec) -- 判断 > 所有行 mysql> select ename, sal, deptno from emp where sal > all(select sal from emp where deptno = 30); +-------+---------+--------+ | ename | sal | deptno | +-------+---------+--------+ | JONES | 2975.00 | 20 | | SCOTT | 3000.00 | 20 | | KING | 5000.00 | 10 | | FORD | 3000.00 | 20 | +-------+---------+--------+ 4 rows in set (0.00 sec)
any关键字(与查询结果的任意一行比较为真的):显示工资比部门30的任意员工的工资高的员工的姓名、工资和部门号(包含自己部门的员工) some是any的别称很少使用。
mysql> select ename, sal, deptno from emp where sal > any (select sal from emp where deptno=30); +--------+---------+--------+ | ename | sal | deptno | +--------+---------+--------+ | ALLEN | 1600.00 | 30 | | WARD | 1250.00 | 30 | | JONES | 2975.00 | 20 | | MARTIN | 1250.00 | 30 | | BLAKE | 2850.00 | 30 | | CLARK | 2450.00 | 10 | | SCOTT | 3000.00 | 20 | | KING | 5000.00 | 10 | | TURNER | 1500.00 | 30 | | ADAMS | 1100.00 | 20 | | FORD | 3000.00 | 20 | | MILLER | 1300.00 | 10 | +--------+---------+--------+ 12 rows in set (0.00 sec)
1.3.3 多列子查询
单行子查询是指子查询只返回单列,单行数据
多行子查询是指返回单列多行数据,都是针对单列而言的,而多列子查询则是指查询返回多个列数据的子查询语句
其实很简单,多列也可以比较,和单列一样。
案例:查询和SMITH的部门和岗位完全相同的所有雇员,不含SMITH本人
mysql> select ename, deptno, job from emp where (job, deptno) = (select job, deptno from emp where ename = 'SMITH') and ename <=> 'SMITH'; +-------+--------+-------+ | ename | deptno | job | +-------+--------+-------+ | SMITH | 20 | CLERK | +-------+--------+-------+ 1 row in set (0.00 sec)
1.3.4 在from子句中使用子查询
子查询语句出现在from子句中。这里要用到数据查询的技巧,把一个子查询当做一个临时表使用。
显示每个高于自己部门平均工资的员工的姓名、部门、工资、平均工资
-- 先找到每个部门的平均工资 mysql> select deptno, avg(sal) from emp group by deptno; +--------+-------------+ | deptno | avg(sal) | +--------+-------------+ | 10 | 2916.666667 | | 20 | 2175.000000 | | 30 | 1566.666667 | +--------+-------------+ 3 rows in set (0.00 sec) -- 然后再用这张表和原本的表做笛卡尔积,多表查询 mysql> select emp.ename, emp.deptno, emp.sal, format(dept_avg,2) from emp , (select deptno , avg(sal) dept_avg from emp group by deptno) tmp wheree tmp.deptno = emp.deptno and emp.sal > dept_avg; +-------+--------+---------+--------------------+ | ename | deptno | sal | format(dept_avg,2) | +-------+--------+---------+--------------------+ | ALLEN | 30 | 1600.00 | 1,566.67 | | JONES | 20 | 2975.00 | 2,175.00 | | BLAKE | 30 | 2850.00 | 1,566.67 | | SCOTT | 20 | 3000.00 | 2,175.00 | | KING | 10 | 5000.00 | 2,916.67 | | FORD | 20 | 3000.00 | 2,175.00 | +-------+--------+---------+--------------------+ 6 rows in set (0.00 sec)
显示每个部门的信息(部门名,编号,地址)和人员数量
对于有未知的,聚合的信息(人员数量),先将它求出来
-- 首先拿到相关表的信息 mysql> select * from dept limit 3; +--------+------------+----------+ | deptno | dname | loc | +--------+------------+----------+ | 10 | ACCOUNTING | NEW YORK | | 20 | RESEARCH | DALLAS | | 30 | SALES | CHICAGO | +--------+------------+----------+ 3 rows in set (0.00 sec) mysql> select * from emp limit 3; +--------+-------+----------+------+---------------------+---------+--------+--------+ | empno | ename | job | mgr | hiredate | sal | comm | deptno | +--------+-------+----------+------+---------------------+---------+--------+--------+ | 007369 | SMITH | CLERK | 7902 | 1980-12-17 00:00:00 | 800.00 | NULL | 20 | | 007499 | ALLEN | SALESMAN | 7698 | 1981-02-20 00:00:00 | 1600.00 | 300.00 | 30 | | 007521 | WARD | SALESMAN | 7698 | 1981-02-22 00:00:00 | 1250.00 | 500.00 | 30 | +--------+-------+----------+------+---------------------+---------+--------+--------+ 3 rows in set (0.01 sec)
暴力使用多表查询
注意:和聚合函数一起显示的字段,需要被分组,因为聚合函数对于某一个字段是不清楚的,比如count(*),它是统计总数,并不对应某一个deptno,ename这些字段
mysql> select dept.dname, dept.deptno, dept.loc, count(*) from dept, emp where dept.deptno = emp.deptno group by deptno; ERROR 1055 (42000): Expression #1 of SELECT list is not in GROUP BY clause and contains nonaggregated column 'scott.dept.dname' which is not functionally dependent on columns in GROUP BY clause; this is incompatible with sql_mode=only_full_group_by ---- 注意:和聚合函数一起显示的字段,需要被分组,因为聚合函数对于某一个字段是不清楚的,比如count(*),它是统计总数,并不对应某一个deptno,ename这些字段 mysql> select dept.dname, dept.deptno, dept.loc, count(*) from dept, emp where dept.deptno = emp.deptno group by deptno, dept.dname, dept.loc; +------------+--------+----------+----------+ | dname | deptno | loc | count(*) | +------------+--------+----------+----------+ | ACCOUNTING | 10 | NEW YORK | 3 | | RESEARCH | 20 | DALLAS | 5 | | SALES | 30 | CHICAGO | 6 | +------------+--------+----------+----------+ 3 rows in set (0.00 sec)
使用子查询
-- 先子查询出每个部门的人员数量 mysql> select deptno, count(*) from emp group by deptno; +--------+----------+ | deptno | count(*) | +--------+----------+ | 10 | 3 | | 20 | 5 | | 30 | 6 | +--------+----------+ 3 rows in set (0.00 sec) -- 再和部门表组合,根据条件筛选。 mysql> select dept.dname, dept.deptno, dept.loc, dept_cnt from dept, (select deptno, count(*) dept_cnt from emp group by deptno) cnt_table where cnt_table.deptno = dept.deptno; +------------+--------+----------+----------+ | dname | deptno | loc | dept_cnt | +------------+--------+----------+----------+ | ACCOUNTING | 10 | NEW YORK | 3 | | RESEARCH | 20 | DALLAS | 5 | | SALES | 30 | CHICAGO | 6 | +------------+--------+----------+----------+ 3 rows in set (0.00 sec)
1.4 合并查询
在实际应用中,为了合并多个select的执行结果,可以使用集合操作符 union,union all
1.4.1 union
该操作符用于取得两个结果集的并集。当使用该操作符时,会自动去掉结果集中的重复行。
将工资大于2500或职位是MANAGER的人找出来
岗位是MANAGER的必定有工资大于2500的,这里自动去重了
mysql> select ename, sal, job from emp where sal>2500 union select ename, sal, job from emp where job='MANAGER'; +-------+---------+-----------+ | ename | sal | job | +-------+---------+-----------+ | JONES | 2975.00 | MANAGER | | BLAKE | 2850.00 | MANAGER | | SCOTT | 3000.00 | ANALYST | | KING | 5000.00 | PRESIDENT | | FORD | 3000.00 | ANALYST | | CLARK | 2450.00 | MANAGER | +-------+---------+-----------+ 6 rows in set (0.00 sec)
1.4.2 union all
该操作符用于取得两个结果集的并集。当使用该操作符时,不会去掉结果集中的重复行。
mysql> select ename, sal, job from emp where sal>2500 union all select ename, sal, job from emp where job='MANAGER'; +-------+---------+-----------+ | ename | sal | job | +-------+---------+-----------+ | JONES | 2975.00 | MANAGER | | BLAKE | 2850.00 | MANAGER | | SCOTT | 3000.00 | ANALYST | | KING | 5000.00 | PRESIDENT | | FORD | 3000.00 | ANALYST | | JONES | 2975.00 | MANAGER | | BLAKE | 2850.00 | MANAGER | | CLARK | 2450.00 | MANAGER | +-------+---------+-----------+ 8 rows in set (0.00 sec)
二、内外连接
2.1 内连接
内连接inner join
实际上就是利用where子句对两种表形成的笛卡儿积进行筛选,我们前面学习的查询都是内连接,也是在开发过程中使用的最多的连接查询。
select 字段 from 表1 inner join 表2 on 连接条件 and 其他条件;
前面使用的都是内连接。
on后面是两张表的连接条件,后可以跟where条件语句查询表。
-- 用前面的写法 select ename, dname from emp, dept where emp.deptno=dept.deptno and ename='SMITH'; -- 用标准的内连接写法 select ename, dname from emp inner join dept on emp.deptno=dept.deptno and ename='SMITH';
两张表连接,并不是所有行都是有效的,所有使用连接条件更加明确。
2.2 外连接
外连接分为左外连接和右外连接
2.2.1 左外连接
如果联合查询,左侧的表完全显示我们就说是左外连接。
什么叫完全显示呢?
就是两张表通过连接条件连接,但是左侧的表有不满足条件的也要全部显示,右侧的表显示为NULL。
语法:
select 字段名 from 表名1 left join 表名2 on 连接条件
创建两张表
mysql> select * from exam; +------+-------+ | id | grade | +------+-------+ | 1 | 56 | | 2 | 76 | | 11 | 8 | +------+-------+ 3 rows in set (0.00 sec) mysql> select * from stu; +------+------+ | id | name | +------+------+ | 1 | jack | | 2 | tom | | 3 | kity | | 4 | nono | +------+------+ 4 rows in set (0.00 sec)
使用内连接
mysql> select * from stu inner join exam on exam.id = stu.id; +------+------+------+-------+ | id | name | id | grade | +------+------+------+-------+ | 1 | jack | 1 | 56 | | 2 | tom | 2 | 76 | +------+------+------+-------+ 2 rows in set (0.00 sec)
使用stu为左表的左外连接
mysql> select * from stu left join exam on exam.id = stu.id; +------+------+------+-------+ | id | name | id | grade | +------+------+------+-------+ | 1 | jack | 1 | 56 | | 2 | tom | 2 | 76 | | 3 | kity | NULL | NULL | | 4 | nono | NULL | NULL | +------+------+------+-------+ 4 rows in set (0.00 sec)
使用exam为左表的左外连接
mysql> select * from exam left join stu on exam.id = stu.id; +------+-------+------+------+ | id | grade | id | name | +------+-------+------+------+ | 1 | 56 | 1 | jack | | 2 | 76 | 2 | tom | | 11 | 8 | NULL | NULL | +------+-------+------+------+ 3 rows in set (0.00 sec)
2.2.1 右外连接
和左外连接对称,左侧不满足右侧的填充为NULL。
select 字段 from 表名1 right join 表名2 on 连接条件;
mysql> select * from exam right join stu on exam.id = stu.id; +------+-------+------+------+ | id | grade | id | name | +------+-------+------+------+ | 1 | 56 | 1 | jack | | 2 | 76 | 2 | tom | | NULL | NULL | 3 | kity | | NULL | NULL | 4 | nono | +------+-------+------+------+ 4 rows in set (0.00 sec)
列出部门名称和这些部门的员工信息,同时列出没有员工的部门
显然要以部门表为主
mysql> select dept.deptno, dept.dname, emp.ename, emp.job from dept left join emp on emp.deptno = dept.deptno; +--------+------------+--------+-----------+ | deptno | dname | ename | job | +--------+------------+--------+-----------+ | 20 | RESEARCH | SMITH | CLERK | | 30 | SALES | ALLEN | SALESMAN | | 30 | SALES | WARD | SALESMAN | | 20 | RESEARCH | JONES | MANAGER | | 30 | SALES | MARTIN | SALESMAN | | 30 | SALES | BLAKE | MANAGER | | 10 | ACCOUNTING | CLARK | MANAGER | | 20 | RESEARCH | SCOTT | ANALYST | | 10 | ACCOUNTING | KING | PRESIDENT | | 30 | SALES | TURNER | SALESMAN | | 20 | RESEARCH | ADAMS | CLERK | | 30 | SALES | JAMES | CLERK | | 20 | RESEARCH | FORD | ANALYST | | 10 | ACCOUNTING | MILLER | CLERK | | 40 | OPERATIONS | NULL | NULL | +--------+------------+--------+-----------+ 15 rows in set (0.00 sec)
左外连接是以左表为主
右外连接是以右表为主