MySQL隱式類(lèi)型的轉(zhuǎn)換陷阱和規(guī)則
投稿:daisy
前言
相信大家都知道隱式類(lèi)型轉(zhuǎn)換有無(wú)法命中索引的風(fēng)險(xiǎn),在高并發(fā)、大數(shù)據(jù)量的情況下,命不中索引帶來(lái)的后果非常嚴(yán)重。將數(shù)據(jù)庫(kù)拖死,繼而整個(gè)系統(tǒng)崩潰,對(duì)于大規(guī)模系統(tǒng)損失慘重。所以下面通過(guò)本文來(lái)好好學(xué)習(xí)下MySQL隱式類(lèi)型的轉(zhuǎn)換陷阱和規(guī)則。
1. 隱式類(lèi)型轉(zhuǎn)換實(shí)例
今天生產(chǎn)庫(kù)上突然出現(xiàn)MySQL線(xiàn)程數(shù)告警,IOPS很高,實(shí)例會(huì)話(huà)里面出現(xiàn)許多類(lèi)似下面的sql:(修改了相關(guān)字段和值)
SELECT f_col3_id,f_qq1_id FROM d_dbname.t_tb1 WHERE f_col1_id=1226391 and f_col2_id=1244378 and f_qq1_id in (12345,23456,34567,45678,56789,67890,78901,89012,90123,901231,901232,901233)
用 explain 看了下掃描行數(shù)和索引選擇情況:
mysql>explain SELECT f_col3_id,f_qq1_id FROM d_dbname.t_tb1 WHERE f_col1_id=1226391 and f_col2_id=1244378 and f_qq1_id in (12345,23456,34567,45678,56789,67890,78901,89012,90123,901231,901232,901233); +------+---------------+---------+--------+--------------------------------+---------------+------------+--------+--------+------------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+---------------+---------+--------+--------------------------------+---------------+------------+--------+--------+------------------------------------+ | 1 | SIMPLE | t_tb1 | ref | uid_type_frid,idx_corpid_qq1id | uid_type_frid | 8 | const | 1386 | Using index condition; Using where | +------+---------------+---------+--------+--------------------------------+---------------+------------+--------+--------+------------------------------------+ 共返回 1 行記錄,花費(fèi) 11.52 ms.
t_tb1 表上有個(gè)索引uid_type_frid(f_col2_id,f_type)
、idx_corp_id_qq1id(f_col1_id,f_qq1_id)
,而且如果選擇后者時(shí),f_qq1_id
的過(guò)濾效果應(yīng)該很佳,但卻選擇了前者。當(dāng)使用 hint use index(idx_corp_id_qq1id)
時(shí):
mysql>explain extended SELECT f_col3_id,f_qq1_id FROM d_dbname.t_tb1 use index(idx_corpid_qq1id) WHERE f_col1_id=1226391 and f_col2_id=1244378 and f_qq1_id in (12345,23456,34567,45678,56789,67890,78901,89012,90123,901231,901232,901233); +------+---------------+--------+--------+---------------------+------------------+------------+----------+-------------+------------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+---------------+--------+--------+---------------------+------------------+------------+----------+-------------+------------------------------------+ | 1 | SIMPLE | t_tb1 | ref | idx_corpid_qq1id | idx_corpid_qq1id | 8 | const | 2375752 | Using index condition; Using where | +---- -+---------------+--------+--------+---------------------+------------------+------------+----------+-------------+------------------------------------+ 共返回 1 行記錄,花費(fèi) 17.48 ms. mysql>show warnings; +-----------------+----------------+-----------------------------------------------------------------------------------------------------------------------+ | Level | Code | Message | +-----------------+----------------+-----------------------------------------------------------------------------------------------------------------------+ | Warning | 1739 | Cannot use range access on index 'idx_corpid_qq1id' due to type or collation conversion on field 'f_qq1_id' | | Note | 1003 | /* select#1 */ select `d_dbname`.`t_tb1`.`f_col3_id` AS `f_col3_id`,`d_dbname`.`t_tb1`.`f_qq1_id` AS `f_qq1_id` from `d_dbname`.`t_tb1` USE INDEX (`idx_corpid_qq1id`) where | | | | ((`d_dbname`.`t_tb1`.`f_col2_id` = 1244378) and (`d_dbname`.`t_tb1`.`f_col1_id` = 1226391) and (`d_dbname`.`t_tb1`.`f_qq1_id` in | | | | (12345,23456,34567,45678,56789,67890,78901,89012,90123,901231,901232,901233))) | +-----------------+----------------+-----------------------------------------------------------------------------------------------------------------------+ 共返回 2 行記錄,花費(fèi) 10.81 ms.
rows列達(dá)到200w行,但問(wèn)題也發(fā)現(xiàn)了:select_type
應(yīng)該是 range
才對(duì),key_len
看出來(lái)只用到了idx_corpid_qq1id
索引的第一列。上面explain使用了 extended
,所以show warnings;可以很明確的看到 f_qq1_id
出現(xiàn)了隱式類(lèi)型轉(zhuǎn)換:f_qq1_id
是varchar
,而后面的比較值是整型。
解決該問(wèn)題就是避免出現(xiàn)隱式類(lèi)型轉(zhuǎn)換(implicit type conversion)帶來(lái)的不可控:把f_qq1_id in
的內(nèi)容寫(xiě)成字符串:
mysql>explain SELECT f_col3_id,f_qq1_id FROM d_dbname.t_tb1 WHERE f_col1_id=1226391 and f_col2_id=1244378 and f_qq1_id in ('12345','23456','34567','45678','56789','67890','78901','89012','90123','901231'); +-------+---------------+--------+---------+--------------------------------+------------------+-------------+---------+---------+------------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +-------+---------------+--------+---------+--------------------------------+------------------+-------------+---------+---------+------------------------------------+ | 1 | SIMPLE | t_tb1 | range | uid_type_frid,idx_corpid_qq1id | idx_corpid_qq1id | 70 | | 40 | Using index condition; Using where | +-------+---------------+--------+---------+--------------------------------+------------------+-------------+---------+---------+------------------------------------+ 共返回 1 行記錄,花費(fèi) 12.41 ms.
掃描行數(shù)從1386減少為40。
類(lèi)似的還出現(xiàn)過(guò)一例:
SELECT count(0) FROM d_dbname.t_tb2 where f_col1_id= '1931231' AND f_phone in(098890); | Warning | 1292 | Truncated incorrect DOUBLE value: '1512-98464356'
優(yōu)化后直接從掃描rows 100w行降為1。
借這個(gè)機(jī)會(huì),系統(tǒng)的來(lái)看一下mysql中的隱式類(lèi)型轉(zhuǎn)換。
2. mysql隱式轉(zhuǎn)換規(guī)則
2.1 規(guī)則
下面來(lái)分析一下隱式轉(zhuǎn)換的規(guī)則:
a. 兩個(gè)參數(shù)至少有一個(gè)是 NULL
時(shí),比較的結(jié)果也是 NULL
,例外是使用 <=> 對(duì)兩個(gè) NULL
做比較時(shí)會(huì)返回 1,這兩種情況都不需要做類(lèi)型轉(zhuǎn)換
b. 兩個(gè)參數(shù)都是字符串,會(huì)按照字符串來(lái)比較,不做類(lèi)型轉(zhuǎn)換
c. 兩個(gè)參數(shù)都是整數(shù),按照整數(shù)來(lái)比較,不做類(lèi)型轉(zhuǎn)換
d. 十六進(jìn)制的值和非數(shù)字做比較時(shí),會(huì)被當(dāng)做二進(jìn)制串
e. 有一個(gè)參數(shù)是 TIMESTAMP
或 DATETIME
,并且另外一個(gè)參數(shù)是常量,常量會(huì)被轉(zhuǎn)換為 timestamp
f. 有一個(gè)參數(shù)是 decimal
類(lèi)型,如果另外一個(gè)參數(shù)是 decimal
或者整數(shù),會(huì)將整數(shù)轉(zhuǎn)換為 decimal
后進(jìn)行比較,如果另外一個(gè)參數(shù)是浮點(diǎn)數(shù),則會(huì)把 decimal
轉(zhuǎn)換為浮點(diǎn)數(shù)進(jìn)行比較
g. 所有其他情況下,兩個(gè)參數(shù)都會(huì)被轉(zhuǎn)換為浮點(diǎn)數(shù)再進(jìn)行比較
mysql> select 11 + '11', 11 + 'aa', 'a1' + 'bb', 11 + '0.01a'; +-----------+-----------+-------------+--------------+ | 11 + '11' | 11 + 'aa' | 'a1' + 'bb' | 11 + '0.01a' | +-----------+-----------+-------------+--------------+ | 22 | 11 | 0 | 11.01 | +-----------+-----------+-------------+--------------+ 1 row in set, 4 warnings (0.00 sec) mysql> show warnings; +---------+------+-------------------------------------------+ | Level | Code | Message | +---------+------+-------------------------------------------+ | Warning | 1292 | Truncated incorrect DOUBLE value: 'aa' | | Warning | 1292 | Truncated incorrect DOUBLE value: 'a1' | | Warning | 1292 | Truncated incorrect DOUBLE value: 'bb' | | Warning | 1292 | Truncated incorrect DOUBLE value: '0.01a' | +---------+------+-------------------------------------------+ 4 rows in set (0.00 sec) mysql> select '11a' = 11, '11.0' = 11, '11.0' = '11', NULL = 1; +------------+-------------+---------------+----------+ | '11a' = 11 | '11.0' = 11 | '11.0' = '11' | NULL = 1 | +------------+-------------+---------------+----------+ | 1 | 1 | 0 | NULL | +------------+-------------+---------------+----------+ 1 row in set, 1 warning (0.01 sec)
上面可以看出11 + 'aa',由于操作符兩邊的類(lèi)型不一樣且符合第g條,aa要被轉(zhuǎn)換成浮點(diǎn)型小數(shù),然而轉(zhuǎn)換失?。ㄗ帜副唤?cái)啵?,可以認(rèn)為轉(zhuǎn)成了 0,整數(shù)11被轉(zhuǎn)成浮點(diǎn)型還是它自己,所以11 + 'aa' = 11。
0.01a轉(zhuǎn)成double
型也是被截?cái)喑?.01,所以11 + '0.01a' = 11.01。
等式比較也說(shuō)明了這一點(diǎn),'11a'和'11.0'轉(zhuǎn)換后都等于 11,這也正是文章開(kāi)頭實(shí)例為什么沒(méi)走索引的原因: varchar
型的f_qq1_id
,轉(zhuǎn)換成浮點(diǎn)型比較時(shí),等于 12345 的情況有無(wú)數(shù)種如12345a、12345.b等待,MySQL優(yōu)化器無(wú)法確定索引是否更有效,所以選擇了其它方案。
但并不是只要出現(xiàn)隱式類(lèi)型轉(zhuǎn)換,就會(huì)引起上面類(lèi)似的性能問(wèn)題,最終是要看轉(zhuǎn)換后能否有效選擇索引。像f_id = '654321'
、f_mtime between '2016-05-01 00:00:00'
and '2016-05-04 23:59:59'
就不會(huì)影響索引選擇,因?yàn)榍罢遞_id是整型,即使與后面的字符串型數(shù)字轉(zhuǎn)換成double比較,依然能根據(jù)double確定f_id的值,索引依然有效。后者是因?yàn)榉系趀條,只是右邊的常量做了轉(zhuǎn)換。
開(kāi)發(fā)人員可能都只要存在這么一個(gè)隱式類(lèi)型轉(zhuǎn)換的坑,但卻又經(jīng)常不注意,所以干脆無(wú)需記住那么多規(guī)則,該什么類(lèi)型就與什么類(lèi)型比較。
2.2 隱式類(lèi)型轉(zhuǎn)換的安全問(wèn)題
implicit type conversion 不僅可能引起性能問(wèn)題,還有可能產(chǎn)生安全問(wèn)題。
mysql> desc t_account; +-----------+-------------+------+-----+---------+----------------+ | Field | Type | Null | Key | Default | Extra | +-----------+-------------+------+-----+---------+----------------+ | fid | int(11) | NO | PRI | NULL | auto_increment | | fname | varchar(20) | YES | | NULL | | | fpassword | varchar(50) | YES | | NULL | | +-----------+-------------+------+-----+---------+----------------+ mysql> select * from t_account; +-----+-----------+-------------+ | fid | fname | fpassword | +-----+-----------+-------------+ | 1 | xiaoming | p_xiaoming | | 2 | xiaoming1 | p_xiaoming1 | +-----+-----------+-------------+
假如應(yīng)用前端沒(méi)有WAF防護(hù),那么下面的sql很容易注入:
mysql> select * from t_account where fname='A' ; fname傳入 A' OR 1='1 mysql> select * from t_account where fname='A' OR 1='1';
攻擊者更聰明一點(diǎn): fname
傳入 A'+'B ,fpassword
傳入 ccc'+0 :
mysql> select * from t_account where fname='A'+'B' and fpassword='ccc'+0; +-----+-----------+-------------+ | fid | fname | fpassword | +-----+-----------+-------------+ | 1 | xiaoming | p_xiaoming | | 2 | xiaoming1 | p_xiaoming1 | +-----+-----------+-------------+ 2 rows in set, 7 warnings (0.00 sec)
總結(jié)
以上就是為大家總結(jié)的MySQL隱式類(lèi)型的轉(zhuǎn)換陷阱和規(guī)則,希望這篇文章對(duì)大家學(xué)習(xí)或者mysql能有所幫助,如果有疑問(wèn)大家可以留言交流,謝謝大家對(duì)腳本之家的支持。