快捷導(dǎo)航

Python3中正則模塊re.compile、re.match及re.search函數(shù)用法詳解

更新時(shí)間：2018年06月11日 14:24:29 作者：Citizen_Wang

這篇文章主要介紹了Python3中正則模塊re.compile、re.match及re.search函數(shù)用法,結(jié)合實(shí)例形式較為詳細(xì)的分析了re模塊中re.compile、re.match及re.search函數(shù)的功能、參數(shù)、具體使用技巧與注意事項(xiàng),需要的朋友可以參考下

本文實(shí)例講述了Python3中正則模塊re.compile、re.match及re.search函數(shù)用法。分享給大家供大家參考，具體如下：

re模塊 re.compile、re.match、 re.search

re 模塊官方說(shuō)明文檔

正則匹配的時(shí)候，第一個(gè)字符是 r，表示 raw string 原生字符，意在聲明字符串中間的特殊字符不用轉(zhuǎn)義。

比如表示 ‘\n'，可以寫(xiě) r'\n'，或者不適用原生字符 ‘\n'。

推薦使用 re.match

re.compile() 函數(shù)

編譯正則表達(dá)式模式，返回一個(gè)對(duì)象。可以把常用的正則表達(dá)式編譯成正則表達(dá)式對(duì)象，方便后續(xù)調(diào)用及提高效率。

re.compile(pattern, flags=0)

pattern 指定編譯時(shí)的表達(dá)式字符串
flags 編譯標(biāo)志位，用來(lái)修改正則表達(dá)式的匹配方式。支持 re.L|re.M 同時(shí)匹配

flags 標(biāo)志位參數(shù)

re.I(re.IGNORECASE)
使匹配對(duì)大小寫(xiě)不敏感

re.L(re.LOCAL)
做本地化識(shí)別（locale-aware）匹配

re.M(re.MULTILINE)
多行匹配，影響 ^ 和 $

re.S(re.DOTALL)
使 . 匹配包括換行在內(nèi)的所有字符

re.U(re.UNICODE)
根據(jù)Unicode字符集解析字符。這個(gè)標(biāo)志影響 \w, \W, \b, \B.

re.X(re.VERBOSE)
該標(biāo)志通過(guò)給予你更靈活的格式以便你將正則表達(dá)式寫(xiě)得更易于理解。

示例：

import re
content = 'Citizen wang , always fall in love with neighbour，WANG'
rr = re.compile(r'wan\w', re.I) # 不區(qū)分大小寫(xiě)
print(type(rr))
a = rr.findall(content)
print(type(a))
print(a)

findall 返回的是一個(gè) list 對(duì)象

<class '_sre.SRE_Pattern'>
<class 'list'>
['wang', 'WANG']

re.match() 函數(shù)

總是從字符串‘開(kāi)頭曲匹配'，并返回匹配的字符串的 match 對(duì)象 <class '_sre.SRE_Match'>。

re.match(pattern, string[, flags=0])

pattern 匹配模式，由 re.compile 獲得
string 需要匹配的字符串

import re
pattern = re.compile(r'hello')
a = re.match(pattern, 'hello world')
b = re.match(pattern, 'world hello')
c = re.match(pattern, 'hell')
d = re.match(pattern, 'hello ')
if a:
  print(a.group())
else:
  print('a 失敗')
if b:
  print(b.group())
else:
  print('b 失敗')
if c:
  print(c.group())
else:
  print('c 失敗')
if d:
  print(d.group())
else:
  print('d 失敗')

hello
b 失敗
c 失敗
hello

match 的方法和屬性

參考鏈接

import re
str = 'hello world! hello python'
pattern = re.compile(r'(?P<first>hell\w)(?P<symbol>\s)(?P<last>.*ld!)') # 分組，0 組是整個(gè) hello world!, 1組 hello，2組 ld!
match = re.match(pattern, str)
print('group 0:', match.group(0)) # 匹配 0 組，整個(gè)字符串
print('group 1:', match.group(1)) # 匹配第一組，hello
print('group 2:', match.group(2)) # 匹配第二組，空格
print('group 3:', match.group(3)) # 匹配第三組，ld!
print('groups:', match.groups())  # groups 方法，返回一個(gè)包含所有分組匹配的元組
print('start 0:', match.start(0), 'end 0:', match.end(0)) # 整個(gè)匹配開(kāi)始和結(jié)束的索引值
print('start 1:', match.start(1), 'end 1:', match.end(1)) # 第一組開(kāi)始和結(jié)束的索引值
print('start 2:', match.start(1), 'end 2:', match.end(2)) # 第二組開(kāi)始和結(jié)束的索引值
print('pos 開(kāi)始于：', match.pos)
print('endpos 結(jié)束于：', match.endpos) # string 的長(zhǎng)度
print('lastgroup 最后一個(gè)被捕獲的分組的名字：', match.lastgroup)
print('lastindex 最后一個(gè)分組在文本中的索引：', match.lastindex)
print('string 匹配時(shí)候使用的文本：', match.string)
print('re 匹配時(shí)候使用的 Pattern 對(duì)象：', match.re)
print('span 返回分組匹配的 index （start(group),end(group))：', match.span(2))

返回結(jié)果：

group 0: hello world!
group 1: hello
group 2:
group 3: world!
groups: ('hello', ' ', 'world!')
start 0: 0 end 0: 12
start 1: 0 end 1: 5
start 2: 0 end 2: 6
pos 開(kāi)始于： 0
endpos 結(jié)束于： 25
lastgroup 最后一個(gè)被捕獲的分組的名字： last
lastindex 最后一個(gè)分組在文本中的索引： 3
string 匹配時(shí)候使用的文本： hello world! hello python
re 匹配時(shí)候使用的 Pattern 對(duì)象： re.compile('(?P<first>hell\\w)(?P<symbol>\\s)(?P<last>.*ld!)')
span 返回分組匹配的 index （start(group),end(group))： (5, 6)

re.search 函數(shù)

對(duì)整個(gè)字符串進(jìn)行搜索匹配，返回第一個(gè)匹配的字符串的 match 對(duì)象。

re.search(pattern, string[, flags=0])

pattern 匹配模式，由 re.compile 獲得
string 需要匹配的字符串

import re
str = 'say hello world! hello python'
pattern = re.compile(r'(?P<first>hell\w)(?P<symbol>\s)(?P<last>.*ld!)') # 分組，0 組是整個(gè) hello world!, 1組 hello，2組 ld!
search = re.search(pattern, str)
print('group 0:', search.group(0)) # 匹配 0 組，整個(gè)字符串
print('group 1:', search.group(1)) # 匹配第一組，hello
print('group 2:', search.group(2)) # 匹配第二組，空格
print('group 3:', search.group(3)) # 匹配第三組，ld!
print('groups:', search.groups())  # groups 方法，返回一個(gè)包含所有分組匹配的元組
print('start 0:', search.start(0), 'end 0:', search.end(0)) # 整個(gè)匹配開(kāi)始和結(jié)束的索引值
print('start 1:', search.start(1), 'end 1:', search.end(1)) # 第一組開(kāi)始和結(jié)束的索引值
print('start 2:', search.start(1), 'end 2:', search.end(2)) # 第二組開(kāi)始和結(jié)束的索引值
print('pos 開(kāi)始于：', search.pos)
print('endpos 結(jié)束于：', search.endpos) # string 的長(zhǎng)度
print('lastgroup 最后一個(gè)被捕獲的分組的名字：', search.lastgroup)
print('lastindex 最后一個(gè)分組在文本中的索引：', search.lastindex)
print('string 匹配時(shí)候使用的文本：', search.string)
print('re 匹配時(shí)候使用的 Pattern 對(duì)象：', search.re)
print('span 返回分組匹配的 index （start(group),end(group))：', search.span(2))

注意 re.search 和 re.match 匹配的 str 的區(qū)別

打印結(jié)果：

group 0: hello world!
group 1: hello
group 2:
group 3: world!
groups: ('hello', ' ', 'world!')
start 0: 4 end 0: 16
start 1: 4 end 1: 9
start 2: 4 end 2: 10
pos 開(kāi)始于： 0
endpos 結(jié)束于： 29
lastgroup 最后一個(gè)被捕獲的分組的名字： last
lastindex 最后一個(gè)分組在文本中的索引： 3
string 匹配時(shí)候使用的文本： say hello world! hello python
re 匹配時(shí)候使用的 Pattern 對(duì)象： re.compile('(?P<first>hell\\w)(?P<symbol>\\s)(?P<last>.*ld!)')
span 返回分組匹配的 index （start(group),end(group))： (9, 10)

PS：這里再為大家提供2款非常方便的正則表達(dá)式工具供大家參考使用：

JavaScript正則表達(dá)式在線測(cè)試工具：
http://tools.jb51.net/regex/javascript

正則表達(dá)式在線生成工具：
http://tools.jb51.net/regex/create_reg

更多關(guān)于Python相關(guān)內(nèi)容可查看本站專(zhuān)題：《Python正則表達(dá)式用法總結(jié)》、《Python數(shù)據(jù)結(jié)構(gòu)與算法教程》、《Python函數(shù)使用技巧總結(jié)》、《Python字符串操作技巧匯總》、《Python入門(mén)與進(jìn)階經(jīng)典教程》及《Python文件與目錄操作技巧匯總》

希望本文所述對(duì)大家Python程序設(shè)計(jì)有所幫助。

您可能感興趣的文章: