python difflib模塊示例講解

更新時(shí)間：2017年09月13日 09:06:18 作者：Lockeyi

這篇文章主要為大家詳細(xì)介紹了python difflib模塊的示例，具有一定的參考價(jià)值，感興趣的小伙伴們可以參考一下

difflib模塊提供的類和方法用來(lái)進(jìn)行序列的差異化比較，它能夠比對(duì)文件并生成差異結(jié)果文本或者h(yuǎn)tml格式的差異化比較頁(yè)面，如果需要比較目錄的不同，可以使用filecmp模塊。

class difflib.SequenceMatcher

此類提供了比較任意可哈希類型序列對(duì)方法。此方法將尋找沒(méi)有包含‘垃圾'元素的最大連續(xù)匹配序列。

通過(guò)對(duì)算法的復(fù)雜度比較，它由于原始的完形匹配算法，在最壞情況下有n的平方次運(yùn)算，在最好情況下，具有線性的效率。

它具有自動(dòng)垃圾啟發(fā)式，可以將重復(fù)超過(guò)片段1%或者重復(fù)200次的字符作為垃圾來(lái)處理?？梢酝ㄟ^(guò)將autojunk設(shè)置為false關(guān)閉該功能。

class difflib.Differ

此類比較的是文本行的差異并且產(chǎn)生適合人類閱讀的差異結(jié)果或者增量結(jié)果，結(jié)果中各部分的表示如下：

這里寫(xiě)圖片描述

class difflib.HtmlDiff

此類可以被用來(lái)創(chuàng)建HTML表格 (或者說(shuō)包含表格的html文件) ，兩邊對(duì)應(yīng)展示或者行對(duì)行的展示比對(duì)差異結(jié)果。

make_file(fromlines, tolines [, fromdesc][, todesc][, context][, numlines])

make_table(fromlines, tolines [, fromdesc][, todesc][, context][, numlines])

以上兩個(gè)方法都可以用來(lái)生成包含一個(gè)內(nèi)容為比對(duì)結(jié)果的表格的html文件，并且部分內(nèi)容會(huì)高亮顯示。

difflib.context_diff(a, b[, fromfile][, tofile][, fromfiledate][, tofiledate][, n][, lineterm])

比較a與b(字符串列表)，并且返回一個(gè)差異文本行的生成器
示例：

>>> s1 = ['bacon\n', 'eggs\n', 'ham\n', 'guido\n']
>>> s2 = ['python\n', 'eggy\n', 'hamster\n', 'guido\n']
>>> for line in context_diff(s1, s2, fromfile='before.py', tofile='after.py'):
...   sys.stdout.write(line) 
*** before.py
--- after.py
***************
*** 1,4 ****
! bacon
! eggs
! ham
 guido
--- 1,4 ----
! python
! eggy
! hamster
 guido

difflib.get_close_matches(word, possibilities[, n][, cutoff])

返回最大匹配結(jié)果的列表

示例：

>>> get_close_matches('appel', ['ape', 'apple', 'peach', 'puppy'])
['apple', 'ape']
>>> import keyword
>>> get_close_matches('wheel', keyword.kwlist)
['while']
>>> get_close_matches('apple', keyword.kwlist)
[]
>>> get_close_matches('accept', keyword.kwlist)
['except']

difflib.ndiff(a, b[, linejunk][, charjunk])

比較a與b(字符串列表)，返回一個(gè)Differ-style 的差異結(jié)果
示例：

>>> diff = ndiff('one\ntwo\nthree\n'.splitlines(1),
...       'ore\ntree\nemu\n'.splitlines(1))
>>> print ''.join(diff),
- one
? ^
+ ore
? ^
- two
- three
? -
+ tree
+ emu

difflib.restore(sequence, which)

返回一個(gè)由兩個(gè)比對(duì)序列產(chǎn)生的結(jié)果

示例

>>> diff = ndiff('one\ntwo\nthree\n'.splitlines(1),
...       'ore\ntree\nemu\n'.splitlines(1))
>>> diff = list(diff) # materialize the generated delta into a list
>>> print ''.join(restore(diff, 1)),
one
two
three
>>> print ''.join(restore(diff, 2)),
ore
tree
emu

difflib.unified_diff(a, b[, fromfile][, tofile][, fromfiledate][, tofiledate][, n][, lineterm])

比較a與b(字符串列表)，返回一個(gè)unified diff格式的差異結(jié)果.

示例：

>>> s1 = ['bacon\n', 'eggs\n', 'ham\n', 'guido\n']
>>> s2 = ['python\n', 'eggy\n', 'hamster\n', 'guido\n']
>>> for line in unified_diff(s1, s2, fromfile='before.py', tofile='after.py'):
...  sys.stdout.write(line) 
--- before.py
+++ after.py
@@ -1,4 +1,4 @@
-bacon
-eggs
-ham
+python
+eggy
+hamster
 guido

實(shí)際應(yīng)用示例

比對(duì)兩個(gè)文件，然后生成一個(gè)展示差異結(jié)果的HTML文件

#coding:utf-8
'''
file:difflibeg.py
date:2017/9/9 10:33
author:lockey
email:lockey@123.com
desc:diffle module learning and practising 
'''
import difflib
hd = difflib.HtmlDiff()
loads = ''
with open('G:/python/note/day09/0907code/hostinfo/cpu.py','r') as load:
 loads = load.readlines()
 load.close()

mems = ''
with open('G:/python/note/day09/0907code/hostinfo/mem.py', 'r') as mem:
 mems = mem.readlines()
 mem.close()

with open('htmlout.html','a+') as fo:
 fo.write(hd.make_file(loads,mems))
 fo.close()

運(yùn)行結(jié)果：

這里寫(xiě)圖片描述