淺談Python使用pickle模塊序列化數(shù)據(jù)優(yōu)化代碼的方法
pickle模塊序列化數(shù)據(jù)
pickle是Python標(biāo)準(zhǔn)庫(kù)中的一個(gè)二進(jìn)制序列化和反序列化庫(kù)。
可以以二進(jìn)制的形式將數(shù)據(jù)持久化保存到磁盤文件中??梢詫?shù)據(jù)和代碼分離,提高代碼可讀性和優(yōu)雅度。
一、pickle模塊介紹
pickle模塊可以對(duì)多種Python對(duì)象進(jìn)行序列化和反序列化,序列化稱為pickling,反序列化稱為unpickling。
序列化是將Python對(duì)象轉(zhuǎn)化為二進(jìn)制數(shù)據(jù),可以配合文件操作將序列化結(jié)果保存到文件中(也可以配合數(shù)據(jù)庫(kù)操作保存到數(shù)據(jù)庫(kù)中)。
反序列化則是將二進(jìn)制數(shù)據(jù)還原回Python對(duì)象,先從文件中(或數(shù)據(jù)庫(kù)中)讀取出保存的二進(jìn)制數(shù)據(jù)。
pickle模塊常用的方法如下:
- dump(obj, file): 將Python對(duì)象序列化,并將序列化結(jié)果寫(xiě)入到打開(kāi)的文件中。
- load(file): 從打開(kāi)的文件中讀取出保存的數(shù)據(jù),將數(shù)據(jù)反序列化成Python對(duì)象。
- dumps(obj): 將Python對(duì)象序列化,并直接返回序列化的二進(jìn)制數(shù)據(jù)(類型為bytes),而不寫(xiě)入文件。
- loads(data): 將字節(jié)碼數(shù)據(jù)反序列化成Python對(duì)象,傳入的數(shù)據(jù)必須為二進(jìn)制數(shù)據(jù)(bytes-like object)。
dump()和load()是互逆的方法,dumps()和loads()是互逆的方法,使用哪一對(duì)方法取決于是否要讀寫(xiě)文件。
二、pickle可以序列化哪些Python對(duì)象
pickle與json相比,json數(shù)據(jù)有嚴(yán)格的格式要求,只能保存滿足格式要求的數(shù)據(jù),而pickle幾乎可以序列化Python中的所有數(shù)據(jù)對(duì)象。
pickle可以序列化的Python對(duì)象如下:
- None、
True和False - 整數(shù)、浮點(diǎn)數(shù)、復(fù)數(shù)
- str、byte、bytearray
- 只包含可序列化對(duì)象的集合,包括tuple、list、set和dict
- 定義在模塊最外層的函數(shù)(使用def定義,lambda函數(shù)不可以)
- 定義在模塊最外層的內(nèi)置函數(shù)
- 定義在模塊最外層的類
- 某些類實(shí)例
三、案例分享
之前寫(xiě)過(guò)一篇使用matplotlib繪制柱狀圖的文章,參考:使用Python的matplotlib庫(kù)繪制柱狀圖。
文章里有一個(gè)56行的字典,本文利用pickle模塊來(lái)將字典序列化寫(xiě)入文件中,繪圖時(shí)從文件中讀取數(shù)據(jù)并反序列化,實(shí)現(xiàn)數(shù)據(jù)與代碼的分離。
1. 將數(shù)據(jù)序列化保存
創(chuàng)建一個(gè)Python腳本pickling.py,先將56行的字典序列化保存。
# coding=utf-8
import pickle
data = {
"DWG-DRX1": [[(3, 2, 4), (2, 0, 4), (1, 0, 1), (3, 1, 4), (0, 0, 4)],
[(2, 3, 1), (0, 2, 1), (1, 0, 0), (0, 2, 1), (0, 2, 2)]],
"DWG-DRX2": [[(1, 2, 8), (6, 1, 5), (2, 1, 8), (3, 1, 7), (0, 2, 7)],
[(3, 3, 1), (0, 2, 5), (1, 3, 4), (2, 2, 4), (1, 2, 4)]],
"DWG-DRX3": [[(2, 2, 10), (7, 0, 6), (5, 0, 8), (3, 1, 6), (4, 4, 4)],
[(3, 4, 0), (2, 6, 2), (1, 3, 0), (1, 3, 3), (0, 5, 3)]],
"SN-JDG1": [[(4, 2, 9), (3, 1, 9), (5, 1, 11), (7, 3, 10), (1, 6, 7)],
[(3, 5, 8), (1, 5, 7), (2, 5, 7), (7, 2, 6), (0, 3, 10)]],
"SN-JDG2": [[(7, 2, 12), (7, 2, 14), (2, 0, 16), (9, 0, 12), (1, 4, 13)],
[(2, 6, 2), (2, 6, 4), (0, 4, 7), (4, 4, 1), (0, 6, 7)]],
"SN-JDG3": [[(5, 1, 5), (5, 1, 9), (3, 1, 8), (3, 1, 7), (1, 3, 11)],
[(0, 4, 2), (1, 2, 4), (0, 4, 3), (3, 1, 4), (3, 6, 3)]],
"SN-JDG4": [[(2, 2, 4), (3, 2, 5), (1, 0, 10), (7, 1, 5), (0, 2, 12)],
[(2, 3, 1), (2, 3, 3), (1, 3, 4), (0, 2, 6), (2, 2, 3)]],
"TES-FNC1": [[(2, 3, 8), (4, 2, 6), (2, 0, 8), (6, 0, 8), (1, 0, 10)],
[(0, 3, 3), (1, 3, 3), (4, 0, 0), (0, 6, 2), (0, 3, 3)]],
"TES-FNC2": [[(0, 2, 10), (8, 1, 4), (4, 0, 6), (4, 1, 5), (1, 2, 13)],
[(3, 2, 3), (1, 4, 5), (1, 2, 3), (0, 2, 6), (1, 7, 1)]],
"TES-FNC3": [[(3, 1, 4), (3, 1, 9), (3, 1, 7), (7, 1, 2), (0, 2, 12)],
[(0, 4, 3), (2, 6, 4), (2, 3, 2), (2, 0, 4), (0, 3, 3)]],
"TES-FNC4": [[(1, 2, 7), (10, 1, 7), (6, 2, 5), (0, 4, 16), (1, 4, 12)],
[(2, 3, 3), (3, 1, 5), (1, 4, 8), (4, 3, 5), (3, 7, 5)]],
"TES-FNC5": [[(1, 2, 1), (4, 1, 6), (4, 0, 6), (4, 1, 5), (0, 1, 6)],
[(2, 2, 1), (2, 3, 1), (0, 4, 1), (0, 1, 2), (0, 3, 2)]],
"G2-GEN1": [[(4, 0, 7), (2, 2, 11), (4, 1, 11), (6, 1, 6), (3, 0, 10)],
[(0, 5, 2), (3, 4, 1), (1, 3, 2), (0, 4, 1), (0, 3, 2)]],
"G2-GEN2": [[(3, 3, 14), (4, 3, 12), (11, 0, 11), (9, 2, 13), (1, 3, 15)],
[(3, 8, 1), (2, 5, 3), (2, 6, 5), (4, 4, 2), (0, 5, 7)]],
"G2-GEN3": [[(2, 5, 11), (7, 2, 10), (6, 3, 13), (7, 3, 11), (1, 1, 18)],
[(4, 5, 8), (2, 6, 7), (5, 4, 6), (3, 2, 6), (0, 6, 7)]],
"DWG-G21": [[(4, 0, 12), (7, 2, 9), (4, 2, 11), (6, 0, 9), (1, 2, 8)],
[(1, 5, 1), (3, 5, 2), (2, 5, 3), (0, 2, 3), (0, 5, 4)]],
"DWG-G22": [[(4, 2, 7), (5, 1, 9), (6, 2, 11), (7, 3, 9), (3, 1, 11)],
[(0, 7, 1), (0, 4, 4), (4, 4, 2), (3, 4, 1), (1, 6, 2)]],
"DWG-G23": [[(3, 1, 9), (6, 2, 5), (5, 2, 6), (8, 2, 7), (0, 3, 13)],
[(1, 3, 3), (3, 3, 4), (1, 4, 3), (2, 3, 3), (3, 9, 4)]],
"DWG-G24": [[(5, 0, 3), (2, 0, 7), (2, 0, 10), (2, 1, 3), (4, 1, 4)],
[(0, 5, 1), (1, 3, 0), (0, 3, 1), (1, 2, 1), (0, 2, 1)]],
"SN-TES1": [[(5, 1, 5), (3, 1, 6), (1, 0, 4), (2, 3, 3), (0, 2, 3)],
[(2, 4, 0), (0, 1, 4), (1, 2, 2), (4, 2, 0), (0, 2, 4)]],
"SN-TES2": [[(5, 1, 4), (1, 2, 5), (3, 1, 7), (3, 3, 4), (0, 0, 7)],
[(2, 1, 2), (1, 3, 5), (2, 5, 4), (2, 2, 0), (0, 1, 5)]],
"SN-TES3": [[(3, 0, 7), (2, 2, 4), (2, 1, 4), (5, 2, 4), (1, 2, 7)],
[(0, 3, 3), (2, 3, 3), (3, 1, 1), (0, 4, 4), (2, 2, 2)]],
"SN-TES4": [[(5, 2, 4), (1, 3, 16), (8, 1, 8), (6, 4, 9), (1, 8, 13)],
[(1, 2, 10), (9, 5, 4), (1, 4, 9), (5, 6, 10), (2, 4, 12)]],
"DWG-SN1": [[(2, 2, 11), (5, 3, 9), (8, 1, 11), (4, 2, 12), (2, 4, 7)],
[(1, 5, 5), (5, 4, 4), (3, 3, 2), (2, 3, 3), (1, 6, 3)]],
"DWG-SN2": [[(10, 1, 4), (2, 1, 10), (3, 3, 11), (3, 3, 10), (2, 4, 7)],
[(0, 4, 8), (5, 4, 2), (5, 6, 2), (2, 3, 5), (0, 3, 9)]],
"DWG-SN3": [[(3, 3, 10), (5, 2, 8), (3, 3, 3), (5, 1, 6), (0, 2, 8)],
[(3, 6, 5), (1, 2, 2), (4, 3, 2), (2, 3, 3), (1, 2, 6)]],
"DWG-SN4": [[(2, 0, 12), (8, 0, 7), (1, 3, 5), (9, 1, 5), (4, 3, 4)],
[(2, 9, 1), (1, 5, 2), (2, 2, 0), (2, 4, 2), (0, 4, 3)]],
}
with open('S10.pkl', 'wb') as pkl_file:
pickle.dump(data, pkl_file)序列化只需要兩行代碼,打開(kāi)一個(gè)文件對(duì)象,使用dump()方法將字典序列化保存到了S10.pkl文件中,文件默認(rèn)在代碼的同級(jí)目錄下,也可以指定絕對(duì)路徑。注意,打開(kāi)文件對(duì)象時(shí)使用wb模式。
S10.pkl的后綴名可以自定義(后綴名不會(huì)改變文件保存的格式),不過(guò)因?yàn)槭怯胮ickle模塊序列化的數(shù)據(jù),通常都以.pkl作為后綴,方便識(shí)別。
直接用IDE打開(kāi)文件S10.pkl,顯示的內(nèi)容是亂碼的,因?yàn)楸4娴膬?nèi)容是二進(jìn)制數(shù)據(jù)。
2. 讀取數(shù)據(jù)并反序列化
# coding=utf-8
import matplotlib.pyplot as plt
from matplotlib import ticker
from numpy import mean
import pickle
with open('S10.pkl', 'rb') as pkl_file:
data = pickle.load(pkl_file)
location = ["上單", "打野", "中單", "下路", "輔助"]
win_loc_kill, win_loc_die, win_loc_assists = [[list() for _ in range(5)] for _ in range(3)]
lose_loc_kill, lose_loc_die, lose_loc_assists = [[list() for _ in range(5)] for _ in range(3)]
for i in range(5):
win_loc_kill[i] = [value[0][i][0] for value in data.values()]
win_loc_die[i] = [value[0][i][1] for value in data.values()]
win_loc_assists[i] = [value[0][i][2] for value in data.values()]
lose_loc_kill[i] = [value[1][i][0] for value in data.values()]
lose_loc_die[i] = [value[1][i][1] for value in data.values()]
lose_loc_assists[i] = [value[1][i][2] for value in data.values()]
# noinspection PyTypeChecker
win_avg_kill = [round(mean(kill), 2) for kill in win_loc_kill]
# noinspection PyTypeChecker
win_avg_die = [round(mean(die), 2) for die in win_loc_die]
# noinspection PyTypeChecker
win_avg_assists = [round(mean(assists), 2) for assists in win_loc_assists]
# noinspection PyTypeChecker
lose_avg_kill = [round(mean(kill), 2) for kill in lose_loc_kill]
# noinspection PyTypeChecker
lose_avg_die = [round(mean(die), 2) for die in lose_loc_die]
# noinspection PyTypeChecker
lose_avg_assists = [round(mean(assists), 2) for assists in lose_loc_assists]
fig, axs = plt.subplots(nrows=2, ncols=1, figsize=(20, 16), dpi=100)
x = range(len(location))
axs[0].bar([i-0.2 for i in x], win_avg_kill, width=0.2, color='b')
axs[0].bar(x, win_avg_die, width=0.2, color='r')
axs[0].bar([i+0.2 for i in x], win_avg_assists, width=0.2, color='g')
axs[1].bar([i-0.2 for i in x], lose_avg_kill, width=0.2, color='b')
axs[1].bar(x, lose_avg_die, width=0.2, color='r')
axs[1].bar([i+0.2 for i in x], lose_avg_assists, width=0.2, color='g')
for a, b in zip(x, win_avg_kill):
axs[0].text(a-0.2, b+0.1, '%.02f' % b, ha='center', va='bottom', fontsize=14)
for a, b in zip(x, win_avg_die):
axs[0].text(a, b+0.1, '%.02f' % b, ha='center', va='bottom', fontsize=14)
for a, b in zip(x, win_avg_assists):
axs[0].text(a+0.2, b+0.1, '%.02f' % b, ha='center', va='bottom', fontsize=14)
for a, b in zip(x, lose_avg_kill):
axs[1].text(a-0.2, b+0.1, '%.02f' % b, ha='center', va='bottom', fontsize=14)
for a, b in zip(x, lose_avg_die):
axs[1].text(a, b+0.1, '%.02f' % b, ha='center', va='bottom', fontsize=14)
for a, b in zip(x, lose_avg_assists):
axs[1].text(a+0.2, b+0.1, '%.02f' % b, ha='center', va='bottom', fontsize=14)
for i in range(2):
axs[i].xaxis.set_major_locator(ticker.FixedLocator(x))
axs[i].xaxis.set_major_formatter(ticker.FixedFormatter(location))
axs[i].set_yticks(range(0, 11, 2))
axs[i].grid(linestyle="--", alpha=0.5)
axs[i].legend(['擊殺', '死亡', '助攻'], loc='upper left', fontsize=16, markerscale=0.5)
axs[i].set_xlabel("位置", fontsize=18)
axs[i].set_ylabel("場(chǎng)均數(shù)據(jù)", fontsize=18, rotation=0)
axs[0].set_title("S10總決賽勝方各位置場(chǎng)均數(shù)據(jù)", fontsize=18)
axs[1].set_title("S10總決賽負(fù)方各位置場(chǎng)均數(shù)據(jù)", fontsize=18)
plt.show()反序列化代碼也只有兩行,打開(kāi)文件S10.pkl,然后使用load()方法對(duì)文件對(duì)象反序列化,返回?cái)?shù)據(jù)。打開(kāi)文件對(duì)象時(shí)使用rb模式。
運(yùn)行代碼,繪圖功能正常。

經(jīng)過(guò)pickle模塊的序列化和反序列化,將數(shù)據(jù)持久化到了文件S10.pkl中。實(shí)現(xiàn)了數(shù)據(jù)與代碼的分離,避免了直接在代碼中寫(xiě)一個(gè)很長(zhǎng)的字典數(shù)據(jù),代碼更加優(yōu)雅。
在上面的例子中,對(duì)一個(gè)56行的數(shù)據(jù)進(jìn)行序列化,已經(jīng)有不錯(cuò)的效果了。在實(shí)際的項(xiàng)目中,數(shù)據(jù)更大,將數(shù)據(jù)放到代碼中會(huì)占很大的篇幅,進(jìn)行序列化處理的優(yōu)化效果會(huì)更明顯。
如果有多個(gè)腳本使用同一份數(shù)據(jù),可以直接讀取同一個(gè)序列化數(shù)據(jù)文件,避免了在不同腳本中粘貼同一份數(shù)據(jù)。
到此這篇關(guān)于淺談Python使用pickle模塊序列化數(shù)據(jù)優(yōu)化代碼的方法的文章就介紹到這了,更多相關(guān)Python的pickle模塊序列化數(shù)據(jù)內(nèi)容請(qǐng)搜索腳本之家以前的文章或繼續(xù)瀏覽下面的相關(guān)文章希望大家以后多多支持腳本之家!
相關(guān)文章
基于Python編寫(xiě)復(fù)雜密碼圖形化生成工具
這篇文章主要為大家詳細(xì)介紹了如何基于Python編寫(xiě)復(fù)雜密碼圖形化生成工具,支持選擇生成10位和12位復(fù)雜密碼,需要的小伙伴可以參考一下2024-04-04
python實(shí)現(xiàn)簡(jiǎn)單加密解密機(jī)制
這篇文章主要為大家詳細(xì)介紹了python實(shí)現(xiàn)簡(jiǎn)單加密解密機(jī)制,具有一定的參考價(jià)值,感興趣的小伙伴們可以參考一下2019-03-03
Python批量修改xml的坐標(biāo)值全部轉(zhuǎn)為整數(shù)的實(shí)例代碼
這篇文章主要介紹了Python批量修改xml的坐標(biāo)值全部轉(zhuǎn)為整數(shù)的實(shí)例代碼,本文給大家介紹的非常詳細(xì),對(duì)大家的學(xué)習(xí)或工作具有一定的參考借鑒價(jià)值,需要的朋友可以參考下2020-11-11
Python實(shí)現(xiàn)爬取房源信息的示例詳解
站在一個(gè)租房人的立場(chǎng),租房平臺(tái)實(shí)在太多了,并且各平臺(tái)篩選和排序邏輯都不太一致。這篇文章將教教大家如何利用Python語(yǔ)言實(shí)現(xiàn)爬取房源信息,需要的可以參考一下2022-09-09
14個(gè)用Python實(shí)現(xiàn)的Excel常用操作總結(jié)
自從學(xué)了Python后就逼迫自己不用Excel,所有操作用Python實(shí)現(xiàn)。目的是鞏固Python,與增強(qiáng)數(shù)據(jù)處理能力。本文為大家總結(jié)了14個(gè)用Python實(shí)現(xiàn)的Excel常用操作,需要的可以參考一下2022-06-06

