Python實(shí)現(xiàn)從log日志中提取ip的方法【正則提取】
本文實(shí)例講述了Python實(shí)現(xiàn)從log日志中提取ip的方法。分享給大家供大家參考,具體如下:
log日志內(nèi)容如下(myjob.log):
124.90.53.68 - - [05/Feb/2018 11:37:07] "GET /favicon.ico HTTP/1.1" 404 - 61.148.245.145 - - [05/Feb/2018 12:37:44] "GET / HTTP/1.1" 200 - 61.148.245.145 - - [05/Feb/2018 12:37:44] "GET /apple-touch-icon-120x120-precomposed.png HTTP/1.1" 404 - 61.148.245.145 - - [05/Feb/2018 12:37:44] "GET /apple-touch-icon-120x120.png HTTP/1.1" 404 - 61.148.245.145 - - [05/Feb/2018 12:37:45] "GET /apple-touch-icon-precomposed.png HTTP/1.1" 404 - 61.148.245.145 - - [05/Feb/2018 12:37:45] "GET /apple-touch-icon.png HTTP/1.1" 404 - 61.148.245.145 - - [05/Feb/2018 12:37:45] "GET /static/favicon.ico HTTP/1.1" 200 - 101.226.33.218 - - [05/Feb/2018 13:07:39] "GET / HTTP/1.1" 200 - 101.226.33.219 - - [05/Feb/2018 13:09:46] "GET / HTTP/1.1" 200 - 101.226.33.219 - - [05/Feb/2018 13:09:46] "GET /static/youkulogo.png HTTP/1.1" 200 - 101.226.33.219 - - [05/Feb/2018 13:09:46] "GET /static/iqiyi.png HTTP/1.1" 200 - 101.226.33.219 - - [05/Feb/2018 13:09:46] "GET /static/qqlogo.png HTTP/1.1" 200 - 124.202.223.62 - - [05/Feb/2018 14:29:45] "GET / HTTP/1.1" 200 - 124.202.223.62 - - [05/Feb/2018 14:29:47] "GET /static/youkulogo.png HTTP/1.1" 200 - 124.202.223.62 - - [05/Feb/2018 14:29:48] "GET /static/qqlogo.png HTTP/1.1" 200 - 124.202.223.62 - - [05/Feb/2018 14:29:48] "GET /static/iqiyi.png HTTP/1.1" 200 - 124.202.223.62 - - [05/Feb/2018 14:29:49] "GET /static/favicon.ico HTTP/1.1" 200 -
提取ip:
# encoding: utf-8 import sys reload(sys) sys.setdefaultencoding('utf-8') import pandas as pd import re import time import requests time1=time.time() ######函數(shù)功能:能夠提取ip地址,并且去重################ def read_file(input_file_name,output_file_name): _fLog = open(input_file_name) sep = '\n' ip_list=[] for each in _fLog: ip=re.findall(r'(?<![\.\d])(?:\d{1,3}\.){3}\d{1,3}(?![\.\d])',str(each),re.S) ip_list.append(ip[0]) # 列表去重:通過(guò)set方法進(jìn)行處理 ids = list(set(ip_list)) print "共解析ip個(gè)數(shù):%s "% len(ids) ##寫出數(shù)據(jù)到本地 # 設(shè)置輸出文件路徑 out = open(output_file_name, "a") # out.write("ip" + sep) for each in ids: print each out.write(each + sep) ##關(guān)閉連接 out.close() _fLog.close() print "ip提取完畢~~" ####主函數(shù)################ if __name__ == '__main__': input_file_name = "C:/myjob.log" output_file_name = "c:/myjob.txt" read_file(input_file_name, output_file_name) time2 = time.time() print u'總共耗時(shí):' + str(time2 - time1) + 's'
運(yùn)行結(jié)果:
共解析ip個(gè)數(shù):5
61.148.245.145
124.90.53.68
124.202.223.62
101.226.33.219
101.226.33.218
ip提取完畢~~
總共耗時(shí):0.000999927520752s
Process finished with exit code 0
PS:這里再為大家提供2款非常方便的正則表達(dá)式工具供大家參考使用:
JavaScript正則表達(dá)式在線測(cè)試工具:
http://tools.jb51.net/regex/javascript
正則表達(dá)式在線生成工具:
http://tools.jb51.net/regex/create_reg
更多關(guān)于Python相關(guān)內(nèi)容可查看本站專題:《Python正則表達(dá)式用法總結(jié)》、《Python數(shù)據(jù)結(jié)構(gòu)與算法教程》、《Python函數(shù)使用技巧總結(jié)》、《Python字符串操作技巧匯總》、《Python入門與進(jìn)階經(jīng)典教程》及《Python文件與目錄操作技巧匯總》
希望本文所述對(duì)大家Python程序設(shè)計(jì)有所幫助。
相關(guān)文章
Pycharm中配置使用Anaconda的虛擬環(huán)境進(jìn)行項(xiàng)目開發(fā)的圖文教程
今天在一臺(tái)電腦上跑環(huán)境的時(shí)候,發(fā)現(xiàn)已經(jīng)裝了Pytorch了,但是運(yùn)行沒有用,提示報(bào)錯(cuò):OSError:?[WinError?126]?找不到指定的模塊,但其實(shí)cmd進(jìn)入虛擬環(huán)境是可以調(diào)用torch的,故本文給大家介紹了Pycharm中配置使用Anaconda的虛擬環(huán)境進(jìn)行項(xiàng)目開發(fā)的圖文教程2024-09-09淺談numpy 函數(shù)里面的axis參數(shù)的含義
這篇文章主要介紹了numpy 函數(shù)里面的axis參數(shù)的含義,具有很好的參考價(jià)值,希望對(duì)大家有所幫助。一起跟隨小編過(guò)來(lái)看看吧2021-05-05python爬蟲 urllib模塊發(fā)起post請(qǐng)求過(guò)程解析
這篇文章主要介紹了python爬蟲 urllib模塊發(fā)起post請(qǐng)求過(guò)程解析,文中通過(guò)示例代碼介紹的非常詳細(xì),對(duì)大家的學(xué)習(xí)或者工作具有一定的參考學(xué)習(xí)價(jià)值,需要的朋友可以參考下2019-08-08在Tensorflow中查看權(quán)重的實(shí)現(xiàn)
今天小編就為大家分享一篇在Tensorflow中查看權(quán)重的實(shí)現(xiàn),具有很好的參考價(jià)值,希望對(duì)大家有所幫助。一起跟隨小編過(guò)來(lái)看看吧2020-01-01Python Pandas學(xué)習(xí)之基本數(shù)據(jù)操作詳解
本文將通過(guò)讀取一個(gè)股票數(shù)據(jù),來(lái)進(jìn)行Pandas的一些基本數(shù)據(jù)操作的語(yǔ)法介紹。文中的示例代碼講解詳細(xì),感興趣的小伙伴可以跟隨小編一起學(xué)習(xí)一下2022-02-02Python 3.8正式發(fā)布,來(lái)嘗鮮這些新特性吧
今天 Python3.8 發(fā)布啦,它是 Python2 終結(jié)前最后一個(gè)大版本,我們一起看看這個(gè)版本都添加了那些新功能和特性2019-10-10