Python優(yōu)秀開源項目Rich源碼解析的流程分析

更新時間：2020年07月06日 11:32:16 作者：渡碼

這篇文章主要介紹了Python優(yōu)秀開源項目Rich源碼解析,本文通過實例圖文相結合給大家介紹的非常詳細，對大家的學習或工作具有一定的參考借鑒價值，需要的朋友可以參考下

這篇文章對優(yōu)秀的開源項目Rich的源碼進行解析，OMG，盤他。為什么建議閱讀源碼，有兩個原因，第一，單純學語言很難在實踐中靈活應用，通過閱讀源碼可以看到每個知識點的運用場景，印象會更深，以后寫代碼的時候就能應用起來；第二，通過閱讀優(yōu)秀的開源代碼，可以學習比人的代碼規(guī)范、設計思路；第三，參與到開源社區(qū)，獲得更廣闊的的發(fā)展前景；第四，面試加分項。所以，有時間的話還是建議大家多讀讀優(yōu)秀開源項目的源碼。

下面進入今天的主題，這個開源項目的名字叫Rich，地址：https://github.com/willmcgugan/rich 。這個項目是個英國老鐵開發(fā)的，比較友好的是有中文文檔。它的作用是可以在控制臺輸出富文本和精美的可視化格式（如：表格、進度條和markdown）。截圖感受一下

各種格式

進度條

效果看起來很酷炫，我忍不住看了一些代碼，發(fā)現(xiàn)作者用的是Python 3.8版本實現(xiàn)的，好多新特性我也不了解，所以在看源碼過程中還補了一下語法基礎。下面以一個例子來簡單看看Rich的源碼，源碼的講解我盡量言簡意賅，重點講解源碼中涉及的一些關鍵的知識點。

先撿個軟柿子捏，如下：

from rich import print

print('Hello, [bold yellow]World[/bold yellow]!')

輸出效果：

可以看到對單詞World顯示為粗體、紅顏色。

先通過一張圖來看看大致流程

簡單來說就是將文本的格式轉化成標準輸出能夠識別的格式，然后輸出即可。下面來講解源碼，當我們調用print函數(shù)時，最終程序會跳轉到console.py文件的print函數(shù)中，執(zhí)行以下代碼

調用self._collect_renderables函數(shù)處理輸入的字符串，將需要格式化的部分標出來，返回的renderables變量是一個Text列表，因為輸入只有1個字符串，所以列表的大小為1，變量結果如下

Span(7, 12, 'bold red')便是框出來需要格式化的內容。

上述代碼還有一個with self，它的作用我們一會兒再說。接著print函數(shù)往下看

這里會遍歷剛剛提到的renderables變量，先調用render函數(shù)渲染輸入的文本，然后調用extend函數(shù)將render返回的結果添加到self._buffer列表里。這里有幾個知識點簡單說一下

self._buffer是函數(shù)調用，由于它加了@property注解，所以調用是可以不用加小括號，它返回的是self._thread_locals.buffer變量，該變量是List[Segment]類型的
self._thread_locals.buffer變量用到dataclasses模塊的field函數(shù)初始化，初始化代碼為buffer: List[Segment] = field(default_factory=list)，dataclasses是Python 3.7 版本的新引入的模塊，field函數(shù)可提供更加靈活的初始化方式，并且該模塊中的@dataclass注解可以為類自動添加__init__等方法，比較方便
extend = self._buffer.extend這種寫法將list的extent函數(shù)存到了臨時變量里，后續(xù)直接通過extend調用該函數(shù)，比對象名.extend的方式更簡潔。

下面我們來看render(renderable, render_options)函數(shù)的渲染邏輯，該函數(shù)里會調用下面的代碼

render_iterable = renderable.__rich_console__(self, options)

在函數(shù)聲明里renderable對象是RenderableType類型的，但實際上Text類型的，并且這兩種類型沒有繼承關系，這里沒太想明白作者為什么這樣搞。所以，這里的__rich_console__函數(shù)我們要到text.py文件中去找。__rich_console__函數(shù)最終會調用Text對象的render函數(shù)，核心代碼如下：

def render(self, console: "Console", end: str = "") -> Iterable["Segment"]:
 style_map = {index: get_style(span.style) for index, span in enumerated_spans}

 _Segment = Segment

 for (offset, leaving, style_id), (next_offset, _, _) in zip(spans, spans[1:]):
 yield _Segment(text[offset:next_offset], get_current_style())

調用get_style函數(shù)，將格式轉為Style對象，如：'bold red'轉成Style對象，然后按照不同的顯示格式進行‘分片'，每個‘片段'構造一個Segment對象存儲文本及其對應的格式。

get_style函數(shù)會調用Style.parse(name)生成Style對象，核心代碼如下

@lru_cache(maxsize=1024)
def parse(cls, style_definition: str) -> "Style":
 words = iter(style_definition.split())
 for original_word in words:
 word = original_word.lower()
 if word == "on":
 # ...省略
 elif word in style_attributes:
 attributes[style_attributes[word]] = True
 else:
 color = word
 style = Style(color=color, bgcolor=bgcolor, link=link, **attributes)
 return style

參數(shù)style_definition取值為bold red，分割后生成['bold', 'red']列表，當word變量等于'bold'時，會執(zhí)行attributes[style_attributes[word]] = True語句，執(zhí)行后attributes等于{'bold': true}，它是一個字典。當word變量等于red時，執(zhí)行color=word語句。最終調用導數(shù)第二行構造Style對象，Style對象最核心的兩個數(shù)據形式_attributes和_color，前者是int類型，在我們例子中取值是1，代表'bold'，即：粗體。后者代表顏色，即：'red'，它是Color類型的，該類中有個屬性number也是我們后續(xù)要用到的。

下面來看下__rich_console__函數(shù)返回了哪些Segment對象

可以看到有4個，每一個都有文本及其Style對象。

回到render(renderable, render_options)函數(shù)，剛剛介紹了__rich_console__部分，下面還有返回的代碼，一起來看看

iter_render = iter(render_iterable)
for render_output in iter_render:
 if isinstance(render_output, Segment):
 yield render_output

render_iterable變量是__rich_console__的返回值，即：4個Segment對象。遍歷后通過yield方式返回。該關鍵字用來返回一個迭代器，也可以理解為一個列表。并且yield返回有個特點，函數(shù)返回值只有真正被使用的時候才會執(zhí)行調用函數(shù)。

這樣，render(renderable, render_options)函數(shù)就講解完了，返回上一層extend(render(renderable, render_options))，通過extend函數(shù)將4個Segment對象保存到buffer中，結果如下

然后print方法就執(zhí)行完了?？雌饋硪呀浗Y束了，然而控制臺打印的代碼貌似沒有看到。答案就在剛剛的with self中，with關鍵字使得執(zhí)行完代碼體后，會自動調用self的__exit__函數(shù)。__exit__函數(shù)中調用_render_buffer函數(shù)進行最終的輸出，核心代碼如下

output: List[str] = []
append = output.append
for line in Segment.split_and_crop_lines(buffer, self.width, pad=False):
 for text, style, is_control in line:
 if style and not is_control:
  append(
  style.render(
   text,
   color_system=color_system,
   legacy_windows=legacy_windows,
  )
  )
rendered = "".join(output)

return rendered

split_and_crop_lines函數(shù)是為了適應控制臺的寬度，暫時忽略它。line變量仍然是剛剛提到的4個Segment對象，通過for text, style, is_control in line直接將每個Segment對象的屬性解出來并賦給text, style, is_control變量，最終每個style對象都會調用render方法完成最后的渲染。

render方法核心代碼如下

attrs = self._make_ansi_codes(color_system)
rendered = f"\x1b[{attrs}m{text}\x1b[0m" if attrs else text

_make_ansi_codes函數(shù)就不展開了，其實就是利用上面提到的_attributes和number屬性生成標準輸出的能夠識別的格式，返回值attrs的結果為1;31，1取自_attributes代表粗體，31中的1取自number代表顏色，其他顏色取值是不同的，比如黃色是33，紫色是35。最后通過f-string格式（新特性）生成rendered變量，取值為[1;31mWorld[0m它就是標準輸出流能夠識別的格式。

回到_render_buffer函數(shù)中，調用rendered = "".join(output)將4個渲染后的片段拼在一起，返回。返回后執(zhí)行的代碼如下：

text = self._render_buffer()
if text:
 self.file.write(text)

self.file變量的賦值語句為self.file = file or sys.stdout，由于我們沒有定義file變量，所以self.file取值為sys.stdout。最終的輸出為sys.stdout.write(text)，至此整個流程就講解完了。如果你理解了上述邏輯，應該可以通過下面代碼輸出同樣的效果