快捷導(dǎo)航

C++中hashmap的一些使用建議

更新時(shí)間：2023年03月30日 10:48:11 作者：victorika

由于hashmap不是c++ stl中標(biāo)準(zhǔn)實(shí)現(xiàn),這樣在跨平臺(tái)使用時(shí)就可能會(huì)出現(xiàn)問(wèn)題,下面這篇文章主要給大家介紹了關(guān)于C++中hashmap的一些使用建議,需要的朋友可以參考下

前言

相信大部分C++開(kāi)發(fā)都會(huì)在項(xiàng)目里直接使用std::unorderd_map，實(shí)現(xiàn)方便快捷且沒(méi)有依賴，當(dāng)后來(lái)發(fā)現(xiàn)性能不足時(shí)則會(huì)用一些開(kāi)源的性能較好的hashmap去替換掉當(dāng)前std::unordered_map來(lái)獲得不錯(cuò)的提升，但也止步于此。而本文則是想分享一下使用開(kāi)源的hashmap時(shí)候的一些常規(guī)選型思路和使用技巧。

hash沖突的解決方案

首先回顧一下hash沖突的解決方案有哪些。

Open addressing

open addressing是通過(guò)探測(cè)或者搜索數(shù)組的方法找到未使用的bucket來(lái)解決hash沖突的。

優(yōu)點(diǎn)：

* 當(dāng)hash沖突很小的時(shí)候，只需要訪問(wèn)對(duì)應(yīng)的bucket就能獲得對(duì)應(yīng)的pair<key, value>，不需要再查找額外的數(shù)據(jù)結(jié)構(gòu)，性能較好。

缺點(diǎn)：

* 對(duì)hash函數(shù)的要求比較高，否則當(dāng)hash沖突很大的時(shí)，查找速度會(huì)很慢。

Separate chaining

或者叫closed addressing，當(dāng)發(fā)生hash沖突時(shí)需要通過(guò)額外的數(shù)據(jù)結(jié)構(gòu)來(lái)處理，比如鏈表或者紅黑樹(shù)。

優(yōu)點(diǎn)：

* hash沖突處理簡(jiǎn)單，比如采用鏈表來(lái)解決hash沖突的話，添加節(jié)點(diǎn)的時(shí)候直接在鏈表后面添加即可。

* 對(duì)hash函數(shù)要求會(huì)低一點(diǎn)，即便沖突稍微大一點(diǎn)，也能把查找速度控制得比較好。

缺點(diǎn)：

* 由于需要額外的數(shù)據(jù)結(jié)構(gòu)處理，性能在一般情況下不如Open addressing。

STL采用的是Separate chaining的方案。使用鏈表掛在bucket上解決沖突，當(dāng)鏈表超過(guò)一定長(zhǎng)度時(shí)轉(zhuǎn)換為紅黑樹(shù)。

Flat Or Node

一般的開(kāi)源庫(kù)都會(huì)提供兩種memory layout，一種叫flat，另一種叫node。

Flat

flat實(shí)現(xiàn)是指存儲(chǔ)pair<key, value>的時(shí)候是直接存到對(duì)應(yīng)的節(jié)點(diǎn)上。

優(yōu)點(diǎn)：

* 對(duì)比node少一次尋址，對(duì)cache更加友好，查找速度會(huì)更高。

缺點(diǎn)：

* 對(duì)象不穩(wěn)定，rehash的時(shí)候?qū)ο蟮刂窌?huì)修改，如果對(duì)象是個(gè)大結(jié)構(gòu)的話rehash時(shí)的開(kāi)銷會(huì)比node要大。

Node

node實(shí)現(xiàn)是指在節(jié)點(diǎn)上只存儲(chǔ)pair<key, value>的地址。而pair<key, value>則存儲(chǔ)在另一塊內(nèi)存上。

優(yōu)點(diǎn)：

* 對(duì)象穩(wěn)定，rehash的時(shí)候?qū)ο蟮刂凡恍薷?，且rehash的效率不會(huì)被結(jié)構(gòu)體影響。

缺點(diǎn)：

* 查找多一次尋址，對(duì)cache不友好，查找速度會(huì)比f(wàn)lat要慢。

STL用的是Node的實(shí)現(xiàn)。每個(gè)pair<key, value>都是stable的。

使用建議

在hash沖突上，大部分的開(kāi)源實(shí)現(xiàn)都選擇了Open addressing這種方式，畢竟理論性能會(huì)更好，而flat和node則是兩種實(shí)現(xiàn)都會(huì)提供。結(jié)合上面說(shuō)的各種優(yōu)缺點(diǎn)，我們可以簡(jiǎn)單得出一套通用的方案。

首先考慮下面幾個(gè)點(diǎn)：

1. 是否可以一開(kāi)始就可以確定好容量。

2. key的copy開(kāi)銷是否很大。

3. value的copy開(kāi)銷是否很大。

4. value的地址不穩(wěn)定是否會(huì)影響代碼邏輯。

可以簡(jiǎn)單歸納為以下四種情況：

情況1:value可以是不穩(wěn)定的，而且容量是已知的，可以一開(kāi)始確定。

推薦：使用flat實(shí)現(xiàn)，通過(guò)一開(kāi)始reserve兩倍的size來(lái)減少rehash帶來(lái)的開(kāi)銷。

情況2:value可以是不穩(wěn)定的，容量未知，key和value的copy開(kāi)銷很小。

推薦：使用flat實(shí)現(xiàn)。

情況3:value可以是不穩(wěn)定的，容量未知，key的copy開(kāi)銷很小但value的copy開(kāi)銷很大。

推薦：使用flat實(shí)現(xiàn)，value使用unique_ptr包裹起來(lái)。

其他情況均使用node結(jié)構(gòu)。

RobinHood

上面提到的規(guī)則基本可以適用大部分情況，但也不是沒(méi)有例外，比如筆者在用這套規(guī)則去測(cè)試robinhood的性能的時(shí)候發(fā)現(xiàn)行不通，robinhood在絕大部分情況下都是node的實(shí)現(xiàn)性能會(huì)更高，除非value是個(gè)十分簡(jiǎn)單的結(jié)構(gòu)。通過(guò)分析發(fā)現(xiàn)，這主要是因?yàn)橐韵聝蓚€(gè)原因：

1.robinhood在emplace的時(shí)候會(huì)有移動(dòng)pair<key, value>的操作，這使得如果pair<key, value>的copy代價(jià)很高，性能會(huì)大打折扣。

2.robinhood實(shí)現(xiàn)了自己的allocator來(lái)分配node的內(nèi)存，使得調(diào)用malloc的次數(shù)大約為log(n)次，并且內(nèi)存連續(xù)的情況會(huì)變多，對(duì)CPU Cache比一般的node實(shí)現(xiàn)要友好。

具體我們可以看看源代碼，首先是emplace的實(shí)現(xiàn)。

    void nextWhileLess(InfoType* info, size_t* idx) const noexcept {
        // unrolling this by hand did not bring any speedups.
        while (*info < mInfo[*idx]) {
            next(info, idx);
        }
    }
 
    // Finds key, and if not already present prepares a spot where to pot the key & value.
    // This potentially shifts nodes out of the way, updates mInfo and number of inserted
    // elements, so the only operation left to do is create/assign a new node at that spot.
    template <typename OtherKey>
    std::pair<size_t, InsertionState> insertKeyPrepareEmptySpot(OtherKey&& key) {
        for (int i = 0; i < 256; ++i) {
            size_t idx{};
            InfoType info{};
            // 找到對(duì)應(yīng)key的info
            keyToIdx(key, &idx, &info);
            // 跳過(guò)distance大于自己的
            nextWhileLess(&info, &idx);
 
            // while we potentially have a match
            while (info == mInfo[idx]) {
                // distance相等的情況需要判key是不是已經(jīng)存在了
                if (WKeyEqual::operator()(key, mKeyVals[idx].getFirst())) {
                    // key already exists, do NOT insert.
                    // see http://en.cppreference.com/w/cpp/container/unordered_map/insert
                    return std::make_pair(idx, InsertionState::key_found);
                }
                next(&info, &idx);
            }
 
            // unlikely that this evaluates to true
            if (ROBIN_HOOD_UNLIKELY(mNumElements >= mMaxNumElementsAllowed)) {
                if (!increase_size()) {
                    return std::make_pair(size_t(0), InsertionState::overflow_error);
                }
                continue;
            }
 
            // key not found, so we are now exactly where we want to insert it.
            // 此時(shí)的位置原來(lái)的distance一定是小于當(dāng)前的distance
            auto const insertion_idx = idx;
            auto const insertion_info = info;
            if (ROBIN_HOOD_UNLIKELY(insertion_info + mInfoInc > 0xFF)) {
                mMaxNumElementsAllowed = 0;
            }
 
            // find an empty spot
            // 找到下一個(gè)空白的位置
            while (0 != mInfo[idx]) {
                next(&info, &idx);
            }
            // 如果當(dāng)前的位置不是空白的，則把當(dāng)前位置->下一個(gè)空白位置的所有元素往后移
            if (idx != insertion_idx) {
                shiftUp(idx, insertion_idx);
            }
            // put at empty spot
            mInfo[insertion_idx] = static_cast<uint8_t>(insertion_info);
            ++mNumElements;
            return std::make_pair(insertion_idx, idx == insertion_idx
                                                     ? InsertionState::new_node
                                                     : InsertionState::overwrite_node);
        }
 
        // enough attempts failed, so finally give up.
        return std::make_pair(size_t(0), InsertionState::overflow_error);
    }
 
 
    template <typename OtherKey, typename... Args>
    std::pair<iterator, bool> try_emplace_impl(OtherKey&& key, Args&&... args) {
        ROBIN_HOOD_TRACE(this)
        auto idxAndState = insertKeyPrepareEmptySpot(key);
        switch (idxAndState.second) {
        case InsertionState::key_found:
            break;
 
        case InsertionState::new_node:
            ::new (static_cast<void*>(&mKeyVals[idxAndState.first])) Node(
                *this, std::piecewise_construct, std::forward_as_tuple(std::forward<OtherKey>(key)),
                std::forward_as_tuple(std::forward<Args>(args)...));
            break;
 
        case InsertionState::overwrite_node:
            mKeyVals[idxAndState.first] = Node(*this, std::piecewise_construct,
                                               std::forward_as_tuple(std::forward<OtherKey>(key)),
                                               std::forward_as_tuple(std::forward<Args>(args)...));
            break;
 
        case InsertionState::overflow_error:
            throwOverflowError();
            break;
        }
 
        return std::make_pair(iterator(mKeyVals + idxAndState.first, mInfo + idxAndState.first),
                              InsertionState::key_found != idxAndState.second);
    }

這里robinhood有一個(gè)很神奇的操作，它在info里面存了一個(gè)distance，這個(gè)distance表示當(dāng)前元素所在位置與初始位置的距離，簡(jiǎn)單舉例，假設(shè)我插入了4個(gè)key，分別為a,b,c,d。

sequence1：插入a，hash(a) == 0，此時(shí)index0是空的，直接插入。它的distance(0, 0) == 0。

sequence2：插入b，hash(b) == 1，此時(shí)index1是空的，直接插入。它的distance(1, 1) == 0。

sequence3：插入c，hash(c) == 1，此時(shí)index1存在，那么找到下一個(gè)位置2插入，它的distance(1, 2) == 1。

sequence4：插入d，hash(d) == 1，此時(shí)index1和2都存在，找到位置3插入，它的distance(1, 3) == 2。

至于它的移動(dòng)操作是怎么來(lái)的呢，假設(shè)基于上面的流程此時(shí)再加一個(gè)插入元素e的操作，并且這時(shí)候hash(e) == 0，首先是index0，發(fā)現(xiàn)這個(gè)位置兩者的distance是相同，所以跳過(guò)看下一個(gè)。而index1則滿足條件（新的距離>當(dāng)前b這個(gè)key的距離），所以會(huì)把e放到index1這個(gè)位置，并且找到下一個(gè)空白的位置index4，然后把c, d這兩個(gè)元素后移，最終會(huì)變成下面這個(gè)圖。

分配內(nèi)存這塊就比較簡(jiǎn)單了。

    T* allocate() {
        T* tmp = mHead;
        if (!tmp) {
            tmp = performAllocation();
        }
 
        mHead = *reinterpret_cast_no_cast_align_warning<T**>(tmp);
        return tmp;    }
 
    ROBIN_HOOD(NOINLINE) T* performAllocation() {
        size_t const numElementsToAlloc = calcNumElementsToAlloc();
 
        // alloc new memory: [prev |T, T, ... T]
        size_t const bytes = ALIGNMENT + ALIGNED_SIZE * numElementsToAlloc;
        ROBIN_HOOD_LOG("std::malloc " << bytes << " = " << ALIGNMENT << " + " << ALIGNED_SIZE
                                      << " * " << numElementsToAlloc)
        add(assertNotNull<std::bad_alloc>(std::malloc(bytes)), bytes);
        return mHead;
    }
 
 
    ROBIN_HOOD(NODISCARD) size_t calcNumElementsToAlloc() const noexcept {
        auto tmp = mListForFree;
        size_t numAllocs = MinNumAllocs;
 
        while (numAllocs * 2 <= MaxNumAllocs && tmp) {
            auto x = reinterpret_cast<T***>(tmp);
            tmp = *x;
            numAllocs *= 2;
        }
 
        return numAllocs;
    }

每次都分配比原來(lái)更多的內(nèi)存，所以大概是分配log(n)次。

所以如果是用的robinhood，筆者建議除非你的pair<key, value>真的是非常簡(jiǎn)單的結(jié)構(gòu)，否則都是用node實(shí)現(xiàn)會(huì)好一點(diǎn)?；蛘吣憧梢越唤orobinhood自己判斷，不顯示指定使用flat還是node，robinhood的模板會(huì)自動(dòng)根據(jù)你的size去判斷去使用哪個(gè)實(shí)現(xiàn)。

using unordered_map =
    detail::Table<sizeof(robin_hood::pair<Key, T>) <= sizeof(size_t) * 6 &&
                      std::is_nothrow_move_constructible<robin_hood::pair<Key, T>>::value &&
                      std::is_nothrow_move_assignable<robin_hood::pair<Key, T>>::value,
                  MaxLoadFactor100, Key, T, Hash, KeyEqual>;
 
template <bool IsFlat, size_t MaxLoadFactor100, typename Key, typename T, typename Hash,
          typename KeyEqual>
class Table
    : public WrapHash<Hash>,
      public WrapKeyEqual<KeyEqual>,
      detail::NodeAllocator<
          typename std::conditional<
              std::is_void<T>::value, Key,
              robin_hood::pair<typename std::conditional<IsFlat, Key, Key const>::type, T>>::type,
          4, 16384, IsFlat> {}

RobinHood VS Absl

這里附帶上一個(gè)簡(jiǎn)單的benchmark測(cè)試robinhood和absl的性能，測(cè)試的key是uint32_t類型，value是個(gè)120size的結(jié)構(gòu)。測(cè)試代碼如下

//
// Created by victorika on 2022/10/14.
//
 
#include "absl/container/flat_hash_map.h"
#include "absl/container/node_hash_map.h"
#include <vector>
#include <cstdio>
#include <iostream>
 
#define ANKERL_NANOBENCH_IMPLEMENT
#include "riemann/3rd/nanobench/nanobench.h"
#include "robin_hood.h"
 
/**
 * 測(cè)試結(jié)構(gòu)
 */
struct TestStruct {
  uint32_t* a;
  std::vector<uint32_t> b, c, d;
  std::string e, f, g;
  uint32_t h, i, j;
};
 
void TestEmplace() {
  ankerl::nanobench::Bench bench;
  bench.minEpochIterations(50);
  bench.title("Benchmarking rare value emplace");
  bench.run("absl_flat", [&] {
    absl::flat_hash_map<uint32_t, TestStruct> m;
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, TestStruct());
    }
  });
 
  bench.run("absl_flat_and_set_value_pointer",  [&] {
    absl::flat_hash_map<uint32_t, std::unique_ptr<TestStruct>> m;
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, std::make_unique<TestStruct>());
    }
  });
 
  bench.run("absl_node", [&] {
    absl::node_hash_map<uint32_t, TestStruct> m;
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, TestStruct());
    }
  });
  bench.run("absl_flat_reserve", [&] {
    absl::flat_hash_map<uint32_t, TestStruct> m;
    m.reserve(20000);
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, TestStruct());
    }
  });
 
  bench.run("absl_flat_and_set_value_pointer_reserve",  [&] {
    absl::flat_hash_map<uint32_t, std::unique_ptr<TestStruct>> m;
    m.reserve(20000);
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, std::make_unique<TestStruct>());
    }
  });
 
  bench.run("absl_node_reserve", [&] {
    absl::node_hash_map<uint32_t, TestStruct> m;
    m.reserve(20000);
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, TestStruct());
    }
  });
 
 
  bench.run("robinhood_flat", [&] {
    robin_hood::unordered_flat_map<uint32_t, TestStruct> m;
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, TestStruct());
    }
  });
 
  bench.run("robinhood_flat_and_set_value_pointer",  [&] {
    robin_hood::unordered_flat_map<uint32_t, std::unique_ptr<TestStruct>> m;
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, std::make_unique<TestStruct>());
    }
  });
 
  bench.run("robinhood_node", [&] {
    robin_hood::unordered_node_map<uint32_t, TestStruct> m;
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, TestStruct());
    }
  });
  bench.run("robinhood_flat_reserve", [&] {
    robin_hood::unordered_flat_map<uint32_t, TestStruct> m;
    m.reserve(20000);
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, TestStruct());
    }
  });
 
  bench.run("robinhood_flat_and_set_value_pointer_reserve",  [&] {
    robin_hood::unordered_flat_map<uint32_t, std::unique_ptr<TestStruct>> m;
    m.reserve(20000);
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, std::make_unique<TestStruct>());
    }
  });
 
  bench.run("robinhood_node_reserve", [&] {
    robin_hood::unordered_node_map<uint32_t, TestStruct> m;
    m.reserve(20000);
    for (int i = 0; i < 10000; i++) {
      m.try_emplace(i, TestStruct());
    }
  });
}
 
void TestSearch() {
  absl::flat_hash_map<uint32_t, TestStruct> m1;
  absl::flat_hash_map<uint32_t, std::unique_ptr<TestStruct>> m2;
  absl::node_hash_map<uint32_t, TestStruct> m3;
 
  robin_hood::unordered_flat_map<uint32_t, TestStruct> m4;
  robin_hood::unordered_flat_map<uint32_t, std::unique_ptr<TestStruct>> m5;
  robin_hood::unordered_node_map<uint32_t, TestStruct> m6;
  for (int i = 0; i < 10000; i++) {
    m1.try_emplace(i, TestStruct());
    m2.try_emplace(i, std::make_unique<TestStruct>());
    m3.try_emplace(i, TestStruct());
 
    m4.try_emplace(i, TestStruct());
    m5.try_emplace(i, std::make_unique<TestStruct>());
    m6.try_emplace(i, TestStruct());
  }
  ankerl::nanobench::Bench bench;
  bench.minEpochIterations(50);
  std::vector<uint32_t> key_v;
  key_v.resize(10000);
  bench.title("Benchmarking rare value search");
  bench.run("absl_flat_normal", [&] {
    for (int i = 0; i < 10000; i++) {
      auto it = m1.find(i);
      key_v[i] = it->first;
    }
  });
  bench.run("absl_flat_unique_ptr", [&] {
    for (int i = 0; i < 10000; i++) {
      auto it = m2.find(i);
      key_v[i] = it->first;
    }
  });
  bench.run("absl_node", [&] {
    for (int i = 0; i < 10000; i++) {
      auto it = m3.find(i);
      key_v[i] = it->first;
    }
  });
 
  bench.run("robinhood_flat_normal", [&] {
    for (int i = 0; i < 10000; i++) {
      auto it = m4.find(i);
      key_v[i] = it->first;
    }
  });
  bench.run("robinhood_unique_ptr", [&] {
    for (int i = 0; i < 10000; i++) {
      auto it = m5.find(i);
      key_v[i] = it->first;
    }
  });
  bench.run("robinhood_node", [&] {
    for (int i = 0; i < 10000; i++) {
      auto it = m6.find(i);
      key_v[i] = it->first;
    }
  });
}
 
void TestIterator() {
  absl::flat_hash_map<uint32_t, TestStruct> m1;
  absl::flat_hash_map<uint32_t, std::unique_ptr<TestStruct>> m2;
  absl::node_hash_map<uint32_t, TestStruct> m3;
 
  robin_hood::unordered_flat_map<uint32_t, TestStruct> m4;
  robin_hood::unordered_flat_map<uint32_t, std::unique_ptr<TestStruct>> m5;
  robin_hood::unordered_node_map<uint32_t, TestStruct> m6;
  for (int i = 0; i < 10000; i++) {
    m1.try_emplace(i, TestStruct());
    m2.try_emplace(i, std::make_unique<TestStruct>());
    m3.try_emplace(i, TestStruct());
 
    m4.try_emplace(i, TestStruct());
    m5.try_emplace(i, std::make_unique<TestStruct>());
    m6.try_emplace(i, TestStruct());
  }
  ankerl::nanobench::Bench bench;
  bench.minEpochIterations(50);
  std::vector<uint32_t> key_v;
  key_v.resize(10000);
  bench.title("Benchmarking rare value iterator");
  bench.run("absl_flat_normal", [&] {
    int i = 0;
    for (auto &[key, value] : m1) {
      key_v[i] = key;
    }
  });
  bench.run("absl_flat_unique_ptr", [&] {
    int i = 0;
    for (auto &[key, value] : m2) {
      key_v[i] = key;
    }
  });
  bench.run("absl_node", [&]() {
    int i = 0;
    for (auto &[key, value] : m3) {
      key_v[i] = key;
    }
  });
 
  bench.run("robinhood_flat_normal", [&] {
    for (int i = 0; i < 10000; i++) {
      auto it = m4.find(i);
      key_v[i] = it->first;
    }
  });
  bench.run("robinhood_flat_unique_ptr", [&] {
    for (int i = 0; i < 10000; i++) {
      auto it = m5.find(i);
      key_v[i] = it->first;
    }
  });
  bench.run("robinhood_node", [&] {
    for (int i = 0; i < 10000; i++) {
      auto it = m6.find(i);
      key_v[i] = it->first;
    }
  });
}
 
int main() {
  std::cout << "size=" << sizeof(TestStruct) << std::endl;
  TestEmplace();
  TestSearch();
  TestIterator();
}

測(cè)試結(jié)果

可以看到，在value copy rare的場(chǎng)景，absl的性能完全遵循上面提到的規(guī)則，而robinhood在這種情況下，emplace+construct+deconstruct是node更快，查找和遍歷幾乎和flat沒(méi)區(qū)別。橫向?qū)Ρ萢bsl和robinhood兩者的話，在查找和遍歷上都是absl更快，emplace+construct+deconstruct在優(yōu)化到極致的情況下兩者差不多，robinhood并沒(méi)有比absl快多少。當(dāng)然，這只是簡(jiǎn)單測(cè)試，針對(duì)key類型不同的場(chǎng)景可能兩者速度不太一樣，具體就需要更加詳細(xì)的benchmark了。

筆者也嘗試過(guò)極簡(jiǎn)類型的場(chǎng)景，結(jié)論也沒(méi)有違背上面的規(guī)則，都是flat速度遠(yuǎn)大于node。

建議：從兩者里面選擇的話，如果選型是用flat的話建議用absl，是node的話建議用robinhood。