C語(yǔ)言哈希表概念超詳細(xì)講解

更新時(shí)間：2023年02月09日 15:08:51 作者：動(dòng)名詞

哈希是一種很高效的存儲(chǔ)數(shù)據(jù)的結(jié)構(gòu)，它是利用的一種映射關(guān)系，它也是STL中unordered_map和unordered_set?的底層結(jié)構(gòu)。本文，主要講解哈希的原理，哈希的實(shí)現(xiàn)，里面關(guān)鍵點(diǎn)在于如何解決哈希沖突

1. 哈希概念

哈希其實(shí)在學(xué)排序時(shí)已經(jīng)用過(guò)了，就是計(jì)數(shù)排序。計(jì)數(shù)排序也是用的一種映射關(guān)系。

比如對(duì)此數(shù)組進(jìn)行計(jì)數(shù)排序：1 1 9 9 9 3 3 8 8

我用的是絕對(duì)映射，所以開(kāi)辟的數(shù)組空間它的大小必須能映射到最大的元素。

但是對(duì)于哈希來(lái)講，可以用決定映射嘛？當(dāng)然不可以，如果是絕對(duì)映射會(huì)造成很大的空間浪費(fèi)。所以哈希用的是取模的方式來(lái)存數(shù)據(jù)。

比如：哈希表的空間我給定只能存放 10個(gè)元素

存進(jìn)來(lái)的數(shù) 對(duì)10進(jìn)行取模，那么必定可以存方到這個(gè)哈希表中。

比如：存 100 ，它對(duì)10取模得 0，那它就存在第一個(gè)位置；存 52 ，它對(duì)10進(jìn)行取模得 2，那它就存到下標(biāo)為 2的位置。

也就是說(shuō) 無(wú)論多大的數(shù)據(jù)，都可以存到哈希表中。但是有兩個(gè) 問(wèn)題：

數(shù)據(jù)都能進(jìn)行取模嗎？假如我要求哈希表中存的是一個(gè)字符串，字符串不能進(jìn)行取模運(yùn)算，該怎么辦？這就是數(shù)據(jù)可否哈希的問(wèn)題，我們要把存進(jìn)哈希表的數(shù)據(jù)，變?yōu)榭晒?shù)據(jù)。
如果我存的是 4，下一次我要存的是 14。由于 4的位置已經(jīng)被占了，我存的 14 該存放到何處？要是直接存，就意味著前面存的 4 會(huì)被覆蓋，造成數(shù)據(jù)丟失。這就是哈希沖突問(wèn)題。

2. 哈希沖突

造成了哈希沖突，得解決哈希沖突問(wèn)題。

這里給出兩種解決手段：

閉散列：也叫開(kāi)放定址法，當(dāng)發(fā)生哈希沖突時(shí)，如果哈希表未被裝滿，說(shuō)明在哈希表中必然還有空位置，那么可以把key存放到?jīng)_突位置中的“下一個(gè)” 空位置中去。

它相當(dāng)于如果我本來(lái)要存的位置，已經(jīng)被占了，那么我就要在哈希表中找一個(gè)空位置存放。開(kāi)散列：開(kāi)散列法又叫鏈地址法(開(kāi)鏈法)，首先對(duì)關(guān)鍵碼集合用散列函數(shù)計(jì)算散列地址，具有相同地址的關(guān)鍵碼歸于同一子集合，每一個(gè)子集合稱為一個(gè)桶，各個(gè)桶中的元素通過(guò)一個(gè)單鏈表鏈接起來(lái)，各鏈表的頭結(jié)點(diǎn)存儲(chǔ)在哈希表中。

這種辦法是常用的，它相當(dāng)于哈希表每個(gè)位置都存的是一個(gè)哈希桶，如果發(fā)送哈希沖突，直接就放在哈希桶里就行了。

3. 哈希實(shí)現(xiàn)

哈希表其實(shí)就是一個(gè)數(shù)組，數(shù)組中存的是節(jié)點(diǎn)數(shù)據(jù)，發(fā)生哈希沖突后，采用的是往后找空位置的方法。

圖解：

(1) 10 % 6 == 4,所以插入到下標(biāo)為4的位置

(2) 20%6==2,插入到下標(biāo)為2的位置

(3)12%6 == 0,插入到下標(biāo)為0的位置。

(4)22%6 == 4，插入到下標(biāo)為4的位置，發(fā)現(xiàn)已經(jīng)有數(shù)據(jù)了，所以向后找空位置。

(5)44%6 == 2，插入到下標(biāo)為2的位置，發(fā)現(xiàn)已經(jīng)有數(shù)據(jù)了，所以向后找空位置。

哈希桶其實(shí)就是一個(gè)數(shù)組，數(shù)組中存的是節(jié)點(diǎn)鏈表，發(fā)生哈希沖突后，是直接插入到節(jié)點(diǎn)鏈表中。

如果是哈希桶，存放上面的數(shù)據(jù)，是什么樣的呢？

圖解：

它相當(dāng)于把發(fā)生沖突的數(shù)據(jù) 掛在了沖突位置的下面。

3.1 閉散列(哈希表)

#include<vector>
#include<iostream>
using namespace std;
namespace hash_table
{
	enum status
	{
		Empty,
		Exist,
		Delete
	};
	template<class K,class V>
	struct hashdate
	{
		pair<K, V> _kv;
		status _status = Empty;
	};
	template<class K,class V>
	class close_hashtable
	{
		typedef hashdate<K, V> Node;
	private:
		vector<Node> _tables;
		size_t _n = 0;
	public:
		Node* find(const K& key)
		{
			if (_tables.size() == 0)
				return nullptr;
			size_t start = key % _tables.size();
			size_t i = 0;
			size_t index = start + i;
			while (_tables[index]._status != Empty)
			{
				if (_tables[index]._kv.first == key && _tables[index]._status == Exist)
					return &_tables[index];
				i++;
				index = start + i;
				index %= _tables.size();
			}
			return nullptr;
		}
        bool erase(const K& key)
		{
			Node* ret = find(key);
			if (ret == nullptr)
				return false;
			ret->_status = Delete;
			_n -= 1;
			return true;
		}
		bool insert(const pair<K,V>& kv)
		{
			Node* ret = find(kv.first);
			if (ret)
			{
				return false;
			}
			if (_tables.size() == 0 || _n * 10 / _tables.size() >= 7)
			{
				size_t newsize = _tables.size() == 0 ? 10 : _tables.size() * 2;
				close_hashtable<K, V> tmp;
				tmp._tables.resize(newsize);
				for (size_t i = 0; i < _tables.size(); i++)
				{
					tmp.insert(_tables[i]._kv);
				}
				_tables.swap(tmp._tables);
			}
			size_t start = kv.first % _tables.size();
			size_t i = 0;
			size_t index = start + i;
			while (_tables[index]._status == Exist)
			{
				i++;
				index = start + i;
				index %= _tables.size();
			}
			_tables[index]._kv = kv;
			_tables[index]._status = Exist;
			_n += 1;
			return true;
		}
	};
}

以上就是閉散列的實(shí)現(xiàn)。我們來(lái)一步一步的解析以上代碼。

(1) 用枚舉常量來(lái) 標(biāo)記哈希表中每個(gè)位置的狀態(tài)，狀態(tài)有空，不為空，被刪除。

大家可能會(huì)對(duì) 被刪除這個(gè)狀態(tài)產(chǎn)生疑問(wèn)，一個(gè)位置不就是有數(shù)據(jù)和沒(méi)數(shù)據(jù)嗎？主要是大家想如果直接物理上刪除，把位置狀態(tài)設(shè)置為空，那么就會(huì)影響后面的數(shù)據(jù)。

比如：刪除 5 這個(gè)數(shù)據(jù)、

直接將 5 的位置設(shè)置為空，那么 15 這個(gè)數(shù)據(jù) 會(huì)受到影響。因?yàn)?對(duì) 哈希表大小取模后，等于 5 的不一定只有 5，還有 15，25，35。如果將 5位置直接設(shè)置為空，就相當(dāng)于后面的數(shù)據(jù)中已經(jīng)沒(méi)有 15，25，35 了。具體我們往下看查找的實(shí)現(xiàn)。

    enum status
	{
		Empty,
		Exist,
		Delete
	};

(2) 哈希表中的數(shù)據(jù)類型，以及哈希表的底層結(jié)構(gòu)

哈希表中的數(shù)據(jù)類型，是一個(gè)結(jié)構(gòu)體，包括了一個(gè)鍵值對(duì)和狀態(tài)：

template<class K,class V>
	struct hashdate
	{
		pair<K, V> _kv;
		// 默認(rèn)狀態(tài)為空
		status _status = Empty;
	};

哈希表的底層結(jié)構(gòu)，可以是一個(gè)數(shù)組，還得有一個(gè) 無(wú)符號(hào)整數(shù)用來(lái)處理哈希表中數(shù)據(jù)的個(gè)數(shù)：

	typedef hashdate<K, V> Node;
	private:
		vector<Node> _tables;
		size_t _n = 0;

(3) 哈希表的查找

        Node* find(const K& key)
		{
			if (_tables.size() == 0)
				return nullptr;
			size_t start = key % _tables.size();
			size_t i = 0;
			size_t index = start + i;

			while (_tables[index]._status != Empty)
			{
				if (_tables[index]._kv.first == key && _tables[index]._status == Exist)
					return &_tables[index];
				i++;
				index = start + i;
				index %= _tables.size();
			}
			return nullptr;
		}

注意： while循環(huán)中，它的條件是 _tables[index]._status != Empty 說(shuō)明即使當(dāng)下位置狀態(tài)是 Delete 也會(huì)往后找要查找的數(shù)據(jù)。這也解釋了上文中所述。

找到了的條件是 (_tables[index]._kv.first == key && _tables[index]._status == Exist)

找到了返回數(shù)據(jù)的地址，找不到返回空。

(4) 哈希表的插入

        bool insert(const pair<K,V>& kv)
		{
		    // 去重 
			Node* ret = find(kv.first);
			if (ret)
			{
				return false;
			}
            // 擴(kuò)容，后面講，大家可能對(duì)這個(gè)條件有疑問(wèn)
			if (_tables.size() == 0 || _n * 10 / _tables.size() >= 7)
			{
				size_t newsize = _tables.size() == 0 ? 10 : _tables.size() * 2;
				close_hashtable<K, V> tmp;
				tmp._tables.resize(newsize);
				for (size_t i = 0; i < _tables.size(); i++)
				{
					tmp.insert(_tables[i]._kv);
				}
				_tables.swap(tmp._tables);
			}
			size_t start = kv.first % _tables.size();
			size_t i = 0;
			size_t index = start + i;
            // 找空的位置
			while (_tables[index]._status == Exist)
			{
				i++;
				index = start + i;
				index %= _tables.size();
			}
            // 插入操作
			_tables[index]._kv = kv;
			_tables[index]._status = Exist;
			_n += 1;
			return true;
		}

擴(kuò)容是有說(shuō)法的，首先我們要知道什么時(shí)候需要擴(kuò)容？

如果為空，必然需要擴(kuò)容，默認(rèn)給 10 個(gè)大小即可。
當(dāng)有效數(shù)據(jù)個(gè)數(shù) 除以數(shù)組大小大于等于 0.7 時(shí)，需要擴(kuò)容

其實(shí) 有效數(shù)據(jù)個(gè)數(shù) 除以數(shù)組大小被稱為載荷因子，當(dāng)載荷因子大于 0.7時(shí)，就說(shuō)明需要擴(kuò)容了。這是大佬們搞出來(lái)的，我們還需要知道，載荷因子越大就說(shuō)明填入哈希表的元素越多，越可能發(fā)送哈希沖突。

擴(kuò)容的操作，我是創(chuàng)建了一個(gè)新的哈希表，然后把原表中的數(shù)據(jù)插入到新表中。這里還有一個(gè)坑，就是，可不可以直接將舊表的數(shù)據(jù)拷貝到新表中，答案是不行。

舉個(gè)例子：

原表是：

新表是：

直接拷貝的話是這樣的：

看圖也懂了哈，擴(kuò)容后的表是需要重新插入數(shù)據(jù)，因?yàn)?位置可能會(huì)發(fā)送改變。

擴(kuò)容完了，就是插入了，如果當(dāng)下的位置是 Delete 或者 Eempty 那么就可以直接插入；否則就需要向后面查找空的位置，進(jìn)行插入。

(5) 哈希表的刪除

        bool erase(const K& key)
		{
			Node* ret = find(key);
			if (ret == nullptr)
				return false;
			ret->_status = Delete;
			_n -= 1;
			return true;
		}

刪除很簡(jiǎn)單，就是將那個(gè)位置的狀態(tài)改為 Delete，然后有效數(shù)據(jù)個(gè)數(shù) 減一就行了。

3.1.1 閉散列的細(xì)節(jié)

首先，上面的哈希表其實(shí)還有問(wèn)題。

比如：不是所有的數(shù)據(jù)都可以取模，這個(gè)問(wèn)題，并沒(méi)有解決，上面實(shí)現(xiàn)是直接取模。

所以還需要實(shí)現(xiàn)一個(gè) 將數(shù)據(jù)轉(zhuǎn)為可哈希數(shù)據(jù)的仿函數(shù)。為什么是仿函數(shù)呢？因?yàn)?數(shù)據(jù)類型較多，情況不一，這里還用到了模板特化的知識(shí)，大家坐穩(wěn)扶好。

    template<class K>
	struct Hash
	{
		size_t operator()(const K& key)
		{
			return key;
		}
	};
	template<>
	struct Hash<string>
	{
		size_t operator()(const string& key)
		{
			size_t value = 0;
			for (auto ch : key)
			{
				value *= 31;
				value += ch;
			}
			return value;
		}
	};

第二個(gè)就是模板的特化，它的作用就是將string對(duì)象可以轉(zhuǎn)換成整型(可哈希)。至于為什么每次都乘以 31 ，這也是大佬的手法，因?yàn)槎啻螠y(cè)試后發(fā)現(xiàn)，乘以 31 會(huì)使哈希沖突少一些。

默認(rèn)情況下，就是直接返回 key，也就是默認(rèn)情況下都是可哈希的。

如果你要哈希一個(gè)自定義對(duì)象，那么還得是用模板的特化，自己處理。

所以有了仿函數(shù)之后，我們就不必?fù)?dān)心，傳過(guò)去的數(shù)據(jù)是否能夠被哈希了，靠仿函數(shù)去處理。具體怎么用，后面會(huì)給出完整代碼。

其次，還有一個(gè)問(wèn)題，就是線性探索和二次探索：

大家可能對(duì)這倆詞不陌生，也就是哈希表中，發(fā)生哈希沖突后，查找空位置時(shí)，是連續(xù)的查找空位置還是平方次的跳躍的查找。

當(dāng)然是二次查找更優(yōu)秀一些，上面的程序用的是線性探索，也就是那個(gè) i++，它就是連續(xù)的往后查找。為什么呢？因?yàn)?如果是線性探索，它會(huì)比較擁擠，連續(xù)位置太多，從而引發(fā)踩踏效應(yīng)，也就導(dǎo)致，每次來(lái)的數(shù)據(jù)，都需要去找空位置。

二次探索很簡(jiǎn)單，把 i++ 變成 i =i *i。

3.1.2 優(yōu)化后的閉散列

enum status
	{
		Empty,
		Exist,
		Delete
	};
	template<class K>
	struct Hash
	{
		size_t operator()(const K& key)
		{
			return key;
		}
	};
	template<>
	struct Hash<string>
	{
		size_t operator()(const string& key)
		{
			size_t value = 0;
			for (auto ch : key)
			{
				value *= 31;
				value += ch;
			}
			return value;
		}
	};
	template<class K,class V>
	struct hashdate
	{
		pair<K, V> _kv;
		status _status = Empty;
	};
	template<class K,class V,class Hashfunc = hash<K>>
	class close_hashtable
	{
		typedef hashdate<K, V> Node;
	private:
		vector<Node> _tables;
		size_t _n = 0;
	public:
		Node* find(const K& key)
		{
			if (_tables.size() == 0)
				return nullptr;
			Hashfunc hf;
			size_t start = hf(key)% _tables.size();
			size_t i = 0;
			size_t index = start + i;
			while (_tables[index]._status != Empty)
			{
				if (_tables[index]._kv.first == key && _tables[index]._status == Exist)
					return &_tables[index];
				i = i*i;
				index = start + i;
				index %= _tables.size();
			}
			return nullptr;
		}
		bool erase(const K& key)
		{
			Node* ret = find(key);
			if (ret == nullptr)
				return false;
			ret->_status = Delete;
			_n -= 1;
			return true;
		}
		bool insert(const pair<K,V>& kv)
		{
			Node* ret = find(kv.first);
			if (ret)
			{
				return false;
			}
			if (_tables.size() == 0 || _n * 10 / _tables.size() >= 7)
			{
				size_t newsize = _tables.size() == 0 ? 10 : _tables.size() * 2;
				close_hashtable<K, V> tmp;
				tmp._tables.resize(newsize);
				for (size_t i = 0; i < _tables.size(); i++)
				{
					tmp.insert(_tables[i]._kv);
				}
				_tables.swap(tmp._tables);
			}
			Hashfunc hf;
			size_t start = hf(kv.first) % _tables.size();
			size_t i = 0;
			size_t index = start + i;
			while (_tables[index]._status == Exist)
			{
				i = i*i;
				index = start + i;
				index %= _tables.size();
			}
			_tables[index]._kv = kv;
			_tables[index]._status = Exist;
			_n += 1;
			return true;
		}
	};

3.2 擴(kuò)散列(哈希桶)

template<class K,class V>
	struct HashNode
	{
		pair<K, V> _kv;
		HashNode<K,V>* _next;
		HashNode(const pair<K, V>& kv)
			:_kv(kv),
			_next(nullptr)
		{
		}
	};
	template<class K,class V,class Hashfunc = Hash<K>>
	class link_hashtable
	{
		typedef HashNode<K, V> Node;
	private:
		vector<Node*> _tables;
		size_t _n = 0;
	public:
		Node* find(const K& key)
		{
			if (_tables.size() == 0)
				return nullptr;
			Hashfunc hf;
			size_t index = hf(key) % _tables.size();
			Node* cur = _tables[index];
			while (cur)
			{
				if (cur->_kv.first == key)
					return cur;
				else
					cur = cur->_next;
			}
			return nullptr;
		}
		bool erase(const K& key)
		{
			Node* ret = find(key);
			if (ret == nullptr)
			{
				return false;
			}
			Hashfunc hf;
			size_t index = hf(key) % _tables.size();
			Node* pre = nullptr;
			Node* cur = _tables[index];
			while (cur)
			{
				Node* next = cur->_next;
				if (cur->_kv.first == key)
				{
					if (pre == nullptr)
					{
						_tables[index] = next;
					}
					else
					{
						pre->_next = next;
					}
					delete cur;
					_n -= 1;
					return true;
				}
				else
				{
					pre = cur;
					cur = next;
				}
			}
			return false;
		}
		bool insert(const pair<K,V>& kv)
		{
			Node* ret = find(kv.first);
			if (ret)
			{
				return false;
			}
			Hashfunc hf;
			if (_n == _tables.size())
			{
				size_t newSize = _tables.size() == 0 ? 10 : _tables.size() * 2;
				vector<Node*> newTables;
				newTables.resize(newSize);
				for (size_t i = 0; i < _tables.size(); ++i)
				{
					Node* cur = _tables[i];
					while (cur)
					{
						Node* next = cur->_next;
						size_t index = hf(cur->_kv.first) % newTables.size();
						// 頭插
						cur->_next = newTables[index];
						newTables[index] = cur;
						cur = next;
					}
					_tables[i] = nullptr;
				}
				_tables.swap(newTables);
			}
			size_t index = hf(kv.first) % _tables.size();
			Node* newnode = new Node(kv);
			newnode->_next = _tables[index];
			_tables[index] = newnode;
		}
	};
}

(1) 哈希桶的節(jié)點(diǎn)以及底層結(jié)構(gòu)

哈希桶的節(jié)點(diǎn)是一個(gè)單向鏈表，它得有數(shù)據(jù)，是一個(gè)鍵值對(duì)，還得有下一個(gè)節(jié)點(diǎn)的指針。

template<class K,class V>
	struct HashNode
	{
		pair<K, V> _kv;
		HashNode<K,V>* _next;
		HashNode(const pair<K, V>& kv)
			:_kv(kv),
			_next(nullptr)
		{
		}
	};

哈希桶的底層，是一個(gè)數(shù)組，數(shù)組中存的是節(jié)點(diǎn)的指針，當(dāng)然還得有一個(gè)有效數(shù)據(jù)的個(gè)數(shù)，它是用于判斷是否需要擴(kuò)容的。

template<class K,class V,class Hashfunc = Hash<K>>
	class link_hashtable
	{
		typedef HashNode<K, V> Node;
	private:
		vector<Node*> _tables;
		size_t _n = 0;
	public:
	}

(2) 哈希桶的查找

查找也簡(jiǎn)單呢，就是迭代往下查找，如果找到就返回，位置的指針，找不到就返回空。

        Node* find(const K& key)
		{
			if (_tables.size() == 0)
				return nullptr;
			Hashfunc hf;
			size_t index = hf(key) % _tables.size();
			Node* cur = _tables[index];
			while (cur)
			{
				if (cur->_kv.first == key)
					return cur;
				else
					cur = cur->_next;
			}
			return nullptr;
		}

(3) 哈希桶的插入

       bool insert(const pair<K,V>& kv)
		{
			Node* ret = find(kv.first);
			if (ret)
			{
				return false;
			}
			Hashfunc hf;
			if (_n == _tables.size())
			{
				size_t newSize = _tables.size() == 0 ? 10 : _tables.size() * 2;
				vector<Node*> newTables;
				newTables.resize(newSize);
				for (size_t i = 0; i < _tables.size(); ++i)
				{
					Node* cur = _tables[i];
					while (cur)
					{
						Node* next = cur->_next;
						size_t index = hf(cur->_kv.first) % newTables.size();
						// 頭插
						cur->_next = newTables[index];
						newTables[index] = cur;
						cur = next;
					}
                    // 將舊桶置空
					_tables[i] = nullptr;
				}
				_tables.swap(newTables);
			}
			size_t index = hf(kv.first) % _tables.size();
			Node* newnode = new Node(kv);
			newnode->_next = _tables[index];
			_tables[index] = newnode;
		}

先考慮插入的數(shù)據(jù)的key有沒(méi)有重復(fù)，如果重復(fù)了那就直接返回。其實(shí)就是個(gè)頭插，中間代碼很多是擴(kuò)容，我們先不考慮擴(kuò)容，其實(shí) 插入的代碼就是：

           size_t index = hf(kv.first) % _tables.size();
			Node* newnode = new Node(kv);
			newnode->_next = _tables[index];
			_tables[index] = newnode;

擴(kuò)容的話，和哈希表同理，擴(kuò)完容之后，哈希桶的位置可能會(huì)變化，所以要自己完成重新插入工作，不過(guò)擴(kuò)容的條件不再是載荷因子 >=0.7，而是載荷因子等于 1時(shí)才擴(kuò)容。

(4) 哈希桶的刪除

        bool erase(const K& key)
		{
			Node* ret = find(key);
			if (ret == nullptr)
			{
				return false;
			}
			Hashfunc hf;
			size_t index = hf(key) % _tables.size();
            // 前一個(gè)節(jié)點(diǎn)
			Node* pre = nullptr;
			//桶的第一個(gè)節(jié)點(diǎn)
			Node* cur = _tables[index];
			while (cur)
			{
			    // 桶的下一個(gè)節(jié)點(diǎn)
				Node* next = cur->_next;
                // 找到要?jiǎng)h除的節(jié)點(diǎn)
				if (cur->_kv.first == key)
				{
				    // 頭刪
					if (pre == nullptr)
					{
						_tables[index] = next;
					}
					// 中間刪或者尾刪
					else
					{
						pre->_next = next;
					}
					delete cur;
					_n -= 1;
					return true;
				}
				else
				{
				    // 往桶下面迭代
					pre = cur;
					cur = next;
				}
	        }
		}

一上來(lái) 先檢查要?jiǎng)h除的數(shù)據(jù)是否存在，存在就往下走，不存在直接返回。

然后就是找要?jiǎng)h除的數(shù)據(jù)在那個(gè)桶中：

            Hashfunc hf;
			size_t index = hf(key) % _tables.size();

再就是在這個(gè)桶中刪除，我們需要考慮幾件事:

桶中是單向鏈表，刪除的話我需要維護(hù)鏈表的關(guān)系，所以需要記錄刪除數(shù)據(jù)的前一個(gè)數(shù)據(jù)
要?jiǎng)h除的節(jié)點(diǎn)如果是頭節(jié)點(diǎn)，就不需要維護(hù)和前一個(gè)數(shù)據(jù)的關(guān)系，因?yàn)樗褪堑谝粋€(gè)
要?jiǎng)h除的節(jié)點(diǎn)在中間或者最后，那就需要維護(hù)和前一個(gè)的關(guān)系

3.2.1 擴(kuò)散列的細(xì)節(jié)

擴(kuò)散列是有極端情況的，比如我開(kāi)辟的數(shù)組大小是 10 ，插入的數(shù)據(jù)是 10，20，30，40，50，60 …… 10000000000，這些數(shù)據(jù)都插入到了一個(gè)桶里面。

會(huì)導(dǎo)致哈希桶變成這樣：

會(huì)發(fā)現(xiàn)，效率退化了，哈希的查找一般情況是O(1) ，但是這種情況下，退化成O(n)了。所以應(yīng)該怎么辦？大佬其實(shí)是給出解決方案的，就是一個(gè)桶中的元素超過(guò)了某一個(gè)量，那么就會(huì)將這個(gè)桶中的數(shù)據(jù)用紅黑樹(shù)組織起來(lái)，對(duì)于這個(gè)量jave和C++還不一樣。

這就是所謂的桶中種樹(shù)。

但是上面的哈希桶，我沒(méi)有支持這種高級(jí)操作，我覺(jué)得只要了解這個(gè)事情就行了，至于實(shí)現(xiàn)，也是可以的，但是對(duì)于我們要學(xué)習(xí)哈希，沒(méi)太大幫助。

4. 哈希表和哈希桶的比較

哈希桶處理溢出，需要增設(shè)鏈接指針，似乎增加了存儲(chǔ)開(kāi)銷。

事實(shí)上：由于哈希表必須保持大量的空閑空間以確保搜索效率，如二次探查法要求裝載因子a <= 0.7，而表項(xiàng)所占空間又比指針大的多，所以使用鏈地址法反而比開(kāi)地址法節(jié)省存儲(chǔ)空間。

哈希表處理哈希沖突用的是搶占別的位置，可能會(huì)導(dǎo)致數(shù)據(jù)比較阻塞，也就是每進(jìn)來(lái)一個(gè)數(shù)據(jù)都需要去搶占別人的位置。

哈希桶處理哈希沖突用的是在沖突位置，增加鏈節(jié)點(diǎn)的方法，但是有可能造成，單向鏈表太長(zhǎng)從而影響效率，所以需要將單向鏈表變?yōu)榧t黑樹(shù)管理起來(lái)。

5. 結(jié)尾語(yǔ)

學(xué)完哈希，能干什么？說(shuō)實(shí)話哈希很重要，學(xué)數(shù)據(jù)結(jié)構(gòu)，你說(shuō)你不會(huì)哈希，那么就相當(dāng)于你白學(xué)數(shù)據(jù)結(jié)構(gòu)了，就是這么夸張哈，以后工作也會(huì)大量用到哈希的。所以大家加油。在我的下一篇文章中，會(huì)利用哈希桶去實(shí)現(xiàn)unordered_map和unordered_set，也算是用上了哈希。當(dāng)然位圖呀，布隆過(guò)濾器呀，海量處理數(shù)據(jù)等都會(huì)用到哈希。

到此這篇關(guān)于C語(yǔ)言哈希表概念超詳細(xì)講解的文章就介紹到這了,更多相關(guān)C語(yǔ)言哈希表內(nèi)容請(qǐng)搜索腳本之家以前的文章或繼續(xù)瀏覽下面的相關(guān)文章希望大家以后多多支持腳本之家！

您可能感興趣的文章: