哈希表、散列表

哈希表、散列表


author@jason_ql(lql0716)
http://blog.csdn.net/lql0716


1 哈希表原理

  • 哈希表,也稱散列表,其原理就是根據所給的值x ,利用給定的哈希函數index=f(x) 計算該函數值,所得的函數值作爲索引值index ,從而在查表的時候可以快速獲取其位置。

  • 哈希表是以空間換時間,即空間複雜度較高,時間複雜度很低,爲O(1)

  • 完美的哈希函數可以使得鍵值與索引值一一對應,但是,有時候多個鍵值對應一個索引值也是不可避免的,這時候就會發生哈希碰撞,解決碰撞的方法有拉鍊法線性探針法鏈地址法

  • 鍵值:可以是字符串,也可以是數字,還可以是其兩者的組合

2 哈希函數

  • 哈希表的核心是哈希函數,可以把哈希函數看成一個僞隨機數生成器。
  • 哈希函數的核心就是儘可能的保證一一映射,索引值的分佈儘可能的平均。

2.1 哈希函數原理

哈希函數的構造方法有很多,常用的有:直接定址法數字分析法平方取中法摺疊法除留餘數法隨機數法.

  • 除留餘數哈希連地址法示例
    鍵值 key= [ 19,14,23,01,68,20,84,27,55,11,10,79 ]

    哈希函數:index=key%13

    對應的索引值 index= [ 6, 1, 10, 1, 3, 7, 6, 1, 3, 11, 10, 1 ]

    哈希表如下圖:
    此處輸入圖片的描述

2.2 經典哈希函數代碼

2.2.1 RSHash

unsigned int RSHash(const std::string& str)
{
   unsigned int b    = 378551;
   unsigned int a    = 63689;
   unsigned int hash = 0;

   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash = hash * a + str[i];
      a    = a * b;
   }

   return hash;
}
/* End Of RS Hash Function */

2.2.2 JSHash

unsigned int JSHash(const std::string& str)
{
   unsigned int hash = 1315423911;

   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash ^= ((hash << 5) + str[i] + (hash >> 2));
   }

   return hash;
}
/* End Of JS Hash Function */

2.2.3 PJWHash

unsigned int PJWHash(const std::string& str)
{
   unsigned int BitsInUnsignedInt = (unsigned int)(sizeof(unsigned int) * 8);
   unsigned int ThreeQuarters     = (unsigned int)((BitsInUnsignedInt  * 3) / 4);
   unsigned int OneEighth         = (unsigned int)(BitsInUnsignedInt / 8);
   unsigned int HighBits          = (unsigned int)(0xFFFFFFFF) << (BitsInUnsignedInt - OneEighth);
   unsigned int hash              = 0;
   unsigned int test              = 0;

   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash = (hash << OneEighth) + str[i];

      if((test = hash & HighBits)  != 0)
      {
         hash = (( hash ^ (test >> ThreeQuarters)) & (~HighBits));
      }
   }

   return hash;
}
/* End Of  P. J. Weinberger Hash Function */

2.2.4 ELFHash

unsigned int ELFHash(const std::string& str)
{
   unsigned int hash = 0;
   unsigned int x    = 0;

   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash = (hash << 4) + str[i];
      if((x = hash & 0xF0000000L) != 0)
      {
         hash ^= (x >> 24);
      }
      hash &= ~x;
   }

   return hash;
}
/* End Of ELF Hash Function */

2.2.5 BKDRHash

unsigned int BKDRHash(const std::string& str)
{
   unsigned int seed = 131; // 31 131 1313 13131 131313 etc..
   unsigned int hash = 0;

   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash = (hash * seed) + str[i];
   }

   return hash;
}
/* End Of BKDR Hash Function */

2.2.6 SDBMHash

unsigned int SDBMHash(const std::string& str)
{
   unsigned int hash = 0;

   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash = str[i] + (hash << 6) + (hash << 16) - hash;
   }

   return hash;
}
/* End Of SDBM Hash Function */

2.2.7 DJBHash

unsigned int DJBHash(const std::string& str)
{
   unsigned int hash = 5381;

   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash = ((hash << 5) + hash) + str[i];
   }

   return hash;
}
/* End Of DJB Hash Function */

2.2.8 DEKHash

unsigned int DEKHash(const std::string& str)
{
   unsigned int hash = static_cast<unsigned int>(str.length());

   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash = ((hash << 5) ^ (hash >> 27)) ^ str[i];
   }

   return hash;
}
/* End Of DEK Hash Function */

2.2.9 BPHash

unsigned int BPHash(const std::string& str)
{
   unsigned int hash = 0;
   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash = hash << 7 ^ str[i];
   }

   return hash;
}
/* End Of BP Hash Function */

2.2.10 FNVHash

unsigned int FNVHash(const std::string& str)
{
   const unsigned int fnv_prime = 0x811C9DC5;
   unsigned int hash = 0;
   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash *= fnv_prime;
      hash ^= str[i];
   }

   return hash;
}
/* End Of FNV Hash Function */

2.2.11 APHash

unsigned int APHash(const std::string& str)
{
   unsigned int hash = 0xAAAAAAAA;

   for(std::size_t i = 0; i < str.length(); i++)
   {
      hash ^= ((i & 1) == 0) ? (  (hash <<  7) ^ str[i] * (hash >> 3)) :
                               (~((hash << 11) + (str[i] ^ (hash >> 5))));
   }

   return hash;
}
/* End Of AP Hash Function */

2.3 哈希表代碼


  • 鏈地址法哈希表

GitHub: https://github.com/lql0716/HashTable

參考

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章