Golang hashmap 的使用及实现

51reboot · 2017-08-22 10:05:03 · 13237 次点击 · 预计阅读时间 4 分钟 · 大约8小时之前开始浏览

这是一个创建于 2017-08-22 10:05:03 的文章，其中的信息可能已经有所发展或是发生改变。

基本语法

定义hashmap变量

由于go语言是一个强类型的语言，因此hashmap也是有类型的，具体体现在key和value都必须指定类型，比如声明一个key为string，value也是string的map，
需要这样做

var m map[string]string // 声明一个hashmap，还不能直接使用，必须使用make来初始化
m = make(map[string]string) // 初始化一个map
m = make(map[string]string, 3) // 初始化一个map并附带一个可选的初始bucket（非准确值，只是有提示意义）

m := map[string]string{} // 声明并初始化

m := make(map[string]string) // 使用make来初始化

大部分类型都能做key，某些类型是不能的，共同的特点是：不能使用==来比较，包括: slice, map, function

get,set,delete

m := map[string]int
m["a"] = 1

fmt.Println(m["a"]) // 输出 1

// 如果访问一个不存在的key，返回类型默认值
fmt.Println(m["b"]) // 输出0

// 测试key是否存在
v, ok := m["b"]

if ok {
    ...
}

// 删除一个key
delete(m, "a")

迭代器

// 只迭代key
for k := range m {
    ...
}

// 同时迭代key-value
for k, v := range m {
    ...
}

在迭代的过程中是可以对map进行删除和更新操作的，规则如下：

迭代是无序的，跟插入是的顺序无关
迭代的过程中删除一个key，无论遍历还是没有遍历过都不会再遍历到
迭代的过程中添加一个key，不确定是否能遍历到
未初始化的map也可以迭代

其他

map的value是不可取地址的，意味着 &m["a"]这样的语法是非法的
len和cap分别可以获取当前map的kv个数和总容量

内部结构

hashmap结构

golang的map是hash结构的，意味着平均访问时间是O(1)的。同传统的hashmap一样，由一个个bucket组成:

// A header for a Go map.
type hmap struct {
 // Note: the format of the Hmap is encoded in ../../cmd/internal/gc/reflect.go and
 // ../reflect/type.go.  Don't change this structure without also changing that code!
 count int // # live cells == size of map.  Must be first (used by len() builtin)
 flags uint8
 B     uint8  // log_2 of # of buckets (can hold up to loadFactor * 2^B items)
 hash0 uint32 // hash seed

 buckets    unsafe.Pointer // array of 2^B Buckets. may be nil if count==0.
 oldbuckets unsafe.Pointer // previous bucket array of half the size, non-nil only when growing
 nevacuate  uintptr        // progress counter for evacuation (buckets less than this have been evacuated)

 // If both key and value do not contain pointers and are inline, then we mark bucket
 // type as containing no pointers. This avoids scanning such maps.
 // However, bmap.overflow is a pointer. In order to keep overflow buckets
 // alive, we store pointers to all overflow buckets in hmap.overflow.
 // Overflow is used only if key and value do not contain pointers.
 // overflow[0] contains overflow buckets for hmap.buckets.
 // overflow[1] contains overflow buckets for hmap.oldbuckets.
 // The first indirection allows us to reduce static size of hmap.
 // The second indirection allows to store a pointer to the slice in hiter.
 overflow *[2]*[]*bmap
}

bucket内部

// A bucket for a Go map.
type bmap struct {
 tophash [bucketCnt]uint8 // Followed by bucketCnt keys and then bucketCnt values.
 // NOTE: packing all the keys together and then all the values together makes the
 // code a bit more complicated than alternating key/value/key/value/... but it allows
 // us to eliminate padding which would be needed for, e.g., map[int64]int8.
 // Followed by an overflow pointer.
}

根据一个key得到value

func mapaccess1(t *maptype, h *hmap, key unsafe.Pointer) unsafe.Pointer

maptype为map的类型信息，是编译器在编译期静态生成的，里面包含了map的一些元信息，比如 key和value的类型信息等等

*hmap为map的header，即map的引用
key是一个通用的指针，代表了key的引用
返回值为一个指针，指向对应的value引用

hash计算找到bucket

那我们怎么访问到对应的bucket呢，我们需要得到对应key的hash值

alg := t.key.alghash := alg.
hash(key, uintptr(h.hash0))
m := uintptr(1)<<h.B - 1
b := (*bmap)(add(h.buckets, (hash&m)*uintptr(t.bucketsize)))

根据 tophash 和 key 定位到具体的 bucket

tophash 可以快速试错，如果 tophash 不相等直接跳过
tophash 相等的话，根据 key 的比较来判断是否相等，如果相等则找到
如果当前 bucket 都试玩还没有找到，则调到下一个 bucket

扩容

各个参数的意思：

%overflow 溢出率，平均一个 bucket 有多少个 kv 的时候会溢出
bytes/entry 平均存一个 kv 需要额外存储多少字节的数据
hitprobe 找到一个存在的 key 平均需要找几下
missprobe 找到一个不存在的 key 平均需要找几下

目前采用的是这一行:

作者丨icexin
链接丨http://t.cn/RCXgEjr

有疑问加站长微信联系（非本文作者）

本文来自：简书

感谢作者：51reboot

查看原文：Golang hashmap 的使用及实现

入群交流（和以上内容无关）：加入Go大咖交流群，或添加微信：liuxiaoyan-s 备注：入群；或加QQ群：692541889

13237 次点击

加入收藏微博

收入我的专栏

上一篇：golang 用/x/net/html写的小爬虫，爬小说

下一篇：GO 指南练习题牛顿法求平分根

信息

类型信息

slice

返回值

0 回复

暂无回复

添加一条新回复（您需要登录后才能回复没有账号？）

请尽量让自己的回复能够对别人有帮助
支持 Markdown 格式, **粗体**、~~删除线~~、`单行代码`
支持 @ 本站用户；支持表情（输入 : 提示），见 Emoji cheat sheet
图片支持拖拽、截图粘贴等方式上传

关注我

扫码关注领全套学习资料
加入 QQ 群：
- 192706294（已满）
- 731990104（已满）
- 798786647（已满）
- 729884609（已满）
- 977810755（已满）
- 815126783（已满）
- 812540095（已满）
- 1006366459（已满）
- 692541889
加入微信群：liuxiaoyan-s，备注入群
也欢迎加入知识星球 Go粉丝们（免费）

Golang hashmap 的使用及实现

基本语法

定义hashmap变量

get,set,delete

迭代器

其他

内部结构

hashmap结构

bucket内部

根据一个key得到value

hash计算找到bucket

根据 tophash 和 key 定位到具体的 bucket

扩容

用户登录

今日阅读排行

一周阅读排行

关注我

基本语法

定义hashmap变量

get,set,delete

迭代器

其他

内部结构

hashmap结构

bucket内部

根据一个key得到value

hash计算找到bucket

根据 tophash 和 key 定位到具体的 bucket

扩容

Golang hashmap 的使用及实现

基本语法

定义hashmap变量

get,set,delete

迭代器

其他

内部结构

hashmap结构

bucket内部

根据一个key得到value

hash计算找到bucket

根据 tophash 和 key 定位到具体的 bucket

扩容

用户登录

今日阅读排行

一周阅读排行

关注我

给该专栏投稿 写篇新文章

收入到我管理的专栏 新建专栏

基本语法

定义hashmap变量

get,set,delete

迭代器

其他

内部结构

hashmap结构

bucket内部

根据一个key得到value

hash计算找到bucket

根据 tophash 和 key 定位到具体的 bucket

扩容

给该专栏投稿写篇新文章

收入到我管理的专栏新建专栏