returning []*T vs []T

polaris · · 380 次点击

这是一个分享于的资源，其中的信息可能已经有所发展或是发生改变。

When should we return []*T and when []T? Edit: Removed the article for less confusion. <hr/>**评论：** lobster_johnson: <pre>There are different use cases based on the pros/cons, and they're nearly exactly the same as non-slice versions (i.e. <code>T</code> vs. <code>*T</code>). <code>[]T</code>: <ul> <li>Pro: Contiguous in memory with respect to T (the runtime will allocate <code>sizeof(T) * cap</code>), which increases cache locality. For example, a loop over <code>[]int</code> can be very efficient and potentially even be vectorized.</li> <li>Con: To access a slice element, it has to be copied, which is more expensive than passing a pointer around. Similarly, modifying an element requires copying the element, modifying it and then copying it back.</li> </ul> <code>[]*T</code>: <ul> <li>Pro: No copying needed in order to read/write elements.</li> <li>Con: Requires an indirection to dereference the stored pointer, which can point anywhere in RAM and will be unlikely to take advantage of cache locality.</li> </ul> Cache locality also includes <a href="http://www.futurechips.org/chip-design-for-all/prefetching.html">RAM prefetching</a>; modern CPU architectures are complicated, but sequential access is generally faster than random access. <hr/> I would recommend using <code>[]T</code> unless you have a specific reason to want to minimize copying. For example, let's say we have this: <pre><code>type Document struct { // Lots of fields here, making Document large } func ClassifyDocuments(docs []Document) map[string][]Document </code></pre> Imagine we want to "classify" the documents based on some heuristic, like divide them into topics like "business", "sports", and so on. There's one input slice, and a map of topics to output slices. Some documents may be in multiple topics, though; and by using <code>[]Document</code>, we're potentially duplicating each document multiple times, which is wasteful. So we should probably do this instead: <pre><code>func ClassifyDocuments(docs []*Document) map[string][]*Document </code></pre> This allows the result to simply point to the same documents as the input. Except for the allocating the <code>map</code> and slices in the result, it's possible that this function doesn't need to allocate anything at all on the heap.</pre>connor4312: <pre>Also note that you can take a pointer to a slice element which avoids the copying and lets you call pointer methods on the type, while still maintaining that lovely chunk of continuous memory. Example: <a href="https://play.golang.org/p/pz0JVHj2dQ">https://play.golang.org/p/pz0JVHj2dQ</a> The small downside is that (at least the last time I checked) in cases where the slice could otherwise be allocated on the stack, taking pointers to elements will cause Go to allocate it on the heap.</pre>lobster_johnson: <pre>Good point, and also in a loop to avoid copying via a <code>range</code>: <pre><code>for i := range things { thing := &things[i] } </code></pre></pre>zemo: <pre><blockquote> modifying an element requires copying the element, modifying it and then copying it back. </blockquote> would this perform a copy? <code>items[8].x = 10</code></pre>xiegeo: <pre>Use []*T when you need to use []*T. Otherwise just use []T. The rules are the same as using *T vs T.</pre>nhooyr: <pre>I've always been using <code>*T</code> as the default for my methods, I thought it was the opposite. Use <code>*T</code> unless you need to use <code>T</code>.</pre>xiegeo: <pre>*T is a must for methods that modify T, or if you what *T to implement an interface, so methods tends to get called on pointers.</pre>nhooyr: <pre>But if I'm not modifying it, I should use <code>T</code> by default unless I profile and find the receiver is large enough that it is causing issues?</pre>xiegeo: <pre>For method receivers, this sums it up nicely: <a href="https://golang.org/doc/faq#methods_on_values_or_pointers" rel="nofollow">https://golang.org/doc/faq#methods_on_values_or_pointers</a> For me, if T is a rename of a simple type such as int, they I use T. but if T is a struct then I use *T in case I want to add a modifying method and the rest none modifying methods should be consistent.</pre>sh41: <pre>Relevant discussion in <code>go-github</code> library, started by Russ Cox: <a href="https://github.com/google/go-github/issues/180">https://github.com/google/go-github/issues/180</a></pre>nesigma: <pre>Nice find! That settles it. So apparently the correct answer is that it depends on the size of T. When dealing with large structs it is better to use []*T like Russ Cox recommends especially because of the code that will iterate on that slice. It's finally crystal clear in my head. Thank you.</pre>uncle_bad_touches: <pre>Less memory fragmentation and fewer pointers to GC?</pre>kl0nos: <pre>When you copy slice out of the function you are not copying any data from the slice, you are copying pointer that is pointing to that data. Just use []T for slices.</pre>nesigma: <pre><blockquote> Just use []T for slices. </blockquote> When is it more appropriate to return []*T?</pre>Deltigre: <pre>I used it temporarily for an object pool (to avoid excessive GC) but quickly changed to a linked list + free object stack implementation. Edit: another related use I can think of is maintaining small struct size with an array of pointers. Or you might want a list of objects you wish to mutate. Obviously these are all specialized use cases.</pre>materialdesigner: <pre>Isn't sync.Pool for this purpose?</pre>Deltigre: <pre>From the docs: <blockquote> On the other hand, a free list maintained as part of a short-lived object is not a suitable use for a Pool, since the overhead does not amortize well in that scenario. It is more efficient to have such objects implement their own free list. </blockquote> I use it to manage the per-routine pools because it includes synchronization, but the in-routine pools are singly-linked structs, placed in a stack when free.</pre>kl0nos: <pre>Don't use slice of pointers unless you really have to. It's another level of indirection, it will hurt your cache and prefetcher. Whenever you can just use []T instead of []*T.</pre>: <pre>[deleted]</pre>DocMerlin: <pre>No, an array is NOT a collection of pointers. It is a bunch of objects in memory. A slice is a pointer to an array, a length, and a capacity.</pre>Remi1115: <pre>I can't find any source that supports your statement. Could it be that you're confusing slices, which contain a pointer to the underlying array?</pre>

入群交流（和以上内容无关）：加入Go大咖交流群，或添加微信：liuxiaoyan-s 备注：入群；或加QQ群：692541889

380 次点击

加入收藏微博

slice

github

runtime

0 回复

添加一条新回复（您需要登录后才能回复没有账号？）

请尽量让自己的回复能够对别人有帮助
支持 Markdown 格式, **粗体**、~~删除线~~、`单行代码`
支持 @ 本站用户；支持表情（输入 : 提示），见 Emoji cheat sheet
图片支持拖拽、截图粘贴等方式上传

returning []*T vs []T

用户登录

今日阅读排行

一周阅读排行

最新主题