[Question] Help with concurrently inserting to redis

agolangf · · 466 次点击    
这是一个分享于 的资源,其中的信息可能已经有所发展或是发生改变。
<p>I have three elements consisting of 1 million, 2 million and 3 million ints respectively. I want to insert them to redis such that they all happen concurrently and the total execution time is not greater than the execution time of the 3 million ints. I tried using sync.Waitgroup but it doesn&#39;t speed up the execution. Here&#39;s my code.</p> <pre><code>package main import ( &#34;log&#34; &#34;strconv&#34; &#34;time&#34; &#34;gopkg.in/redis.v5&#34; ) func main() { oneMillion := makeRange(1, 1000000) twoMillion := makeRange(1000001, 3000000) threeMillion := makeRange(3000001, 6000000) elements := [][]int{oneMillion, twoMillion, threeMillion} client := redis.NewClient(&amp;redis.Options{ Addr: &#34;localhost:6379&#34;, Password: &#34;&#34;, DB: 0, DialTimeout: 60 * time.Second, WriteTimeout: 60 * time.Second, ReadTimeout: 60 * time.Second, PoolTimeout: 60 * time.Second, }) pipeline := client.Pipeline() for _, elem := range elements { for i := 0; i &lt; len(elem); i++ { key := &#34;KEY:&#34; + strconv.Itoa(elem[i]) val := &#34;VAL:&#34; + strconv.Itoa(elem[i]) cmd := pipeline.Set(key, val, 0) if cmd.Err() != nil { log.Fatal(&#34;cmd error: &#34;, cmd.Err()) } } _, err := pipeline.Exec() if err != nil { log.Fatal(&#34;error: &#34;, err) } } } func makeRange(min, max int) []int { a := make([]int, max-min+1) for i := range a { a[i] = min + i } return a } </code></pre> <hr/>**评论:**<br/><br/>cdoxsey: <pre><p>Hi newbgopher,</p> <p>Redis is a single-threaded database, so although concurrency may help a little here, in the end you&#39;re limited by the database itself. To improve performance you need run multiple Redis processes and partition your data.</p> <p>Some other suggestions:</p> <ol> <li>If you are truly inserting a range of integers, using a Lua script will likely perform far better (all the computation happens on the server it self so there&#39;s no network load)</li> <li>You should limit the size of the pipeline and periodically execute it. Doing all 3 million in one execute is probably asking too much.</li> <li>For the goroutines, make sure you are using a separate redis connection for each one.</li> </ol></pre>newbgopher: <pre><p>Thanks for the response! Im not really inserting ints, i just used it for simplicity sake. I&#39;ll try using a dedicated redis connection per data element. </p></pre>sgmansfield: <pre><p>It really doesn&#39;t matter how many different connections you add. Redis literally only does one thing at a time. Your total time will be the sum of all three, the best you can do is eliminate a couple round trips by concurrently sending the data. If your long pole is Redis inserting those huge lists, you&#39;ll be waiting regardless.</p></pre>newbgopher: <pre><p>thanks for the response! </p></pre>Morgahl: <pre><p>The package and configuration in use in the code sample is using a connection pool of 10 (default) so at least your 3rd point is covered somewhat :) </p></pre>lumost: <pre><p>In addition to the single-threaded nature of redis producing a bottleneck ( even with proper batching redis will only push 60-400k ops per second ). It&#39;s important to consider the number of bytes on the network - 3 million ints has a minimum overhead of 12 MB prior to serialization and connection overhead. A stock linux host will also consume a minimum of 4KB ram per connection. </p> <p>Depending on your needs for this data you may want to consider serializing the 3 million ints into a single record. This would avoid most of the per op overhead of redis, and with some cleverness will still allow per record access using the get range command. </p></pre>throwawayguy123xd: <pre><p>i think the other posters answered clearer, but to answer your specific question</p> <p>i would think you need 3 clients (or 3 insert-interfaces), so even though may not be able to with single-threaded redis, you would try something like:</p> <pre><code> func makeandinsert(array* []int, index int) { client := redis.newClient(...) for i,v := range array[index] { client.insert(v) //or whatever } } func main() { elements := ... go makeandinsert(&amp;elements, 0) go makeandinsert(&amp;elements, 1) go makeandinsert(&amp;elements, 2) } </code></pre> <p>its just a different way of doing it multi-threaded (aka refactoring)</p></pre>

入群交流(和以上内容无关):加入Go大咖交流群,或添加微信:liuxiaoyan-s 备注:入群;或加QQ群:692541889

466 次点击  
加入收藏 微博
暂无回复
添加一条新回复 (您需要 登录 后才能回复 没有账号 ?)
  • 请尽量让自己的回复能够对别人有帮助
  • 支持 Markdown 格式, **粗体**、~~删除线~~、`单行代码`
  • 支持 @ 本站用户;支持表情(输入 : 提示),见 Emoji cheat sheet
  • 图片支持拖拽、截图粘贴等方式上传