When does it make sense to use a goroutine?

Assuming I have an API. When someone calls /register, the password is hashed, stored in the database and something is written to the log file. Would it make sense to start a goroutine for the logging part? Or should one just keep it as it is?

Another example might be /information/placexyz. One goroutine would look up api1, number 2 would request data from another api and number 3 would queue the database on the server itself. The "main" method waits for all routines to return their results and a json is send to the client.

Just as examples. Is there any guideline when a goroutine would make sense / improve performance?

评论：

bonekeeper:

Whatever involves some I/O (like reading/writing to a database, remote API request, etc) is a good candidate, unless it is in a tight loop. Also, sometimes, having a fixed number of workers to offload that stuff to makes more sense than just blindly spawning new goroutines (for example, there is usually a limit to how many open connections a database will keep so just spawning a new goroutine and opening a new connection to the database everywhere would be detrimental - that's one of the reasons there is a pool of DB connections).

ArghusSquare:

Thank you

tmornini:

Use a goroutine when you want a function to run independently and asynchronously to the code that invokes the function.

Your example of an endpoint that invoked other HTTP requests is a good example.

Keep in mind that the net/http package calls your handlerfunc in a goroutine, so it runs asynchronously to all other requests already.

One thing I don't understand is what you mean by "queue the database"

And it's critical to understand that when you say "waits for all routines to return their results" that you mean "receives results for all goroutines on channels" :-)

ArghusSquare:


One thing I don't understand is what you mean by "queue the database"

I'm aware that the handlerfuncs are already goroutines. What I mean with queuing the database is getting data from my own database. For example the handler does the following:

[go?] getDataFromExternalAPI1()
[go?] getDataFromExternalAPI2()
[go?] getDataFromDB() // with func (*DB) Query
[go?] logToDBFile()
wait for results from channels 

Currently I do not have a function for every single one of these actions. I guess the real question is: When is the overhead of goroutines too big so just keeping them in the handler function is better?

nhooyr:

You'd have to benchmark it to really know.

tmornini:

Depends on what you mean by overhead.

If the goal is CPU efficiency then don't use go routines.

If the goal is minimum response time, use go routines.

jerf:

While I agree that you'd have to benchmark to really know, goroutine overhead is on the order of 1 microsecond to start them up (give or take an order of magnitude), and relatively small numbers for task switching when IO occurs. External APIs are generally milliseconds away, so goroutine overhead is very likely lost in the shuffle there. For the cost of more complicated code than raw serial code, you'll get better responsiveness if you put those in goroutines because you'll get the answers in parallel, or in the worst case, at least be running your timeouts in parallel.

A local DB is a closer call; it is possible to have queries that run fast enough that goroutine overhead could be noticeable. I say "noticeable" rather than dominant because you'd have to be doing some awfully simple stuff to some awfully simple databases for an in-process goroutine to be the dominant cost of such an access. But I'd say the vast majority of the time goroutine overhead on such a task will still be lost in the noise; hundreds of microseconds or entire milliseconds for a response from such things would be fairly normal.

Logging is its own complicated topic. It has costs that people generally find non- or even counter-intuitive. You'd have to benchmark it.

ArghusSquare:

Thanks, that helps :)

karma_vacuum123:

remember you need to synchronize your goroutines too....and that is a great place to make mistakes

oh-thatguy:

Ugh. Yes.

dirkharrington:

Your handler is already being called backed in a goroutine as part of the std library

NikkoTheGreeko:

Go routines should only be used if you need them. You only know if you need them by benchmarking and finding bottlenecks, blocking, etc after the program is built. I wouldn't recommend trying to plan out how to spawn off Go routines when you first start your project unless you really know what you're doing, or else you risk introducing a lot of unnecessary debugging nightmares.

karma_vacuum123:

the only reason to do this would be some implications for logging itself blocking. if you are using rsyslog, it is typically configured with udp so this should not be a concern. many other log aggregation mechanisms will default to udp for similar reasons...it is better for a logger to fail outright than block mainline execution

用户登录

今日阅读排行

一周阅读排行

最新主题