Is GO fast enough for infrastructure programming?

polaris · 2016-08-21 07:00:11 · 610 次点击    
这是一个分享于 2016-08-21 07:00:11 的资源,其中的信息可能已经有所发展或是发生改变。

Hey guys,

I'm very interested and trying to break into infrastructure programming. I'd like to develop infrastructure components such as loadbalancers, caches, etc...

Most of the main stream tools are written in C, I'm assuming for the speed and memory handling.

I personally do not want to write in C, but I'm totally willing to learn it if this is necessary. However would Go be an applicable language for this type of work also? I've started playing around with Go and I love the syntax and concepts of the language thus far.


评论:

mwholt:

Go was designed for this.

BraveNewCurrency:

Is GO fast enough for infrastructure programming?

Yes.

trying to break into infrastructure programming

I am confused by this comment. You make it sound like Infrastructure Programming is a cabal that you break into. Instead, you should think of it as a skill that you learn. Infrastructure programming requires more knowledge than typical programming.

I'd like to develop infrastructure components such as loadbalancers, caches,

Again, that's an odd thing to want. Do you have someone lined up to pay you to develop that? If not, your target should be to learn about infrastructure components. It's much more likely that companies will pay you to grab one off the shelf instead of re-inventing the wheel.

Don't get me wrong, I'm not saying you shouldn't write those programs. Just that writing those is not an end, it's a means to an end of understanding infrastructure. If you have to ask others if a language is any good, you are too much of a novice to write the next great load balancer. But learning infrastructure programming will make you a better programmer.

ldelossa:

Well that's an interesting way of putting things.

Right now I come from infrastructure architecture, for many years I've setup off the shelf solutions, I've designed load balancing tiers, data centers, countless networks and performant environments. I've recently began to do a lot of development in this space also. I do a lot of coding for operational things.

To me, my perfect job is now to move into the actual development OF these solutions, and I need to start somewhere.

Just taking a look now Uber, Adobe, square space, and digital ocean all have openings specifically looking for software infrastructure engineers. These jobs really get me excited but I need to begin to practice this type of work and get a feel for what it consists of. So I'm looking to give myself some projects and I need to decide on a language.

metarx:

What about contributing to open source infra projects, such as kubernetes, docker, hashicorp stuff and many other open source infrastructure. All written in go btw

I think those would be a better foot in the door to those positions than writing your own from scratch.

ldelossa:

That's a great idea actually. I will take that advice.

BraveNewCurrency:

my perfect job is now to move into the actual development OF these solutions

OK. Let me warn you that there are only a handful of companies big enough to employ people to work on LBs and caches. Most companies just string together existing solutions, and write some glue code to get it all working. That's the reality.

I think part of the problem is terminology. When I hear "Infrastructure Software Engineering" it sounds like DevOps type stuff (writing Chef, setting up monitoring and alerts, etc). I'm not sure I can think of a Job Title that is specific to writing low-level infrastructure like you want.

(Source: I am a hiring manager for a cloud-based company with millions of users. I am looking to hire what HR calls "Software Engineering - Infrastructure". A.K.A. Systems Engineering / DevOps. I just need a good programmer who really knows Linux and Networking.)

ldelossa:

I can completely understand that. SquareSpace right now has a very awesome job posting for "Edge Infrastructure" engineer. That is a perfect description of what I'd like to be doing in year or so as I get my skills up.

https://www.squarespace.com/about/careers/?gh_jid=245517

I actually currently have a DevOps title.

BraveNewCurrency:

I admit that the job description does imply that you will be writing load balancers. But if you look at their technology stack, you'll see that they use a commercial product for balancing and caching. So writing code for a LB/cache might be out of the question.

(I admit this link is from 2009, so maybe it's changed by now. But my point is that you might be reading too much into job descriptions. It's very rare that companies write these things from scratch.)

ldelossa:

No I agree, I think most of the time it will be off the shelf. But even writing the glut code sounds awesome. I think a lot of times custom code will be in lua and manipulating internal structures

professorGroblin:

Docker, Kubernetes, etcd, prometheus, Vitess: all written in go

ldelossa:

I would argue none of these are performant by necessity. Docker is just a wrapper around an underlying containers library. Prometheus is a pull based model. And kubernetes is just a scheduler for the most part. This doesn't strict me the same as a piece of software responsible for pushing bits in different directions as near as real time as possible. But I could be wrong

TheMerovius:

Docker is just a wrapper around an underlying containers library.

Is it? I thought docker issues raw syscalls for container management and ldd docker doesn't give me anything I would identify as a "container library". I might be wrong, but I actually think the "container library" it wraps is go too.

Prometheus is a pull based model.

That's only the collection-part. It is also a performant time series database that need to support interactive live queries over relatively large datasets. It doesn't help at all to have collection, if I then can't use that data interactively in an outage to drill down into the problem, because it is too slow.

And kubernetes is just a scheduler for the most part.

Again, I don't see how this means it doesn't need to be performant. One instance can manage thousands of nodes and tens of thousands of pods while still being performant. That also means a lot of QPS to and from the master itself (after all, consensus over the state of all those pods must be maintained), while still being performant.

This doesn't strict me the same as a piece of software responsible for pushing bits in different directions as near as real time as possible.

What about rend, a memcached-proxy in use at netflix? Scott Mansfield mentioned on gotime that they achieve sub-ms latency on most requests (he is more specific with the percentiles, but I don't remember).

Maybe, instead of handwaving that all the existing infrastructure projects written in go in production at some of the largest internet companies in the world might not be fast enough, you should be more specific about why you think they are slow? I see no reason to doubt that they are more than fast enough.

You asked if go is fast enough. The answer was pretty much unanimously yes, with specific examples of the kind of projects you mentioned, written in go. I'd say the ball is in your court to either accept that answer or prove how they aren't fast enough :)

ldelossa:

I understand what you're saying here. And I think you're correct. I'm coming from a heavy 'network infrastructure' perspective, where I'm worried about the ability to handle packet rates of data. Which I typically see done in C. I'm biasing these questions and examples on my perspective. I think these are all valid points.

My intention was not to downplay go's current projects.

professorGroblin:

OK, perhaps i don't agree with your assessment of what infrastructure software is then. If you mean real-time software,then, essentially thats just software with deadlines for processing events. And if real-time is what you meant, then Vitess absolutely falls into your bracket. I would say that any software that manages the run state of an application (and that definitely includes scheduling, monitoring and logging) is infrastructure software

ldelossa:

I think your definition here is valuable. I haven't actually tried to identity what it IS, and that was well put. Appreciated

exch:

Youtube uses a Go service to do load balancing between its users and mysql databases. It seems to work well enough with millions of users.

ldelossa:

Thank you this is a great example

fakeNAcsgoPlayer:

I am one such platform engineer who write Caches and load balancers and geofencing aggregators and distributors and such stuff in C.

Given a choice I would rewrite a lot if stuff in Go or C++. It is in C because legacy reasons .

Here is what I learnt over a period of time. Measure every god damn thing, do not trust what you read on Internet, it is just a ball park figure, measure it yourself. Every single thing you need, test it for your use case. Only move when you understand the tradeoffs.

Oh and also, performance is subjective and very easily misinterpreted. Beware of it.

ldelossa:

Thank you! This is great to hear from someone with experience.

I understand the problems with performance analysis, I've spent years now tracking down performance issues, understanding the synergy between different components and so on. But I find it highly valuable to still ask and get responses like yours.

TheMerovius:

I think so, but I also think, that the architecture is much more important than the language. You have inherent limits when scaling on a single instance, even if you handroll assembler and run on bare metal to edge out the smallest bit of performance. So, in today's world, being able to scale horizontally to multiple instances is much more important; if you can do that, the actual raw performance is just a constant cost multiplier (e.g. you might need 10% more CPUs or something with Language A than with language B, to handle the same workload). I think go makes it reasonably easy to write these kinds of scalable networked services with competitive enough performance to not increase the costs over C too much.

BTW, if you believe you actually going to need the raw performance, I'd recommend rust a billion times over C, especially if you start something fresh and need to learn the language anyway. It's just as fast (and probably faster, if not now, then soon, due to better guarantees to the compiler), but it's safe and there really is no excuse to write networked software in C these days. More to the point: I assume most of the current infrastructure software is written in C, not because it's best suited, but because there where no alternatives when they started.

ldelossa:

Thanks for the response.

Rust is definitely under consideration but it seems to not have that much proliferation in the tech space. The only guys that I see using it are Mozilla themselves. This doesn't necessarily mean it's not worth wild, but community is a big deal with these things. I realllyyyy like the idea of rust but I'm wondering if I can achieve the same thing with go whether that's a better route to take.

chowmeined:

Dropbox is using rust now for part of their storage backend.

TheMerovius:

"The same thing" probably not (there are a couple of things that work much better in rust, for example interaction with the C world in either direction), but as I said, cloud-infrastructure-thingies probably fare better with go. I'm just saying you definitely shouldn't use C. I know it's a cliche, but I do actually consider C to be harmful :)

ldelossa:

I'd like to avoid C if possible. I'm coming into this with little C experience and I feel the ramp up to write C code that could actually be released into production would be rather long compared to a safe language.

jocull:

Rust also has a mode that lets you strip out the StdLib I believe. As far as I understand (never tried it) this lets you write embedded Rust that wouldn't even need an OS to power it. Complete lack of StdLib would be a real challenge, though. I can't imagine it being a very productive work environment.

MrRichyPants:

I am a big fan of the Caddy webserver, which is in Go. Look at their benchmarks, it's really really fast. Like nearly Nginx fast. I think it's certainly fast enough for systems where the bulk of the waiting is in network communication.

ldelossa:

Great example project! Thanks a ton.

fakeNAcsgoPlayer:

Caddy is nowhere close to nginx. Why do you say so?

MrRichyPants:

I've found it to be in the same performance tier as Nginx, and it looks like others do too. See the Caddy FAQ, near the bottom.

From the FAQ:

How does Caddy perform compared to nginx or Apache?

It's fast. If your site only serves tens of thousands of requests per second on good hardware, then don't worry about it. In our tests—which have been confirmed by multiple trial runs on different hardware, verified independently by different developers—we've found that Caddy and nginx generally perform similarly for most static file needs, a notch above Apache. But this depends on many factors including system environment, hardware capabilities, network, software tuning, etc. We've also seen some tests where Apache performs better than nginx. Given the high degree of variance among these factors combined with your own needs and configurations, we'd rather you discover Caddy's performance for yourself. If you require extremely high throughput and you know what you're doing, do your own benchmarks. If you verify that Caddy is a bottleneck, we welcome your pull request to make it faster.

anacrolix:

I just started https://github.com/anacrolix/goproxy this morning because HAProxy has terrible defaults and was more trouble than it's worth to set up again on a new box. So far it's working great. I also have a lot of custom tools that do all the things typically some from C.

ldelossa:

This is fantastic. I'm going to review this. Looks fun!


入群交流(和以上内容无关):加入Go大咖交流群,或添加微信:liuxiaoyan-s 备注:入群;或加QQ群:692541889

610 次点击  
加入收藏 微博
暂无回复
添加一条新回复 (您需要 登录 后才能回复 没有账号 ?)
  • 请尽量让自己的回复能够对别人有帮助
  • 支持 Markdown 格式, **粗体**、~~删除线~~、`单行代码`
  • 支持 @ 本站用户;支持表情(输入 : 提示),见 Emoji cheat sheet
  • 图片支持拖拽、截图粘贴等方式上传