Hey peeps full disclosure I work as one of Linode's RnD engineers. I want to try...

jamesblonde · on June 20, 2019

If you really want low cost to compute for Deep Learning and you needs lots of compute and don't want to pay for V100s, then the AMD Vega R7 is the card for you. 700 dollars, 16GB Ram, 1TB of GPU bandwidth (higher than the V100!), works with Tensorflow (pip install tensorflow-rocm), and about 60% of the performance on resnet-50.FP64 is not fully gimped (it is halved, i think - so still quite good). Put lots of them in servers with PCI 4.0, and you can do great things. Here's a recent talk on it:

https://www.youtube.com/watch?v=neb1C6JlEXc

microtonal · on June 20, 2019

If you really want low cost to compute for Deep Learning and you needs lots of compute and don't want to pay for V100s, then the AMD Vega R7 is the card for you. 700 dollars, 16GB Ram, 1TB of GPU bandwidth (higher than the V100!), works with Tensorflow (pip install tensorflow-rocm), and about 60% of the performance on resnet-50.FP64 is not fully gimped (it is halved, i think - so still quite good).

Two of my colleagues use high-end AMD GPUs to train RNNs and transformers with tensorflow-rocm. There are still some nasty bugs (e.g. [1]), so it is currently not for everyone. However, given how far they have come compared to 1-2 years ago, it is very likely that in a year or so, they are a real competitor to NVIDIA for compute. That competition was long needed.

[1] https://github.com/ROCmSoftwarePlatform/tensorflow-upstream/...

jamesblonde · on June 20, 2019

Agreed, it is not quite prime-time yet. They are trying to upstream all the ROCm stuff in TensorFlow, and when it gets into mainline and stabilizes, i agree that it has great potential for take-off - particularly from price-sensitive researchers and large companies who need huge GPU farms.

ksec · on June 20, 2019

Two Questions.

I wonder if Google is in any way helping AMD in the TensorFlow and ROCm?

What happen when Intel join the GPU race in 2020. Making their own ROCm again?

chadmeister · on June 20, 2019

This is a terrible suggestion/comparison. AMD has nowhere near the software support in the ML/AI space that Nvidia has. I wish that AMD would invest in a CUDA competitor and break Nvidia's monopoly, but that is not even close to being a reality, unfortunately.

trsohmers · on June 20, 2019

> The difference is 8 more GB of RAM that comes at a steep premium

This is incorrect. The RTX 6000 has 24GB of VRAM and is $4000, and the RTX 8000 has 48GB of VRAM (double the amount) and is $5500. Is it worth the price increase? For a lot of people I know it is.

Also, the RTX Titan is $2500 and is identical to the RTX 6000 (at the chip level) and also with 24GB of VRAM, with the only difference being software enabling of additional H.264/5 encoding features on the Quadro. Definitely not worth the cost increase, especially for anyone doing ML.

IlGrigiore · on June 20, 2019

If you reason as a consumer the RTX Titan makes a lot more sense than the RTX 6000, however datacenters are forbidden by Nvidia to use consumer cards [1], therefore their choice makes sense.

[1]: http://fortune.com/2018/01/07/nvidia-consumer-video-cards/

trsohmers · on June 20, 2019

Except datacenter is not defined by NVIDIA in their EULA at all, and plenty of large and small datacenters continue to use "consumer cards" regardless of NVIDIA's fear mongering. I know that Tesla, OpenAI, Microsoft, Apple, and many others all continue to primarily buy primarily 2080Ti's, RTX Titans, and Titan V's since the EULA change.

PedroBatista · on June 20, 2019

How is that even legal and how nvidia gets away with that type of shit?

_jyog · on June 20, 2019

Companies make unenforceable claims all the time. That's why we've got courts. Theyr'e almost certainly never going to take any one to court, because if they did, it would get tossed out. They can't pull the same "it's a license to a product" bs media services do. Though they still try with the driver. I think for now, they've just run the numbers and figured out it gives them slightly higher datacenter card sales.

tntn · on June 20, 2019

> This is the first time (TMK) that a cloud provider is bringing RT cores into the market.

Your knowledge is incomplete. T4 has been available in google cloud for many months.

Who_me · on June 20, 2019

I stand corrected thank you!

sieabahlpark · on June 20, 2019

Has linode improved their security intrusion and disclosure policy yet?

These are great improvements but are virtually worthless if linode didn't change their behavior.

tomxor · on June 20, 2019

What incident are you referring to? (genuine question)

As far as standards go, we use Linode and all of our customers (some of them quite demanding about internal security details) have been satisfied with the various acronyms they are accredited with... Although I understand this does not necessarily guarantee anything about response behavior, so interested to hear about past incidents.

dillonmckay · on June 20, 2019

There were some compromised accounts via a Coldfusion hack of their admin portal.

Not sure if that was isolated.

There was something more recent, too.

Anyway, happy Linode customer for quite a few years now. My stuff works, no fuss.

_jyog · on June 20, 2019

Any chance you can provide more information? Linode customer as well; slightly concerned.

dillonmckay · on June 20, 2019

Google ‘linode coldfusion’. I think it was over 5 years ago.

tkulick · on June 20, 2019

(Tory from the Linode team here)

We made some improvements to our disclosure / Bug Bounty program last year and launched this on HackerOne. The community and quality of submissions has been great. More information: https://blog.linode.com/2018/05/16/linodes-new-bug-bounty-pr...

We've also been making ongoing improvements to our application security and security infrastructure through the implementation of a DevSecOps culture. This is something we take very seriously.