How about an Arduino style super computer the size of a credit card

The problem with massively parallel computers is you need to have applications that have enormous data sets that you do all of the same processing on each subset of data. You need to be able to break this data down into subparts so that each processing element can process its own subset of data without having to get bogged down because you are waiting for data from other nodes.

Sure, there are are various things that are massively parallel (rendering for instance), but I suspect unless you already have an application that is massively parallel, it isn't the chip for you. When I was between jobs the last time about 4 years ago, Adapteva may have been one of the companies I was looking at, but it wasn't the right fit.