Nvidia, which reported an 89 per cent rise in profit, dominates the AI training chip market.
San Francisco - One of biggest growth areas in field will be deploying computers with chips that implement 'learned' tasks
Published: Fri 17 Aug 2018, 8:37 PM
Updated: Fri 17 Aug 2018, 10:39 PM
Nvidia dominates chips for training computers to think like humans, but it faces an entrenched competitor in a major avenue for expansion in the artificial intelligence chip market: Intel.
Nvidia chips dominate the AI training chip market, where huge amounts of data help algorithms "learn" a task such how to recognise a human voice, but one of the biggest growth areas in the field will be deploying computers that implement the "learned" tasks. Intel dominates data centres where such tasks are likely to be carried out.
"For the next 18 to 24 months, it's very hard to envision anyone challenging Nvidia on training," said Jon Bathgate, analyst and tech sector co-lead at Janus Henderson Investors.
But Intel processors are already widely used for taking a trained artificial intelligence algorithm and putting it to use, for example by scanning incoming audio and translating that into text-based requests, what is called "inference".
Intel's chips can still work just fine there, especially when paired with huge amounts of memory, said Bruno Fernandez-Ruiz, chief technology officer of Nexar, a startup using smartphone cameras to try to prevent car collisions.
That market could be bigger than the training market, said Abhinav Davuluri, an analyst at Morningstar, who sees an inference market of $11.8 billion by 2021, versus $8.2 billion for training. Intel estimates that the current market for AI chips is about $2.5 billion, evenly split between inference and training.
Nvidia, which posted an 89 per cent rise in profit, hasn't given a specific estimate for the inference chip market but CEO Jensen Huang said on an earnings call with analysts that believes it "is going to be a very large market for us."
Nvidia sales of inference chips are rising. In May, the company said it had doubled its shipments of them year-over-year to big data center customers, though it didn't give a baseline.
Earlier this month, Alphabet's Google Cloud unit said it had adopted Nvidia's inference chips and would rent them out to customers.
But Nvidia faces a headwind selling inference chips because the data centre market is blanketed with the CPUs Intel has been selling for 20 years. Intel is working to persuade customers that for both technical and business reasons, they should stick with what they have.
Take Taboola, a New York-based company that helps web publishers recommend related content to readers and that Intel has touted as an example of how its chips remain competitive. The firm uses Nvidia's training chips to teach its algorithm to learn what to recommend and considered Nvidia's inference chips to make the recommendations. Speed matters because users leave slow-loading pages.
But Taboola ended up sticking with Intel for reasons of speed and cost, said Ariel Pisetzky, the company's vice-president of information technology.
Nvidia's chip was far faster, but time spent shuffling data back and forth to the chip negated the gains, Pisetzky said. Second, Intel dispatched engineers to help Taboola tweak its computer code so that the same servers could handle more than twice as many requests.
"My options were, you already have the servers. They're in the racks," he said. "Working with Intel, I can now cut back my new server purchases by a factor of two because I can use my existing servers two times better."
Nvidia has been working to solve those challenges. The company has rolled out software this year to make its inference chips far faster to help overcome the issue of moving data back and forth. And it announced a new family of chips based on a technology called Turing earlier this week that it says will be 10 times faster still, Huang said on the analyst call on Thursday.
"We are actively working with just about every single Internet service provider in the world to incorporate inference acceleration into their stack," Huang said. He gave the example that "voice recognition is only useful if it responds in a relatively short period of time. And our platform is just really, really excellent for that."