At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called the Rubin CPX, designed for context windows larger than 1 million tokens. Part of the chip giant’s forthcoming Rubin ...
While no one has figured out how to make money from generative artificial intelligence, that hasn't stopped Google DeepMind from pushing the boundaries of what's possible with a big pile of inference.
Aug 29 (Reuters) - China's Alibaba (9988.HK), opens new tab, has developed a new chip that is more versatile than its older chips and is meant to serve a broader range of AI inference tasks, the Wall ...