FriendliAI, The Frontier AI Inference Cloud, is collaborating with Samsung SDS, a leading GPU infrastructure-as-a-service ...
In the next phase of the AI megatrend, inference will be the big focus, and Arm Holdings is poised to win big from that shift ...
Artificial intelligence infrastructure startup Parasail Inc. today announced that it has raised $32 million in funding.
Google Research unveiled TurboQuant, a novel quantization algorithm that compresses large language models’ Key-Value caches ...
Broadcom (AVGO) stock rallies 21% on major AI chip deals with Meta and Google. Analysts set $464 target as shares near ...
Novita AI and Hugging Face announced a strategic partnership to bring affordable, reliable inference for the latest AI models to over five million developers on Hugging Face. Notably, inference on ...
This company designs chips ideal for AI inference tasks, which explains the outstanding growth in its revenue and earnings.
A food fight erupted at the AI HW Summit earlier this year, where three companies all claimed to offer the fastest AI processing. All were faster than GPUs. Now Cerebras has claimed insanely fast AI ...
SambaNova and Intel have launched an inference architecture to support agentic AI workloads. The offering will combine GPUs, ...
DigitalOcean maintains a Buy rating with a new $105 price target, driven by a strategic pivot toward usage-based AI inference ...