LLM Inference Optimization

LLM Data Mixture Breaks When Training Pools Shift: Causal Inference Offers Fix

LLM training data mixture optimization breaks when training pools shift — every prior proxy experiment becomes stale.

OpenAI and Broadcom unveil 'Jalapeño' Intelligence Processor for LLM inference

"A blank-slate design for modern LLM inference, not a general-purpose accelerator adapted from earlier AI workloads" ...

From ChatGPT to Chips: OpenAI Unveils Jalapeño to Power Faster LLMs and More Affordable AI

OpenAI and Broadcom unveiled Jalapeño, a custom AI inference chip designed for LLMs, promising higher efficiency, lower costs ...

VentureBeat

New LLM optimization technique slashes memory costs up to 75%

Researchers at the Tokyo-based startup Sakana AI have developed a new technique that enables language models to use memory more efficiently, helping enterprises cut the costs of building applications ...

OpenAI, Broadcom (AVGO) Unveil “Jalapeño” AI Accelerator for Enhanced LLM Inference

Broadcom Inc. (NASDAQ:AVGO) is one of the best stocks for beginners to buy now. On June 24, OpenAI and Broadcom introduced ...

Electronics For You

Processor Designed to Run AI Models Faster

A chip built for AI models is already running AI workloads. Could it change how AI services handle speed, cost, and demand?

Tweakers

Senior LLM Inference Engineer

Senior LLM Inference Engineer. Netherlands - Amsterdam. PDT - Data Science & AI / 1. Role: Permanent / Hybrid. apply for this job. Join our AI team at Prosus, the largest cons ...

Business Wire

MangoBoost Launches Mango LLMBoost™: AI Inference Optimization Software with Up to 12.6x Relative Performance Improvement and 92% Cost Savings

BELLEVUE, Wash.--(BUSINESS WIRE)--MangoBoost, a provider of cutting-edge system solutions designed to maximize AI data center efficiency, is announcing the launch of Mango LLMBoost™, system ...

Tech Times

AI Inference and World Model Startups Pull $1.8B in Two Days as Foundation Models Commoditize

AI inference infrastructure investment pulled $1.8 billion in 48 hours as Baseten’s $1.5B round at a $13B valuation and ...

OpenAI and Broadcom unveil Jalapeño Intelligence Processor for LLM workloads

OpenAI and Broadcom have announced Jalapeño, OpenAI’s first Intelligence Processor. The AI accelerator is designed around ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results