News
LLM-D Serving for AMD Instinct GPUs on OCI
2+ day, 18+ hour ago (633+ words) All decode benchmarks are using v LLM's Decode Bench Connector, a KVConnector that fills the KV cache while skipping the prefill, allowing us to accurately take into account the cost of attention while running pure decode workloads. Testing was performed…...
Deploying Hermes Agent for Free on AMD Developer Cloud with open models and v LLM
3+ day, 16+ hour ago (1076+ words) The AI agent landscape is evolving fast. Coding copilots and chatbot wrappers have their place, but a new class of agent is emerging " one that doesn't just follow instructions but learns from experience. Hermes Agent'built by Nous Research, represents a…...
More Isn't Always Better: Rethinking Memory Choices for Modern Workloads
11+ mon, 2+ week ago (814+ words) What cloud taught us about memory-per-core " and how AMD EPYC" CPUs bring that flexibility to your data center. Enterprises securing servers for data center operations may approach memory with a simple mindset: more is better. But as workloads grow more…...
AMD Announces More Than $10 Billion in Taiwan Ecosystem Investments to Accelerate AI Infrastructure
3+ day, 11+ hour ago (538+ words) SANTA CLARA, Calif. , May 21, 2026 (GLOBE NEWSWIRE) -- To meet the growing demand for AI infrastructure, AMD (NASDAQ: AMD) today announced more than $10 billion in investments across the Taiwan ecosystem to expand strategic partnerships and scale advanced packaging manufacturing for next-generation AI…...
AMD Announces Production Ramp of Next-Generation AMD EPYC Processor "Venice" on TSMC 2nm Process Technology
3+ day, 19+ hour ago (482+ words) AMD AMD Announces Production Ramp of Next-Generation AMD EPYC Processor "Venice" on TSMC 2nm Process Technology - AMD has begun production ramp of its 6th Gen AMD EPYC" CPUs, codenamed "Venice," marking a major milestone for the AMD and TSMC collaboration on 2nm technology…...
Agent Computers: Pay Once for Cloud-Grade Intelligence
4+ day, 47+ min ago (1152+ words) AI has changed what a single person can do. A developer can build faster. A creator can generate video, music, images, and 3 D assets in hours instead of weeks. A business can automate research, support, analysis, and repetitive workflows with…...
Accelerating LLM Startup on AMD Ryzen AI with Two-Phase Custom Op Initialization
4+ day, 3+ hour ago (757+ words) Understanding the bottleneck requires looking at what happens inside each operator's constructor during model loading. In the original implementation, a single operator constructor performed both host-side work (reading model attributes and weight tensors from the ONNX graph) and device-side work…...
AMD Powers Next-Generation Agent Computers with New Ryzen AI Halo Developer Platform and Ryzen AI Max PRO 400 Series Processors
4+ day, 12+ hour ago (591+ words) Looks like you have no items in your shopping cart. AI is rapidly shifting from cloud-based systems to where work happens, with the PC evolving into both an interface for AI interaction and a local execution layer for real-time tasks....
Rubrik Boosts AI-Enhanced Cyber Resilience with AMD
1+ week, 5+ day ago (283+ words) Rubrik, the Security and AI Operations company, boosted cybersecurity with AMD EPYC" CPUs, gaining performance, cost savings and AI efficiency for next-gen protection. "We also saw an opportunity with AI," says Nithrakashyap. "A lot of customers struggle with collecting all…...
Cloud Bridge Drives AWS Cloud Cost Savings at Scale with AMD
1+ week, 2+ day ago (319+ words) Cloud Bridge Case Study Cloud Bridge deployed AWS instances powered by AMD EPYC" Server CPUs to cut costs and boost performance, delivering 30% savings with minimal effort for clients. Cloud Bridge then helped customers leverage the funding available to reduce migration…...