• About Us
  • Advertising
  • Digital Magazine
  • Supplements
  • Media Pack
  • Privacy Policy
  • Contact us
CXO Insight Middle East
  • News
  • Opinion
  • Business
    • Industries
      • Transport
      • Retail
      • Government
      • Real Estate
      • Education
      • Energy
      • Banking and Finance
    • Channel
  • Future
    • Tech
    • Gadgets
    • Science
    • Space
    • Sustainability
  • Events
    • Channel Awards
      • 2025
      • 2024
      • 2023
    • Channel Insights Summit 2025
    • Insight Innovation Summit
    • CXO50 Oman
    • CXO50
      • 2026
      • 2025
    • ICT Awards
      • Dubai 2025
      • Saudi Arabia
    • Cyber Strategists Summit
    • Cloud Connect 2025
    • All events
  • Digital Magazine
  • GITEX x AI EVERYTHING
No Result
View All Result
CXO Insight Middle East
  • News
  • Opinion
  • Business
    • Industries
      • Transport
      • Retail
      • Government
      • Real Estate
      • Education
      • Energy
      • Banking and Finance
    • Channel
  • Future
    • Tech
    • Gadgets
    • Science
    • Space
    • Sustainability
  • Events
    • Channel Awards
      • 2025
      • 2024
      • 2023
    • Channel Insights Summit 2025
    • Insight Innovation Summit
    • CXO50 Oman
    • CXO50
      • 2026
      • 2025
    • ICT Awards
      • Dubai 2025
      • Saudi Arabia
    • Cyber Strategists Summit
    • Cloud Connect 2025
    • All events
  • Digital Magazine
  • GITEX x AI EVERYTHING
No Result
View All Result
CXO Insight Middle East
No Result
View All Result

F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference

by CXO Staff
March 25, 2026
in Business, Channel, News

F5 announced expanded capabilities in its ongoing collaboration with NVIDIA to accelerate and optimise AI inference infrastructures

F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference

F5 announced expanded capabilities in its ongoing collaboration with NVIDIA to accelerate and optimise AI inference infrastructures.

The expanded integration combines F5 BIG-IP Next for Kubernetes with NVIDIA BlueField-3 DPUs, creating an intelligent, telemetry-aware infrastructure layer that increases token throughput with better GPU utilisation, reduces latency, and enables secure multi-tenant AI platforms at scale.

In AI systems, tokens represent the measurable unit of AI output—the words, symbols, or data fragments generated and processed during inference. The volume and velocity of token production ultimately determine user experience, infrastructure efficiency, and revenue per accelerator.

As enterprises and GPUaaS providers race to monetise AI and move from AI experimentation to revenue-generating services, infrastructure efficiency has become a defining metric. Success is increasingly measured not simply by deployed GPU capacity, but by token economics, sustained token throughput, time to first token (TTFT), cost per token, and revenue per GPU accelerator. The F5 and NVIDIA joint solution is designed to directly address these metrics.

Optimising tokenomics through intelligent AI infrastructure

The shift from application-centric inference to agent-driven AI workflows demands new architectural approaches to optimise token throughput and reduce costs. BIG-IP Next for Kubernetes now leverages NVIDIA NIM statistics, Dynamo runtime signals, and GPU telemetry to make inference-aware routing decisions before execution. By matching workloads to the most appropriate accelerators in real time, the solution increases sustained utilisation while reducing latency and re-compute.

“AI infrastructure is no longer just about access to GPU or scaling their deployments. It has evolved into maximising economic output per accelerator,” said Kunal Anand, Chief Product Officer, F5. “Together with NVIDIA, we are enabling AI factories to treat token production as a measurable business metric. BIG-IP Next for Kubernetes provides the intelligence and governance required to increase GPU yield, reduce cost per token, and scale shared AI platforms confidently.”

Validated infrastructure efficiency: A structural uplift

The performance numbers speak for themselves. In testing validated by The Tolly Group, BIG-IP Next for Kubernetes, accelerated by NVIDIA BlueField-3 DPUs, delivered up to a 40% increase in token throughput, a 61% faster time to first token (TTFT), and a 34% reduction in overall request latency.

These are not incremental gains. By offloading networking, TLS/encryption, AI-aware load balancing, and traffic management to NVIDIA BlueField-3 DPUs, BIG-IP Next for Kubernetes preserves host CPU capacity and frees GPUs to do what they were built for: sustained, high-throughput inference at scale. The result is improved GPU utilisation, reduced queuing delays, and increased token yield—enabling lower cost per token within a fixed infrastructure footprint. Critically, no model modifications were required, making these gains immediately deployable across existing AI factory infrastructure. For enterprises and NeoCloud providers competing on token economics, this is the difference between infrastructure that constrains AI output and infrastructure that accelerates it.

“NVIDIA’s accelerated computing infrastructure coupled with F5’s AI-aware Application Delivery and Security Platform unlocks superior AI factory tokenomics—delivering scalable and cost-effective inference without making any changes to the models,” said Kevin Deierling, SVP, Networking, NVIDIA. “Together, F5 and NVIDIA are empowering enterprises to scale AI factory inference efficiently and economically.”

Built for agent-driven AI and multi-tenant AI platforms

Modern AI workloads are increasingly agent-driven, persistent, and context-aware. They demand intelligent traffic control that traditional load balancing cannot provide. The enhanced BIG-IP Next for Kubernetes solution can now support:

  • Inference-aware routing for agentic AI workflows
  • Integration with NVIDIA DOCA Platform Framework (DPF) to simplify NVIDIA BlueField DPU deployment and lifecycle management
  • EVPN-VXLAN with dynamic VRFs for secure network-level multi-tenancy
  • Integrated security, token governance, and observability within Kubernetes AI environments

These capabilities enable enterprises and NeoCloud providers to securely share GPU infrastructure across business units or external customers while preserving performance isolation and predictable service levels.

A control plane for AI factory economics

F5 and NVIDIA provide enterprises with validated tools and best practices to optimise inference architecture. With these advancements, BIG-IP Next for Kubernetes is positioned to become a strategic control plane for AI factory economics, governing token consumption, optimising traffic flows, and maximising infrastructure return on investment.

Rather than overprovisioning to compensate for inefficiencies, organisations can now extract greater economic value from every GPU already in production. The result is improved revenue per GPU, lower operational overhead, and scalable AI services built for sustained growth. By combining NVIDIA’s infrastructure telemetry and DPU acceleration with F5’s traffic intelligence and security capabilities, the companies are helping enterprises transform AI factories into efficient, monetisable platforms ready for the agentic era.

Tags: AI factoryAI inferenceF5NVIDIA
ShareTweet

Related Posts

Snowflake targets productivity gap with new AI platform
Future

Snowflake targets productivity gap with new AI platform

March 25, 2026

Snowflake announced the research preview of Project SnowWork, a new autonomous enterprise AI platform designed to help business users massively...

Only 7% of enterprises are AI-ready, says Cloudera & Harvard
Future

Only 7% of enterprises are AI-ready, says Cloudera & Harvard

March 25, 2026

Cloudera announced findings from a new global study conducted by Harvard Business Review Analytic Services with Cloudera, revealing that while...

Discussion about this post

Latest Issue

Snowflake targets productivity gap with new AI platform

Snowflake targets productivity gap with new AI platform

March 25, 2026
Only 7% of enterprises are AI-ready, says Cloudera & Harvard

Only 7% of enterprises are AI-ready, says Cloudera & Harvard

March 25, 2026
Kaspersky shares tips for updating your digital habits for an AI-driven world

Kaspersky shares tips for updating your digital habits for an AI-driven world

March 25, 2026

The most trusted source of strategic intelligence for IT decision makers in the Middle East.

About

  • About Us
  • Advertising
  • Digital Magazine
  • Supplements
  • Media Pack
  • Contact Us

Policies

  • Privacy Policy
© 2025 – CXO Insight Middle East. All Rights Reserved.
Facebook-f X-twitter Linkedin
Separated they live in Bookmarksgrove right at the coast of the Semantics, a large language ocean. A small river named Duden.

About

  • About Us
  • Site Map
  • Contact Us
  • Career

Policies

  • Help Center
  • Privacy Policy
  • Cookie Setting
  • Term Of Use

Join Our Newsletter

© 2024 – CXO Insight Middle East. All Rights Reserved.

Facebook-f Twitter Youtube Instagram

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Join our mailing list
Sign up here to get the latest news, updates and special offers delivered directly to your inbox.
No Result
View All Result
  • News
  • Opinions
  • Business
    • Industries
      • Transport
      • Retail
      • Government
      • Real Estate
      • Education
      • Energy
      • Banking and Finance
  • Channel
  • Future
    • Tech
    • Gadgets
    • Science
    • Space
    • Sustainability
  • Events
    • Channel Awards
      • 2025
      • 2024
      • 2023
    • Channel Insights Summit 2025
    • Insight Innovation Summit
    • CX50 Oman
    • CXO50
      • 2026
      • 2025
    • ICT Awards
      • Dubai
      • Saudi Arabia
    • Cyber Strategists Summit
    • Cloud Connect 2025
    • All events
  • Videos
  • GITEX x AI Everything
  • Digital Magazine

© 2025 - CXO Insight Middle East. All Rights Reserved.