F5 recently announced the launch of the early access version of F5 AI Gateway to simplify the interaction between applications, APIs, and large language models (LLMs) in the process of promoting AI deployment. This powerful containerization solution can optimize performance, observability, and protection capabilities, thereby significantly reducing costs. Through deep integration with F5's product portfolio, AI Gateway provides security and operations teams with a seamless path to adopt AI services to greatly improve data output quality and provide excellent user experience.
According to the "Artificial Intelligence Application Strategy Status Report" released by F5, 75% of enterprises are deploying AI. Like many modern applications, AI services are mainly delivered and used through APIs. However, enterprises face additional challenges when designing and scaling AI-enabled applications and services.
For example, efficient operations require close monitoring of increasingly important metrics such as GPU computing costs and system response speed, while also addressing emerging compliance requirements.
"LLMs are creating new levels of productivity and improving user experience for users, but they also require monitoring, deep inspection, and protection against new threats during the inference phase," said Kunal Anand, Chief Innovation Officer at F5. By meeting these new requirements and integrating with F5's trusted API traffic management solution, we will help customers confidently and efficiently deploy AI-driven applications in an increasingly complex threat environment."
Real-world AI solutions require optimizing requests, responses, and rapid interactions across the entire data ecosystem. F5 AI Gateways reduce costs, mitigate malicious threats, and ensure compliance by observing, optimizing, and protecting a large number of user and automation variables.
The F5 AI Gateway is designed to find the best fit in the AI evolution of customers and their applications. This gateway can be deployed in any cloud or data center and natively integrates with the F5 NGINX and BIG-IP platforms, taking full advantage of F5's leading application security and delivery services in traditional, multi-cloud or edge deployments. In addition, the open scalability of this solution will allow enterprises to develop and customize programmable security and control performed by the F5 AI Gateway. These processes can be easily updated and dynamically applied to achieve instant response to security policies and compliance requirements.
Shari Lava, senior director of artificial intelligence and automation at IDC, said, "AI-driven applications will become the cornerstone of almost all enterprises and organizations in the coming years. F5's introduction of AI Gateway in its application stack service provides customers with greater flexibility in building AI application architectures while achieving stronger protection and model optimization."
F5 AI Gateway can bring the following value to users:
· Provide security and compliance policy enforcement through automatic detection and remediation to address the top ten risks of large language model applications released by OWASP;
·Use semantic caching to offload repetitive tasks in LLM, improve user experience and reduce operating costs;
·Simplify the integration process so that developers can focus on building AI-driven applications instead of managing complex infrastructure;
Optimize load balancing, traffic routing, and rate limiting for local and third-party LLMs to maintain service availability and improve performance;
·Provide a single API interface for developers to access the AI model of their choice.
“F5’s AI Gateway is an integral part of our AI strategy. With this technology, our customers can develop internal and external AI applications that can handle a large number of queries without degrading site and application performance,” said Austin Geraci, CTO of WorldTech IT. “F5’s leading application security and delivery capabilities accelerate large-scale AI experiences. By adopting the F5 AI Gateway, semantic caching and intelligent traffic routing alone can significantly reduce costs, and the unification of F5 services also saves customers hundreds of hours of integration work.”