Search Results - Devoxx UK

Search results for vllm

Fast, Cheap, and Accurate: Optimizing LLM Inference with vLLM and Quantization

Conference - INTERMEDIATE LEVEL

View

Scaling AI on Hybrid Cloud for Production LLM Inference at Scale

Conference - INTERMEDIATE LEVEL

View

Fast, Cheap, and Accurate: Optimizing LLM Inference with vLLM and Quantization

Similarity score = 0.54

Scaling AI on Hybrid Cloud for Production LLM Inference at Scale

Similarity score = 0.65

Taming GraalVM Reflection with AI Agents: Lessons from Testing 1000 Libraries

Similarity score = 0.69

Welcome to Devoxx UK 2026

Similarity score = 0.71

In the Land of the AI Agents, the One-Layer Memory Is King

Similarity score = 0.72

As the similarity score approaches zero, the match becomes increasingly accurate.