vLLM and High-Performance Inference: Memory Optimization, Parallel Execution, Token Streaming, Scalable Model Serving (Large Language Refinement Inference Series, Band 2)

Preise von
16,90

Hervorgehoben


Beschreibung

Amazon vLLM and High-Performance Inference: Memory Optimization, Parallel Execution, Token Streaming, and Scalable Model Serving (Large Language Model Refinement and Inference Series, Band 2)

Webshops vergleichen (1)

Shop
Preis
Versandkosten
Total price
16,90 
Gratis
16,90 
Zum Shop
Gratis Shipping Costs
Beschreibung (1)

vLLM and High-Performance Inference: Memory Optimization, Parallel Execution, Token Streaming, and Scalable Model Serving (Large Language Model Refinement and Inference Series, Band 2)


Produktspezifikationen

Marken Independently Published
EAN
  • 9798195860981

Preise zuletzt aktualisiert am:

Hervorgehobene Wahl
16,90 
Zum Shop