Open SourceLibraryFree

vLLM

High-throughput local LLM inference and serving engine.

Alternatives

1 workflow use vLLM