SMG gRPC servicer implementations for LLM inference engines (vLLM, SGLang)
Project Links
Meta
Author: Chang Su, Simo Lin
Requires Python: >=3.10
Classifiers
Programming Language
- Python :: 3
- Python :: 3.10
- Python :: 3.11
- Python :: 3.12
- Python :: 3.13
License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
smg-grpc-servicer
gRPC servicer implementations for LLM inference engines. Supports vLLM and SGLang.
Installation
For vLLM:
pip install smg-grpc-servicer[vllm]
For SGLang:
pip install smg-grpc-servicer[sglang]
Usage
vLLM
vllm serve meta-llama/Llama-2-7b-hf --grpc
SGLang
sglang serve --model-path meta-llama/Llama-2-7b-hf --grpc-mode
Architecture
smg-grpc-servicer[vllm] ──optional dep──> vllm (lazy import)
smg-grpc-servicer[sglang] ──optional dep──> sglang (lazy import)
smg-grpc-servicer ──depends on──> smg-grpc-proto (hard dependency)
vllm ──optional──> smg-grpc-servicer (via vllm serve --grpc)
sglang ──optional──> smg-grpc-servicer (via --grpc-mode)
Backend dependencies are isolated via extras to avoid conflicts between vLLM and SGLang.
Development
See DEVELOPMENT.md for local development setup, CI, and release workflows.