diagnose_sglang
Validate your SGLang configuration for NVIDIA DGX Spark by matching against documented failure modes. Get critical issues, non-fatal warnings, and a recommended baseline config.
Instructions
Validate an SGLang configuration for NVIDIA DGX Spark (GB10/SM121A).
Pure pattern-matching against known failure modes documented in the Sovereign AI Blog. No inference, no external calls. Returns critical issues, non-fatal warnings, and a recommended baseline config.
All parameters are optional; supply only what you have. With no inputs you get the recommended config and a 'unknown' verdict.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| attention_backend | No | SGLang --attention-backend value (e.g. 'flashinfer', 'triton'). Empty string = skip this check. | |
| mem_fraction | No | SGLang --mem-fraction-static value (e.g. 0.88). 0.0 = skip this check. | |
| cuda_graph_max_bs | No | SGLang --cuda-graph-max-bs value. 0 = skip this check. | |
| image_tag | No | Docker image tag in use (e.g. 'lmsysorg/sglang:latest', 'lmsysorg/sglang:v0.4.0'). Empty = skip. | |
| hardware | No | Hardware description (e.g. 'GB10', 'DGX Spark', 'SM121A'). Empty = skip GB10-specific rules. | |
| error_message | No | Paste error log output here for pattern matching against known failure modes. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| issues | Yes | Critical issues that will prevent SGLang from running correctly | |
| warnings | Yes | Non-fatal warnings (suboptimal but non-blocking) | |
| recommended_config | Yes | Verified-good baseline config for GB10/SM121A | |
| verdict | Yes | Overall verdict. 'unknown' = no inputs provided. |