Skip to content
@sgl-project

sgl-project

Pinned Loading

  1. sglang sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    Python 22.3k 4k

  2. sgl-learning-materials sgl-learning-materials Public

    Materials for learning SGLang

    716 54

  3. ome ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    Go 356 53

  4. genai-bench genai-bench Public

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    Python 251 45

  5. SpecForge SpecForge Public

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    Python 620 132

  6. sglang-jax sglang-jax Public

    JAX backend for SGL

    Python 213 60

Repositories

Showing 10 of 19 repositories
  • sglang-jax Public

    JAX backend for SGL

    sgl-project/sglang-jax’s past year of commit activity
    Python 213 Apache-2.0 60 78 (6 issues need help) 24 Updated Jan 12, 2026
  • SpecForge Public

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    sgl-project/SpecForge’s past year of commit activity
    Python 620 MIT 132 50 (1 issue needs help) 20 Updated Jan 12, 2026
  • sgl-project.github.io Public

    This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang

    sgl-project/sgl-project.github.io’s past year of commit activity
    HTML 96 25 9 1 Updated Jan 12, 2026
  • sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    sgl-project/sglang’s past year of commit activity
    Python 22,272 Apache-2.0 4,021 652 (29 issues need help) 1,263 Updated Jan 12, 2026
  • whl Public

    Kernel Library Wheel for SGLang

    sgl-project/whl’s past year of commit activity
    HTML 17 MIT 5 1 0 Updated Jan 12, 2026
  • genai-bench Public

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    sgl-project/genai-bench’s past year of commit activity
    Python 251 MIT 45 4 10 Updated Jan 11, 2026
  • ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    sgl-project/ome’s past year of commit activity
    Go 356 Apache-2.0 53 32 (2 issues need help) 37 Updated Jan 11, 2026
  • sgl-flash-attn Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    sgl-project/sgl-flash-attn’s past year of commit activity
    Python 15 BSD-3-Clause 2,287 0 1 Updated Jan 11, 2026
  • sgl-kernel-npu Public

    SGLang kernel library for NPU

    sgl-project/sgl-kernel-npu’s past year of commit activity
    C++ 93 MIT 70 13 34 Updated Jan 9, 2026
  • rbg Public

    A workload for deploying LLM inference services on Kubernetes

    sgl-project/rbg’s past year of commit activity
    Go 153 Apache-2.0 41 17 (1 issue needs help) 11 Updated Jan 9, 2026