LLM Server Benchmark Dashboard

Automated dashboard for tracking and comparing performance metrics between Shortfin LLM Server and SGLang Server.

Overview

This project collects daily performance metrics from two LLM servers:

Shortfin LLM Server with SGLang frontend integration
SGLang's native LLM server (baseline)

Metrics Collected

For each server at varying request rates (1, 2, 4, 8, 16, 32):

Median E2E Latency (ms)
Median TTFT (Time to First Token)
Median ITL (Inter-Token Latency)
Request Throughput (req/s)
Benchmark Duration (s)

Data Collection

Input Format

Data is collected in jsonlines files, named according to the pattern:

{server}_{date}_{request_rate}.jsonl

Example:

shortfin_10_1.jsonl
shortfin_10_2.jsonl
...
sglang_10_1.jsonl
sglang_10_2.jsonl

Collection Frequency

Runs nightly via CI
Data refresh rate: Daily
Initial retention period: 3 months (configurable)

Dashboard Details

Grafana Integration

Integrated with existing Grafana instance
Tracks performance improvements over time
Compares Shortfin vs SGLang server performance

Visualization Goals

Track Shortfin server improvements
Benchmark against SGLang baseline
Identify performance trends and regressions

References

Shortfin LLM with SGLang Documentation
Performance test results can be viewed at the above link

Future Enhancements

Automate data collection
Additional metrics collection
Enhanced visualization options

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
config		config
data_pipeline		data_pipeline
Onboarding_Guide.MD		Onboarding_Guide.MD
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Server Benchmark Dashboard

Overview

Metrics Collected

Data Collection

Input Format

Collection Frequency

Dashboard Details

Grafana Integration

Visualization Goals

References

Future Enhancements

About

Releases

Packages

Languages

nod-ai/shark-data-sglang

Folders and files

Latest commit

History

Repository files navigation

LLM Server Benchmark Dashboard

Overview

Metrics Collected

Data Collection

Input Format

Collection Frequency

Dashboard Details

Grafana Integration

Visualization Goals

References

Future Enhancements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages