Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Burstgpt provides workload traces for ChatGPT (GPT-3.5) and GPT-4, released by HPMLL to facilitate the optimization of Large Language Model (LLM) serving systems. The data captures request patterns and arrival characteristics from production-scale models as of early 2024. It is designed to help researchers model the 'bursty' nature of inference traffic in high-performance computing environments.
Users should check the repository for specific trace formats and ensure compatibility with simulators like vLLM or DeepSpeed-MII before integration.