A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!
-
Updated
May 24, 2026 - Python
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!
LLM Serving Performance Evaluation Harness
An intelligent tuner for vLLM that automatically monitors GPU metrics, uses Bayesian optimization to tune parameters
【一个小工具】用最短的代码完成对模型的分析,包含 ImageNet Val、FLOPs、Params、Throuthput、CAM 等
ROS Network Analysis Package: This is a ROS package that provide tools to analyze the wireless network such as the signal quality, latency, throughput, link utilization, connection rates, error metrics, etc., between two ROS nodes/computers/machines.
Fast and reliable distributed systems in Python
RPC protocol based on kafka. Horizontally scalable, fault-tolerant, wicked fast, just like kafka.
High performance functions to work with the async IO.
A python script to parse through ns2 tracefiles, calculate the throughput and plot the throughputs against packetsizes using gnuplot.
LLM inference benchmarking toolkit. Measure TTFT, inter-token latency, throughput, and P50–P99 across concurrency levels.
The Kafka Partition Count Recommender [Multithreading] tool analyzes historical topic consumption, identifies peak throughput over # of days, scales it for future demand, and translates it into optimal partition counts—delivering automated, data-driven topic sizing that ensures performance and scalability.
evaluate llm's generation speed via API
Software that calculates and plot Throughput, Delay and other metrics from a tcpdump script.
An implementation of Speculative RAG exploring latency-quality trade-offs in multi-draft retrieval. Features batched parallel drafting via vLLM and log-probability verifier selection for fast, high-quality QA on a single A100 GPU.
A script for benchmarking ZFS and otherwise for easy graphing with fio-plot
Evaluate latency and throughput of graphql federation vs grpc.
Track your machine's network throughput with a real time graph displaying bytes sent and received
Throughput + latency benchmark for OpenAI-compatible LLM endpoints (vLLM, TGI, llama.cpp, Ollama). TTFT, TPOT, throughput, percentiles. Model-agnostic.
Add a description, image, and links to the throughput topic page so that developers can more easily learn about it.
To associate your repository with the throughput topic, visit your repo's landing page and select "manage topics."