[None][perf] Skip request broadcast when world_size is 1 by yechank-nvidia · Pull Request #13412 · NVIDIA/TensorRT-LLM

yechank-nvidia · 2026-04-24T07:06:40Z

Skip the MPI broadcast in RequestBroadcaster._broadcast_requests when world_size == 1. The broadcast call still incurs pickle serialization overhead even when there is only one rank, which is wasteful for single-GPU (especially for multimodal) runs.

The check is placed before the has_pp branch so it covers every parallelism configuration (TP / PP / CP / EP / DP).

Summary by CodeRabbit

Bug Fixes
- Optimized request distribution in single-process deployments to eliminate unnecessary communication overhead and improve performance.

Signed-off-by: yechank <[email protected]>

yechank-nvidia · 2026-04-24T07:09:17Z

/bot run

coderabbitai · 2026-04-24T07:09:32Z

📝 Walkthrough

Walkthrough

Adds an early return optimization in _broadcast_requests that skips distributed communication when running in a single-process environment, preventing unnecessary broadcast logic and PP topology dependencies.

Changes

Cohort / File(s)	Summary
Single-process optimization `tensorrt_llm/_torch/pyexecutor/request_utils.py`	Adds fast path early return in `_broadcast_requests` when `world_size` is 1, bypassing PP/TP/recv/send control flow for single-process scenarios.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Description check	❓ Inconclusive	The PR description is concise and explains the optimization, but lacks structured sections from the template (Test Coverage, PR Checklist).	Add Test Coverage section explaining which tests validate the single-GPU scenario, and confirm PR Checklist items are addressed.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately and specifically describes the main change: skipping request broadcast optimization when world_size is 1, matching the code's fast-path addition.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

tensorrt-cicd · 2026-04-24T07:19:14Z

PR_Github #45364 [ run ] triggered by Bot. Commit: 05a2264 Link to invocation

tensorrt-cicd · 2026-04-24T12:21:15Z

PR_Github #45364 [ run ] completed with state SUCCESS. Commit: 05a2264
/LLM/main/L0_MergeRequest_PR pipeline #35608 completed with status: 'SUCCESS'

CI Report

Link to invocation

Signed-off-by: yechank <[email protected]>

[None][perf] Skip request broadcast when world_size is 1

05a2264

Signed-off-by: yechank <[email protected]>

yechank-nvidia self-assigned this Apr 24, 2026

yechank-nvidia requested a review from a team as a code owner April 24, 2026 07:06

yechank-nvidia requested a review from dongxuy04 April 24, 2026 07:06

Funatiq approved these changes Apr 24, 2026

View reviewed changes

yechank-nvidia merged commit 8e2bdfc into NVIDIA:main Apr 24, 2026
10 checks passed

yufeiwu-nv pushed a commit to yufeiwu-nv/TensorRT-LLM that referenced this pull request May 19, 2026

[None][perf] Skip request broadcast when world_size is 1 (NVIDIA#13412)

263d742

Signed-off-by: yechank <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[None][perf] Skip request broadcast when world_size is 1#13412

[None][perf] Skip request broadcast when world_size is 1#13412
yechank-nvidia merged 1 commit into
NVIDIA:mainfrom
yechank-nvidia:remove_broadcast

yechank-nvidia commented Apr 24, 2026 •

edited

Loading

Uh oh!

yechank-nvidia commented Apr 24, 2026

Uh oh!

coderabbitai Bot commented Apr 24, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 inconclusive)

Uh oh!

tensorrt-cicd commented Apr 24, 2026

Uh oh!

tensorrt-cicd commented Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yechank-nvidia commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

yechank-nvidia commented Apr 24, 2026

Uh oh!

coderabbitai Bot commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 inconclusive)

Uh oh!

tensorrt-cicd commented Apr 24, 2026

Uh oh!

tensorrt-cicd commented Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yechank-nvidia commented Apr 24, 2026 •

edited

Loading

coderabbitai Bot commented Apr 24, 2026 •

edited

Loading