data: exploration of review times by cpcloud · Pull Request #1233 · NVIDIA/cuda-python

cpcloud · 2025-11-12T18:32:57Z

This PR should not be merged.

This is a notebook that explores the P95 and P99 values of a few review metrics.

The purpose of putting it in a PR is to allow folks to explore the data
themselves, hopefully driving discussions on how we can improve.

How to use the notebook

cd cuda_core
pixi run -e rev-stats jupyter lab rev-stats.ipynb

then run all the cells.

Metrics

Time to First Review: Duration between PR creation and first review
Time to Merge: Duration between PR creation and time to merge
Time to Close: Duration between PR creation and either closing or merging it
Time from Final Review to Close: Duration between the final review comment and merge

Places where we are doing well

Time from Final Review to Close: This is in a solid place, we're not
waiting too long to click the merge button after approval.
P95: 2 days, P99: 14 days (this isn't ideal, but not concerning).

Places we can improve

Time to First Review: P95 here is 9.7 days, which means most PRs get
a review within that time. P99 is 45 days, which is something we should
address.
Time to Close: This metric includes merges along with PRs that are closed but not merged. P95 is 28D, P99 is 76D.
While these are distributions with long tails, I think we can greatly improve the P95 here.
Time to Merge: This is a subset of Time to Close, and it's a bit
better (but not much). Given that a PR is going to get merged, it tends to
be merged faster than one that isn't. We of course don't know a priori whether a PR is guaranteed to be merged.

Would love to see what others think!

Perhaps there are other interesting metrics to calculate that would help us
determine how to improve our PR turnaround times.

copy-pr-bot · 2025-11-12T18:33:01Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

mdboom · 2025-11-12T20:22:48Z

Perhaps there are other interesting metrics to calculate that would help us
determine how to improve our PR turnaround times.

Just brainstorming:

In an ideal world, the reviews would be spread evenly across all potential reviewers. In practice, most PRs are probably bottlenecked on the most experienced reviewers. It might be interesting to track this overtime and make sure we are "growing more experts".
I don't think you could glean this from the data set, but it would be interesting to know why reviews are delayed -- is it because the reviewer doesn't feel like they have enough context or experience in the code base? Maybe encouraging a culture of self-reporting "I don't feel like I can give this a good review", and then bonus points for "I will spend the time reading / poking / experimenting enough so that I feel confident". That last part is expensive, of course.

data: exploration of review times

94c3c1e

chore: rename and run notebook

e75b062

chore: pull code change metrics

0e5cc77

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data: exploration of review times#1233

data: exploration of review times#1233
cpcloud wants to merge 3 commits intoNVIDIA:mainfrom
cpcloud:pr-stats-analysis

cpcloud commented Nov 12, 2025

Uh oh!

copy-pr-bot bot commented Nov 12, 2025

Uh oh!

mdboom commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cpcloud commented Nov 12, 2025

How to use the notebook

Metrics

Places where we are doing well

Places we can improve

Uh oh!

copy-pr-bot bot commented Nov 12, 2025

Uh oh!

mdboom commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants