feat: v0 tab completion by wjiayis · Pull Request #110 · PaperDebugger/paperdebugger

wjiayis · 2026-02-04T12:02:23Z

#33

In short, it'll

[Frontend] Recognize that user is trying to add a citation (trigger is \cite{)
[Frontend] Temporarily suppress default overleaf dropdown suggestions
[Frontend] Get the last sentence as context for LLM
[Backend] Fetch bibliography in .bib files stored in project as raw text
[Backend] Query a lightweight LLM (hardcoded to gpt-5-nano for now) to get at most 3 citation keys

There are some outstanding issues

Latency is way too high (severe) -> I'll see what kind of caching / design choices I can do to bring the latency down
When user is idle for a long time, token expires and user is logged out. User will not see citation suggestions until they log back in -> won't fix in this iteration

feat: v0 tab completion

Junyi-99 · 2026-02-05T04:48:07Z

Hi @wjiayis, thanks for the update. I’ve created a new issue to address the token expiration problem.

Regarding the latency issue, do we have visibility into which part of the pipeline contributes most to the high latency? For example, a rough breakdown across:

frontend → backend → LLM provider (reasoning + response)

I’ll take a look at this PR later this evening as well.

wjiayis · 2026-02-05T04:56:18Z

@Junyi-99 I haven't gotten to the latency breakdown yet, but I've settled everything else and I'm gonna work on this next. Thanks for helping to review when convenient, I'll update my findings when I have them too!

Junyi-99 · 2026-02-05T04:59:24Z

@wjiayis Got it, thanks for the update. Looking forward to your findings.

wjiayis · 2026-02-05T14:51:36Z

Root Cause

There's a ~20s latency in the inline-suggestion loop, and >99% of the latency comes from waiting for LLM to start responding. This issues arises because I'm passing in a large (but realistic) bibliography (the bibliography of PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing itself), and gpt-5-nano takes a while to parse it.

Solution

I think it's reasonable to expect that a regular user's max latency tolerance is ~2s. I'll implement the following 3 solutions to achieve that.

Model Selection

gpt-5-nano is takes a long time to process the long bibliography. Just swapping it out for gpt-5.2 brings latency down to 2-4s. But gpt-5.2 is expensive to call. I'll improve latency and cost with the next 2 solutions.

Prompt Caching

Since bibliography remains generally constant and takes up the bulk of the prompt, I'll use OpenAI's prompt caching - advertised to reduce latency by up to 80% and input token costs by up to 90%.

Place bibliography at the start of the prompt (prompt caching uses exact prefix match)
Run a "no-reply" LLM query at the start of each session and when database reloads, and config it to cache 24h
Each time \cite{ is triggered, the cached bibliography is used -> lower latency

Prompt Refinement

I'll remove info-sparse fields (eg doi, url, pages) and retain only info-rich fields (eg title, booktitle), to reduce the total size of bibliography by (hopefully) at least 40%.

cc: @Junyi-99

wjiayis and others added 17 commits February 1, 2026 11:51

feat: frontend enable beta features

c12c33b

feat: trigger auto-completion if text before is "\cite{"

324e602

feat: extract last sentence

9a4b2d4

chore: remove debug logging

4992a04

feat: end to end inline suggestion (lots of hardcoding)

5592197

chore: minor reformatting

bc942c3

chore: minor comment improvement

72167c5

chore: rename method

05abfd7

chore: rename method

d688433

refactor: use abstracted methods

56260eb

fix: use debug conversation mode

d8fd357

refactor: move citation method to backend

f453065

chore: revert edit package-lock.json

ac64b91

feat: always use gpt-5-nano

b6cf906

feat: access docs on backend

13e8553

feat: get bibfiles from backend

672e569

Merge pull request #109 from wjiayis/feat/tab-completion

99243ca

feat: v0 tab completion

wjiayis mentioned this pull request Feb 4, 2026

feat: v0 tab completion #109

Merged

wjiayis added 4 commits February 4, 2026 21:44

feat: improve citation prompt

1ff6a69

feat: improve citation prompt

888b66b

feat: override default overleaf autocomplete

60421cb

refactor: make suggestion triggers generalised

a54d354

Junyi-99 mentioned this pull request Feb 5, 2026

[BUG] Token expiration occurs more frequently than expected #111

Open

Junyi-99 linked an issue Feb 5, 2026 that may be closed by this pull request

[Feature Request] Tab-Completion #33

Open

feat: use gpt-5.2 instead of gpt-5-nano to reduce latency

65cf022

kah-seng mentioned this pull request Feb 5, 2026

fix: tokens #113

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: v0 tab completion#110

feat: v0 tab completion#110
wjiayis wants to merge 22 commits intostagingfrom
feat/tab-completion

wjiayis commented Feb 4, 2026 •

edited

Loading

Uh oh!

Junyi-99 commented Feb 5, 2026

Uh oh!

wjiayis commented Feb 5, 2026

Uh oh!

Junyi-99 commented Feb 5, 2026

Uh oh!

wjiayis commented Feb 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wjiayis commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Junyi-99 commented Feb 5, 2026

Uh oh!

wjiayis commented Feb 5, 2026

Uh oh!

Junyi-99 commented Feb 5, 2026

Uh oh!

wjiayis commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Root Cause

Solution

Model Selection

Prompt Caching

Prompt Refinement

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wjiayis commented Feb 4, 2026 •

edited

Loading

wjiayis commented Feb 5, 2026 •

edited

Loading