Review PR 1109 string character offset handling by ericcrosson-bitgo · Pull Request #1110 · BitGo/api-ts

ericcrosson-bitgo · 2026-01-15T22:09:47Z

No description provided.

…dates Bumps the dependencies group with 3 updates in the / directory: [@swc/core](https://github.com/swc-project/swc), [@swc/core-linux-x64-gnu](https://github.com/swc-project/swc) and [@swc/core-darwin-arm64](https://github.com/swc-project/swc). Updates `@swc/core` from 1.5.7 to 1.10.6 - [Release notes](https://github.com/swc-project/swc/releases) - [Changelog](https://github.com/swc-project/swc/blob/main/CHANGELOG.md) - [Commits](swc-project/swc@v1.5.7...v1.10.6) Updates `@swc/core-linux-x64-gnu` from 1.5.7 to 1.10.6 - [Release notes](https://github.com/swc-project/swc/releases) - [Changelog](https://github.com/swc-project/swc/blob/main/CHANGELOG.md) - [Commits](swc-project/swc@v1.5.7...v1.10.6) Updates `@swc/core-darwin-arm64` from 1.5.7 to 1.10.6 - [Release notes](https://github.com/swc-project/swc/releases) - [Changelog](https://github.com/swc-project/swc/blob/main/CHANGELOG.md) - [Commits](swc-project/swc@v1.5.7...v1.10.6) --- updated-dependencies: - dependency-name: "@swc/core" dependency-type: direct:production update-type: version-update:semver-minor dependency-group: dependencies - dependency-name: "@swc/core-linux-x64-gnu" dependency-type: direct:production update-type: version-update:semver-minor dependency-group: dependencies - dependency-name: "@swc/core-darwin-arm64" dependency-type: direct:production update-type: version-update:semver-minor dependency-group: dependencies ... Signed-off-by: dependabot[bot] <support@github.com>

…extraction SWC (written in Rust) provides byte-based span offsets, but JavaScript strings use character-based offsets (UTF-16 code units). When source code contains multibyte UTF-8 characters (e.g., À, 日, 😀), directly using SWC's byte offsets with String.slice() results in incorrect string extraction. This commit introduces byteOffsetToCharOffset() which properly converts byte offsets to character offsets by iterating through the string and accumulating byte lengths until reaching the target byte position. Test cases added for: - Extended Latin characters (2-byte UTF-8: À, ÿ, ñ, ü) - CJK characters (3-byte UTF-8: 日本語, 中文, 한국어) - Mixed multibyte characters at multiple positions - Multibyte characters at the very start of a file Fixes: DX-2788

ericcrosson-bitgo · 2026-01-16T18:31:00Z

Superseded by #1111 (nice number)

dependabot bot and others added 2 commits January 8, 2025 08:30

ericcrosson-bitgo changed the base branch from master to DX-2788-handle-multibyte January 15, 2026 22:10

Base automatically changed from DX-2788-handle-multibyte to master January 15, 2026 22:18

ericcrosson-bitgo closed this Jan 16, 2026

ericcrosson-bitgo deleted the claude/review-pr-1109-string-offsets-QvruR branch January 16, 2026 18:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Review PR 1109 string character offset handling#1110

Review PR 1109 string character offset handling#1110
ericcrosson-bitgo wants to merge 2 commits intomasterfrom
claude/review-pr-1109-string-offsets-QvruR

ericcrosson-bitgo commented Jan 15, 2026

Uh oh!

ericcrosson-bitgo commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ericcrosson-bitgo commented Jan 15, 2026

Uh oh!

ericcrosson-bitgo commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants