Add syllable-boundary braille text wrap using pyphen hyphenation by LeonarddeR · Pull Request #20186 · nvaccess/nvda

LeonarddeR · 2026-05-20T19:13:02Z

Link to issue number:

Closes #17010
Follow-up for #20146 and #20145. This is the last of three PRs replacing #19916.

Summary of the issue:

Word wrap is sometimes pretty aggressive, especially on shorter braille displays. The previous two PRs added the text wrap infrastructure and continuation marks; this PR adds the final mode that splits long words at syllable boundaries using hyphenation dictionaries.

Description of user facing changes:

A fourth option, At word or syllable boundaries, is added to the Text wrap combo box in braille settings. Like "At word boundaries", it avoids splitting words mid-way, but when a word is too long to fit on the display it additionally tries to split at a syllable boundary (using hyphenation dictionaries from the pyphen library) so less of the word spills onto the next row. NVDA marks the split with the continuation mark (braille dots 7-8).

For locales without a pyphen dictionary, the mode falls back cleanly to word-boundary behaviour without any error.

Description of developer facing changes:

BrailleTextWrapFlag.AT_WORD_OR_SYLLABLE_BOUNDARIES member added to config.featureFlagEnums.
Region._languageIndexes (dict[int, str]) tracks language-span boundaries within a braille region. Populated during _addFieldText and _addTextWithFields when format fields carry a language attribute or when field text is in a different language than the surrounding content.
Region._getLanguageAtPos(pos) looks up the language at a raw-text offset using a bisect on the (always-ascending) keys of _languageIndexes.
BrailleBuffer._getLanguageAtBufferPos(pos) delegates to the region that owns that braille cell.
louisHelper.getTableLanguage(table) queries louis.getTableInfo for the "language" key and normalises the result, providing the default language for a region when no format-field language is known.

Description of development approach:

When AT_WORD_OR_SYLLABLE_BOUNDARIES is selected and a word straddles a row boundary, _calculateWindowRowBufferOffsets already finds the last space before the display edge. This PR adds a second pass: it looks up the full word (from that space to the next space), retrieves the language at the word's braille position, and calls textUtils.hyphenation.getHyphenPositions (introduced in #20145) to obtain candidate hyphen offsets. It then iterates the candidates from the end (closest to the display edge) and picks the first that falls within the current row, updating end accordingly and setting showContinuationMark.

Language tracking in Region ensures that the correct pyphen dictionary is selected even when a braille region contains multilingual content (e.g. a paragraph with inline foreign phrases).

Testing strategy:

New unit tests in test_calculateWindowRowBufferOffsets cover:

Successful syllable split: correct end and showContinuationMark = True.
Empty hyphen positions: falls back to word boundary, no continuation mark.
All hyphen positions past the display edge: falls back to word boundary.
Unknown language: getHyphenPositions returns (), falls back to word boundary.

New unit tests in test_regionLanguageIndexes cover:

Fresh region returns the default language for any position.
_addFieldText inserts a switch/restore pair when field language differs.
_addTextWithFields records a language index for a formatChange command carrying a language attribute.
TextInfoRegion.update resets _languageIndexes to {0: default}, discarding stale entries from the previous update cycle.

Manual testing: confirmed the new option appears in the braille settings panel with the correct label, that long words are split at syllable boundaries on a 20-cell display, and that the continuation mark is shown at the split point.

Known issues with pull request:

None.

Code Review Checklist:

Documentation:
- Change log entry
- User Documentation
- Developer / Technical Documentation
- Context sensitive help for GUI changes
Testing:
- Unit tests
- System (end to end) tests
- Manual testing
UX of all users considered:
- Speech
- Braille
- Low Vision
- Different web browsers
- Localization in other languages / culture than English
API is compatible with existing add-ons.
Security precautions taken.

Wires AT_WORD_OR_SYLLABLE_BOUNDARIES mode: language tracking via Region._languageIndexes selects the correct pyphen dictionary per locale. Breaks at syllable boundary closest to display edge within the last word, falling back to word boundary if no better split found. Depends on: pyphen-abstraction + braille-textwrap-refactor Part of nvaccess#17010

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds a new braille “Text wrap” mode that can split long words at syllable boundaries using hyphenation dictionaries, including language-aware behavior, and documents/tests the feature.

Changes:

Introduces AT_WORD_OR_SYLLABLE_BOUNDARIES text wrap option and implements syllable-boundary splitting in braille window row calculations.
Tracks language changes across region raw text to drive correct hyphenation dictionary selection.
Updates user documentation / changelog and adds unit tests for language index tracking and syllable-boundary wrapping.

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
user_docs/en/userGuide.md	Documents the new “At word or syllable boundaries” wrap option with an example.
user_docs/en/changes.md	Updates release notes to reflect the new 4-valued “Text wrap” option and hyphenation behavior.
tests/unit/test_braille/test_regionLanguageIndexes.py	Adds unit tests for region language index tracking/reset behavior.
tests/unit/test_braille/test_calculateWindowRowBufferOffsets.py	Adds tests for syllable-boundary wrap behavior; cleans up per-test AutoProperty overrides.
source/setup.py	Removes `textUtils` from a manifest/module list.
source/louisHelper.py	Adds `getTableLanguage` helper to read/normalize table language metadata.
source/config/featureFlagEnums.py	Adds the new `BrailleTextWrapFlag` enum value and label.
source/braille.py	Implements language index tracking and syllable-boundary wrap using hyphenation positions.

LeonarddeR · 2026-05-20T19:19:12Z

+	"""Build a TextInfoRegion without going through __init__ (which requires an NVDAObject)."""
+	region = braille.TextInfoRegion.__new__(braille.TextInfoRegion)
+	braille.Region.__init__(region)


This would be an interesting case if unit tests would really fail, but they don't at all.

LeonarddeR · 2026-05-20T19:20:54Z

+
+	def _getLanguageAtPos(self, pos: int) -> str:
+		"""Get the language at a given position in L{rawText} based on L{_languageIndexes}."""
+		keys = list(self._languageIndexes)


False positive IMO. Dictionaries keep insertion order since Python 3.7, and keys are always inserted sorted.

Copilot AI review requested due to automatic review settings May 20, 2026 19:13

LeonarddeR requested review from a team as code owners May 20, 2026 19:13

LeonarddeR requested review from Qchristensen and seanbudd May 20, 2026 19:13

LeonarddeR marked this pull request as draft May 20, 2026 19:13

LeonarddeR self-assigned this May 20, 2026

Copilot AI reviewed May 20, 2026

View reviewed changes

Merge remote-tracking branch 'origin/master' into braille-syllable-wrap

6671a66

LeonarddeR marked this pull request as ready for review May 21, 2026 07:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add syllable-boundary braille text wrap using pyphen hyphenation#20186

Add syllable-boundary braille text wrap using pyphen hyphenation#20186
LeonarddeR wants to merge 2 commits into
nvaccess:masterfrom
LeonarddeR:braille-syllable-wrap

LeonarddeR commented May 20, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

LeonarddeR May 20, 2026

Uh oh!

LeonarddeR May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

LeonarddeR commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Link to issue number:

Summary of the issue:

Description of user facing changes:

Description of developer facing changes:

Description of development approach:

Testing strategy:

Known issues with pull request:

Code Review Checklist:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

LeonarddeR May 20, 2026

Choose a reason for hiding this comment

Uh oh!

LeonarddeR May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LeonarddeR commented May 20, 2026 •

edited

Loading