Region/Line-Type Filtering for Image-Level Actions (Transcribe, Train)

Description

Enable filtering of regions or line types when launching transcription-related operations (e.g., Recognize, Training). This functionality already exists for exporting, but not for other tasks, which creates inconsistency and limits workflows for complex documents.

Motivation

Many documents contain a mix of scripts, languages, or distinct editorial categories (e.g., main text vs. marginalia). When only part of a document is appropriate for a given model, users are currently forced to:

Run recognition on the entire page, then manually delete incorrect outputs
Build separate documents to isolate content This is inefficient and error-prone. Consistent region/line filtering would support multilingual and multiscript projects, especially before fine-tuned models are available.

Proposed Feature

Add UI and API options to restrict image-level transcription operations to:
- Selected region types (e.g., MarginalZone only)
- Selected line types (e.g., TitleLine + MainLine)
Improve on the current UI of the Export modal.

Scope

Applicable operations:
- Run Recognition (model inference)
- Training data selection
Operates at the page/document level like other batch jobs.
Non-selected content remains untouched in the target transcription layer.

UI Integration

Include a filtering section in the operation configuration panel (checklist of region/line types).
Optionally remember last-used filters on a per-document basis.

Future Enhancements (Separate Issues)

Combine this filtering with transcription status filtering (Final-only, etc.).
Provide a preview showing how many regions/lines will be processed before running.

Rationale

Brings task filtering parity across export, transcribe, and training operations. Reduces manual cleanup and enables practical work on heterogeneous documents.