Results¶

`results` ¶

`RecognizedText` `dataclass` ¶

Recognized text class

This class represents a result from a text recognition model.

Attributes:

Name	Type	Description
`texts`	`list[str]`	A sequence of candidate texts
`scores`	`list[float]`	The scores of the candidate texts

`top_candidate` ¶

The candidate with the highest confidence score

Source code in src/htrflow/results.py

def top_candidate(self) -> str:
    """The candidate with the highest confidence score"""
    return self.texts[self.scores.index(self.top_score())]

`top_score` ¶

The highest confidence score

Source code in src/htrflow/results.py

def top_score(self):
    """The highest confidence score"""
    return max(self.scores)

`Result` ¶

A result from an arbitrary model (or process)

One result instance corresponds to one input image.

Attributes:

Name	Type	Description
`metadata`		Metadata regarding the result, model-dependent.
`segments`		`Segment` instances representing results from an object detection or instance segmentation model, or similar. May be empty if not applicable.
`data`		Any other data associated with the result.

Create a Result

See also the alternative constructors Result.text_recognition_result, Result.segmentation_result and Result.word_segmentation_result.

Source code in src/htrflow/results.py

def __init__(
    self,
    metadata: dict[str, str] | None = None,
    segments: Sequence[Segment] | None = None,
    data: dict[str, Any] = None,
    text: RecognizedText | None = None,
):
    """Create a Result

    See also the alternative constructors Result.text_recognition_result,
    Result.segmentation_result and Result.word_segmentation_result.
    """
    self.metadata = metadata or {}
    self.segments = segments or []
    self.data = data or {}
    if text is not None:
        self.data.update({TEXT_RESULT_KEY: text})

`bboxes` `property` ¶

Bounding boxes relative to input image

`class_labels` `property` ¶

Class labels of segments

`confidences` `property` ¶

A list of confidence scores for the recognized text. Returns empty list if no text result.

`global_masks` `property` ¶

Global masks relative to input image

`local_mask` `property` ¶

Local masks relative to bounding boxes

`polygons` `property` ¶

Polygons relative to input image

`text_result` `property` ¶

The RecognizedText object, if available.

`texts` `property` ¶

A list of recognized text candidates. Returns empty list if no text result.

`drop_indices` ¶

Drop segments from result

Example: Given a Result with three segments s0, s1 and s2, index = [0, 2] will drop segments s0 and s2.

Parameters:

Name	Type	Description	Default
`index`	`Sequence[int]`	Indices of segments to drop	required

Source code in src/htrflow/results.py

def drop_indices(self, index: Sequence[int]) -> None:
    """Drop segments from result

    Example: Given a `Result` with three segments s0, s1 and s2,
    index = [0, 2] will drop segments s0 and s2.

    Arguments:
        index: Indices of segments to drop
    """
    keep = [i for i in range(len(self.segments)) if i not in index]
    self.reorder(keep)

`filter` ¶

Filter segments and data based on a predicate applied to a specified key.

Parameters:

Name	Type	Description	Default
`key`	`str`	The key in the data dictionary to test the predicate against.	required
`predicate`	`[Callable]`	A function that takes a value associated with the key	required

Example:

>>> def remove_certain_text(text_results):
>>>    return text_results != 'lorem'
>>> result.filter('text_results', remove_certain_text)
True

Source code in src/htrflow/results.py

def filter(self, key: str, predicate: Callable[[Any], bool]) -> None:
    """Filter segments and data based on a predicate applied to a specified key.

    Args:
        key: The key in the data dictionary to test the predicate against.
        predicate [Callable]: A function that takes a value associated with the key
        and returns True if the segment should be kept.

    Example:
    ```
    >>> def remove_certain_text(text_results):
    >>>    return text_results != 'lorem'
    >>> result.filter('text_results', remove_certain_text)
    True
    ```
    """
    keep = [i for i, item in enumerate(self.data) if predicate(item.get(key, None))]
    self.reorder(keep)

`reorder` ¶

Reorder result

Example: Given a Result with three segments s0, s1 and s2, index = [2, 0, 1] will put the segments in order [s2, s0, s1]. Any indices not in index will be dropped from the result.

Parameters:

Name	Type	Description	Default
`index`	`Sequence[int]`	A list of indices representing the new ordering.	required

Source code in src/htrflow/results.py

def reorder(self, index: Sequence[int]) -> None:
    """Reorder result

    Example: Given a `Result` with three segments s0, s1 and s2,
    index = [2, 0, 1] will put the segments in order [s2, s0, s1].
    Any indices not in `index` will be dropped from the result.

    Arguments:
        index: A list of indices representing the new ordering.
    """
    if self.segments:
        self.segments = [self.segments[i] for i in index]

`rescale` ¶

Rescale the Result's segments

Source code in src/htrflow/results.py

def rescale(self, factor: float):
    """Rescale the Result's segments"""
    for segment in self.segments:
        segment.rescale(factor)

`segmentation_result` `classmethod` ¶

Create a segmentation result

Parameters:

Name	Type	Description	Default
`image`		The original image	required
`metadata`	`dict[str, Any]`	Result metadata	required
`segments`		The segments	required

Returns:

Type	Description
`Result`	A Result instance with the specified data and no texts.

Source code in src/htrflow/results.py

@classmethod
def segmentation_result(
    cls,
    orig_shape: tuple[int, int],
    metadata: dict[str, Any],
    bboxes: Sequence[Bbox | Iterable[int]] | None = None,
    masks: Sequence[Mask] | None = None,
    polygons: Sequence[Polygon] | None = None,
    scores: Iterable[float] | None = None,
    labels: Iterable[str] | None = None,
) -> "Result":
    """Create a segmentation result

    Arguments:
        image: The original image
        metadata: Result metadata
        segments: The segments

    Returns:
        A Result instance with the specified data and no texts.
    """
    segments = []
    for item in _zip_longest_none(bboxes, masks, scores, labels, polygons):
        segment = Segment(*item, orig_shape=orig_shape)
        if segment.bbox.area > 0:
            segments.append(segment)
    return cls(metadata, segments=segments)

`text_recognition_result` `classmethod` ¶

Create a text recognition result

Parameters:

Name	Type	Description	Default
`metadata`	`dict[str, Any]`	Result metadata	required
`text`		The recognized text	required

Returns:

Type	Description
`Result`	A Result instance with the specified data and no segments.

Source code in src/htrflow/results.py

@classmethod
def text_recognition_result(cls, metadata: dict[str, Any], texts: list[str], scores: list[float]) -> "Result":
    """Create a text recognition result

    Arguments:
        metadata: Result metadata
        text: The recognized text

    Returns:
        A Result instance with the specified data and no segments.
    """
    return cls(metadata, text=RecognizedText(texts, scores))

`Segment` ¶

Segment class

Class representing a segment of an image, typically a result from a segmentation model or a detection model.

Attributes:

Name	Type	Description
`bbox`	`Bbox`	The bounding box of the segment
`mask`	`Mask \| None`	The segment's mask, if available. The mask is stored relative to the bounding box. Use the `global_mask()` method to retrieve the mask relative to the original image.
`score`	`float \| None`	Segment confidence score, if available.
`class_label`	`str \| None`	Segment class label, if available.
`polygon`	`Polygon \| None`	An approximation of the segment mask, relative to the original image. If no mask is available, `polygon` defaults to a polygon representation of the segment's bounding box.
`orig_shape`	`tuple[int, int] \| None`	The shape of the orginal input image.

Create a Segment instance

A segment can be created from a bounding box, a polygon, a mask or any combination of the three.

Parameters:

Name	Type	Description	Default
`bbox`	`tuple[int, int, int, int] \| Bbox \| None`	The segment's bounding box, as either a `geometry.Bbox` instance or as a (xmin, ymin, xmax, ymax) tuple. Required if `mask` and `polygon` are None. Defaults to None.	`None`
`mask`	`Mask \| None`	The segment's mask relative to the original input image. Required if both `polygon` and `bbox` are None. Defaults to None.	`None`
`score`	`float \| None`	Segment confidence score. Defaults to None.	`None`
`class_label`	`str \| None`	Segment class label. Defaults to None.	`None`
`polygon`	`Polygon \| Sequence[tuple[int, int]] \| None`	A polygon defining the segment, relative to the input image. Defaults to None. Required if both `mask` and `bbox` are None.	`None`
`orig_shape`	`tuple[int, int] \| None`	The shape of the orginal input image. Defaults to None.	`None`

Source code in src/htrflow/results.py

def __init__(
    self,
    bbox: tuple[int, int, int, int] | Bbox | None = None,
    mask: Mask | None = None,
    score: float | None = None,
    class_label: str | None = None,
    polygon: Polygon | Sequence[tuple[int, int]] | None = None,
    orig_shape: tuple[int, int] | None = None,
    data: dict[str, Any] | None = None,
):
    """Create a `Segment` instance

    A segment can be created from a bounding box, a polygon, a mask
    or any combination of the three.

    Arguments:
        bbox: The segment's bounding box, as either a `geometry.Bbox`
            instance or as a (xmin, ymin, xmax, ymax) tuple. Required
            if `mask` and `polygon` are None. Defaults to None.
        mask: The segment's mask relative to the original input image.
            Required if both `polygon` and `bbox` are None. Defaults
            to None.
        score: Segment confidence score. Defaults to None.
        class_label: Segment class label. Defaults to None.
        polygon: A polygon defining the segment, relative to the input
            image. Defaults to None. Required if both `mask` and `bbox`
            are None.
        orig_shape: The shape of the orginal input image. Defaults to
            None.
    """
    if all(item is None for item in (bbox, mask, polygon)):
        raise ValueError("Cannot create a Segment without bbox, mask or polygon")

    # Mask is given: Compute a polygon and a bounding box from the mask
    if mask is not None:
        bbox = geometry.mask2bbox(mask)
        polygon = geometry.mask2polygon(mask)
        mask = imgproc.crop(mask, bbox)

    # Polygon is given: Compute a bounding box and possibly mask
    elif polygon is not None:
        polygon = geometry.Polygon(polygon)
        bbox = polygon.bbox()
        if orig_shape:
            mask = geometry.polygon2mask(polygon, orig_shape)
            mask = imgproc.crop(mask, Bbox(*bbox))

    self.bbox = geometry.Bbox(*bbox)
    self.polygon = polygon or self.bbox.polygon()
    self.mask = mask
    self.score = score
    self.class_label = class_label
    self.orig_shape = orig_shape
    self.data = data or {}

`global_mask` `property` ¶

The segment mask relative to the original input image.

Parameters:

Name	Type	Description	Default
`orig_shape`		Pass this argument to use another original shape than the segment's `orig_shape` attribute. Defaults to None.	required

`local_mask` `property` ¶

The segment mask relative to the bounding box (alias for self.mask)

`approximate_mask` ¶

A lower resolution version of the global mask

Parameters:

Name	Type	Description	Default
`ratio`	`float`	Size of approximate mask relative to the original.	required

Source code in src/htrflow/results.py

def approximate_mask(self, ratio: float) -> Mask | None:
    """A lower resolution version of the global mask

    Arguments:
        ratio: Size of approximate mask relative to the original.
    """
    global_mask = self.global_mask
    if global_mask is None:
        return None
    return imgproc.rescale(global_mask, ratio)

`rescale` ¶

Rescale the segment's mask, bounding box and polygon by factor

Source code in src/htrflow/results.py

def rescale(self, factor: float) -> None:
    """Rescale the segment's mask, bounding box and polygon by `factor`"""
    if self.mask is not None:
        self.mask = imgproc.rescale_linear(self.mask, factor)
    self.bbox = self.bbox.rescale(factor)
    if self.polygon is not None:
        self.polygon = self.polygon.rescale(factor)

Results¶

results ¶

RecognizedText dataclass ¶

top_candidate ¶

top_score ¶

Result ¶

bboxes property ¶

class_labels property ¶

confidences property ¶

global_masks property ¶

local_mask property ¶

polygons property ¶

text_result property ¶

texts property ¶

drop_indices ¶

filter ¶

reorder ¶

rescale ¶

segmentation_result classmethod ¶

text_recognition_result classmethod ¶

Segment ¶

global_mask property ¶

local_mask property ¶

approximate_mask ¶

rescale ¶

`results` ¶

`RecognizedText` `dataclass` ¶

`top_candidate` ¶

`top_score` ¶

`Result` ¶

`bboxes` `property` ¶

`class_labels` `property` ¶

`confidences` `property` ¶

`global_masks` `property` ¶

`local_mask` `property` ¶

`polygons` `property` ¶

`text_result` `property` ¶

`texts` `property` ¶

`drop_indices` ¶

`filter` ¶

`reorder` ¶

`rescale` ¶

`segmentation_result` `classmethod` ¶

`text_recognition_result` `classmethod` ¶

`Segment` ¶

`global_mask` `property` ¶

`local_mask` `property` ¶

`approximate_mask` ¶

`rescale` ¶