Add Model Evaluation panel callbacks for segmentation tasks #5332

brimoor · 2025-01-01T00:32:57Z

Change log

Adds class/confusion matrix callbacks when evaluating segmentations

TODO

Benchmark performance of using label IDs to implement load_view()
Investigate alternatives that don't require calling load_evaluation_results()? cf this comment
Figure out a way to show TP/FP/FN data in the ME panel?
Figure out a way to support TP/FP/FN filtering in the App sidebar?

Example usage

import random

import fiftyone as fo
import fiftyone.zoo as foz
import fiftyone.core.fields as fof
import fiftyone.utils.labels as foul

dataset = foz.load_zoo_dataset("quickstart", max_samples=10, shuffle=True)

model = foz.load_zoo_model("deeplabv3-resnet101-coco-torch")
dataset.apply_model(model, "resnet101")

model = foz.load_zoo_model("deeplabv3-resnet50-coco-torch")
dataset.apply_model(model, "resnet50")

CLASSES = (
    "background,aeroplane,bicycle,bird,boat,bottle,bus,car,cat,chair,cow," +
    "diningtable,dog,horse,motorbike,person,pottedplant,sheep,sofa,train," +
    "tvmonitor"
)
classes = CLASSES.split(",")
mask_targets = {idx: label for idx, label in enumerate(classes)}

rgb_mask_targets = {
    fof.int_to_hex(random.getrandbits(64)): label
    for label in CLASSES.split(",")
}

_mask_targets = {v: k for k, v in mask_targets.items()}
_rgb_mask_targets = {v: k for k, v in rgb_mask_targets.items()}
targets_map = {_mask_targets[c]: _rgb_mask_targets[c] for c in classes}

dataset.clone_sample_field("resnet101", "resnet101_rgb")
foul.transform_segmentations(
    dataset,
    "resnet101_rgb",
    targets_map,
)

dataset.clone_sample_field("resnet50", "resnet50_rgb")
foul.transform_segmentations(
    dataset,
    "resnet50_rgb",
    targets_map,
)

dataset.mask_targets["resnet101"] = mask_targets
dataset.mask_targets["resnet50"] = mask_targets
dataset.mask_targets["resnet101_rgb"] = rgb_mask_targets
dataset.mask_targets["resnet50_rgb"] = rgb_mask_targets
dataset.save()

# Evaluation with int mask targets
dataset.evaluate_segmentations(
    "resnet50",
    gt_field="resnet101",
    eval_key="eval",
)

# Evaluation with RGB mask targets
dataset.evaluate_segmentations(
    "resnet50_rgb",
    gt_field="resnet101_rgb",
    eval_key="eval_rgb",
    mask_targets=dataset.mask_targets["resnet50_rgb"],
)

sesssion = fo.launch_app(dataset)

Summary by CodeRabbit

New Features
- Added color conversion utility functions in core fields module
- Introduced new caching mechanism for datasets in server utilities
Improvements
- Enhanced segmentation evaluation with more detailed result tracking
- Streamlined segmentation evaluation code by utilizing library utility functions
Technical Updates
- Implemented hex and integer color conversions
- Added support for tracking pixel-wise matches in segmentation evaluation

coderabbitai · 2025-01-01T00:33:03Z

Walkthrough

This pull request introduces enhancements to color data handling and segmentation evaluation across multiple files in the FiftyOne library. The changes include new utility functions for color conversion in fields.py, improvements to segmentation evaluation in segmentation.py, and a new dataset caching mechanism in server/utils.py. The modifications focus on providing more robust color processing capabilities and streamlining the evaluation process for segmentation tasks.

Changes

File	Change Summary
`fiftyone/core/fields.py`	Added 4 new color conversion functions: `hex_to_int()`, `int_to_hex()`, `rgb_array_to_int()`, `int_array_to_rgb()`
`fiftyone/utils/eval/segmentation.py`	Updated `SimpleEvaluation` and `SegmentationResults` classes to include `matches` tracking and refactored color conversion methods
`fiftyone/server/utils.py`	Added new `cache_dataset()` function for direct dataset caching

Sequence Diagram

sequenceDiagram
    participant User
    participant Fields
    participant Segmentation
    participant Server

    User->>Fields: Convert color formats
    Fields-->>User: Return converted colors
    User->>Segmentation: Evaluate segmentation
    Segmentation->>Segmentation: Track pixel matches
    Segmentation-->>User: Return evaluation results
    User->>Server: Cache dataset
    Server-->>User: Confirm dataset cached

Possibly related PRs

model evaluation bug fixes #5166: Modifies Evaluation component for metrics handling
Show "missing" counts in confusion matrices #5187: Updates EvaluationPanel for confusion matrix processing
Fixing #5254 #5267: Addresses evaluation metrics in EvaluationPanel

Suggested labels

enhancement

Suggested reviewers

imanjra
Br2850

Poem

🐰 Colors dance, integers prance,
In FiftyOne's magical advance,
Masks convert with rabbit's might,
Segmentation shines so bright!
Code hops forward with glee 🌈

Finishing Touches

📝 Generate Docstrings (Beta)

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (5)

plugins/panels/model_evaluation.py (4)

17-17: Unused import
import fiftyone.core.fields as fof is never referenced. Consider removing it to satisfy linters and reduce clutter.

🧰 Tools

🪛 Ruff (0.8.2)

17-17: fiftyone.core.fields imported but unused

Remove unused import: fiftyone.core.fields

(F401)

485-485: mask_targets2 assigned.
The variable is set here but appears to be overwritten later, leading to redundant assignments.

491-491: mask_targets2 is never effectively used.
Kindly remove or integrate it if necessary; currently it generates lint warnings.

🧰 Tools

🪛 Ruff (0.8.2)

491-491: Local variable mask_targets2 is assigned to but never used

Remove assignment to unused variable mask_targets2

(F841)

685-734: _init_segmentation_results: assembling ID dictionaries.
This function modifies the passed-in results object to map (i, j) pairs to lists of IDs. Be cautious about naming collisions if this is run multiple times; consider clearing any stale data.

fiftyone/core/fields.py (1)

1627-1636: Implementation of hex_to_int.
Simple bit-shift logic is correct. Provide error handling for malformed hex strings if user input is allowed.

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between fddb6a4 and 33a2dac.

📒 Files selected for processing (3)

fiftyone/core/fields.py (1 hunks)
fiftyone/utils/eval/segmentation.py (11 hunks)
plugins/panels/model_evaluation.py (5 hunks)

🧰 Additional context used

🪛 Ruff (0.8.2)

plugins/panels/model_evaluation.py

17-17: fiftyone.core.fields imported but unused

Remove unused import: fiftyone.core.fields

(F401)

491-491: Local variable mask_targets2 is assigned to but never used

Remove assignment to unused variable mask_targets2

(F841)

🔇 Additional comments (25)

fiftyone/utils/eval/segmentation.py (13)

11-11: The added import is properly used for creating repeat iterators.
No issues identified; it is used in the _from_dict method when handling “no ID” scenarios.

337-337: Validate the hex string keys for consistent usage.
This dictionary comprehension properly converts hex string keys to integers. Consider verifying that all user-supplied hex strings follow the #RRGGBB format before conversion to avoid potential ValueError.

353-353: Initialization of the matches list.
No issues here; it neatly collects pixel-wise mapping data for subsequent analysis.

396-406: Appending match details for segmentation.
This loop accurately records ground truth/prediction associations and pixel counts. However, this can grow large for massive images or datasets. Be mindful of memory usage if used repeatedly in large-scale evaluations.

440-440: Passing the newly built matches to SegmentationResults.
Clean approach to provide the collected matches in the results constructor.

455-457: Docstring accurately reflects the new matches field.
The description matches the tuple structure from the evaluation loop.

469-469: New matches parameter defaults to None.
This is a good backward-compatible signature update.

Line range hint 475-492: Conditional handling of matches.
The fallback to parse pixel_confusion_matrix when matches is None ensures compatibility with legacy workflows. Watch for potential ValueError if ytrue, ypred, weights do not align in length.

510-529: Consistent _from_dict logic for matches.
Correctly handles both new and legacy (no IDs) formats, merging them into a uniform list of tuples.

534-534: Passing the reconstructed matches in _from_dict.
Good consistency with the constructor.

594-597: RGB to integer masking for dimensional consistency.
Properly uses fof.rgb_array_to_int to handle multi-channel arrays.

670-670: Use of new utility for RGB array conversion.
Reusing fof.rgb_array_to_int avoids code duplication.

677-677: Generating hex class labels from integer-based values.
Efficient approach for color-coded segmentation classes.

plugins/panels/model_evaluation.py (8)

13-13: Necessary import for ObjectId usage.
This is used in _to_object_ids.

350-358: Loading segmentation results and initializing them.
Assigning mask_targets and calling _init_segmentation_results is a clear approach to unify the data before proceeding with metrics. Make sure to handle any potential logging or warnings if results are partially missing.

596-611: Segmentations with legacy format.
Early returns handle older data where IDs don’t exist. Ensure end users receive a clear message if early-exiting leads to partial data in the UI.

612-664: Match expressions for segmentation subviews.
This logic effectively filters segmentation results by class/matrix cell. It might be beneficial to confirm performance on large datasets, as multiple .is_in() calls could be costly.

736-752: _get_segmentation_class_ids: retrieving matching IDs by class.
Check for key existence in results._classes_map[x] to avoid KeyError if x is not recognized.

755-760: _get_segmentation_conf_mat_ids: confusion matrix IDs.
Straightforward approach to isolate matches. This is well-structured.

762-780: _get_segmentation_tp_fp_fn_ids: basic classification logic for pixel-level segmentation.
The approach is consistent with typical definitions of TP, FP, and FN. If large sets are expected, consider memory usage.

782-783: _to_object_ids: converting string IDs to ObjectId.
Simple utility that is helpful for consistent typed usage. Ensure _id is always a valid string to avoid conversion errors.

fiftyone/core/fields.py (4)

1624-1625: hex_to_int function declaration and docstring.
Docstring is clear; confirm that input always starts with '#' and contains exactly 6 hex characters.

1639-1652: int_to_hex: Reverse conversion from int to hex.
Logic is standard. No issues observed.

1654-1668: rgb_array_to_int: Transforming 3D RGB arrays to 2D integer arrays.
The use of NumPy bit-shifts is efficient and readable. Ensure mask is always [..., 3] shape or raise warnings.

1670-1684: int_array_to_rgb: Restoring 3D RGB arrays from integer-based masks.
Works in parallel with rgb_array_to_int. Usage of np.stack is correct.

prernadh · 2025-01-02T17:23:52Z

fiftyone/utils/eval/segmentation.py

+        matches (None): a list of
+            ``(gt_label, pred_label, pixel_count, gt_id, pred_id)``
+            matches


I like this data model better - reads cleaner and like you said avoids calling _parse_confusion_matrix when present.

prernadh · 2025-01-02T17:28:22Z

fiftyone/utils/eval/segmentation.py

+            if ytrue_ids is None:
+                ytrue_ids = itertools.repeat(None)
+
+            if ypred_ids is None:
+                ypred_ids = itertools.repeat(None)
+


Suggested change

if ytrue_ids is None:

ytrue_ids = itertools.repeat(None)

if ypred_ids is None:

ypred_ids = itertools.repeat(None)

Nit: Won't need the None check here because of the previous if-statement

Actually they are required:

In previous versions of this code, ytrue, ypred, and weights were not persisted as properties of SegmentationResults. If a user loads such a pre-existing segmentation evaluation and then calls results.save(), this will create a new version of the results that does persist ytrue, ypred and weights (as parsed from _parse_confusion_matrix()). However, there still won't be ytrue_ids and ypred_ids for these results, so these if statements are needed to ensure that the next time these results are loaded, we'll be able to construct the matches object.

Makes sense!

prernadh · 2025-01-02T17:36:03Z

plugins/panels/model_evaluation.py

+                expr = F(gt_id).is_in(ytrue_ids)
+                expr &= F(pred_id).is_in(ypred_ids)


Mmmm, I see this expression should evaluate much faster than select_labels since you are specifying which labels to look for in which fields. Nice

prernadh · 2025-01-02T17:38:15Z

plugins/panels/model_evaluation.py

+    if field == "tp":
+        # True positives
+        inds = results.ytrue == results.ypred
+        ytrue_ids = _to_object_ids(results.ytrue_ids[inds])
+        ypred_ids = _to_object_ids(results.ypred_ids[inds])
+        return ytrue_ids, ypred_ids
+    elif field == "fn":
+        # False negatives
+        inds = results.ypred == results.missing
+        ytrue_ids = _to_object_ids(results.ytrue_ids[inds])
+        return ytrue_ids, None
+    else:
+        # False positives
+        inds = results.ytrue == results.missing
+        ypred_ids = _to_object_ids(results.ypred_ids[inds])
+        return None, ypred_ids


Would we want to move this tp/fp/fn calculation to utils/eval/segmentation.py and make it a sample level field so we can filter on it - similar to detections?

I'm not sure what we'd store as a sample-level field. The TP/FP/FN designation has to do with each region in the segmentation mask, so there would usually be multiple per sample (like object detection tasks for example), but the the mask is stored in a single Segmentation field (unlike object detection).

Hmm.. I see what you are saying- but I guess my confusion is arising from the fact that if we are marking labels as TP/FP/FN, we should be able to filter by it at the sample level too.

I agree that it would be nice to support more sample-level filtering, but I don't know what to do!

Yep understood. Makes sense to leave as is for now then. Maybe something to discuss with the ML team

prernadh

Just a question about high level TP/FP/FN design

prernadh · 2025-01-02T21:51:40Z

plugins/panels/model_evaluation.py

+            elif view_type == "field":
+                if field == "tp":
+                    # All true positives
+                    ytrue_ids, ypred_ids = _get_segmentation_tp_fp_fn_ids(
+                        results, field
+                    )
+                    expr = F(gt_id).is_in(ytrue_ids)
+                    expr &= F(pred_id).is_in(ypred_ids)
+                    view = eval_view.match(expr)
+                elif field == "fn":
+                    # All false negatives
+                    ytrue_ids, _ = _get_segmentation_tp_fp_fn_ids(
+                        results, field
+                    )
+                    expr = F(gt_id).is_in(ytrue_ids)
+                    view = eval_view.match(expr)
+                else:
+                    # All false positives
+                    _, ypred_ids = _get_segmentation_tp_fp_fn_ids(
+                        results, field
+                    )
+                    expr = F(pred_id).is_in(ypred_ids)
+                    view = eval_view.match(expr)


We currently don't display FP/FN/TP in the summary table- get_tp_fp_fn function will have to be updated for segmentations if we ever want to reach this section of the code is my understanding

Yeah I went ahead and implemented the callback so it would be available if we found a reasonable way to show TP/FP/FN icons in the panel. Not quite sure what the best way to show this info would be though.

Oh I understand now! Realized you had also left comment on the other PR explaining this. Apologies

prernadh · 2025-01-02T22:00:50Z

fiftyone/utils/eval/segmentation.py

@@ -424,6 +437,7 @@ def evaluate_samples(
            eval_key,
            confusion_matrix,
            classes,
+            matches=matches,


Suggested change

matches=matches,

matches=matches if matches!=[] else None,

This line is causing tests to fail

Good catch! Just fixed this in a slightly different way, for consistency with how object detection handles this:

fiftyone/fiftyone/utils/eval/detection.py

Lines 543 to 553 in 46bd6bd

if matches:

ytrue, ypred, ious, confs, ytrue_ids, ypred_ids = zip(*matches)

else:

ytrue, ypred, ious, confs, ytrue_ids, ypred_ids = (

[],

[],

[],

[],

[],

[],

)

prernadh

LGTM! 😄

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (4)

fiftyone/server/utils.py (2)
52-65: Implementation looks good but needs additional validation.

The function is well-documented and serves a clear purpose in preventing garbage collection of dataset singletons between async calls.

Consider adding these validations for robustness:
 def cache_dataset(dataset):
     """Caches the given dataset.
 
     This method ensures that subsequent calls to
     :func:`fiftyone.core.dataset.load_dataset` in async calls will return this
     dataset singleton.
 
     See :meth:`load_and_cache_dataset` for additional details.
 
     Args:
         dataset: a :class:`fiftyone.core.dataset.Dataset`
+
+    Raises:
+        ValueError: if the dataset is None or not a Dataset instance
+        ValueError: if the dataset name is None
     """
+    if dataset is None or not isinstance(dataset, fod.Dataset):
+        raise ValueError("Expected a Dataset instance, but got %r" % dataset)
+
+    if dataset.name is None:
+        raise ValueError("Cannot cache dataset with None name")
+
     _cache[dataset.name] = dataset
52-65: Consider architectural improvements for the caching mechanism.

The current TTL cache configuration (maxsize=10, ttl=900s) might need adjustment based on usage patterns:

Limited cache size could lead to premature evictions in high-concurrency scenarios

No monitoring of cache effectiveness

Consider these improvements:

Make cache size and TTL configurable via environment variables

Add cache statistics/metrics for monitoring

Implement a more sophisticated eviction strategy based on dataset size and access patterns

Add logging for cache hits/misses to help tune the configuration

Would you like me to propose a detailed implementation for any of these improvements?
plugins/panels/model_evaluation.py (2)
17-17: Remove unused import.

The import fiftyone.core.fields is not used in the code.
-import fiftyone.core.fields as fof
🧰 Tools

🪛 Ruff (0.8.2)

17-17: fiftyone.core.fields imported but unused

Remove unused import: fiftyone.core.fields

(F401)

687-748: Consider adding error handling for invalid mask targets.

The initialization logic is robust, but it might benefit from additional error handling when dealing with mask targets.

Consider adding validation:
 def _init_segmentation_results(dataset, results, gt_field):
     if results.ytrue_ids is None or results.ypred_ids is None:
         # Legacy format segmentations
         return

     if getattr(results, "_classes_map", None):
         # Already initialized
         return

     fosu.cache_dataset(dataset)

     classes_map = {c: i for i, c in enumerate(results.classes)}

     mask_targets = _get_mask_targets(dataset, gt_field)
     if mask_targets is not None:
+        # Validate mask targets
+        if not isinstance(mask_targets, dict):
+            raise ValueError(f"Expected dict for mask_targets, got {type(mask_targets)}")
+
         mask_targets = {str(k): v for k, v in mask_targets.items()}
         classes = [mask_targets.get(c, c) for c in results.classes]
         classes_map.update({c: i for i, c in enumerate(classes)})

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5e45561 and d5fc685.

📒 Files selected for processing (2)

fiftyone/server/utils.py (1 hunks)
plugins/panels/model_evaluation.py (4 hunks)

🧰 Additional context used

🪛 Ruff (0.8.2)

plugins/panels/model_evaluation.py

17-17: fiftyone.core.fields imported but unused

Remove unused import: fiftyone.core.fields

(F401)

🔇 Additional comments (7)

plugins/panels/model_evaluation.py (7)

340-348: LGTM! Well-structured initialization of segmentation results.

The code properly initializes segmentation-specific data by getting mask targets and initializing results.

582-653: LGTM! Comprehensive segmentation view filtering.

The implementation handles all necessary cases for segmentation tasks:

Class-based filtering

Confusion matrix cell filtering

TP/FP/FN filtering

676-684: LGTM! Clear mask target retrieval logic.

The function follows a clear precedence order for retrieving mask targets.

750-767: LGTM! Efficient class ID retrieval.

The function efficiently retrieves ground truth and predicted IDs for a given class.

769-774: LGTM! Clear confusion matrix ID retrieval.

The function provides a straightforward way to get IDs for specific confusion matrix cells.

777-794: LGTM! Comprehensive TP/FP/FN handling.

The function handles all cases for true positives, false positives, and false negatives correctly.

796-797: LGTM! Simple ObjectId conversion.

The utility function correctly converts string IDs to BSON ObjectIds.

coderabbitai

Actionable comments posted: 4

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d5fc685 and 7e45bd2.

📒 Files selected for processing (2)

fiftyone/core/fields.py (1 hunks)
fiftyone/server/utils.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

fiftyone/server/utils.py

⏰ Context from checks skipped due to timeout of 90000ms (5)

GitHub Check: lint / eslint
GitHub Check: e2e / test-e2e
GitHub Check: build / build
GitHub Check: test / test-app
GitHub Check: build

🔇 Additional comments (1)

fiftyone/core/fields.py (1)

1624-1684: Add unit tests for the new color conversion functions.

The new color conversion functions need comprehensive unit tests to verify their behavior with:

Valid inputs

Edge cases (e.g., black, white, primary colors)

Invalid inputs (to verify error handling)

Would you like me to generate a comprehensive test suite for these functions?

coderabbitai · 2025-01-12T03:52:26Z

fiftyone/core/fields.py

+def int_array_to_rgb(mask):
+    """Converts a 2D hex integer mask array to an RGB mask array.
+
+    Args:
+        mask: a 2D integer mask array
+
+    Returns:
+        an RGB mask array
+    """
+    return np.stack(
+        [(mask >> 16) & 255, (mask >> 8) & 255, mask & 255],
+        axis=2,
+        dtype=np.uint8,
+    )
+


🛠️ Refactor suggestion

Add input validation and type hints for numpy array.

The function should validate the input array's shape and type to prevent runtime errors.

Add validation and type hints:

-def int_array_to_rgb(mask): +def int_array_to_rgb(mask: np.ndarray) -> np.ndarray: + if not isinstance(mask, np.ndarray): + raise TypeError("Input must be a numpy array") + if mask.ndim != 2: + raise ValueError("Input array must be 2-dimensional") + if not np.issubdtype(mask.dtype, np.integer): + raise TypeError("Input array must contain integers") + return np.stack( [(mask >> 16) & 255, (mask >> 8) & 255, mask & 255], axis=2, dtype=np.uint8, )

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def int_array_to_rgb(mask):

"""Converts a 2D hex integer mask array to an RGB mask array.

Args:

mask: a 2D integer mask array

Returns:

an RGB mask array

"""

return np.stack(

[(mask >> 16) & 255, (mask >> 8) & 255, mask & 255],

axis=2,

dtype=np.uint8,

)

def int_array_to_rgb(mask: np.ndarray) -> np.ndarray:

"""Converts a 2D hex integer mask array to an RGB mask array.

Args:

mask: a 2D integer mask array

Returns:

an RGB mask array

"""

if not isinstance(mask, np.ndarray):

raise TypeError("Input must be a numpy array")

if mask.ndim != 2:

raise ValueError("Input array must be 2-dimensional")

if not np.issubdtype(mask.dtype, np.integer):

raise TypeError("Input array must contain integers")

return np.stack(

[(mask >> 16) & 255, (mask >> 8) & 255, mask & 255],

axis=2,

dtype=np.uint8,

)

coderabbitai · 2025-01-12T03:52:26Z

fiftyone/core/fields.py

+def int_to_hex(value):
+    """Converts an RRGGBB integer value to hex string like `"#ff6d04"`.
+
+    Args:
+        value: an integer value
+
+    Returns:
+        a hex string
+    """
+    r = (value >> 16) & 255
+    g = (value >> 8) & 255
+    b = value & 255
+    return "#%02x%02x%02x" % (r, g, b)
+


🛠️ Refactor suggestion

Add input validation for integer range.

The function should validate that the input is a non-negative integer within the valid color range.

Add validation before processing:

def int_to_hex(value): + if not isinstance(value, int) or value < 0 or value > 0xFFFFFF: + raise ValueError("Invalid color value. Must be an integer between 0 and 16777215 (0xFFFFFF)") + r = (value >> 16) & 255 g = (value >> 8) & 255 b = value & 255 return "#%02x%02x%02x" % (r, g, b)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def int_to_hex(value):

"""Converts an RRGGBB integer value to hex string like `"#ff6d04"`.

Args:

value: an integer value

Returns:

a hex string

"""

r = (value >> 16) & 255

g = (value >> 8) & 255

b = value & 255

return "#%02x%02x%02x" % (r, g, b)

def int_to_hex(value):

"""Converts an RRGGBB integer value to hex string like `"#ff6d04"`.

Args:

value: an integer value

Returns:

a hex string

"""

if not isinstance(value, int) or value < 0 or value > 0xFFFFFF:

raise ValueError("Invalid color value. Must be an integer between 0 and 16777215 (0xFFFFFF)")

r = (value >> 16) & 255

g = (value >> 8) & 255

b = value & 255

return "#%02x%02x%02x" % (r, g, b)

coderabbitai · 2025-01-12T03:52:26Z

fiftyone/core/fields.py

+def hex_to_int(hex_str):
+    """Converts a hex string like `"#ff6d04"` to a hex integer.
+
+    Args:
+        hex_str: a hex string
+
+    Returns:
+        an integer
+    """
+    r = int(hex_str[1:3], 16)
+    g = int(hex_str[3:5], 16)
+    b = int(hex_str[5:7], 16)
+    return (r << 16) + (g << 8) + b
+


🛠️ Refactor suggestion

Add input validation for hex string format.

The function should validate that the input is a valid hex color string to prevent runtime errors.

Add validation before processing:

def hex_to_int(hex_str): + if not isinstance(hex_str, str) or not re.match(r'^#[0-9a-fA-F]{6}$', hex_str): + raise ValueError("Invalid hex color string. Expected format: '#RRGGBB'") + r = int(hex_str[1:3], 16) g = int(hex_str[3:5], 16) b = int(hex_str[5:7], 16) return (r << 16) + (g << 8) + b

Committable suggestion skipped: line range outside the PR's diff.

coderabbitai · 2025-01-12T03:52:26Z

fiftyone/core/fields.py

+def rgb_array_to_int(mask):
+    """Converts an RGB mask array to a 2D hex integer mask array.
+
+    Args:
+        mask: an RGB mask array
+
+    Returns:
+        a 2D integer mask array
+    """
+    return (
+        np.left_shift(mask[:, :, 0], 16, dtype=int)
+        + np.left_shift(mask[:, :, 1], 8, dtype=int)
+        + mask[:, :, 2]
+    )
+


🛠️ Refactor suggestion

Add input validation and type hints for numpy array.

The function should validate the input array's shape and type to prevent runtime errors.

Add validation and type hints:

-def rgb_array_to_int(mask): +def rgb_array_to_int(mask: np.ndarray) -> np.ndarray: + if not isinstance(mask, np.ndarray): + raise TypeError("Input must be a numpy array") + if mask.ndim != 3 or mask.shape[2] != 3: + raise ValueError("Input array must have shape (H, W, 3)") + if not np.issubdtype(mask.dtype, np.integer): + raise TypeError("Input array must contain integers") + return ( np.left_shift(mask[:, :, 0], 16, dtype=int) + np.left_shift(mask[:, :, 1], 8, dtype=int) + mask[:, :, 2] )

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def rgb_array_to_int(mask):

"""Converts an RGB mask array to a 2D hex integer mask array.

Args:

mask: an RGB mask array

Returns:

a 2D integer mask array

"""

return (

np.left_shift(mask[:, :, 0], 16, dtype=int)

+ np.left_shift(mask[:, :, 1], 8, dtype=int)

+ mask[:, :, 2]

)

def rgb_array_to_int(mask: np.ndarray) -> np.ndarray:

"""Converts an RGB mask array to a 2D hex integer mask array.

Args:

mask: an RGB mask array

Returns:

a 2D integer mask array

"""

if not isinstance(mask, np.ndarray):

raise TypeError("Input must be a numpy array")

if mask.ndim != 3 or mask.shape[2] != 3:

raise ValueError("Input array must have shape (H, W, 3)")

if not np.issubdtype(mask.dtype, np.integer):

raise TypeError("Input array must contain integers")

return (

np.left_shift(mask[:, :, 0], 16, dtype=int)

+ np.left_shift(mask[:, :, 1], 8, dtype=int)

+ mask[:, :, 2]

)

brimoor requested review from imanjra and prernadh January 1, 2025 00:32

brimoor changed the title ~~Add Model Evaluation panel callbacks for segmentation tasks (Option 1)~~ Add Model Evaluation panel callbacks for segmentation tasks (Option 2) Jan 1, 2025

brimoor force-pushed the segmentation-callbacks2 branch from 33a2dac to 33f06a8 Compare January 1, 2025 00:35

coderabbitai bot reviewed Jan 1, 2025

View reviewed changes

prernadh requested changes Jan 2, 2025

View reviewed changes

prernadh self-requested a review January 2, 2025 17:41

prernadh reviewed Jan 2, 2025

View reviewed changes

brimoor force-pushed the segmentation-callbacks2 branch from 33f06a8 to 5e45561 Compare January 2, 2025 23:44

prernadh self-requested a review January 3, 2025 02:14

prernadh approved these changes Jan 3, 2025

View reviewed changes

prernadh mentioned this pull request Jan 3, 2025

Adding label IDs in Segmentation Evaluation results #5323

Closed

7 tasks

brimoor changed the title ~~Add Model Evaluation panel callbacks for segmentation tasks (Option 2)~~ Add Model Evaluation panel callbacks for segmentation tasks Jan 10, 2025

coderabbitai bot reviewed Jan 10, 2025

View reviewed changes

prernadh and others added 5 commits January 11, 2025 22:49

Adding Seg changes

5df74d5

Reverting new variable

0a712c2

implement callbacks

5aa92d9

store matches instead

5f3acbb

more robust initialization

7e45bd2

brimoor force-pushed the segmentation-callbacks2 branch from d5fc685 to 7e45bd2 Compare January 12, 2025 03:50

coderabbitai bot reviewed Jan 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Model Evaluation panel callbacks for segmentation tasks #5332

Add Model Evaluation panel callbacks for segmentation tasks #5332

brimoor commented Jan 1, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 1, 2025 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

coderabbitai bot left a comment

prernadh Jan 2, 2025

prernadh Jan 2, 2025

prernadh Jan 2, 2025

brimoor Jan 2, 2025

prernadh Jan 2, 2025

prernadh Jan 2, 2025

brimoor Jan 2, 2025

prernadh Jan 2, 2025

brimoor Jan 2, 2025

prernadh Jan 2, 2025

brimoor Jan 2, 2025

prernadh Jan 3, 2025

prernadh left a comment

prernadh Jan 2, 2025

brimoor Jan 2, 2025 •

edited

Loading

prernadh Jan 3, 2025

prernadh Jan 2, 2025

prernadh Jan 2, 2025

brimoor Jan 2, 2025

prernadh left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot Jan 12, 2025

coderabbitai bot Jan 12, 2025

coderabbitai bot Jan 12, 2025

coderabbitai bot Jan 12, 2025

		expr = F(gt_id).is_in(ytrue_ids)
		expr &= F(pred_id).is_in(ypred_ids)

	if matches:
	ytrue, ypred, ious, confs, ytrue_ids, ypred_ids = zip(*matches)
	else:
	ytrue, ypred, ious, confs, ytrue_ids, ypred_ids = (
	[],
	[],
	[],
	[],
	[],
	[],
	)

Add Model Evaluation panel callbacks for segmentation tasks #5332

Are you sure you want to change the base?

Add Model Evaluation panel callbacks for segmentation tasks #5332

Conversation

brimoor commented Jan 1, 2025 • edited by coderabbitai bot Loading

Change log

TODO

Example usage

Summary by CodeRabbit

coderabbitai bot commented Jan 1, 2025 • edited Loading

Walkthrough

Changes

Sequence Diagram

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Finishing Touches

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

prernadh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

brimoor Jan 2, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

prernadh left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Jan 12, 2025

Choose a reason for hiding this comment

coderabbitai bot Jan 12, 2025

Choose a reason for hiding this comment

coderabbitai bot Jan 12, 2025

Choose a reason for hiding this comment

coderabbitai bot Jan 12, 2025

Choose a reason for hiding this comment

brimoor commented Jan 1, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 1, 2025 •

edited

Loading

brimoor Jan 2, 2025 •

edited

Loading