fix(embedded): adding logic to check dataset used by filters #24808

Vitor-Avila · 2023-07-26T01:50:49Z

SUMMARY

When granting dashboard access to a guest user, it's only granted access to datasets used by its charts. If the dashboard has any native filters powered by datasets that aren't used by any chart, the filter wouldn't load with a permission error. This PR changes this logic to also allow access to datasets used by filters.

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

Before

After

TESTING INSTRUCTIONS

Create a chart using any dataset.
Save the chart and add it to a dashboard.
Create a virtual dataset for the same table (a select * ... would be enough).
Create a dashboard filter using the virtual dataset.
Enable embedded access for the dashboard.
Create a guest_token and grant access to this dashboard.
Access this dashboard in embedded mode.
Validate that the dashboard filter loads properly.

ADDITIONAL INFORMATION

Has associated issue: Fixes Guest user (embedded) doesn't get access to datasets used only in dashboard filters #24807
Required feature flags: EMBEDDED_SUPERSET = True
Changes UI
Includes DB Migration (follow approval process in SIP-59)
- Migration is atomic, supports rollback & is backwards-compatible
- Confirm DB migration upgrade and downgrade tested
- Runtime estimates and downtime expectations provided
Introduces new feature or API
Removes existing feature or API

codecov · 2023-07-26T02:01:34Z

Codecov Report

Merging #24808 (aa2c6f7) into master (c17accc) will increase coverage by 10.44%.
Report is 20 commits behind head on master.
The diff coverage is 73.63%.

❗ Current head aa2c6f7 differs from pull request most recent head 7668aec. Consider uploading reports for the commit 7668aec to get more accurate results

@@             Coverage Diff             @@
##           master   #24808       +/-   ##
===========================================
+ Coverage   58.40%   68.85%   +10.44%     
===========================================
  Files        1902     1903        +1     
  Lines       73996    74089       +93     
  Branches     8195     8194        -1     
===========================================
+ Hits        43220    51013     +7793     
+ Misses      28657    20955     -7702     
- Partials     2119     2121        +2

Flag	Coverage Δ
hive	`?`
mysql	`79.21% <71.96%> (?)`
postgres	`79.31% <71.96%> (?)`
presto	`?`
python	`83.05% <73.64%> (+21.78%)`	⬆️
sqlite	`77.88% <71.12%> (?)`
unit	`54.97% <50.62%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Changed	Coverage Δ
...et-chart-deckgl/src/layers/Geojson/controlPanel.ts	`50.00% <ø> (ø)`
...egacy-preset-chart-deckgl/src/layers/Path/Path.jsx	`0.00% <ø> (ø)`
...reset-chart-deckgl/src/layers/Path/controlPanel.ts	`50.00% <ø> (ø)`
...preset-chart-deckgl/src/layers/Polygon/Polygon.jsx	`0.00% <ø> (ø)`
...et-chart-deckgl/src/layers/Polygon/controlPanel.ts	`33.33% <ø> (ø)`
...reset-chart-deckgl/src/utilities/Shared_DeckGL.jsx	`86.48% <ø> (ø)`
superset-frontend/src/SqlLab/App.jsx	`0.00% <ø> (ø)`
...d/src/SqlLab/components/SaveDatasetModal/index.tsx	`60.63% <ø> (+10.63%)`	⬆️
...frontend/src/SqlLab/components/SouthPane/index.tsx	`79.54% <ø> (ø)`
superset-frontend/src/SqlLab/fixtures.ts	`100.00% <ø> (ø)`
... and 109 more

... and 222 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

eschutho

LGTM, thanks @Vitor-Avila!

john-bodley · 2023-07-27T16:32:20Z

superset/security/manager.py

+                            for target in filter_.get("targets", [])
+                        ]
+                        if datasource.id in filter_dataset_ids:
+                            exists = True


Suggested change

exists = True

return True

We should short circuit when possible.

Makes perfect sense! Made this change.

john-bodley · 2023-07-27T16:33:38Z

superset/security/manager.py

+                            exists = True
+                except ValueError:
+                    pass
+
        return exists


Probably cleaner on line #2066 to have:

if db.session.query(query.exists()).scalar(): return True

and then on line #2088 have

return False

Here as well.

john-bodley · 2023-07-27T16:35:17Z

superset/security/manager.py

@@ -2063,6 +2064,27 @@ def can_access_based_on_dashboard(datasource: "BaseDatasource") -> bool:
        )

        exists = db.session.query(query.exists()).scalar()
+
+        # check for datasets that are only used by filters


I think this logic makes sense. The one thing I struggle with regarding this method (for both the existing and proposed logic) is how is agnostic of the specific dashboard in question and thus iterates over all dashboards said user has access to. This raises two questions i) correctness, and ii) efficiency.

Currently I can't formulate a situation where (i) is a problem, however for (ii) this method seems highly inefficient, e.g. we loop over all the dashboards a user has access to in relation to said dataset/datasource, whereas in actuality we likely know the context a priori.

Note in #24789 this method is slated for removal, but the addition of the integration test will ensure that the logic will be preserved.

@john-bodley I totally agree that this approach is far from ideal from a performance point of view. I think with the bigger changes that are going to be discussed, we can reformulate this process and make sure this validation happens with a target dashboard ID in mind. I saw in #24804 that you have updated the raise_for_access function so it can also receive a dashboard, so I think it would be easier to implement this improvement once those changes (and other decisions made in regards to expected behavior) are made.

Personally this is my second contribution to Superset so I would rather avoid doing bigger estructural changes until I get more familiar with the code as a whole.

eschutho · 2023-07-28T20:15:10Z

@john-bodley thanks for the feedback! We'll wait for your approval before merging.

…24808) (cherry picked from commit 7f9b038)

sadpandajoe · 2023-08-01T16:52:00Z

🏷️ preset:2023.31

…pache#24808)" This reverts commit 7f9b038.

(cherry picked from commit 7f9b038)

…ters (#24808) (#24892)

…ters (#24808) (#24892) (cherry picked from commit 9f7f2c6)

…d by filters (apache#24808) (apache#24892)" This reverts commit 9f7f2c6.

…24808)

…ters (apache#24808) (apache#24892)

Vitor-Avila added 5 commits July 25, 2023 14:28

adding logic to check dataset used by filters

e39f4aa

handling exceptions

27afd2c

Adding integration test

e5ae33b

simplifying role

36fa0c3

Improving test logic

6dffe8a

pull-request-size bot added the size/M label Jul 26, 2023

fixing tests

5579a93

betodealmeida mentioned this pull request Jul 26, 2023

fix: Dashboard aware RBAC dataset permission #24789

Merged

9 tasks

eschutho approved these changes Jul 27, 2023

View reviewed changes

john-bodley reviewed Jul 27, 2023

View reviewed changes

Vitor-Avila added 2 commits July 28, 2023 02:39

Addressing PR feedback

bcf1fc5

Small changes

7668aec

john-bodley approved these changes Jul 31, 2023

View reviewed changes

john-bodley merged commit 7f9b038 into apache:master Jul 31, 2023

michael-s-molina added the v3.0 Label added by the release manager to track PRs to be included in the 3.0 branch label Aug 1, 2023

sadpandajoe pushed a commit to preset-io/superset that referenced this pull request Aug 1, 2023

fix(embedded): adding logic to check dataset used by filters (apache#…

c730900

…24808) (cherry picked from commit 7f9b038)

john-bodley added a commit to john-bodley/superset that referenced this pull request Aug 4, 2023

Revert "fix(embedded): adding logic to check dataset used by filters (a…

2d00c63

…pache#24808)" This reverts commit 7f9b038.

john-bodley mentioned this pull request Aug 4, 2023

fix: revert "fix(embedded): adding logic to check dataset used by filters (#24808) #24892

Merged

9 tasks

michael-s-molina pushed a commit that referenced this pull request Aug 4, 2023

fix(embedded): adding logic to check dataset used by filters (#24808)

bbe4e01

(cherry picked from commit 7f9b038)

john-bodley added a commit that referenced this pull request Aug 4, 2023

fix: revert "fix(embedded): adding logic to check dataset used by fil…

9f7f2c6

…ters (#24808) (#24892)

john-bodley mentioned this pull request Aug 4, 2023

chore: Refactor dashboard security access #24804

Merged

9 tasks

michael-s-molina pushed a commit that referenced this pull request Aug 7, 2023

fix: revert "fix(embedded): adding logic to check dataset used by fil…

215b3b5

…ters (#24808) (#24892) (cherry picked from commit 9f7f2c6)

jinghua-qa added a commit to preset-io/superset that referenced this pull request Aug 16, 2023

Revert "fix: revert "fix(embedded): adding logic to check dataset use…

96bc3c2

…d by filters (apache#24808) (apache#24892)" This reverts commit 9f7f2c6.

mistercrunch added 🍒 3.0.0 🍒 3.0.1 🍒 3.0.2 🍒 3.0.3 labels Mar 8, 2024

mistercrunch added 🍒 3.0.4 🏷️ bot A label used by `supersetbot` to keep track of which PR where auto-tagged with release labels 🚢 3.1.0 labels Mar 8, 2024

vinothkumar66 pushed a commit to vinothkumar66/superset that referenced this pull request Nov 11, 2024

fix(embedded): adding logic to check dataset used by filters (apache#…

3969b39

…24808)

vinothkumar66 pushed a commit to vinothkumar66/superset that referenced this pull request Nov 11, 2024

fix: revert "fix(embedded): adding logic to check dataset used by fil…

ac740c5

…ters (apache#24808) (apache#24892)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(embedded): adding logic to check dataset used by filters #24808

fix(embedded): adding logic to check dataset used by filters #24808

Vitor-Avila commented Jul 26, 2023

codecov bot commented Jul 26, 2023 •

edited

Loading

eschutho left a comment

john-bodley Jul 27, 2023

Vitor-Avila Jul 28, 2023

john-bodley Jul 27, 2023

Vitor-Avila Jul 28, 2023

john-bodley Jul 27, 2023 •

edited

Loading

Vitor-Avila Jul 28, 2023

eschutho commented Jul 28, 2023

sadpandajoe commented Aug 1, 2023

fix(embedded): adding logic to check dataset used by filters #24808

fix(embedded): adding logic to check dataset used by filters #24808

Conversation

Vitor-Avila commented Jul 26, 2023

SUMMARY

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

Before

After

TESTING INSTRUCTIONS

ADDITIONAL INFORMATION

codecov bot commented Jul 26, 2023 • edited Loading

Codecov Report

eschutho left a comment

Choose a reason for hiding this comment

john-bodley Jul 27, 2023

Choose a reason for hiding this comment

Vitor-Avila Jul 28, 2023

Choose a reason for hiding this comment

john-bodley Jul 27, 2023

Choose a reason for hiding this comment

Vitor-Avila Jul 28, 2023

Choose a reason for hiding this comment

john-bodley Jul 27, 2023 • edited Loading

Choose a reason for hiding this comment

Vitor-Avila Jul 28, 2023

Choose a reason for hiding this comment

eschutho commented Jul 28, 2023

sadpandajoe commented Aug 1, 2023

codecov bot commented Jul 26, 2023 •

edited

Loading

john-bodley Jul 27, 2023 •

edited

Loading