add workarounds for servers that don't paginate correctly #167

vmpjdc · 2021-03-18T02:04:25Z

Some Swift API implementations (I've observed this with Ceph
RADOSGW) can return fewer results than specified by the "limit"
parameter, even when we have not reached the end of the listing.

It's unclear to me from reading the API docs whether this is
a violation of the API specification, but since it happens in the
wild, it's best to be able to handle it.

One way of doing this is to simply keep fetching pages until we
receive an empty page. Another is to assume that pages within
a certain percentage of limit are not the last page. Given the
tradeoffs involved, let's support both.

vmpjdc · 2021-03-18T02:06:39Z

I discovered this problem while trying to use rclone to copy data from an OpenStack Swift into a Ceph RADOSGW.

Listings of some RADOSGW containers would terminate after 1999 entries even though there were over 10,000 objects in them, and rclone copy would copy the same objects again and again on multiple runs.

With this change applied to a local build of rclone I'm now getting full listings and clean syncs with no spurious copies.

ncw

This change will do one more API transaction than it needs to on conforming servers won't it?

So for your average directory listing this will now do two transactions rather than one.

I can't see a way round this in the protocol - can you?

vmpjdc · 2021-03-25T00:15:40Z

It will indeed result in an extra transaction, and unfortunately I can't see another way around it. (The number of objects in a container can be looked up by issuing a HEAD request, but of course that's still an extra request, and racy to boot.)

Correcting myself, I found a separate page describing the Swift API's pagination and RADOSGW is clearly in the wrong, at least in terms of that document.

However, I also discovered that Swift's own Python client code (which I think we could probably call the reference client) doesn't implement pagination as described above when fetching full listings, rather just fetching new pages until it receives an empty one. (Link is into get_container; get_account does the same.)

The code dates from 2012, so perhaps this was implemented before API pagination was nailed down. Either way, swift list and get_container(..., full_listing=True, limit=1000) do not trip over this RADOSGW bug.

ncw · 2021-03-25T18:21:51Z

One thing we could do is make a feature flag for this and only do the new behaviour if the feature flag is set.

So make a new flag in the Connection struct and check it to enable the new behaviour.

I have a feeling that this has already been reported as a radosgw bug - it would be worth searching their issue tracker to see.

I hate the idea of doubling the number of transactions for directory traversals - I can see that being very bad for performance.

ncw · 2021-03-26T09:00:54Z

...or we could use some kind of heuristic - if we got more than 90% of the max listing then do an extra transaction just to check.

This would probably work quite well but has the potential to go wrong.

vmpjdc · 2021-03-28T21:52:14Z

The 90% heuristic should work nicely, based on the lengths of the replies I'm getting from RADOSGW for my problematic buckets:

reply lengths: 1000 999 1000 1000 1000 1000 1000 1000 1000 1000 119
reply lengths: 1000 992 1000 1000 1000 1000 1000 935 1000 1000 257
reply lengths: 1000 1000 1000 1000 1000 975 1000 948
reply lengths: 953 1000 1000 1000 1000 1000 954 1000 1000 70
reply lengths: 1000 1000 1000 1000 998 15
reply lengths: 1000 1000 1000 1000 974 1000 1000 1000 1000 1000 1000 1000 1000 1000 1000 1000 939 1000 1000 1000 1000 949 1000 1000 1000 644
reply lengths: 1000 1000 1000 1000 999 1000 1000 937 1000 1000 538
reply lengths: 1000 998 1000 1000 1000 1000 1000 1000 1000 1000 1000 1000 1000 551
reply lengths: 1000 1000 1000 1000 1000 1000 1000 931 1000 986 1000 1000 1000 975 1000 989 1000 1000 1000 966 1000 998 921 994 1000 1000 973 58
reply lengths: 1000 1000 1000 1000 1000 1000 1000 1000 1000 1000 1000 1000 976 1000 366
reply lengths: 1000 1000 1000 1000 1000 983 1000 1000 1000 1000 1000 1000 1000 517
reply lengths: 1000 1000 1000 984 1000 1000 971 1000 1000 401
reply lengths: 949 1000 1000 1000 1000 1000 1000 403
reply lengths: 1000 998 532
reply lengths: 951 1000 1000 1000 1000 1000 976 1000 877

I was not able to find a matching bug in the Ceph tracker, but I'll start a thread on ceph-users to confirm.

And I'll also take a look at implementing the flag and heuristic.

ncw · 2021-03-31T10:26:15Z

The 90% heuristic should work nicely, based on the lengths of the replies I'm getting from RADOSGW for my problematic buckets:

useful data - thanks

Those missing items are probably filtered out items (eg deleted items) or something like that.

I was not able to find a matching bug in the Ceph tracker, but I'll start a thread on ceph-users to confirm.

If you find something out can you link it here?

And I'll also take a look at implementing the flag and heuristic.

:-)

vmpjdc · 2021-04-06T23:31:10Z

Those missing items are probably filtered out items (eg deleted items) or something like that.

I'm reasonably certain that it's not deleted items, at least -- it's a fairly new cluster, and these buckets have only ever been written to by rclone copy.

I was not able to find a matching bug in the Ceph tracker, but I'll start a thread on ceph-users to confirm.

If you find something out can you link it here?

Most definitely!

And I'll also take a look at implementing the flag and heuristic.

:-)

Implemented, and from my testing with hacked rclone builds, the workarounds seem to perform as expected when enabled.

vmpjdc · 2021-04-11T23:43:24Z

I forgot to mention I made a draft rclone PR that makes this change much easier to try out: rclone/rclone#5224

ncw

Apologies for the delay!

This looks great now - I'll merge it

Thank you :-)

Ceph's Swift API emulation does not fully confirm to the API spec. As a result, it sometimes returns fewer items in a container than the requested limit, which according to the spec should means that there are no more objects left in the container. (Note that python-swiftclient always fetches unless the current page is empty.) This commit adds a pair of new Swift backend settings to handle this. Set `fetch_until_empty_page` to true to always fetch another page of the container listing unless there are no items left. Alternatively, set `partial_page_fetch_threshold` to an integer percentage. In this case rclone will fetch a new page only when the current page is within this percentage of the limit. Swift API reference: https://docs.openstack.org/swift/latest/api/pagination.html PR against ncw/swift with research and discussion: ncw/swift#167

Ceph's Swift API emulation does not fully confirm to the API spec. As a result, it sometimes returns fewer items in a container than the requested limit, which according to the spec should means that there are no more objects left in the container. (Note that python-swiftclient always fetches unless the current page is empty.) This commit adds a pair of new Swift backend settings to handle this. Set `fetch_until_empty_page` to true to always fetch another page of the container listing unless there are no items left. Alternatively, set `partial_page_fetch_threshold` to an integer percentage. In this case rclone will fetch a new page only when the current page is within this percentage of the limit. Swift API reference: https://docs.openstack.org/swift/latest/api/pagination.html PR against ncw/swift with research and discussion: ncw/swift#167 Fixes #7924

Ceph's Swift API emulation does not fully confirm to the API spec. As a result, it sometimes returns fewer items in a container than the requested limit, which according to the spec should means that there are no more objects left in the container. (Note that python-swiftclient always fetches unless the current page is empty.) This commit adds a pair of new Swift backend settings to handle this. Set `fetch_until_empty_page` to true to always fetch another page of the container listing unless there are no items left. Alternatively, set `partial_page_fetch_threshold` to an integer percentage. In this case rclone will fetch a new page only when the current page is within this percentage of the limit. Swift API reference: https://docs.openstack.org/swift/latest/api/pagination.html PR against ncw/swift with research and discussion: ncw/swift#167 Fixes rclone#7924

ncw reviewed Mar 23, 2021

View reviewed changes

vmpjdc force-pushed the empty-result-sentinel branch from dfa1f5b to 1818271 Compare April 6, 2021 23:22

vmpjdc added 3 commits April 7, 2021 11:30

factor out isLastPage

52d2425

add FetchUntilEmptyPage workaround

3853f6a

add PartialPageFetchThreshold workaround

ac42f5d

vmpjdc force-pushed the empty-result-sentinel branch from 1818271 to ac42f5d Compare April 6, 2021 23:30

vmpjdc changed the title ~~use empty result as sentinel when fetching container listings~~ add workarounds for servers that don't paginate correctly Apr 7, 2021

vmpjdc mentioned this pull request Apr 11, 2021

add workarounds for Ceph RGW to Swift backend rclone/rclone#5224

Merged

5 tasks

ncw approved these changes Oct 6, 2021

View reviewed changes

ncw merged commit fef4f1d into ncw:master Oct 6, 2021

ncw mentioned this pull request Jun 28, 2024

rclone size & rclone about commands returns different results rclone/rclone#7924

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add workarounds for servers that don't paginate correctly #167

add workarounds for servers that don't paginate correctly #167

vmpjdc commented Mar 18, 2021 •

edited

Loading

vmpjdc commented Mar 18, 2021

ncw left a comment

vmpjdc commented Mar 25, 2021 •

edited

Loading

ncw commented Mar 25, 2021

ncw commented Mar 26, 2021

vmpjdc commented Mar 28, 2021

ncw commented Mar 31, 2021

vmpjdc commented Apr 6, 2021

vmpjdc commented Apr 11, 2021

ncw left a comment

add workarounds for servers that don't paginate correctly #167

add workarounds for servers that don't paginate correctly #167

Conversation

vmpjdc commented Mar 18, 2021 • edited Loading

vmpjdc commented Mar 18, 2021

ncw left a comment

Choose a reason for hiding this comment

vmpjdc commented Mar 25, 2021 • edited Loading

ncw commented Mar 25, 2021

ncw commented Mar 26, 2021

vmpjdc commented Mar 28, 2021

ncw commented Mar 31, 2021

vmpjdc commented Apr 6, 2021

vmpjdc commented Apr 11, 2021

ncw left a comment

Choose a reason for hiding this comment

vmpjdc commented Mar 18, 2021 •

edited

Loading

vmpjdc commented Mar 25, 2021 •

edited

Loading