Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] range分页查询出现数据重复 #1677

Closed
1 task done
z7658329 opened this issue Dec 1, 2021 · 4 comments
Closed
1 task done

[Question] range分页查询出现数据重复 #1677

z7658329 opened this issue Dec 1, 2021 · 4 comments
Labels

Comments

@z7658329
Copy link
Member

z7658329 commented Dec 1, 2021

Problem Type (问题类型)

No response

Before submit

  • 我已经确认现有的 IssuesFAQ 中没有相同 / 重复问题

Environment (环境信息)

Server Version: v0.11.x
Backend: MYSQL
OS: xx CPUs, xx G RAM, Centos 7.x
Data Size: 100W vertices, 200W edges

Your Question (问题描述)

g.V().hasLabel("v_app").has("appSrcType","taf").count()总数量为1W条,然后想通过range实现分页,发现从第3条语句开始返回数据是一样的,请问为什么?
1.g.V().hasLabel("v_app").has("appSrcType","taf").range(0,500)
2.g.V().hasLabel("v_app").has("appSrcType","taf").range(500,1000)
3.g.V().hasLabel("v_app").has("appSrcType","taf").range(1000,1500)
4.g.V().hasLabel("v_app").has("appSrcType","taf").range(1500,2000)
5.g.V().hasLabel("v_app").has("appSrcType","taf").range(2000,2500)

下图是第3和第4次查询的结果,结果里都是重复的数据
image

Vertex/Edge example (问题点 / 边数据举例)

No response

Schema [VertexLabel, EdgeLabel, IndexLabel] (元数据结构)

No response

@imbajin
Copy link
Member

imbajin commented Dec 1, 2021

这个不确定, 可以等其他熟悉的同学看看

另外, 你可以试试自带的 rest-api 直接分页查询, 看看是否正常.

@javeme
Copy link
Contributor

javeme commented Dec 1, 2021

@z7658329 看起来可能是因为有残留索引(关于残留索引可参考:https://github.com/hugegraph/hugegraph/issues/1495 ),请问是否有大量覆盖过数据?

不过分页确实建议使用rest api,以page参数的方式获取。

@z7658329
Copy link
Member Author

z7658329 commented Dec 1, 2021

感谢回复,确实该顶点类型我们覆盖过数据 这种要怎么修数?
有问题的顶点 v_app(数据量1W)这个分页查询有问题
没问题的顶点 v_host(数量2W) 这个分页查询没有问题

本质是v_app和v_host的区别就在于v_app有一个全文索引的appCoreTag属性,这个属性我这边是通过SPI自定义分词的实现,而且会经常修改 不知道和这个有没有关系

@github-actions
Copy link

Due to the lack of activity, the current issue is marked as stale and will be closed after 20 days, any update will remove the stale label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants