Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix bugs in bulk loader. Add Dgraph debug tool. #2449

Merged
merged 7 commits into from
Jun 20, 2018
Merged

Conversation

manishrjain
Copy link
Contributor

@manishrjain manishrjain commented Jun 20, 2018

When building multiple shards, we want each predicate to only lie to one shard, and nowhere else. Bulk loader was doing that, but then adding the schema for all predicates on all shards, which causes confusion about ownership of the predicates. This changes that to only add schema for the predicates that the shard holds.

Similarly, when outputting _predicate_ edge, the shard being used was the one corresponding to the original predicate. Instead, we should use the shard corresponding to _predicate_.

This PR fixes #2129 .

Added a new debug tool which can iterate over posting store and spit out stats per predicate. Useful for debugging.


This change is Reviewable

@manishrjain manishrjain merged commit 1c602b0 into master Jun 20, 2018
@manishrjain manishrjain deleted the mrjn/bulk branch June 20, 2018 02:29
dna2github pushed a commit to dna2fork/dgraph that referenced this pull request Jul 19, 2019
When building multiple shards, we want each predicate to only lie to one shard, and nowhere else. Bulk loader was doing that, but then adding the schema for all predicates on all shards, which causes confusion about ownership of the predicates. This changes that to only add schema for the predicates that the shard holds.

Similarly, when outputting _predicate_ edge, the shard being used was the one corresponding to the original predicate. Instead, we should use the shard corresponding to _predicate_.

This PR fixes dgraph-io#2129 .

Added a new debug tool which can iterate over posting store and spit out stats per predicate. Useful for debugging.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

Data missing in Dgraph cluster after bulk loading
1 participant