-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bulk Loader On Single Server Not Loading All Predicates #2616
Comments
That script is not officially supported by Dgraph. But, @MichelDiz might be able to help you with it. |
Updated 9/28/18 @ 1000 PSTI added additional steps to reproducing the issue. I neglected to specifically mention to export the data, then bulk upload it. Thank you for your prompt response! Are these 13 entries also showing in Dgraph Ratel? Just want to clarify that it's the not the *.schema file itself that isn't showing the entries, but in Ratel after the bulk upload completes. My *.schema file defines 12 entries so I expect 12 in Dgraph Ratel, but only see 9. |
In Ratel only 12, but " |
By doing what you said in:
I was able to reproduce the issue. This happens in the insertion of Schema by bulkload, the data is okay. You can work around this by adding your schema directly in Alter.
@danielmai Can you help me with this? Maybe I did something wrong. If you confirm add a Bug flag No need to use my script, just do this:
In my example only the predicates with UID were recorded. The others were not recorded. info:
|
* Don't skip predicates with value type of default when loading the schema. (#2616) * Allow running test.sh from another directory. * Keep all predicates from bulk import schema, not just the ones used. * Make set of predicates the union of predicates in the schema and rdf. * Add test for schema after export/bulk load. * Add more schema test cases.
* Don't skip predicates with value type of default when loading the schema. (dgraph-io#2616) * Allow running test.sh from another directory. * Keep all predicates from bulk import schema, not just the ones used. * Make set of predicates the union of predicates in the schema and rdf. * Add test for schema after export/bulk load. * Add more schema test cases.
If you suspect this could be a bug, follow the template.
What version of Dgraph are you using?
v1.0.8
Have you tried reproducing the issue with latest release?
On latest release
What is the hardware spec (RAM, OS)?
macOS High Sierra 10.13.6
16 GB RAM
Docker Version 18.06.0-ce-mac73 (26764)
Steps to reproduce the issue (command/config used to run Dgraph).
default
with count(uid)=1, and the last has count(uid)=0 but also with an index.I would expect that after exporting using
curl localhost:8080/admin/export
, I would see all three predicates I added to the 1million movie dataset, which would be 12 entries in the schema.Instead, I see still only 9 entries after using the Bulk script (after changing the path of the schema and rdf).
Additionally, if I modify the schema file so that
predicate_with_default_type
is changed fromdefault
tostring
, then the predicate will display and entries will total to 10 in the schema.I have forked the Dgraph-Bulk-Script repo and included the data I exported (using the steps above):
https://github.com/dareneiri/Dgraph-Bulk-Script
service/bulk-it-or-not-bulk-it.sh
, you can use the.schema
file where I changed the predicate type fromdefault
tostring
and shows 10 entries. Or you can use.schema.gz
, which will result in 9 entries instead of the expected 12.The text was updated successfully, but these errors were encountered: