All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog and this project will adhere to Semantic Versioning starting v1.0.0.
1.0.12 - Unreleased
- Support gzip compression for gRPC and HTTP requests. (#2843)
- Restore is available from a full binary backup. This is an enterprise feature licensed under the Dgraph Community License.
- Strict schema mode via
--mutations
flag. By default--mutations=allow
is set to allow all mutations;--mutations=disallow
disables all mutations;--mutations=strict
allows mutations only for predicates which are defined in the schema. Fixes #2277. - Add
dgraph increment
tool for debugging and testing. The increment tool queries for the specified predicate (default:counter.val
), increments its integer counter value, and mutates the result back to Dgraph. Useful for testing end-to-end txns to verify cluster health. (#2955)
- Use the new Stream API from Badger instead of Dgraph's Stream framework. (#2852)
- Discard earlier versions of posting lists. (#2859)
- Make HTTP JSON response encoding more efficient by operating on a bytes buffer directly. (ae1d9f3)
- Optimize and refactor facet filtering. (#2829)
- Show badger.Item meta information in
dgraph debug
output. - Add new option to
dgraph debug
tool to get a histogram of key and value sizes. (#2844) - Add new option to
dgraph debug
tool to get info from a particular read timestamp. - Refactor rebuild index logic. (#2851, #2866)
- For gRPC clients, schema queries are returned in the Json field. The Schema proto field is deprecated.
- Simplify design and make tablet moves robust. (#2800)
- Switch all node IDs to hex in logs (e.g., ID 0xa instead of ID 10), so they are consistent with Raft logs.
- Refactor reindexing code to only reindex specific tokenizers. (#2948)
- Introduce group checksums (#2964)
- Return aborted error if commit ts is 0.
- Reduce number of "ClusterInfoOnly" requests to Zero by making VerifyUid wait for membership information. (#2974)
- Remove LRU cache from Alpha for big wins in query latency reduction (5-10x)
and mutation throughput (live loading 1.7x faster). Setting
--lru_mb
is still required but will not have any effect since the cache is removed. The flag will be used later version when LRU cache is introduced within Badger and configurable from Dgraph. - Remove
--nomutations
flag. Its functionality has moved into strict schema mode with the--mutations
flag (see Added section).
- Use json.Marshal for strings and blobs. Fixes #2662.
- Let eq use string "uid" as value. Fixes #2827.
- Skip empty posting lists in
has
function. - Fix Rollup to pick max update commit ts.
- Fix a race condition when processing concurrent queries. Fixes #2849.
- Show an error when running multiple mutation blocks. Fixes #2815
- Bring in optimizations and bug fixes over from Badger.
1.0.11 - 2018-12-17
- Integrate OpenCensus in Dgraph. (#2739)
- Add Dgraph Community License for proprietary features.
- Feature: Full binary backups. This is an enterprise feature licensed under the Dgraph Community License. (#2710)
- Add
--enterprise_features
flag to enable enterprise features. By enabling enterprise features, you accept the terms of the Dgraph Community License. - Add minio dep and its deps in govendor. (94daeaf7, 35a73e81)
- Add network partitioning tests with blockade tool. (./contrib/blockade)
- Add Zero endpoints
/assign?what=uids&num=10
and/assign?what=timestamps&num=10
to assign UIDs or transaction timestamp leases. - Adding the acl subcommand to support acl features (still work-in-progress). (#2795)
- Support custom tokenizer in bulk loader (#2820)
- Support JSON data with Dgraph Bulk Loader. (#2799)
- Make posting list memory rollup happen right after disk. (#2731)
- Do not retry proposal if already found in CommittedEntries. (#2740)
- Remove ExportPayload from protos. Export returns Status and ExportRequest. (#2741)
- Allow more escape runes to be skipped over when parsing string literal. (#2734)
- Clarify message of overloaded pending proposals for live loader. (#2732)
- Posting List Evictions. (e2bcfdad)
- Log when removing a tablet. (#2746)
- Deal better with network partitions in leaders. (#2749)
- Keep maxDelay during timestamp req to 1s.
- Updates to the version output info.
- Checking nil values in the equal function (#2769)
- Optimize query: UID expansion. (#2772)
- Split membership sync endpoints and remove PurgeTs endpoint. (#2773)
- Set the Prefix option during iteration. (#2780)
- Replace Zero's
/assignIds?num=10
endpoint with/assign?what=uids&num=10
(see Added section).
- Remove type hinting for JSON and RDF schema-less types. (#2742)
- Remove deprecated logic that was found using vet. (#2758)
- Remove assert for zero-length posting lists. (#2763)
- Restore schema states on error. (#2730)
- Refactor bleve tokenizer usage (#2738). Fixes #2622 and #2601.
- Switch to Badger's Watermark library, which has a memory leak fix. (0cd9d82e)
- Fix tiny typo. (#2761)
- Fix Test: TestMillion.
- Fix Jepsen bank test. (#2764)
- Fix link to help_wanted. (#2774)
- Fix invalid division by zero error. Fixes #2733.
- Fix missing predicates after export and bulk load. Fixes #2616.
- Handle various edge cases around cluster memberships. (#2791)
- Change Encrypt to not re-encrypt password values. Fixes #2765.
- Correctly parse facet types for both JSON and RDF formats. Previously the parsing was handled differently depending on the input format. (#2797)
1.0.10 - 2018-11-05
Note: This release requires you to export and re-import data. We have changed the underlying storage format.
- The Alter endpoint can be protected by an auth token that is set on the Dgraph Alphas via the
--auth_token
option. This can help prevent accidental schema updates and drop all operations. (#2692) - Optimize has function (#2724)
- Expose the health check API via gRPC. (#2721)
- Dgraph is relicensed to Apache 2.0. (#2652)
- Breaking change. Rename Dgraph Server to Dgraph Alpha to clarify discussions of the Dgraph cluster. The top-level command
dgraph server
is nowdgraph alpha
. (#2667) - Prometheus metrics have been renamed for consistency for alpha, memory, and lru cache metrics. (#2636, #2670, #2714)
- The
dgraph-converter
command is available as the subcommanddgraph conv
. (#2635) - Updating protobuf version. (#2639)
- Allow checkpwd to be aliased (#2641)
- Better control excessive traffic to Dgraph (#2678)
- Export format now exports on the Alpha receiving the export request. The naming scheme of the export files has been simplified.
- Improvements to the
dgraph debug
tool that can be used to inspect the contents of the posting lists directory. - Bring in Badger updates (#2697)
- Make raft leader resume probing after snapshot crash (#2707)
- Breaking change: Create a lot simpler sorted uint64 codec (#2716)
- Increase the size of applyCh, to give Raft some breathing space. Otherwise, it fails to maintain quorum health.
- Zero should stream last commit update
- Send commit timestamps in order (#2687)
- Query blocks with the same name are no longer allowed.
- Fix out-of-range values in query parser. (#2690)
1.0.9 - 2018-10-02
- This version switches Badger Options to reasonable settings for p and w directories. This removes the need to expose
--badger.options
option and removes thenone
option from--badger.vlog
. (#2605) - Add support for ignoring parse errors in bulk loader with the option
--ignore_error
. (#2599) - Introduction of new command
dgraph cert
to simplify initial TLS setup. See TLS configuration docs for more info. - Add
expand(_forward_)
andexpand(_reverse_)
to GraphQL+- query language. If_forward_
is passed as an argument toexpand()
, all predicates at that level (minus any reverse predicates) are retrieved. If_reverse_
is passed as an argument toexpand()
, only the reverse predicates are retrieved.
- Rename intern pkg to pb (#2608)
- Remove LinRead map logic from Dgraph (#2570)
- Sanity length check for facets mostly.
- Make has function correct w.r.t. transactions (#2585)
- Increase the snapshot calculation interval, while decreasing the min number of entries required; so we take snapshots even when there's little activity.
- Convert an assert during DropAll to inf retry. (#2578)
- Fix a bug which caused all transactions to abort if
--expand_edge
was set to false. Fixes #2547. - Set the Applied index in Raft directly, so it does not pick up an index older than the snapshot. Ensure that it is in sync with the Applied watermark. Fixes #2581.
- Pull in Badger updates. This also fixes the Unable to find log file, retry error.
- Improve efficiency of readonly transactions by reusing the same read ts (#2604)
- Fix a bug in Raft.Run loop. (#2606)
- Fix a few issues regarding snapshot.Index for raft.Cfg.Applied. Do not overwrite any existing data when apply txn commits. Do not let CreateSnapshot fail.
- Consider all future versions of the key as well, when deciding whether to write a key or not during txn commits. Otherwise, we'll end up in an endless loop of trying to write a stale key but failing to do so.
- When testing inequality value vars with non-matching values, the response was sent as an error although it should return empty result if the query has correct syntax. (#2611)
- Switch traces to glogs in worker/export.go (#2614)
- Improve error handling for
dgraph live
for errors when processing RDF and schema files. (#2596) - Fix task conversion from bool to int that used uint32 (#2621)
- Fix
expand(_all_)
in recurse queries (#2600). - Add language aliases for broader support for full text indices. (#2602)
1.0.8 - 2018-08-29
- Introduce a new /assignIds HTTP endpoint in Zero, so users can allocate UIDs to nodes externally.
- Add a new tool which retrieves and increments a counter by 1 transactionally. This can be used to test the sanity of Dgraph cluster.
- This version introduces tracking of a few anonymous metrics to measure Dgraph adoption (#2554). These metrics do not contain any specifically identifying information about the user, so most users can leave it on. This can be turned off by setting
--telemetry=false
flag if needed in Dgraph Zero.
- Correctly handle a list of type geo in json (#2482, #2485).
- Fix the graceful shutdown of Dgraph server, so a single Ctrl+C would now suffice to stop it.
- Fix various deadlocks in Dgraph and set ConfState in Raft correctly (#2548).
- Significantly decrease the number of transaction aborts by using SPO as key for entity to entity connections. (#2556).
- Do not print error while sending Raft message by default. No action needs to be taken by the user, so it is set to V(3) level.
1.0.7 - 2018-08-10
- Set the
--conc
flag in live loader default to 1, as a temporary fix to avoid tons of aborts.
- All Oracle delta streams are applied via Raft proposals. This deals better with network partition like edge-cases. #2463
- Fix deadlock in 10-node cluster convergence. Fixes #2286.
- Make ReadIndex work safely. #2469
- Simplify snapshots, leader now calculates and proposes snapshots to the group. #2475.
- Make snapshot streaming more robust. #2487
- Consolidate all txn tracking logic into Oracle, remove inSnapshot logic. #2480.
- Bug fix in Badger, to stop panics when exporting.
- Use PreVote to avoid leader change on a node join.
- Fix a long-standing bug where
raft.Step
was being called via goroutines. It is now called serially. - Fix context deadline issues with proposals. #2501.
1.0.6 - 2018-06-20
- Support GraphQL vars as args for Regexp function. #2353
- Support GraphQL vars with filters. #2359
- Add JSON mutations to raw HTTP. #2396
- Fix math >= evaluation. #2365
- Avoid race condition between mutation commit and predicate move. #2392
- Ability to correctly distinguish float from int in JSON. #2398
- Remove dummy data key. #2401
- Serialize applying of Raft proposals. Concurrent application was complex and cause of multiple bugs. #2428.
- Improve Zero connections.
- Fix bugs in snapshot move, refactor code and improve performance significantly. #2440, #2442
- Add error handling to GetNoStore. Fixes #2373.
- Fix bugs in Bulk loader. #2449
- Posting List and Raft bug fixes. #2457
- Pull in Badger v1.5.2.
- Raft storage is now done entirely via Badger. This reduces RAM consumption by previously used MemoryStorage. #2433
- Trace how node.Run loop performs.
- Allow tweaking Badger options.
Note: This change modifies some flag names. In particular, Badger options
are now exposed via flags named with --badger.
prefix.
1.0.5 - 2018-04-20
- Option to have server side sequencing.
- Ability to specify whitelisted IP addresses for admin actions.
- Fix bug where predicate with string type sometimes appeared as
_:uidffffffffffffffff
in exports. - Validate facet value should be according to the facet type supplied when mutating using NQuads (#2074).
- Use
time.Equal
function for comparing predicates withdatetime
(#2219). - Skip
BitEmptyPosting
forhas
queries. - Return error from query if we don't serve the group for the attribute instead of crashing (#2227).
- Send
maxpending
in connection state to server (#2236). - Fix bug in SP* transactions (#2148).
- Batch and send during snapshot to make snapshots faster.
- Don't skip schema keys while calculating tablets served.
- Fix the issue which could lead to snapshot getting blocked for a cluster with replicas (#2266).
- Dgraph server retries indefinitely to connect to Zero.
- Allow filtering and regex queries for list types with lossy tokenizers.
- Dgraph server segfault in worker package (#2322).
- Node crashes can lead to the loss of inserted triples (#2290).
- Cancel pending transactions for a predicate when predicate move is initiated.
- Move Go client to its own repo at
dgraph-io/dgo
. - Make
expand(_all_)
return value and uid facets. - Add an option to specify a
@lang
directive in schema for predicates with lang tags. - Flag
memory_mb
has been changed tolru_mb
. The default recommended value forlru_mb
is one-third of the total RAM available on the server.
1.0.4 - 2018-03-09
- Support for empty strings in query attributes.
- Support GraphQL vars in first, offset and after at root.
- Add support for query_edge_limit flag which can be used to limit number of results for shortest path, recurse queries.
- Make rebalance interval a flag in Zero.
- Return latency information for mutation operations.
- Support @upsert directive in schema.
- Issues with predicate deletion in a cluster.
- Handle errors from posting.Get.
- Correctly update commitTs while committing and startTs == deleteTs.
- Error handling in abort http handler.
- Get latest membership state from Zero if uid in mutation > maxLeaseId.
- Fix bug in Mutate where mutated keys were not filled.
- Update membership state if we can't find a leader while doing snapshot retrieval.
- Make snapshotting more frequent, also try aborting long pending transactions.
- Trim null character from end of strings before exporting.
- Sort facets after parsing RDF's using bulk loader.
- Fig bug in SyncIfDirty.
- Fix fatal error due to TxnTooBig error.
- Fix bug in dgraph live where some batches could be skipped on conflict error.
- Fix a bug related to expand(all) queries.
- Run cleanPredicate and proposeKeyValues sequentially.
- Serialize connect requests in Zero.
- Retry snapshot retrieval and join cluster indefinitely.
- Make client directory optional in dgraph live.
- Do snapshot in Zero in a goroutine so that Run loop isn't blocked.
1.0.3 - 2018-02-08
- Support for specifying blank nodes as part of JSON mutation.
dgraph version
command to check current version.curl
to Docker image.moveTablet
endpoint to Zero to allow initiating a predicate move.
- Out of range error while doing
eq
query. - Reduce
maxBackOffDelay
to 10 sec so that leader election is faster after restart. - Fix bugs with predicate move where some data was not sent and schema not loaded properly on replicas.
- Fix the total number of RDF's processed when live loader ends.
- Reindex data when schema is changed to list type to fix adding and deleting new data.
- Correctly upate uidMatrix when facetOrder is supplied.
- Inequality operator(
gt
andlt
) result for non lossy tokenizers.
--zero_addr
flag changed to--zero
fordgraph bulk
command.- Default ports for Zero have been changed
7080
=>5080
(grpc) and8080
=>6080
(http). - Update badger version and how purging is done to fix CPU spiking when Dgraph is idle.
- Print predicate name as part of the warning about long term for exact index.
1.0.2 - 2018-01-17
- Always return predicates of list type in an array.
- Edges without facet values are also returned when performing sort on facet.
- Don't derive schema while deleting edges.
- Better error checking when accessing posting lists. Fixes bug where parts of queries are sometimes omitted when system is under heavy load.
- Fix missing error check in mutation handling when using CommitNow (gave incorrect error).
- Fix bug where eq didn't work correctly for the fulltext index.
- Fix race because of which
replicas
flag was not respected. - Fix bug with key copy during predicate move.
- Fix race in merging keys keys from btree and badger iterator.
- Fix snapshot retrieval for new nodes by retrieving it before joining the cluster.
- Write schema at timestamp 1 in bulk loader.
- Fix unexpected meta fatal error.
- Fix groupby result incase the child being grouped open has multiple parents.
- Remove StartTs field from
api.Operation
. - Print error message in live loader if its not ErrAborted. Also, stop using membership state and instead use the address given by user.
- Only send keys corresponding to data that was mutated.
1.0.1 - 2017-12-20
- Wait for background goroutines to finish in posting package on shutdown.
- Return error if we cant parse the uid given in json input for mutations.
- Don't remove
_predicate_
schema from disk during drop all. - Fix panic in expand(all)
- Make sure at least one field is set while doing Alter.
1.0.0 - 2017-12-18
- Allow doing Mutate and Alter Operations using dgraph UI.
- Provide option to user to ignore conflicts on index keys.
- Language tag parsing in queries now accepts digits (in line with RDF parsing).
- Ensure that GraphQL variables are declared before use.
- Export now uses correct blank node syntax.
- Membership stream doesn't get stuck if node steps down as leader.
- Fix issue where sets were not being returned after doing a S P * deletion when part of same transaction.
- Empty string values are stored as it is and no strings have special meaning now.
- Correctly update order of facetMatrix when orderdesc/orderasc is applied.
- Allow live and bulk loaders to work with multiple zeros.
- Fix sorting with for predicates with multiple language tags.
- Fix alias edge cases in normalize directive.
- Allow reading new index key mutated as part of same transaction.
- Fix bug in value log GC in badger.
- SIGINT now forces a shutdown after 5 seconds when there are pending RPCs.
DropAttr
now also removes the schema for the attribute (previously it just removed the edges).- Tablet metadata is removed from zero after deletion of predicate.
- LRU size is changed dynamically now based on
max_memory_mb
- Call RunValueLogGC for every GB increase in size of value logs. Upgrade vendored version of Badger.
- Prohibit string to password schema change.
- Make purging less aggressive.
- Check if GraphQL Variable is defined before using.
0.9.3 - 2017-12-01
- Support for alias while asking for facets.
- Support for general configuration via environment variables and configuration files.
IgnoreIndexConflict
field in Txn which allows ignoring conflicts on index keys.
expand(_all_)
now correctly gives all language variants of a string.- Indexes now correctly maintained when deleting via
S * *
andS P *
. expand(_all_)
now follows reverse edges.- Don't return uid for nodes without any children when requested through debug flag.
- GraphQL variables for HTTP endpoints. Variable map can be set as a JSON
object using the
X-Dgraph-Vars
header. - Abort if CommitNow flag is set and the mutation fails.
- Live loader treats subjects/predicates that look like UIDs as existing nodes rather than new nodes.
- Fix bug in
@groupby
queries where predicate was converted to lower case in queries.
- When showing a predicate with list type, only values without a language tag are shown. To get the values of the predicate that are tagged with a language, query the predicate with that language explicitly.
- Validate the address advertised by dgraph nodes.
- Store/Restore peer map on snapshot.
- Fix rdfs per second reporting in live loader.
- Fix bug in lru eviction.
- Proto definitions are split into intern and api.
0.9.2 - 2017-11-20
- Support for removing dead node from quorum.
- Support for alias in groupby queries.
- Add DeleteEdges helper function for Go client.
- Dgraph tries to abort long running/abandoned transactions.
- Fix TLS flag parsing for Dgraph server and live loader.
- Reduce dependencies for Go client.
depth
andloop
arguments should be inside @recurse().- Base36 encode keys that are part of TxnContext and are sent to the client.
- Fix race condition in expand(all) queries.
- Fix (--ui) flag not being parsed properly.
0.9.1 - 2017-11-15
- Transaction HTTP API has been modified slightly.
start_ts
is now a path parameter instead of a header. For/commit
API, keys are passed in the body.
0.9.0 - 2017-11-14
The latest release has a lot of breaking changes but also brings powerful features like Transactions, support for CJK and custom tokenization.
- Dgraph adds support for distributed ACID transactions (a blog post is in works). Transactions can be done via the Go, Java or HTTP clients (JS client coming). See docs here.
- Support for Indexing via Custom tokenizers.
- Support for CJK languages in the full-text index.
- We have consolidated all the
server
,zero
,live/bulk-loader
binaries into a singledgraph
binary for convenience. Instructions for running Dgraph can be found in the docs. - For Dgraph server, Raft ids can be assigned automatically. A user can optionally still specify an ID, via
--idx
flag. --peer
flag which was used to specify another Zero instance’s IP address is being replaced by--zero
flag to indicate the address corresponds to Dgraph zero.port
,grpc_port
andworker_port
flags have been removed from Dgraph server and Zero. The ports are:
- Internal Grpc: 7080
- HTTP: 8080
- External Grpc: 9080 (Dgraph server only)
Users can set port_offset
flag, to modify these fixed ports.
- Queries, mutations and schema updates are done through separate endpoints. Queries can no longer have a mutation block.
- Queries can be done via
Query
Grpc endpoint (it was calledRun
before) or the/query
HTTP handler. _uid_
is renamed touid
. So queries now need to request foruid
. Example
{
bladerunner(func: eq(name@en, "Blade Runner")) {
uid
name@en
}
}
- Facets response structure has been modified and is a lot flatter. Facet key is now
predicate|facet_name
. Examples for Go client and HTTP. - Query latency is now returned as numeric (ns) instead of string.
Recurse
is now a directive. So queries withrecurse
keyword at root won't work anymore.- Syntax for
count
at root has changed. You need to ask forcount(uid)
, instead ofcount()
.
- Mutations can only be done via
Mutate
Grpc endpoint or via/mutate
HTTP handler. Mutate
Grpc endpoint can be used to set/ delete JSON, or set/ delete a list of NQuads and set/ delete raw RDF strings.- Mutation blocks don't require the mutation keyword anymore. Here is an example of the new syntax.
{
set {
<name> <is> <something> .
<hometown> <is> "San Francisco" .
}
}
Upsert
directive and mutation variables go away. Both these functionalities can now easily be achieved via transactions.
<*> <pred> <*>
operations, that is deleting a predicate can't be done via mutations anymore. They need to be done viaAlter
Grpc endpoint or via the/alter
HTTP handler.- Drop all is now done via
Alter
. - Schema updates are now done via
Alter
Grpc endpoint or via/alter
HTTP handler.
Query
Grpc endpoint returns response in JSON underJson
field instead of protocol buffer.client.Unmarshal
method also goes away from the Go client. Users can usejson.Unmarshal
for unmarshalling the response.- Response for predicate of type
geo
can be unmarshalled into a struct. Example here. Node
andEdge
structs go away along with theSetValue...
methods. We recommend usingSetJson
andDeleteJson
fields to do mutations.- Examples of how to use transactions using the client can be found at https://docs.dgraph.io/clients/#go.
- Embedded dgraph goes away. We haven’t seen much usage of this feature. And it adds unnecessary maintenance overhead to the code.
- Dgraph live no longer stores external ids. And hence the
xid
flag is gone.