Optimize go N steps over edge #1471

dangleptr · 2019-12-18T08:57:58Z

Optimize go executor.
The pr improve the perf for "go 2 steps over edge" about 20x with about 1M vertex ids returned.
Of course, it will improve the perf for other queries, but i have't test it yet.

Main changes:

Modify the structure storaged returned, separate vertexId from props to avoid "getPropByName" called when getting dstId. (This is a normal request for go)
Return directly if current executor is the rightest. It avoid one extra encode/decode procedure which cost lots of time if we have many rows returned.
Pre calculate the yield columns types to avoid calculate them for every edge.

nebula-community-bot · 2019-12-18T10:46:54Z

Unit testing passed.

nebula-community-bot · 2019-12-18T11:28:26Z

Unit testing passed.

nebula-community-bot · 2019-12-20T14:26:44Z

Unit testing passed.

nebula-community-bot · 2019-12-20T15:31:23Z

Unit testing failed.

nebula-community-bot · 2019-12-20T18:06:49Z

Unit testing passed.

src/graph/GoExecutor.cpp

nebula-community-bot · 2019-12-24T11:04:58Z

Unit testing passed.

nebula-community-bot · 2019-12-25T02:53:31Z

Unit testing passed.

src/graph/GoExecutor.cpp

wadeliuyi · 2019-12-26T16:48:32Z

src/graph/GoExecutor.cpp

+    int64_t totalRows = 0;
+    for (auto& resp : rpcResp.responses()) {
+        if (resp.get_total_edges() != nullptr) {
+            totalRows += *resp.get_total_edges();


maybe *(resp.get_total_edges()) more readable

src/graph/GoExecutor.cpp

wadeliuyi · 2019-12-26T17:08:05Z

src/graph/GoExecutor.cpp

+                    if (!edgeSchema.empty()) {
+                        auto it = edgeSchema.find(edgeType);
+                        DCHECK(it != edgeSchema.end());
+                        reader = RowReader::getRowReader(edge.props, it->second);


because the reader depend edgeSchema, if the edgeSchema is empty, the reader is nullptr, then CHECK(reader != nullptr); will crash , so if can we return directly when edgeSchema is empty?

If getAliasProp invoked, it means the reader should not be NULL. Otherwise it is a bug.

src/graph/FindPathExecutor.cpp

src/graph/GoExecutor.cpp

src/storage/query/QueryBaseProcessor.inl

src/interface/storage.thrift

jude-zhu · 2020-01-08T08:16:36Z

close #1604

src/graph/GoExecutor.cpp

src/graph/FindPathExecutor.cpp

src/graph/GoExecutor.cpp

laura-ding · 2020-01-17T08:40:09Z

src/graph/GoExecutor.cpp

@@ -1408,5 +1468,73 @@ SupportedType GoExecutor::getPropTypeFromInterim(const std::string &prop) const
    return index_->getColumnType(prop);
 }

+nebula::cpp2::SupportedType GoExecutor::calculateExprType(Expression* exp) const {


If the schema cache of graphd and the storaged are different, maybe has problem.

Yes, we use the schema in graphd.
And the problem also existed in current master branch, because different storaged maybe have different versions schema when updating it.

laura-ding · 2020-01-20T09:17:47Z

src/graph/GoExecutor.cpp

+                      << time::WallClock::fastNowInMicroSec() - start << "us";
+        }
+        if (!ret.ok()) {
+            LOG(ERROR) << "Get rows failed: " << ret.status();


add doError()

Optimize go executor. The pr improve the perf for "go 2 steps over edge" about 20x with about 1M vertex ids returned. Of course, it will improve the perf for other queries, but i have't test it yet. Main changes: 1. Modify the structure storaged returned, separate vertexId from props to avoid "getPropByName" called when getting dstId. (This is a normal request for go) 2. Return directly if current executor is the rightest. It avoid one extra encode/decode procedure which cost lots of time if we have many rows returned. 3. Pre calculate the yield columns types to avoid calculate them for every edge.

dangleptr requested review from monadbobo, dutor, darionyaphet, sherman-the-tank, CPWstatic, critical27, laura-ding, whitewum, liuyu85cn and bright-starry-sky December 18, 2019 08:57

dangleptr force-pushed the trace branch from eb7c807 to 0b1faf4 Compare December 18, 2019 10:05

dangleptr added the ready-for-testing PR: ready for the CI test label Dec 18, 2019

dangleptr force-pushed the trace branch from 0409cab to 6241239 Compare December 18, 2019 11:03

dangleptr force-pushed the trace branch from 6241239 to 221c975 Compare December 20, 2019 14:00

dangleptr force-pushed the trace branch from 221c975 to 2621fe3 Compare December 20, 2019 15:04

dangleptr removed the ready-for-testing PR: ready for the CI test label Dec 20, 2019

dangleptr force-pushed the trace branch from 2621fe3 to d2eeb2e Compare December 20, 2019 17:45

dangleptr added the ready-for-testing PR: ready for the CI test label Dec 20, 2019

dangleptr force-pushed the trace branch from d2eeb2e to 75a3320 Compare December 24, 2019 10:41

CPWstatic reviewed Dec 24, 2019

View reviewed changes

src/graph/GoExecutor.cpp Outdated Show resolved Hide resolved