Parse Server much slower than Parse.com with same database #2654

zingano · 2016-09-06T09:19:41Z

I have migrated (and finalised) my database from Parse.com to mLab. My (production) client applications are still using the parse.com API, but in test I am using both that and my own Parse Server running on Heroku for comparison.

Steps to reproduce

Please include a detailed list of steps that reproduce the issue. Include curl commands when applicable.

Run OS X test of finding Job objects (with many includeKeys) on parse.com
Run the same test on parse server
Run cloud code test of finding Task objects in a large geographical area on parse.com
Run the same test on parse server

Expected Results

The times for the tests should be roughly the same.

Actual Outcome

Parse Server tests are 2 - 3x the time of the Parse.com tests.
Find jobs is around 4.8 seconds on parse.com, 14 seconds on parse server.
Find tasks is around 7.7 seconds on parse.com, 16 seconds on parse server

This does not happen just in these tests, but permeates the app, with each "page" being 2-3x as slow to load on parse server.

Environment Setup

Server
- parse-server version 2.2.8
- Operating System:
- Hardware:
- Localhost or remote server?

Heroku, US East. 6 of 1x web instances, 1 of 1x worker instance. I have tried other combinations, even up to Performance-M but the difference is not close to being eradicated) Note the worker thread is not used in either of the tests.

Database
- MongoDB version: 3.0.12
- Storage engine: MMAPv1
- Hardware: Shared Cluster 8GB
- Localhost or remote server? mLab via Heroku add-on (both in US East)

There are around 250K Task objects in the database, and around 900 job objects.

I have added indices for any queries that mLab thinks are slow, except one that cannot be automatically indexed, although none of them have been close to as slow as the actual transaction time - which is often in seconds. The most they have been is the order of 100 milliseconds.

I have added individual column indices as it helped so much here, but it made no difference to me.
#2537

Logs/Trace

I am using New Relic to trace / analyse but I am getting nowhere as all New Relic shows is that a high percentage (almost 100% in my most recent test) is taken up by Middleware: anonymous which I know from similar defects on here is parse server.

Since the database is the same in both cases, I assume the blame must be within Parse Server, but would like to find out more. Could it be that this would help me? #2316

Or am I seeing something like this with authentication? #2556

Basically without more instrumentation I won't find out.

The New Relic people are saying that perhaps I'm not getting more out of this analysis because Parse Server is passing through async boundaries in which the transaction is lost. They say that using custom instrumentation could help - trouble is I would have no idea where to put it. If anyone could help with that - or add commented out lines to Parse server to help all of us who use New Relic to instrument, that would be great.

https://docs.newrelic.com/docs/agents/nodejs-agent/supported-features/nodejs-custom-instrumentation#expanding-instrumentation

The code for my tests, run from an OS X app, is below.

- (IBAction)timeFetchJobs:(id)sender {

    [self.testConsoleView setString:@""];
    __block NSString* consoleText = @"";
    __block NSDate* debugStartTime = [NSDate date];

    PFQuery* postQuery = [PFQuery queryWithClassName:@"Job"];
    postQuery.limit = appDelegate.mainWC.numberOfRecordsInPage;
    postQuery.skip = appDelegate.mainWC.numberOfRecordsToSkip;
    [postQuery whereKey:@"status" containedIn:@[kOFEJobStatusAwaitingApproval,kOFEJobStatusDraft,kOFEJobStatusLive,kOFEJobStatusReadyToGoLive,kOFEJobStatusClosing]];
    [postQuery includeKey:@"account"];
    [postQuery includeKey:@"account.customer"];
    [postQuery includeKey:@"account.customer.locationListsArray"];
    [postQuery includeKey:@"locationList"];
    [postQuery includeKey:@"publicJob"];
    [postQuery includeKey:@"publicJob.stepsArray"];
    [postQuery includeKey:@"agentList"];
    [postQuery orderByDescending:@"createdAt"];

    [postQuery findObjectsInBackgroundWithBlock:^(NSArray *objects, NSError *error) {

        PFQuery* postQuery2 = [PFQuery queryWithClassName:@"Job"];
        postQuery2.limit = appDelegate.mainWC.numberOfRecordsInPage;
        postQuery2.skip = appDelegate.mainWC.numberOfRecordsToSkip;
        [postQuery2 whereKey:@"status" containedIn:@[kOFEJobStatusAwaitingApproval,kOFEJobStatusDraft,kOFEJobStatusLive,kOFEJobStatusReadyToGoLive,kOFEJobStatusClosing]];
        [postQuery2 includeKey:@"account"];
        [postQuery2 includeKey:@"account.customer"];
        [postQuery2 includeKey:@"account.customer.locationListsArray"];
        [postQuery2 includeKey:@"locationList"];
        [postQuery2 includeKey:@"publicJob"];
        [postQuery2 includeKey:@"publicJob.stepsArray"];
        [postQuery2 includeKey:@"agentList"];
        [postQuery2 orderByDescending:@"createdAt"];

        [postQuery2 findObjectsInBackgroundWithBlock:^(NSArray *objects, NSError *error) {
            NSDate* debugEndTime = [NSDate date];
            NSTimeInterval parseLoadTime = [debugEndTime timeIntervalSinceDate:debugStartTime];
            consoleText = [NSString stringWithFormat:@"Find jobs (2x): %f",parseLoadTime];
            [self.testConsoleView setString:consoleText];
        }];
    }];
}

- (IBAction)timeFindTasks:(id)sender {

    [self.testConsoleView setString:@""];
    __block NSString* consoleText = @"";
    __block NSDate* debugStartTime = [NSDate date];

    __block NSDictionary* params = @{@"appversion" : @(1.9),
                             @"swlat": @(49.84),
                             @"swlon": @(-9.23),
                             @"nelat": @(60.85),
                             @"nelon": @(2.69)};

    [PFCloud callFunctionInBackground:@"findTasks" withParameters:params block:^(id objects, NSError *error) {

        if (!error) {
            [PFCloud callFunctionInBackground:@"findTasks" withParameters:params block:^(id objects, NSError *error) {

                if (!error) {
                    NSDate* debugEndTime = [NSDate date];
                    NSTimeInterval parseLoadTime = [debugEndTime timeIntervalSinceDate:debugStartTime];
                    consoleText = [NSString stringWithFormat:@"Find tasks (twice): %f",parseLoadTime];
                    [self.testConsoleView setString:consoleText];

                }
                else {
                    consoleText = [NSString stringWithFormat:@"ERROR in find tasks: %@",error];
                    [self.testConsoleView setString:consoleText];
                }
            }];
        }
        else {
            consoleText = [NSString stringWithFormat:@"ERROR in find tasks: %@",error];
            [self.testConsoleView setString:consoleText];
        }
    }];

}

The text was updated successfully, but these errors were encountered:

flovilmart · 2016-09-06T13:16:58Z

@zingano Thanks for the detailed report. What kind of dyno are you running on? When running parse-server on your local development machine, do you see similar times? I'm asking that as I want to rule out deployment related problems before any further investigation.

The number of includes you ask for is probably the source of the issue as I believe we sequentially resolves them instead of in parallel

Another thing you could test is to remove some includes, if you see the time dropping, that would indicate that we're running in O(n) time for resolving each include instead of being closer to O(1) (running in parallel)

flovilmart · 2016-09-06T13:21:22Z

@zingano Do you feel like working on an improvement for that feature running all handleIncludes in parallel instead of sequentially like we do?

zingano · 2016-09-06T14:35:23Z

@flovilmart Dynos, it says above :-) - Heroku, US East. 6 of 1x web instances, 1 of 1x worker instance. I have tried other combinations, even up to Performance-M but the difference is not close to being eradicated) Note the worker thread is not used in either of the tests.

flovilmart · 2016-09-06T14:42:05Z

@zingano sorry, I misread. So I can definitely say that the includes occur sequentially. Do you want to improve that?

zingano · 2016-09-06T14:42:17Z

However, I disagree with changing this to be only about running the includes in parallel. As I said, my findTasks is also slow (and this is a more complex cloud function with more at play than just include keys - there are multiple finds over various classes).

I also have a minimal findJobs, the same as the code above but without any includeKeys or whereKey, but this time with a limit of 1000.

In this case parse.com has a time of around 3.5 secs and parse server has a much more variable time of around 6-8 seconds.

zingano · 2016-09-06T14:45:23Z

And if I drop the limit to 200, parse.com falls to 1.1s pretty consistently, but parse server is around 3s, but varying wildly up to around 9s.

flovilmart · 2016-09-06T14:53:02Z

For cloud functions, they take time as they are using the HTTP API instead of a direct access. This PR will enable direct interaction with parse-server API's instead of using HTTP requests. You can try that PR by setting the parse-server version to 2.3.0-alpha1

zingano · 2016-09-06T14:55:44Z

Thanks, I'll do that.

The minimal findJobs mentioned above, with no includeKeys, is OS X client code with no cloud code, like so:

- (IBAction)timeSimpleFindJobs:(id)sender {

    [self.testConsoleView setString:@""];
    __block NSString* consoleText = @"";
    __block NSDate* debugStartTime = [NSDate date];

    PFQuery* postQuery = [PFQuery queryWithClassName:@"Job"];
    postQuery.limit = 200;
    postQuery.skip = 0;
    [postQuery findObjectsInBackgroundWithBlock:^(NSArray *objects, NSError *error) {

        PFQuery* postQuery2 = [PFQuery queryWithClassName:@"Job"];
        postQuery2.limit = 200;
        postQuery2.skip = 0;
        [postQuery2 findObjectsInBackgroundWithBlock:^(NSArray *objects, NSError *error) {

            NSDate* debugEndTime = [NSDate date];
            NSTimeInterval parseLoadTime = [debugEndTime timeIntervalSinceDate:debugStartTime];
            consoleText = [NSString stringWithFormat:@"Simple find jobs (2x): %f",parseLoadTime];
            [self.testConsoleView setString:consoleText];
        }];
    }];

flovilmart · 2016-09-06T15:33:46Z

On your dynos, do you run with NODE_ENV=production ?

zingano · 2016-09-07T06:00:40Z

@flovilmart - yes, I do.

I'd like to return to one of my main points, which is how do I instrument parse server to discover what is slow, before I jump to fix something that may not be the cause at all? Please can you share how you do it, whether with New Relic or some other way.

GuichiZhao · 2016-09-07T10:26:39Z

I deployed it to elastic beanstalk,and the parse server is much faster than parse.com constantly

zingano · 2016-09-07T10:29:22Z

@GuichiZhao what database are you using?

GuichiZhao · 2016-09-07T10:37:38Z

I followed this guide http://docs.aws.amazon.com/quickstart/latest/mongodb/step2a.html for mongo server,didn't check which engine it use, but not the RockDB as Parse recommend,I think it should be mmapv1 which is default

Using elastic beanstalk as application server.

Also,the two server are in same subnet i.e. mongodb://10.0.1.123
Maybe the traffic between database and application server slow you down

zingano · 2016-09-07T10:42:18Z

My database was set up through Heroku so much the same set up I think.

zingano · 2016-09-07T10:46:32Z

I've got AppDynamics running at last and am starting to get some profile data. What I can't understand is why AppDynamics shows a call trace that is e.g. 848ms but shows the whole execution time as 2166ms.

I found two cases of stringify being called in PromiseRouter even though the debug log would not be called - each of those took around 200ms on one trace.

I'm still looking into another stringify, and MongoTransform.js : 711

flovilmart · 2016-09-07T12:29:46Z

I found two cases of stringify being called in PromiseRouter even though the debug log would not be called - each of those took around 200ms on one trace

Looking at your trace, it originates from lib/response.js which is the actual expressjs response handle that serializes the JSON response.

zingano · 2016-09-07T13:13:17Z

@flovilmart I didn't show those particular traces. I'll attach one of them.

zingano · 2016-09-07T13:15:22Z

MongoTransform seems to be most consistently to blame, but I'm still confused about the difference in the entire time (in the tab) and the execution time (left, top). MongoTransform can be a large percentage of the latter, but not usually of the former. Any AppDynamics people help me out? Or anyone use another trace tool?

zingano · 2016-09-08T07:39:52Z

For anyone else who's interested in instrumenting Parse Server, you'll be glad to know that NewRelic do think that something is amiss, and the support guys have added a request to add support for Parse Server (possibly just on Heroku, depends if the issue is seen more widely). I should be updated when that support arrives.

flovilmart · 2016-09-08T11:02:39Z

@zingano, your first screenshot, originating from PromiseRouter, is actually the request serialization. In your tests, what's the size of the response you're sending?

flovilmart · 2016-09-10T15:22:22Z

@zingano I see in the stack trace that also the stringify comes from the es6 polyfill.

zingano · 2016-09-13T06:01:50Z

@flovilmart - not sure, but some responses are pretty large. I feel that if these issues we've referenced are all that's causing my problems, someone else would have come across them too, rather than someone else saying Parse Server is much faster? But after I have completed my current work (to remove stale data from my database and speed things that way, as that is a certainty for getting me ready for production), then I will look at these defects and also your direct access which I think may have more impact. Unless it is the latter, I don't think we've found the issue completely yet.

flovilmart · 2016-09-13T11:33:27Z

can you try that branch and see if the response times are better? Or at least make sure that stringify is not anymore the cause of problems?

gateway · 2016-09-14T21:59:41Z

I seem to also be hitting issue with json parser at least what I can tell from New Relic dashboard. #2711 .. My app dynamics trail expired so cant really drill down deeper :(

zingano · 2016-09-20T14:21:46Z

I have tried enabling the direct access and I see no clear difference in my test times.

I'm trying to find out at App Dynamics why the massive difference between the call time and the sum of the execution times - https://community.appdynamics.com/t5/AppDynamics-Discussions/Difference-in-times-on-top-tab-entire-execution-time-of-a/m-p/22075/highlight/false#M7725

I'll attach a particularly egregious example - in this call the summed execution time is 671ms but the total execution time (in the tab) is 4,662ms. It seems to me that it's not worth chasing small improvements in the execution time until I find what is responsible for this 4 second "wait time".

My App Dynamics trial runs out tomorrow, I think.

flovilmart · 2016-09-20T14:28:32Z

@zingano the direct access only affects cloud code, by removing all HTTP calls and replacing them with JS direct calls. What are you running? still the same function?

zingano · 2016-09-20T14:31:25Z

My findTasks test calls cloud code. It's coming in pretty much the same with and without direct access.

zingano · 2016-09-20T14:32:23Z

@flovilmart - do you have any ideas about what could be causing the massive "wait time" above?

steven-supersolid · 2016-09-20T14:32:49Z

Could the total time be the time including waiting for response from the database etc.

zingano · 2016-09-20T14:40:49Z

I haven't any slow responses from the database - at least, only one, not involved in that test, and any less than 150ms

flovilmart · 2016-09-20T15:59:45Z

Not really, if you time it manually with running the function itself with pure nodejs?

zingano · 2016-09-22T08:40:11Z

DIRECT ACCESS

OK, I have a few stats back from running the direct access and it does seem faster. I had to run it on real production so there was enough data to play with, and didn't see any glitches in the matrix. However I don't want to leave it on there until I'm confident (i.e. it's no longer experimental in a release) so those few tests are all I'm going to do.

Heroku - 3 2x dynos @1024mb, 8GB 8GB variable RAM shared tier database, session-affinity, NO experimental direct access
Find tasks test (cloud) - 10.0, 15.8, 16.8, 12, 11.1

Heroku - 3 2x dynos @1024mb, 8GB 8GB variable RAM shared tier database, session-affinity, experimental direct access
Find tasks test (cloud) - 10.1, 9.0, 7.8, 9.8, 8.0

For comparison, here's Parse.com - 9.8, 6.9, 7.7

So, without more major testing, I'd say that's now getting pretty similar results to Parse.com, which pleases me as the main bottlenecks for my mobile clients with be the three cloud code calls they make, which I can also work on optimising more.

zingano · 2016-09-22T09:00:38Z

OTHER TIMINGS

Here for comparison are a few tests across some of the configurations I've tried:

Parse.com
QUERY WITH MANY INCLUDES, run twice - 8.4, 5.4, 8.4, 6.1,
QUERY NO INCLUDES, run twice - 1.03, 0.87, 1.09
Dashboard stats - 15.76

Heroku - 3 2x dynos @1024mb, 8GB 8GB variable RAM shared tier database, session-affinity
QUERY WITH MANY INCLUDES, run twice - 12.8, 8.6, 8.6, 9.1
QUERY NO INCLUDES, run twice - 2.66, 5.07, 3, 2.61
Dashboard stats - 31.94

Node Chef - 3 servers @1024mb , 10GB 1GB RAM database
QUERY WITH MANY INCLUDES, run twice - 6.3, 6.2, 6.2
QUERY NO INCLUDES, run twice - 1.9, 2.3, 1.78, 2.3
Dashboard stats - 24.87

Conclusions

NodeChef is better than Parse.com at queries with many includes (which makes me suspect they did some optimisation in this area as Parse Server itself is slower).

NodeChef is slower than Parse.com on simple queries with no includes. (Parse Server on node.js is slower than the Parse.com GO code)

NodeChef is slower than Parse.com on massive database counts (Dashboard stats)

My Heroku set up is in every way worse than NodeChef , although not by an order of magnitude and therefore probably manageable if I try to optimise my own code.

Makes me suspect that either

a - NodeChef don't contribute optimisations back to Parse Server

b- Could be my database as I was running on a shared cluster with variable RAM. I am currently stepping mine up to the M1 dedicated plan with 1.7GB RAM for comparison.

Finally, something else that may be true (especially over other tests I've not shown here) is that my Heroku setup is more variable in timings than either Parse.com or NodeChef.

steven-supersolid · 2016-09-22T09:56:06Z

Just a thought but Heroku 2x dynos are also shared so it may be worth trying with Performance M to see if you get a more consistent test time with dedicated dynos. I believe those have 2 CPUS so can safely use a node cluster option of 2.

NodeChef uses the RocksDB storage engine which Parse.com also uses I think. RocksDB or WiredTiger should give significant performance gains over MMAPv1. mLab offers WT on some of their dedicated cluster plans so you could try that for a test.

These are all hosting details though. Profiling is definitely useful if you can narrow the problem down to a specific part of the code. Database profiling is a great help too as, for example, we have previously seen parse-server make extraneous queries before caching was introduced.

zingano · 2016-09-22T10:04:48Z

One other quick question - I'm seeing three authentications a second on my database. Is that to be expected?

zingano · 2016-09-22T10:18:52Z

Thanks Steven - I thought that myself and tested Performance M briefly and didn't see what I hoped for. However, a while ago and things have changed since. I'm not set up to scale within a dyno, but I could at least test with one parse-server per dyno.

I think as my mLab comes via Heroku, I can't go to Wired Tiger. However that might be different once I'm on M1 dedicated, I'll see.

zingano · 2016-09-22T10:21:57Z

And while this thread is active, what do people think the following do for performance?

Session affinity
Node version (I'm on 4.3 - worth trying > 6.0?)
Client SDK - both my apps are still on Parse.com SDKs

steven-supersolid · 2016-11-01T12:58:32Z

You should hopefully see a performance increase with 2.2.24 as the number of _SCHEMA queries should be reduced per request.

flovilmart · 2016-11-01T13:46:30Z

Worth updating as we revamped the inclusion strategies in 2.2.23.

paivaric · 2019-07-31T12:50:47Z

Why not use MongoDB Bulk Operations?
This is pretty fast:

https://docs.mongodb.com/manual/reference/method/Bulk/
https://docs.mongodb.com/manual/reference/method/db.collection.initializeUnorderedBulkOp/#db.collection.initializeUnorderedBulkOp

dplewis · 2019-12-24T01:48:59Z

Moving conversation to #6193

ozbillwang · 2020-08-27T12:13:26Z

@zingano

Do you still recall, which file you add the below codes to get AppDynamic metric works?

require("appdynamics").profile({
 controllerHostName: 'wharf20200xxxxxxx.saas.appdynamics.com',
 controllerPort: 443,

 // If SSL, be sure to enable the next line
 controllerSslEnabled: true,
 accountName: 'wharf202008200xxxxxxxx',
 accountAccessKey: 'gb7pufbxxxx',
 applicationName: 'parse',
 tierName: 'nodejs',
 nodeName: 'test' // The controller will automatically append the node name with a unique number
});

I tried in /bin/parse-server, lib/cli/parse-server.js and others, seems no one works

flovilmart added the enhancement label Sep 6, 2016

flovilmart changed the title ~~Parse Server much slower than Parse.com with same database~~ RestQuery includes are run in sequence, they should run in parallel / batch Sep 6, 2016

flovilmart added up-for-grabs labels Sep 6, 2016

zingano changed the title ~~RestQuery includes are run in sequence, they should run in parallel / batch~~ Parse Server much slower than Parse.com with same database Sep 6, 2016

zingano mentioned this issue Sep 6, 2016

RestQuery includes are run in sequence, they should run in parallel / batch #2656

Closed

flovilmart mentioned this issue Sep 10, 2016

Removes runtime dependency babel-polyfill #2692

Merged

dplewis mentioned this issue Apr 2, 2019

Saving 3000 objects in cloud code with saveAll takes more than 10 seconds #5471

Closed

dplewis closed this as completed Dec 24, 2019

mtrezza added type:feature New feature or improvement of existing feature and removed type:improvement labels Dec 6, 2021

Parse Server much slower than Parse.com with same database #2654

Parse Server much slower than Parse.com with same database #2654

Comments

zingano commented Sep 6, 2016 • edited Loading

Steps to reproduce

Expected Results

Actual Outcome

Environment Setup

Logs/Trace

flovilmart commented Sep 6, 2016 • edited Loading

flovilmart commented Sep 6, 2016

zingano commented Sep 6, 2016

flovilmart commented Sep 6, 2016

zingano commented Sep 6, 2016

zingano commented Sep 6, 2016

flovilmart commented Sep 6, 2016

zingano commented Sep 6, 2016 • edited Loading

flovilmart commented Sep 6, 2016

zingano commented Sep 7, 2016

GuichiZhao commented Sep 7, 2016

zingano commented Sep 7, 2016

GuichiZhao commented Sep 7, 2016

zingano commented Sep 7, 2016

zingano commented Sep 7, 2016

flovilmart commented Sep 7, 2016

zingano commented Sep 7, 2016 • edited Loading

zingano commented Sep 7, 2016

zingano commented Sep 8, 2016

flovilmart commented Sep 8, 2016

flovilmart commented Sep 10, 2016

zingano commented Sep 13, 2016

flovilmart commented Sep 13, 2016

gateway commented Sep 14, 2016

zingano commented Sep 20, 2016 • edited Loading

flovilmart commented Sep 20, 2016

zingano commented Sep 20, 2016

zingano commented Sep 20, 2016

steven-supersolid commented Sep 20, 2016

zingano commented Sep 20, 2016

flovilmart commented Sep 20, 2016

zingano commented Sep 22, 2016 • edited Loading

zingano commented Sep 22, 2016

steven-supersolid commented Sep 22, 2016

zingano commented Sep 22, 2016

zingano commented Sep 22, 2016 • edited Loading

zingano commented Sep 22, 2016

steven-supersolid commented Nov 1, 2016

flovilmart commented Nov 1, 2016

paivaric commented Jul 31, 2019

dplewis commented Dec 24, 2019

ozbillwang commented Aug 27, 2020 • edited Loading

zingano commented Sep 6, 2016 •

edited

Loading

flovilmart commented Sep 6, 2016 •

edited

Loading

zingano commented Sep 6, 2016 •

edited

Loading

zingano commented Sep 7, 2016 •

edited

Loading

zingano commented Sep 20, 2016 •

edited

Loading

zingano commented Sep 22, 2016 •

edited

Loading

zingano commented Sep 22, 2016 •

edited

Loading

ozbillwang commented Aug 27, 2020 •

edited

Loading