Autocomplete tester #28

orangejulius · 2015-12-10T21:06:15Z

This is it! A way too big pull request that includes a lot of refactoring (also in #25, which is now closed), and adds the ability to test autocomplete.

It includes not just a new output format for autocomplete, but some additional interesting autocomplete related statistics.

Here's the latest screenshot of the acceptance test output

Fixes #23

The callback function doesn't have to be use for an output generator as it was previously named.

This helps reinforce that they are related

exec_test_case.js has a lot of responsiblities now handled by smaller more focused modules: validate_test_suites.js: perform validation on test suites. Some errors are recoverable, some are not gather_test_suites.js: find all test suites, filter out tests that shouldn't be run based on test type command line parameter gather_test_urls.js: collect all the URLs from all the test cases that should be fetched request_urls.js: perform all the fetching of URLs eval_tests.js: run all the scoring code on each of our tests based on the responses previously fetched analyze_results.js: perform any final analysis required on the tests results. currently it simply tallys up the pass/fail/regression/improvement count test_suite_helpers.js: miscellaneous helper code used in a few places

The return code is used as the return code for the entire process. Previously each output generator was responsible for calling process.exit which was easy to forget about.

The functionality that used to be in these files is now separated more cleanly into many other files.

The `request_urls` module uses the [HTTP.Agent](https://nodejs.org/api/http.html#http_class_http_agent) class to limit the maximum number of open HTTP sockets (by default with no agent configured, no artificial limit on the number of sockets is set). This means that instead of attempting to space out the actual calling of request(), all the request() calls can be made at once, and then they will only actually be performed a few at a time. In my testing this is able to completely eliminate the possibility of overloading our server with too many requests at once, while allowing for much faster test suite execution. As an added (small) bonus, HTTP keepalives are now set so the overhead of initializing a connection for each request is removed, making things even faster. Finally, when results are cached in Fastly, the test suite can execute extremely fast: I've seen over 400 request/second handled just fine. Oh, and it's a lot less code :)

These are somewhat confusing because, for example, the autocomplete tests can show that a completely correct result was first, but the test still failed (because there was another expected result that wasn't found).

This is much more friendly to dev.

There seems to an [issue](nodejs/node#2148) with some versions of Node that cause one of stderr and stdout to be buffered, while the other is not. This jumbles output if both are used.

It now has to be able to print the circular test case structure, and URL info has slightly changed.

The test evaluation code can handle 500 responses, we just need to store them.

These basically mean autocomplete was not helpful over search.

Production and dev have different timeouts

This is slower when testing against fastly, but multiple sockets can easily overwhelm prodbuild or dev.

Otherwise, when some requests are currently being retried, the script can exit early before printing results (but usually after waiting through almost all of the fetching).

This reverts commit a21eb16. Because of the timeout functionality of Elasticsearch, even using only one socket can overwhelm a server when sending off requests as fast as possible, if timeouts start coming back, requests need to be spaced out and ExponentialBackoff is really good at that.

0 is now acceptable for minimum delay since all requests are queued through a single socket. However, in the case of retries it's desireable to slow down more severely to allow the Elasticsearch cluster to finish processing any timed-out requests, so the exponent is increased. Finally the maximum timeout is doubled to 20 seconds since an overloaded Elasticsearch server tends to thrash (our 4 core prodbuild server will sometimes have a load of 20+)

setTimeout and setInterval can't handle a 0ms delay, and ExponentialBackoff defaults to 50ms when passed 0ms

Autocomplete tester

orangejulius added the in progress label Dec 10, 2015

orangejulius mentioned this pull request Dec 10, 2015

Big refactor to move functionality of exec_test_suite.js into smaller modules #25

Closed

orangejulius added in review and removed in progress labels Dec 10, 2015

orangejulius mentioned this pull request Dec 17, 2015

Create autocomplete specific test cases pelias/pelias#196

Closed

13 tasks

orangejulius added 25 commits December 18, 2015 11:39

Use forEach instead of recurisve function

f67ea82

Move argument processing code into separate file

b3b7f0b

Simplify finding of test files

719f0fa

Improve help output and simplify help code

cfbdb2a

Use generic variable name in ExecTestSutes

1c2a775

The callback function doesn't have to be use for an output generator as it was previously named.

Separate config parsing, test running, and output generating

42902ff

Extract methods used in execTestSuites

7dd1c53

Clarify global nature of stats object

93d7274

Move colors module include to output generators

a43d8a1

Rename main method of processArguments

dd4f589

Nest enpoint name and url in a single object

7aa07ba

This helps reinforce that they are related

Pass only config and callback to execTestSuites

629e29e

Use helper method to print progress while fetching URLs

f85400e

Use return code from output generators

29b5def

The return code is used as the return code for the entire process. Previously each output generator was responsible for calling process.exit which was easy to forget about.

Use getLocations from test_suite_helpers

0d3d19c

Improve comments in main program body

d1d362e

Remove old files

c30bb58

The functionality that used to be in these files is now separated more cleanly into many other files.

Fix indentation

cd6f44e

Calculate running time in analyze_results

58c9334

WIP: autocomplete tester

f2ed56d

Remove error output

3dc9f8a

Index results by url for faster lookup

fb0899e

Filter out autocomplete tests that completely fail

1e7c92c

orangejulius added 22 commits December 18, 2015 11:39

Add output note when there are multiple test expectations

140ae8b

These are somewhat confusing because, for example, the autocomplete tests can show that a completely correct result was first, but the test still failed (because there was another expected result that wasn't found).

Use only 3 sockets for requesting URLs

6ec6ce4

This is much more friendly to dev.

Extract retry logic to function

65659ec

Replace console.error with console.log in output generator

b8d6c4f

There seems to an [issue](nodejs/node#2148) with some versions of Node that cause one of stderr and stdout to be buffered, while the other is not. This jumbles output if both are used.

Update error response printing code

d51d308

It now has to be able to print the circular test case structure, and URL info has slightly changed.

Store results of any error code

57cd469

The test evaluation code can handle 500 responses, we just need to store them.

Print number of retries (from rate limiting) while fetching

d9280bf

Fix clearing of interval ID when fetching URLs

31645b2

Add interesting autocomplete stats

236532a

Add count of test cases that pass autocomplete only on last character

5d1cbfc

These basically mean autocomplete was not helpful over search.

Add example of autocomplete output

de215a1

Add autocomplete section to readme

0169e4c

Retry on any request timeout

45fe13d

Production and dev have different timeouts

Use only a single socket for all requests

9d16c5d

This is slower when testing against fastly, but multiple sockets can easily overwhelm prodbuild or dev.

Only clear the request interval when all are completely done

471a25c

Otherwise, when some requests are currently being retried, the script can exit early before printing results (but usually after waiting through almost all of the fetching).

Print progress after retrying to increment number of retries

e000e4c

Use ExponentialBackoff in request_urls module

2694651

Fix missing parameter in ExponentialBackoff docs

5d892e9

Clear interval before resetting it

1e4d504

Properly check for request timeout error messages

3290242

orangejulius force-pushed the autocomplete-tester branch from d0c5468 to 79ff088 Compare December 18, 2015 16:39

orangejulius added 2 commits December 18, 2015 12:48

Only clear interval when it changes

c4e5143

Use 1ms delay between tests

496a8c9

setTimeout and setInterval can't handle a 0ms delay, and ExponentialBackoff defaults to 50ms when passed 0ms

riordan assigned orangejulius Jan 6, 2016

orangejulius added a commit that referenced this pull request Jan 6, 2016

Merge pull request #28 from pelias/autocomplete-tester

bcd59f7

Autocomplete tester

orangejulius merged commit bcd59f7 into master Jan 6, 2016

orangejulius removed the in review label Jan 6, 2016

orangejulius deleted the autocomplete-tester branch March 23, 2016 17:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Autocomplete tester #28

Autocomplete tester #28

orangejulius commented Dec 10, 2015

Autocomplete tester #28

Autocomplete tester #28

Conversation

orangejulius commented Dec 10, 2015