Wrong TP reporting for long delay lines like xDSL uplink #910

zokl · 2019-09-02T11:33:11Z

Hello Developers,

I would like to report an issue in measurement of real xDSL line. We try to measure uplink direction (512 kbps) but we obtain result without throughput, however RTT, CWIN and retransmission are colected. Throughput field is always printed, however only the first item has non zero value. All other values are equal to zero.

The example output follows:
iperf_dsl_issue.txt

We used iperf version 3.6 and 3.7. Both of them have the same problem.

Version of iperf3: 3.7
Hardware: x86
Operating system (and distribution, if any): OpenWRT - master branch

The issue could be reproduced by TC with the following syntax:
tc qdisc add dev eth1 root tbf rate 512kbit burst 1540 limit 384k

The text was updated successfully, but these errors were encountered:

bmah888 · 2019-09-10T00:40:44Z

I'm starting to look at this. I have partially reproduced the problem you're seeing. It would help a little bit if you can send me the command-line arguments you are using on the client and server side....I can sort of dig them out of the JSON output but having the actual arguments would be useful.

My first guess is the problem is caused by a combination of: 1) the really slow 512kbps link speed and 2) the fairly large default send size used by TCP tests (128KB) and 3) trying to do 3 parallel streams. At that link speed, it takes about 2 seconds for to finish a send (yes it's chopped up into smaller TCP segments), and if you're doing 3 parallel streams there might very well be intervals where iperf3 doesn't actually record the sending or reception of a complete send.

If you try something smaller, like --length 1k, it forces iperf3 to do finer-grained measurements by doing small sends, and you get more intuitive results.

(I was going to insert some output here to illustrate my point, but I have to figure out how to get it out of the VM I was testing on, sigh.)

Note that iperf3 was originally designed for high-speed networks that are several orders of magnitude faster than the environment that you're testing, and so some adjustment of parameters might be necessary to get useful results.

acooks · 2019-09-10T03:46:35Z

The log file shows that the test did not complete successfully.

I regularly use iperf3 (with patches) on these kinds of links. I think this patch is relevant to this issue, but it was previously submitted and rejected as redundant.

zokl · 2019-09-10T09:17:40Z

The log file shows that the test did not complete successfully.

I regularly use iperf3 (with patches) on these kinds of links. I think this patch is relevant to this issue, but it was previously submitted and rejected as redundant.

This is not our problem. We are missing data during the test.

acooks · 2019-09-10T09:24:43Z

Ok then, good luck

…

On Tue., 10 Sep. 2019, 19:17 Zbyněk Kocur, ***@***.***> wrote: The log file shows that the test did not complete successfully. I regularly use iperf3 (with patches) on these kinds of links. I think this patch <acooks@929f88a> is relevant to this issue, but it was previously submitted and rejected as redundant <#859>. This is not our problem. We are missing data during the test. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#910?email_source=notifications&email_token=AADQJN623CM3XPKKD3MV2CLQI5Q4DA5CNFSM4IS4Q2R2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD6KNXDI#issuecomment-529849229>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AADQJN5JOVK7QISKR6OL5HTQI5Q4DANCNFSM4IS4Q2RQ> .

zokl · 2019-09-10T12:15:19Z

I'm starting to look at this. I have partially reproduced the problem you're seeing. It would help a little bit if you can send me the command-line arguments you are using on the client and server side....I can sort of dig them out of the JSON output but having the actual arguments would be useful.

For our test and comparison between several network technologies we use the same parameters for server and client like:

/usr/bin/iperf3 -c 147.32.211.37 --connect-timeout 1000 -t 90 --logfile XYZ.iperf3 -p 5201 --get-server-output -J --parallel 3 --window 1500k --set-mss 1400 -C cubic

My first guess is the problem is caused by a combination of: 1) the really slow 512kbps link speed and 2) the fairly large default send size used by TCP tests (128KB) and 3) trying to do 3 parallel streams. At that link speed, it takes about 2 seconds for to finish a send (yes it's chopped up into smaller TCP segments), and if you're doing 3 parallel streams there might very well be intervals where iperf3 doesn't actually record the sending or reception of a complete send.

Yes, what you describe will be the cause of this behavior. Is it possible to improve the bit rate calculation so that it can record the bit rate even at these parameters? RTT, CWND and retransmission work. Only the yhroughput has a problem.

If you try something smaller, like --length 1k, it forces iperf3 to do finer-grained measurements by doing small sends, and you get more intuitive results.

TCP window size bellow 128k works.

Communication graph with TCP window 64k - works
iperf3_TP-problem-w64k.txt

Communication graph with wrong throughput results
iperf3_TP-problem.txt

(I was going to insert some output here to illustrate my point, but I have to figure out how to get it out of the VM I was testing on, sigh.)

Note that iperf3 was originally designed for high-speed networks that are several orders of magnitude faster than the environment that you're testing, and so some adjustment of parameters might be necessary to get useful results.

bmah888 · 2019-10-01T16:40:53Z

@zokl: I think really the key here is to change the --length parameter. iperf3 can only compute the throughput on the basis of complete send calls into the network, and the granularity of those is what the --length parameter is supposed to control. Put another way, if on average there is less than one send call that can complete during a measurement interval, you're going to have some measurement intervals with zero throughput, because iperf3 can only measure the time between complete send calls.

Hmmm. Now that I'm thinking about it, here are a couple other ideas. Another thing you could try is specifying a larger value for --interval (the default is 1 second).

Also you can try (weirdly) reducing the TCP window size, because a burst of packets can get absorbed by the sender buffers on the client side. (This is a funny variant of the buffer bloat problem.)

zokl · 2019-10-02T06:40:13Z

Hi Bruce,
Thank you very much for your answer. What you write is true. It helps to change the interval and especially the TCP window. Packet size does not move much on xDSL. With GPRS / EDGE or BPL communication, packet size handling behaves better. Overall, the measurement is confusing for me that the test is performed; most parameters are calculated; only the throughput is 0.

bmah888 self-assigned this Sep 10, 2019

bmah888 added the question label Sep 10, 2019

zokl closed this as completed Oct 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong TP reporting for long delay lines like xDSL uplink #910

Wrong TP reporting for long delay lines like xDSL uplink #910

zokl commented Sep 2, 2019

bmah888 commented Sep 10, 2019

acooks commented Sep 10, 2019

zokl commented Sep 10, 2019

acooks commented Sep 10, 2019 via email

zokl commented Sep 10, 2019 •

edited

Loading

bmah888 commented Oct 1, 2019

zokl commented Oct 2, 2019

Wrong TP reporting for long delay lines like xDSL uplink #910

Wrong TP reporting for long delay lines like xDSL uplink #910

Comments

zokl commented Sep 2, 2019

bmah888 commented Sep 10, 2019

acooks commented Sep 10, 2019

zokl commented Sep 10, 2019

acooks commented Sep 10, 2019 via email

zokl commented Sep 10, 2019 • edited Loading

bmah888 commented Oct 1, 2019

zokl commented Oct 2, 2019

zokl commented Sep 10, 2019 •

edited

Loading