Strange restriction bandwidth #73

Kotische · 2015-02-12T14:35:34Z

If I connect 2 servers:
4 ethernet direct to 4 ethernet by cables
I have about 4g bandwidth on 1 flow.

If I connect 2 servers:
4 ethernet to cisco catalist and then to 4 ethernet
I have about 1.6 g bandwidth on 1 flow.
But I have about 4g total bandwidth on 4 parallel flow.

obonaventure · 2015-02-12T15:43:24Z

Please ask the question on the mptcp-dev mailing list and provide a bit more information on the configuration of the catalyst switch. Do you use the same MTU in both cases?

Kotische · 2015-02-16T15:40:57Z

It is real bug!
linux distrib - debian swueeze
Mptcp version - 0.88
net.mptcp.mptcp_path_manager = fullmesh

All eth adapter on some network.
This case generate interferente traffic.
As result total bandwidth too low.

I define iptables rules one to one accept
and drop crossower traffic and bandwidth now 3.82 g/s!

cpaasch · 2015-02-16T15:50:04Z

Yes, having the crossover traffic is bad for the bandwidth. As the number of subflows increases, the CPU-overhead increases and scheduling across the subflows also gets less optimal.

Ideally, you want to minimize the number of subflows. So that you have one subflow per bottleneck and only send one single subflow across each bottleneck.

Automatically preventing these crossover subflows within the kernel is quite a difficult problem. If you have an idea, that would be great! :)

Kotische · 2015-02-16T16:26:44Z

fullmesh manager does not allow to specify the number of subflow :(

ndiffports manager & num_subflows=4 does not show a positive result :(

it would be nice if there was a cache file showing the current state of the connection and default preset for it.

for example:
node1
ip1 192.168.1.1
ip2 192.168.1.2
ip3 192.168.1.3
ip4 192.168.1.4
subflow preset 4
subflow1 192.168.1.1 -192.168.1.100
subflow2 192.168.1.2 -192.168.1.101
subflow3 192.168.1.3 -192.168.1.102
subflow4 192.168.1.4 -192.168.1.103

cpaasch · 2015-02-16T20:37:58Z

Yes, a proper interface to specify which subflows should be created would be good. E.g., over netlink,...

Patches are very welcome! :)

(I think, we can close this issue-report and move discussion about an API for path-management to the mailing-list)

commit 9f0bbf3 upstream. Because there may be random garbage beyond a string's null terminator, it's not correct to copy the the complete character array for use as a hist trigger key. This results in multiple histogram entries for the 'same' string key. So, in the case of a string key, use strncpy instead of memcpy to avoid copying in the extra bytes. Before, using the gdbus entries in the following hist trigger as an example: # echo 'hist:key=comm' > /sys/kernel/debug/tracing/events/sched/sched_waking/trigger # cat /sys/kernel/debug/tracing/events/sched/sched_waking/hist ... { comm: ImgDecoder #4 } hitcount: 203 { comm: gmain } hitcount: 213 { comm: gmain } hitcount: 216 { comm: StreamTrans #73 } hitcount: 221 { comm: mozStorage #3 } hitcount: 230 { comm: gdbus } hitcount: 233 { comm: StyleThread#5 } hitcount: 253 { comm: gdbus } hitcount: 256 { comm: gdbus } hitcount: 260 { comm: StyleThread#4 } hitcount: 271 ... # cat /sys/kernel/debug/tracing/events/sched/sched_waking/hist | egrep gdbus | wc -l 51 After: # cat /sys/kernel/debug/tracing/events/sched/sched_waking/hist | egrep gdbus | wc -l 1 Link: http://lkml.kernel.org/r/50c35ae1267d64eee975b8125e151e600071d4dc.1549309756.git.tom.zanussi@linux.intel.com Cc: Namhyung Kim <[email protected]> Cc: [email protected] Fixes: 79e577c ("tracing: Support string type key properly") Signed-off-by: Tom Zanussi <[email protected]> Signed-off-by: Steven Rostedt (VMware) <[email protected]> Signed-off-by: Greg Kroah-Hartman <[email protected]>

commit 9f0bbf3 upstream. Because there may be random garbage beyond a string's null terminator, it's not correct to copy the the complete character array for use as a hist trigger key. This results in multiple histogram entries for the 'same' string key. So, in the case of a string key, use strncpy instead of memcpy to avoid copying in the extra bytes. Before, using the gdbus entries in the following hist trigger as an example: # echo 'hist:key=comm' > /sys/kernel/debug/tracing/events/sched/sched_waking/trigger # cat /sys/kernel/debug/tracing/events/sched/sched_waking/hist ... { comm: ImgDecoder multipath-tcp#4 } hitcount: 203 { comm: gmain } hitcount: 213 { comm: gmain } hitcount: 216 { comm: StreamTrans multipath-tcp#73 } hitcount: 221 { comm: mozStorage multipath-tcp#3 } hitcount: 230 { comm: gdbus } hitcount: 233 { comm: StyleThread#5 } hitcount: 253 { comm: gdbus } hitcount: 256 { comm: gdbus } hitcount: 260 { comm: StyleThread#4 } hitcount: 271 ... # cat /sys/kernel/debug/tracing/events/sched/sched_waking/hist | egrep gdbus | wc -l 51 After: # cat /sys/kernel/debug/tracing/events/sched/sched_waking/hist | egrep gdbus | wc -l 1 Link: http://lkml.kernel.org/r/50c35ae1267d64eee975b8125e151e600071d4dc.1549309756.git.tom.zanussi@linux.intel.com Cc: Namhyung Kim <[email protected]> Cc: [email protected] Fixes: 79e577c ("tracing: Support string type key properly") Signed-off-by: Tom Zanussi <[email protected]> Signed-off-by: Steven Rostedt (VMware) <[email protected]> Signed-off-by: Greg Kroah-Hartman <[email protected]>

commit 9f0bbf3 upstream. Because there may be random garbage beyond a string's null terminator, it's not correct to copy the the complete character array for use as a hist trigger key. This results in multiple histogram entries for the 'same' string key. So, in the case of a string key, use strncpy instead of memcpy to avoid copying in the extra bytes. Before, using the gdbus entries in the following hist trigger as an example: # echo 'hist:key=comm' > /sys/kernel/debug/tracing/events/sched/sched_waking/trigger # cat /sys/kernel/debug/tracing/events/sched/sched_waking/hist ... { comm: ImgDecoder #4 } hitcount: 203 { comm: gmain } hitcount: 213 { comm: gmain } hitcount: 216 { comm: StreamTrans #73 } hitcount: 221 { comm: mozStorage #3 } hitcount: 230 { comm: gdbus } hitcount: 233 { comm: StyleThread#5 } hitcount: 253 { comm: gdbus } hitcount: 256 { comm: gdbus } hitcount: 260 { comm: StyleThread#4 } hitcount: 271 ... # cat /sys/kernel/debug/tracing/events/sched/sched_waking/hist | egrep gdbus | wc -l 51 After: # cat /sys/kernel/debug/tracing/events/sched/sched_waking/hist | egrep gdbus | wc -l 1 Link: http://lkml.kernel.org/r/50c35ae1267d64eee975b8125e151e600071d4dc.1549309756.git.tom.zanussi@linux.intel.com Cc: Namhyung Kim <[email protected]> Cc: [email protected] Fixes: 79e577c ("tracing: Support string type key properly") Signed-off-by: Tom Zanussi <[email protected]> Signed-off-by: Steven Rostedt (VMware) <[email protected]> Signed-off-by: Greg Kroah-Hartman <[email protected]>

…nsigned fw_level [ Upstream commit e75d18cecbb3805895d8ed64da4f78575ec96043 ] Though acpi_find_last_cache_level() always returned signed value and the document states it will return any errors caused by lack of a PPTT table, it never returned negative values before. Commit 0c80f9e165f8 ("ACPI: PPTT: Leave the table mapped for the runtime usage") however changed it by returning -ENOENT if no PPTT was found. The value returned from acpi_find_last_cache_level() is then assigned to unsigned fw_level. It will result in the number of cache leaves calculated incorrectly as a huge value which will then cause the following warning from __alloc_pages as the order would be great than MAX_ORDER because of incorrect and huge cache leaves value. | WARNING: CPU: 0 PID: 1 at mm/page_alloc.c:5407 __alloc_pages+0x74/0x314 | Modules linked in: | CPU: 0 PID: 1 Comm: swapper/0 Not tainted 5.19.0-10393-g7c2a8d3ac4c0 multipath-tcp#73 | pstate: 20000005 (nzCv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--) | pc : __alloc_pages+0x74/0x314 | lr : alloc_pages+0xe8/0x318 | Call trace: | __alloc_pages+0x74/0x314 | alloc_pages+0xe8/0x318 | kmalloc_order_trace+0x68/0x1dc | __kmalloc+0x240/0x338 | detect_cache_attributes+0xe0/0x56c | update_siblings_masks+0x38/0x284 | store_cpu_topology+0x78/0x84 | smp_prepare_cpus+0x48/0x134 | kernel_init_freeable+0xc4/0x14c | kernel_init+0x2c/0x1b4 | ret_from_fork+0x10/0x20 Fix the same by changing fw_level to be signed integer and return the error from init_cache_level() early in case of error. Reported-and-Tested-by: Bruno Goncalves <[email protected]> Signed-off-by: Sudeep Holla <[email protected]> Link: https://lore.kernel.org/r/[email protected] Signed-off-by: Will Deacon <[email protected]> Signed-off-by: Sasha Levin <[email protected]>

cpaasch closed this as completed Feb 16, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strange restriction bandwidth #73

Strange restriction bandwidth #73

Kotische commented Feb 12, 2015

obonaventure commented Feb 12, 2015

Kotische commented Feb 16, 2015

cpaasch commented Feb 16, 2015

Kotische commented Feb 16, 2015

cpaasch commented Feb 16, 2015

Strange restriction bandwidth #73

Strange restriction bandwidth #73

Comments

Kotische commented Feb 12, 2015

obonaventure commented Feb 12, 2015

Kotische commented Feb 16, 2015

cpaasch commented Feb 16, 2015

Kotische commented Feb 16, 2015

cpaasch commented Feb 16, 2015