Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DNS-SD PTR record ill formed on Pluto... #1072

Closed
rgetz opened this issue Oct 13, 2023 · 1 comment · Fixed by #1080
Closed

DNS-SD PTR record ill formed on Pluto... #1072

rgetz opened this issue Oct 13, 2023 · 1 comment · Fixed by #1080

Comments

@rgetz
Copy link
Contributor

rgetz commented Oct 13, 2023

In iiod, we wait until host name is accessible (in a background thread), I thought that would mean that networking/dbus is ready, and then eventually start avahi
https://github.com/analogdevicesinc/libiio/blob/main/iiod/dns-sd.c#L257-L288

However - it looks like this doesn't actually work robustly. PTR records don't include the name like they are supposed to.

rgetz@brain:~/github/libiio/deps/mdns$  avahi-discover
...
Found service 'iiod on (none)' of type '_iio._tcp' in domain 'local' on 68.0.
Found service 'iiod on analog' of type '_iio._tcp' in domain 'local' on 2.1.
...

the (none) is wrong, it should be the hostname (just like it is from the zedboard - which is named analog).

We can see that it's also a problem in mdns (meaning it's not the client side).

rgetz@brain:~/github/libiio/deps/mdns$ gcc -o mdns mdns.c
rgetz@brain:~/github/libiio/deps/mdns$ ./mdns --query _iio._tcp.local.
...
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
192.168.1.120:5353 : answer _iio._tcp.local. PTR "iiod on analog._iio._tcp.local." rclass 0x1 ttl 10 length 17
...

according to wireshark, that is what the Pluto Avahi is transmitting...

No.	Time	Source	Destination	Protocol	Length	Info
2	0.000720776	192.168.2.1	192.168.2.10	MDNS	159	Standard query response 0x0000 PTR _iio._tcp.local, "QU" question PTR iiod on (none)._iio._tcp.local TXT SRV 0 0 30431 pluto.local A 192.168.2.1

0000   00 e0 22 6a 6a 3c 00 05 f7 10 1f e3 08 00 45 00   .."jj<........E.
0010   00 91 7a 6b 40 00 ff 11 7b 94 c0 a8 02 01 c0 a8   ..zk@...{.......
0020   02 0a 14 e9 cb 93 00 7d ad bf 00 00 84 00 00 01   .......}........
0030   00 04 00 00 00 00 04 5f 69 69 6f 04 5f 74 63 70   ......._iio._tcp
0040   05 6c 6f 63 61 6c 00 00 0c 80 01 c0 0c 00 0c 00   .local..........
0050   01 00 00 00 0a 00 11 0e 69 69 6f 64 20 6f 6e 20   ........iiod on 
0060   28 6e 6f 6e 65 29 c0 0c c0 2d 00 10 00 01 00 00   (none)...-......
0070   00 0a 00 01 00 c0 2d 00 21 00 01 00 00 00 0a 00   ......-.!.......
0080   0e 00 00 00 00 76 df 05 70 6c 75 74 6f c0 16 c0   .....v..pluto...
0090   5d 00 01 00 01 00 00 00 0a 00 04 c0 a8 02 01      ]..............

Meaning the problem is in the code that I pointed to (I think) - since in Kuiper (where things start later) it works fine...

-Robin

@rgetz rgetz changed the title mdns PTR record ill formed on Pluto... DNS-SD PTR record ill formed on Pluto... Oct 13, 2023
rgetz added a commit to rgetz/libiio that referenced this issue Oct 29, 2023
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortantely - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc actaully do in gethostname().

So - now we actualyl check there is a working, configured ethernet card,
that supports mdns (ie multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes analogdevicesinc#1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

Signed-off-by: Robin Getz <[email protected]>
@rgetz
Copy link
Contributor Author

rgetz commented Oct 29, 2023

with the fix in #1080 - things seems to be working:

rgetz@brain:~/github/libiio/deps/mdns$  avahi-discover
...
Found service 'iiod on pluto' of type '_iio._tcp' in domain 'local' on 81.0.

and

rgetz@brain:~/github/libiio/deps/mdns$ ./mdns --query _iio._tcp.local.
...
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16
192.168.2.1:5353 : answer iiod on pluto._iio._tcp.local. SRV pluto.local. priority 0 weight 0 port 30431
192.168.2.1:5353 : answer pluto.local. A 192.168.2.1

when things get merged - I will close.

rgetz added a commit to rgetz/libiio that referenced this issue Oct 30, 2023
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortantely - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc actaully do in gethostname().

So - now we actualyl check there is a working, configured ethernet card,
that supports mdns (ie multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes analogdevicesinc#1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

Signed-off-by: Robin Getz <[email protected]>
rgetz added a commit to rgetz/libiio that referenced this issue Oct 30, 2023
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortunately - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc implement in gethostname().

So - now we check there is a working, configured ethernet card,
that supports mdns (ie. multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes analogdevicesinc#1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

Signed-off-by: Robin Getz <[email protected]>
rgetz added a commit to rgetz/libiio that referenced this issue Nov 10, 2023
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortunately - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc implement in gethostname().

So - now we check there is a working, configured ethernet card,
that supports mdns (ie. multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes analogdevicesinc#1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

and when you have multiple on network (with same name):
192.168.1.110:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16
192.168.1.110:5353 : answer pluto.local. A 192.168.1.110
192.168.1.115:5353 : answer _iio._tcp.local. PTR "iiod on pluto analogdevicesinc#2._iio._tcp.local." rclass 0x1 ttl 10 length 19
192.168.1.115:5353 : answer pluto-2.local. A 192.168.1.115

Signed-off-by: Robin Getz <[email protected]>
rgetz added a commit to rgetz/libiio that referenced this issue Nov 10, 2023
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortunately - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc implement in gethostname().

So - now we check there is a working, configured ethernet card,
that supports mdns (ie. multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes analogdevicesinc#1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

and when you have multiple on network (with same name):
192.168.1.110:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16
192.168.1.110:5353 : answer pluto.local. A 192.168.1.110
192.168.1.115:5353 : answer _iio._tcp.local. PTR "iiod on pluto analogdevicesinc#2._iio._tcp.local." rclass 0x1 ttl 10 length 19
192.168.1.115:5353 : answer pluto-2.local. A 192.168.1.115

Signed-off-by: Robin Getz <[email protected]>
rgetz added a commit to rgetz/libiio that referenced this issue Nov 10, 2023
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortunately - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc implement in gethostname().

So - now we check there is a working, configured ethernet card,
that supports mdns (ie. multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes analogdevicesinc#1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

and when you have multiple on network (with same name):
192.168.1.110:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16
192.168.1.110:5353 : answer pluto.local. A 192.168.1.110
192.168.1.115:5353 : answer _iio._tcp.local. PTR "iiod on pluto analogdevicesinc#2._iio._tcp.local." rclass 0x1 ttl 10 length 19
192.168.1.115:5353 : answer pluto-2.local. A 192.168.1.115

Signed-off-by: Robin Getz <[email protected]>
pcercuei pushed a commit that referenced this issue Nov 13, 2023
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortunately - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc implement in gethostname().

So - now we check there is a working, configured ethernet card,
that supports mdns (ie. multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes #1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

and when you have multiple on network (with same name):
192.168.1.110:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16
192.168.1.110:5353 : answer pluto.local. A 192.168.1.110
192.168.1.115:5353 : answer _iio._tcp.local. PTR "iiod on pluto #2._iio._tcp.local." rclass 0x1 ttl 10 length 19
192.168.1.115:5353 : answer pluto-2.local. A 192.168.1.115

Signed-off-by: Robin Getz <[email protected]>
pcercuei pushed a commit that referenced this issue Nov 13, 2023
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortunately - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc implement in gethostname().

So - now we check there is a working, configured ethernet card,
that supports mdns (ie. multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes #1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

and when you have multiple on network (with same name):
192.168.1.110:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16
192.168.1.110:5353 : answer pluto.local. A 192.168.1.110
192.168.1.115:5353 : answer _iio._tcp.local. PTR "iiod on pluto #2._iio._tcp.local." rclass 0x1 ttl 10 length 19
192.168.1.115:5353 : answer pluto-2.local. A 192.168.1.115

Signed-off-by: Robin Getz <[email protected]>
[pcercuei: change %d to %ld in print format string]
Signed-off-by: Paul Cercueil <[email protected]>
pcercuei pushed a commit that referenced this issue Nov 13, 2023
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortunately - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc implement in gethostname().

So - now we check there is a working, configured ethernet card,
that supports mdns (ie. multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes #1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

and when you have multiple on network (with same name):
192.168.1.110:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16
192.168.1.110:5353 : answer pluto.local. A 192.168.1.110
192.168.1.115:5353 : answer _iio._tcp.local. PTR "iiod on pluto #2._iio._tcp.local." rclass 0x1 ttl 10 length 19
192.168.1.115:5353 : answer pluto-2.local. A 192.168.1.115

Signed-off-by: Robin Getz <[email protected]>
[pcercuei: change %d to %ld in print format string]
Signed-off-by: Paul Cercueil <[email protected]>
SRaus pushed a commit that referenced this issue Feb 27, 2024
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortunately - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc implement in gethostname().

So - now we check there is a working, configured ethernet card,
that supports mdns (ie. multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes #1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

and when you have multiple on network (with same name):
192.168.1.110:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16
192.168.1.110:5353 : answer pluto.local. A 192.168.1.110
192.168.1.115:5353 : answer _iio._tcp.local. PTR "iiod on pluto #2._iio._tcp.local." rclass 0x1 ttl 10 length 19
192.168.1.115:5353 : answer pluto-2.local. A 192.168.1.115

Signed-off-by: Robin Getz <[email protected]>
[pcercuei: change %d to %ld in print format string]
Signed-off-by: Paul Cercueil <[email protected]>
(cherry picked from commit 38483f3)
SRaus pushed a commit that referenced this issue Feb 28, 2024
In the past, we were only checking hostname, as a way to see if the
network was ready, but unfortunately - that doesn't work. The kernel
really only needs to have a hostname set for uname() to work, which is
what glibc and uclibc implement in gethostname().

So - now we check there is a working, configured ethernet card,
that supports mdns (ie. multicast) that is up before moving on to making
sure there is a hostname.

We also don't allow "none" or "(none)" to be the hostname until after
things time out (3 min).

This fixes #1072 and turns:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on (none)._iio._tcp.local." rclass 0x1 ttl 10 length 17
into:
192.168.2.1:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16

and when you have multiple on network (with same name):
192.168.1.110:5353 : answer _iio._tcp.local. PTR "iiod on pluto._iio._tcp.local." rclass 0x1 ttl 10 length 16
192.168.1.110:5353 : answer pluto.local. A 192.168.1.110
192.168.1.115:5353 : answer _iio._tcp.local. PTR "iiod on pluto #2._iio._tcp.local." rclass 0x1 ttl 10 length 19
192.168.1.115:5353 : answer pluto-2.local. A 192.168.1.115

Signed-off-by: Robin Getz <[email protected]>
[pcercuei: change %d to %ld in print format string]
Signed-off-by: Paul Cercueil <[email protected]>
(cherry picked from commit 38483f3)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant