Issue 28134: socket.socket(fileno=fd) does not work as documented (original) (raw)

Created on 2016-09-13 19:13 by christian.heimes, last changed 2022-04-11 14:58 by admin. This issue is now closed.

Messages (25)

msg276327 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2016-09-13 19:13

Documentation of socket.socket(fileno) https://docs.python.org/3/library/socket.html#socket.socket says:

If fileno is specified, the other arguments are ignored, causing the socket with the specified file descriptor to return.

The feature does not work. fileno does not infer the values for family, type and proto from the fd. Instead if uses the other arguments. I don't see how this feature should have ever worked on POSIX. There are no calls to getsockopt() with SO_DOMAIN, SO_TYPE and SO_PROTOCOL.

$ ./python Python 3.7.0a0 (default:6bcedf96d25f, Sep 13 2016, 20:48:50) [GCC 6.1.1 20160621 (Red Hat 6.1.1-3)] on linux Type "help", "copyright", "credits" or "license" for more information.

import socket uds = socket.socket(socket.AF_UNIX, socket.SOCK_STREAM) uds <socket.socket fd=3, family=AddressFamily.AF_UNIX, type=SocketKind.SOCK_STREAM, proto=0> s = socket.socket(fileno=uds.fileno()) s <socket.socket fd=3, family=AddressFamily.AF_INET, type=SocketKind.SOCK_STREAM, proto=0> s.family == uds.family False s2 = socket.socket(type=socket.SOCK_DGRAM, fileno=uds.fileno()) s2 <socket.socket fd=3, family=AddressFamily.AF_INET, type=SocketKind.SOCK_DGRAM, proto=0>

msg276328 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2016-09-13 19:17

I expected that socket.socket(fileno) would fill in all socket options like my own implementation https://github.com/tiran/socketfromfd/blob/master/socketfromfd.py

msg276330 - (view)

Author: R. David Murray (r.david.murray) * (Python committer)

Date: 2016-09-13 19:23

See also issue 27377?

msg276363 - (view)

Author: Martin Panter (martin.panter) * (Python committer)

Date: 2016-09-14 00:07

The documentation says that the family, type and proto attributes correspond to the constructor arguments. Although it is unfortunate and quirky, I think your behaviour does match the documentation.

Do the mismatched settings cause any serious problems with socket methods, or just affect the Python-level attributes and repr()?

Even without using fileno=..., you could argue that the proto attribute is not ideal:

s = socket() s.proto 0 s.getsockopt(SOL_SOCKET, SO_PROTOCOL) 6 IPPROTO_TCP 6

Perhaps the way forward is to deprecate fileno=... in favour of Neil’s fromfd2() function. That would avoid any confusion about conflicting socket() constructor arguments and defaults. I.e. what does socket(AF_INET, SOCK_STREAM, fileno=unix_datagram_fd) mean, and is it equivalent to socket(filno=unix_datagram_fd)?

msg276395 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2016-09-14 08:09

Martin, the documentation says "If fileno is specified, the other arguments are ignored, causing the socket with the specified file descriptor to return." It's a direct quote from Python 3's socket library documentation. For a non-native speaker like me, this sentence implies that socket.socket(fileno) not only ignores the arguments (which it does not) but that the constructor uses the fileno to set family, type and proto.

The socket module uses self->sock_family in several places to calculate the addr len or when it handles arguments and address information. Just look at this simple example:

import socket uds = socket.socket(socket.AF_UNIX, socket.SOCK_STREAM) s = socket.socket(fileno=uds.fileno()) s.bind('/tmp/sock') Traceback (most recent call last): File "", line 1, in TypeError: getsockaddrarg: AF_INET address must be tuple, not str uds.bind('/tmp/sock')

msg276660 - (view)

Author: Martin Panter (martin.panter) * (Python committer)

Date: 2016-09-16 01:56

I agree the doc is far from perfect. The bit I was going off is just above <https://docs.python.org/3.5/library/socket.html#socket.socket.family>, saying “these (read-only) attributes that correspond to the values given to the socket constructor”.

My instinct would be to clarify that for existing versions 2.7, 3.5, etc, that the constructor arguments are not ignored and should correspond to the file descriptor. Then in the next Python version we can make it more automatic using the getsockopt() techniques.

msg276684 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2016-09-16 08:44

How about we fix the code and only document the limitations instead? :) After all it works fine on Windows and it is documented to work on all operating systems. Since it's a bug we can fix it in 3.5, too.

My patch implements a best-effort to get type, family and proto from the socket. It ignores any errors.

msg276866 - (view)

Author: Martin Panter (martin.panter) * (Python committer)

Date: 2016-09-18 01:28

Personally, I’m not too enthusiastic, because it is rather magical, and does not work in all cases. It seems more like a feature than a bug fix. But I have rarely used the fileno=... parameter, and it shouldn’t have much negative impact, so I’m not too fussed.

According to Issue 27377, these are some cases where parts won’t work:

Also, if we are going to read SO_PROTOCOL when fileno=... is given, why not also read it in the normal case when proto=0 (unspecified) is given?

msg276879 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2016-09-18 11:05

I'm well aware that it does not work in all cases. However it works good enough in most cases. Right now the fileno argument must be considered broken because it leads to wrong results. It is a problem and possible security issue for a couple of use cases, e.g. passing of sockets through AF_UNIX AUX data or systemd socket activation.

On Windows it is less problematic because socket(filno) works correctly with WSAPROTOCOL_INFO. It's only broken for integer fd.

I have considered to set type, family and proto to 0 (unspec) when the getsockopt and getsockname fail.

I have a differnt ticket for the protocol issue, #27816.

msg285972 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2017-01-21 23:43

Here is another example of how broken and dangerous fileno argument is. getpeername() is neither a valid IPv4 tuple nor a valid IPv6 tuple. It's all messed up:

import socket s = socket.create_connection(('www.python.org', 443)) s <socket.socket fd=3, family=AddressFamily.AF_INET6, type=SocketKind.SOCK_STREAM, proto=6, laddr=('2003:58:bc4a:3b00:56ee:75ff:fe47:ca7b', 59730, 0, 0), raddr=('2a04:4e42:1b::223', 443, 0, 0)> socket.socket(fileno=s.fileno()) <socket.socket fd=3, family=AddressFamily.AF_INET, type=SocketKind.SOCK_STREAM, proto=0, laddr=('2003:58:bc4a:3b00::%2550471192', 59730, 0, 2550471192), raddr=('2a04:4e42:1b:0:700c:e70b:ff7f:0%2550471192', 443, 0, 2550471192)>

msg308433 - (view)

Author: Yury Selivanov (yselivanov) * (Python committer)

Date: 2017-12-16 00:21

Let's quickly iterate over what's possible first:

Given the above I propose the following:

  1. Passing a wrong type can be considered as a serious error. SOCK_STREAM is fundamentally different from SOCK_DGRAM. Because we have a way to validate socket type on all platforms, I propose to do this validation when we create a socket from an FD, and raise an error if the passed socket type isn't correct.

  2. When Python is running with '-X dev' (new dev mode added by Victor), I propose to also validate socket.family and socket.proto on Linux, and raise a RuntimeWarning when they don't match.

msg309207 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2017-12-29 23:42

Yuri,

msg309225 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2017-12-30 09:43

Issue #32454 adds socket.close(fd) function.

msg311175 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2018-01-29 21:38

New changeset b6e43af669f61a37a29d8ff0785455108e6bc29d by Christian Heimes in branch 'master': bpo-28134: Auto-detect socket values from file descriptor (#1349) https://github.com/python/cpython/commit/b6e43af669f61a37a29d8ff0785455108e6bc29d

msg311176 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2018-01-29 21:39

I'm leaving the ticket open to remind me that I have to add a whatsnew entry and maybe consider a backport.

msg311178 - (view)

Author: STINNER Victor (vstinner) * (Python committer)

Date: 2018-01-29 22:06

x86-64 Sierra 3.x is grumpy:

http://buildbot.python.org/all/#/builders/14/builds/659

====================================================================== FAIL: test_uknown_socket_family_repr (test.test_socket.GeneralModuleTests)

Traceback (most recent call last): File "/Users/buildbot/buildarea/3.x.billenstein-sierra/build/Lib/test/test_socket.py", line 1645, in test_uknown_socket_family_repr self.assertEqual(s.proto, 23) AssertionError: 0 != 23

msg311181 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2018-01-29 22:11

The test for proto isn't super critical. It's mostly ignored any way. I'll submit a band-aid.

msg311252 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2018-01-30 07:55

New changeset 2e0ecde8d74f5fc0e3e3e39216975cc70efc4796 by Christian Heimes in branch 'master': bpo-28134: Ignore proto in unknown socket test (GH-5435) https://github.com/python/cpython/commit/2e0ecde8d74f5fc0e3e3e39216975cc70efc4796

msg312681 - (view)

Author: (YoSTEALTH) *

Date: 2018-02-24 00:53

I am using 3.7.0b1 i don't think this issue is fixed!

simple mockup:

--------------

def accept(sock): client, addr = sock.accept() inside = socket(fileno=client.fileno()) print(inside) # <__main__.Socket fd=5, family=AddressFamily.AF_INET, type=SocketKind.SOCK_STREAM, proto=6, laddr=('127.0.0.1', 8000), raddr=('127.0.0.1', 42532)> return inside

outside = accept(sock) print(outside)

<__main__.Socket fd=5, family=AddressFamily.AF_INET, type=SocketKind.SOCK_STREAM, proto=6>

So the "laddr" and "raddr" goes missing the second its out of the function???

This has wasted days of my time, to even get to this point of figuring out whats going on wasn't easy! extremely frustrating.

msg312684 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2018-02-24 01:31

The problem is fixed and your example behaves as expected.

The laddr string is the result of inside.getsockname() and raddr string is the result of inside.getpeername(). In your example, inside and client share the same file descriptor. When the function exits client goes out of scope, the reference count of client drops to 0, the object gets deallocated and Python closes the shared file descriptor. In outside, the shared fd is gone and outside.getsockname() and outside.getpeername() fail. Basically the outside socket is dead because its fd has been closed.

You have to duplicate the fd or detach the socket to keep the fd open.

msg312690 - (view)

Author: (YoSTEALTH) *

Date: 2018-02-24 02:22

Christian thank you for your reply, i really appreciate it.

Lets analyze this a bit:

I could understand making a duplicate of the fd if it were being passed across thread/process but in its original thread making a duplicate! This behavior is extremely odd.

I fell like everyone that uses socket(fileno) will run into days wasted and frustration just to figure out these solutions.

msg312708 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2018-02-24 09:50

File descriptors are a advanced features and expose low level operating system resources. You really have to understand how the OS works. They cannot be reference counted. In fact they are the reference to entries in the Kernel's global open file table. I gave a talked about FDs at PyCon US two years ago, maybe https://speakerdeck.com/tiran/pycon-2016-file-descriptors-unix-sockets-and-other-posix-magic will help you understand fds better.

You can make your example work with https://docs.python.org/3/library/socket.html#socket.socket.detach

def accept(sock): client, addr = sock.accept() inside = socket(fileno=client.fileno()) client.detach() print(inside) return inside # after return, the client socket is closed by due to detach the fd isn't no longer close

The feature works as intended. It's designed to turn an inherited file descriptor (e.g. systemd socket activation) or transfered fds (e.g. through AF_UNIX SCM_RIGHTS). In both cases the fd is already a duplicated fd.

msg313153 - (view)

Author: (YoSTEALTH) *

Date: 2018-03-02 18:10

It would be nice if "python" accounted for such low level os things. None the less client.detach() method works fine.

I really did enjoy your talk, kinda bummed it was short and didn't get into more details.

Thanks for your help and patience Christian :)

msg332704 - (view)

Author: Nathaniel Smith (njs) * (Python committer)

Date: 2018-12-29 06:10

Am I right that this is considered fixed (or at least as fixed as it can be), and the issue should be closed?

Also, small note in case anyone stumbles across this in the future and is confused: Python does not handle this correctly on Windows. I suspect Christian was confused because there's an undocumented features on Windows where if you pass fileno=<opaque string returned by socket.share()>, then that correctly reinstantiates the socket object. But fileno= doesn't seem to do any special autodetection of type/family/proto.

msg404590 - (view)

Author: Christian Heimes (christian.heimes) * (Python committer)

Date: 2021-10-21 12:05

Yes, the fix works well enough.

History

Date

User

Action

Args

2022-04-11 14:58:36

admin

set

github: 72321

2021-10-21 12:05:20

christian.heimes

set

status: open -> closed
resolution: fixed
messages: +

stage: patch review -> resolved

2018-12-29 06:10:49

njs

set

nosy: + njs
messages: +

2018-03-02 18:10:47

YoSTEALTH

set

messages: +

2018-02-24 09:50:34

christian.heimes

set

messages: +

2018-02-24 02:22:16

YoSTEALTH

set

messages: +

2018-02-24 01:31:12

christian.heimes

set

messages: +

2018-02-24 00:53:39

YoSTEALTH

set

nosy: + YoSTEALTH
messages: +

2018-01-30 07:55:49

christian.heimes

set

messages: +

2018-01-29 22:16:44

christian.heimes

set

pull_requests: + <pull%5Frequest5268>

2018-01-29 22:11:01

christian.heimes

set

messages: +

2018-01-29 22:06:02

vstinner

set

messages: +

2018-01-29 21:39:17

christian.heimes

set

messages: +

2018-01-29 21:38:02

christian.heimes

set

messages: +

2017-12-30 09:43:20

christian.heimes

set

messages: +

2017-12-29 23:42:18

christian.heimes

set

messages: +

2017-12-16 00:21:50

yselivanov

set

nosy: + pitrou, yselivanov, vstinner
messages: +

2017-04-28 19:07:30

christian.heimes

set

pull_requests: + <pull%5Frequest1461>

2017-01-21 23:43:08

christian.heimes

set

messages: +

2016-09-21 20:25:03

christian.heimes

set

assignee: christian.heimes
components: + Extension Modules
stage: needs patch -> patch review

2016-09-18 11:05:00

christian.heimes

set

messages: +

2016-09-18 01:28:41

martin.panter

set

messages: +

2016-09-16 08:44:32

christian.heimes

set

files: + sock_initobj_types.patch
keywords: + patch
messages: +

2016-09-16 01:56:08

martin.panter

set

messages: +

2016-09-14 08:09:53

christian.heimes

set

messages: +

2016-09-14 00:07:59

martin.panter

set

nosy: + martin.panter
messages: +

2016-09-13 19:23:53

r.david.murray

set

nosy: + r.david.murray
messages: +

2016-09-13 19:17:55

christian.heimes

set

messages: +

2016-09-13 19:16:48

christian.heimes

set

type: behavior
stage: needs patch

2016-09-13 19:13:31

christian.heimes

create