Issue 14872: subprocess is not safe from deadlocks (original) (raw)

Created on 2012-05-21 20:33 by techtonik, last changed 2022-04-11 14:57 by admin. This issue is now closed.

Messages (6)
msg161294 - (view)	Author: anatoly techtonik (techtonik)	Date: 2012-05-21 20:33
There is no way to write a program in Python capable to process large/unlimited output coming from a subprocess stream without deadlocks. http://docs.python.org/library/subprocess.html#subprocess.Popen.communicate "Note The data read is buffered in memory, so do not use this method if the data size is large or unlimited." http://docs.python.org/library/subprocess.html#subprocess.Popen.stdin http://docs.python.org/library/subprocess.html#subprocess.Popen.stdout http://docs.python.org/library/subprocess.html#subprocess.Popen.stderr "Warning Use communicate() rather than .stdin.write, .stdout.read or .stderr.read to avoid deadlocks due to any of the other OS pipe buffers filling up and blocking the child process." So, what should I use?
msg161303 - (view)	Author: Richard Oudkerk (sbt) *	Date: 2012-05-21 22:24
I think the note for communicate() just means that you might get MemoryError (or some other exception) if the output is too big. But I agree it is ambiguous. communicate() uses select() on Unix and threads on Windows, so deadlocks should not be possible. > So, what should I use? Use communicate() (on a machine with infinite memory;-)
msg161312 - (view)	Author: Daniel Swanson (weirdink13)	Date: 2012-05-22 01:34
what sort of machine has infinite memory?
msg161322 - (view)	Author: Ross Lagerwall (rosslagerwall)	Date: 2012-05-22 05:16
Well if you're certain that the process is only using one stream, then you can just use read/write on that stream. If not, it probably means you have to use either threads or select/poll. This is a known issue with subprocess; there are a few proposals on the tracker about this. See for example.
msg161325 - (view)	Author: anatoly techtonik (techtonik)	Date: 2012-05-22 05:43
The problem with memory is more actual for machines with SSD where swap is usually turned off and /tmp files are located on memory disk. Hitting memory limit often means hard reset. My process is pretty generic that uses all streams, and I don't know how to use threads/polls crossplatform way. looks interesting.
msg161337 - (view)	Author: Ross Lagerwall (rosslagerwall)	Date: 2012-05-22 08:45
See also . Closing as a duplicate of that.

History
Date	User	Action	Args
2022-04-11 14:57:30	admin	set	github: 59077
2012-05-22 08:45:11	rosslagerwall	set	status: open -> closedsuperseder: subprocess: more general (non-buffering) communicationmessages: + type: enhancementresolution: duplicatestage: resolved
2012-05-22 05:43:46	techtonik	set	messages: +
2012-05-22 05:16:54	rosslagerwall	set	nosy: + rosslagerwallmessages: +
2012-05-22 01:34:11	weirdink13	set	nosy: + weirdink13messages: +
2012-05-22 01:17:21	cvrebert	set	nosy: + cvrebert
2012-05-21 22:24:08	sbt	set	nosy: + sbtmessages: +
2012-05-21 20:33:12	techtonik	create