Issue 43888: GitHub Actions CI/CD Coverage job is broken on master (original) (raw)

Created on 2021-04-19 11:50 by webknjaz, last changed 2022-04-11 14:59 by admin. This issue is now closed.

Pull Requests
URL Status Linked Edit
PR 25471 merged webknjaz,2021-04-19 12:06
PR 25679 merged ammar2,2021-04-28 05:07
Messages (7)
msg391373 - (view) Author: Sviatoslav Sydorenko (webknjaz) * Date: 2021-04-19 11:50
I noticed that https://github.com/python/cpython/runs/2378199636 (a coverage job on the last commit on master at the time of writing) takes suspiciously long to complete. I did some investigation and noticed that this job on the 3.9 branch succeeds (all of the job runs on the first page in the list are green — https://github.com/python/cpython/actions/workflows/coverage.yml?query=branch%3A3.9) But then I took a look at the runs on master and discovered that the last successful run was 4 months ago — https://github.com/python/cpython/actions.html?query=is%3Asuccess+branch%3Amaster&workflow_file_name=coverage.yml. The last success is https://github.com/python/cpython/actions/runs/444323166 and after that, starting with https://github.com/python/cpython/actions/runs/444405699, if fails consistently. Notably, all of the failures are caused by the job timeout after *6 hours* — GitHub platform just kills those, 6h is a default per-job timeout in GHA. It's also important to mention that before every job starting timing out effectively burning 6 hours of GHA time for each merge and producing no useful reports, there were occasional 6h-timeouts but they weren't consistent. Looking into the successful runs from the past, on master and other jobs, I haven't noticed it taking more than 1h35m to complete with a successful outcome. Taking into account this as a baseline, I suggest changing the timeout of the whole job or maybe just one step that actually runs coverage. Action items: * Set job timeout in GHA to 1h40m (allowing a bit of extra time for exceptionally slow jobs) — this will make sure that the failure/timeout is reported sooner than 6h * Figure out why this started happening in the first place. I'm going to send a PR addressing the first point but feel free to pick up the investigation part — I don't expect to have time for this anytime soon. P.S. FTR the last timeout of this type happened two months ago — https://github.com/python/cpython/actions.html?page=4&query=branch%3A3.9&workflow_file_name=coverage.yml.
msg391412 - (view) Author: Brett Cannon (brett.cannon) * (Python committer) Date: 2021-04-19 22:44
It might be time to just kill the coverage report since people are obviously not looking at the results.
msg391455 - (view) Author: miss-islington (miss-islington) Date: 2021-04-20 17:22
New changeset 077a2e76649c2fc49ebb8982b6855bab09a85e8f by Sviatoslav Sydorenko in branch 'master': bpo-43888: Reduce coverage collection timeout to 1h40m in GHA (GH-25471) https://github.com/python/cpython/commit/077a2e76649c2fc49ebb8982b6855bab09a85e8f
msg391836 - (view) Author: Batuhan Taskaya (BTaskaya) * (Python committer) Date: 2021-04-25 04:33
Coverage runs are still failing on the master, and I think at least we should do something like allow failure or other wise github will send notifications for this flaky run.
msg391837 - (view) Author: Ammar Askar (ammar2) * (Python committer) Date: 2021-04-25 04:36
For what it's worth I think Brett's suggestion of just removing the coverage build entirely is good too since it seems like no one actually looks at the results and they take up valuable CI time.
msg392514 - (view) Author: Brett Cannon (brett.cannon) * (Python committer) Date: 2021-04-30 21:04
New changeset 726c931b3896dc73fd156e2340b5ef0b8f55cfb7 by Ammar Askar in branch 'master': bpo-43888: Remove coverage builds from CI (GH-25679) https://github.com/python/cpython/commit/726c931b3896dc73fd156e2340b5ef0b8f55cfb7
msg392527 - (view) Author: Ammar Askar (ammar2) * (Python committer) Date: 2021-04-30 22:41
Coverage builds have been removed from the CI, closing this now.
History
Date User Action Args
2022-04-11 14:59:44 admin set github: 88054
2021-04-30 22:41:40 ammar2 set status: open -> closedresolution: fixedmessages: + stage: patch review -> resolved
2021-04-30 21:04:47 brett.cannon set messages: +
2021-04-28 05:07:12 ammar2 set pull_requests: + <pull%5Frequest24370>
2021-04-25 04:36:37 ammar2 set nosy: + ammar2messages: +
2021-04-25 04:33:19 BTaskaya set nosy: + BTaskayamessages: +
2021-04-20 17:22:30 miss-islington set nosy: + miss-islingtonmessages: +
2021-04-19 22:44:11 brett.cannon set messages: +
2021-04-19 19:46:02 ned.deily set nosy: + brett.cannon, pablogsal
2021-04-19 12:06:18 webknjaz set keywords: + patchstage: patch reviewpull_requests: + <pull%5Frequest24196>
2021-04-19 11:51:07 webknjaz set type: crash ->
2021-04-19 11:50:46 webknjaz create