[MacPorts] #54247: buildbot buildmaster is sometimes very slow to respond
MacPorts
noreply at macports.org
Mon May 29 03:50:48 UTC 2017
#54247: buildbot buildmaster is sometimes very slow to respond
----------------------------+---------------------
Reporter: ryandesign | Owner: admin@…
Type: defect | Status: new
Priority: Normal | Milestone:
Component: server/hosting | Version:
Keywords: | Port:
----------------------------+---------------------
The [https://build.macports.org buildbot buildmaster] is sometimes
extremely slow to respond. We use nginx as a reverse proxy in front of the
buildmaster, and buildmaster's slowness sometimes exceeds nginx's default
proxy timeouts, causing nginx to respond with a `502 Bad Gateway` or `504
Gateway Time-out` error. Not only does this cause developers to be unable
to use the buildbot web interface at times, it also prevents GitHub from
successfully delivering push notifications, so some ports might not be
getting built.
Not sure whether the notifications aren't getting to the buildmaster, or
whether the buildmaster just isn't responding in time for GitHub to
consider it a success, but I suspect the latter.
Not sure whether it's the `proxy_connect_timeout` or `proxy_read_timeout`
or `proxy_send_timeout` or a combination of these but I've increased them
all, so now we shouldn't see any more 502 or 504 errors; the requests
should complete, if given enough time. This hasn't helped GitHub
successfully send its push notifications. GitHub seems to have its own
built-in timeout for sending these notifications.
I've gone through the entire history of GitHub push notifications up to
now and redelivered those that failed, so any ports that previously
weren't built because of this should now be built.
I suspect the cause is that the RAID the buildmaster is on is too slow. It
seems quick enough when tested with Blackmagic Disk Speed Test, but I have
a feeling that the buildmaster workload plus the periodic syncing via
rsync makes random disk accesses slow. When the web interface is
responding slowly, accessing the server over ssh also feels sluggish.
The server does have an unused Apple SSD. I could try moving the
buildmaster folder to the SSD; that would be fairly easy to do and would
let us know whether disk speed is a factor. Another option is to move the
entire OS and everything except the rsync directory to the SSD. That's
more difficult to do and will involve more downtime, and the reason I
hadn't done so initially is that I wanted the assurance of a RAID for our
critical infrastructure. Some parts of macOS Server, such as the Caching
Server, also seem particularly unhappy about storing their data on a disk
that is not the startup volume, and default back to storing their data on
the startup volume after a restart. But the caching service uses less disk
space than I had anticipated so there should be room to store that data on
the SSD.
--
Ticket URL: <https://trac.macports.org/ticket/54247>
MacPorts <https://www.macports.org/>
Ports system for macOS
More information about the macports-tickets
mailing list