aboutsummaryrefslogtreecommitdiff
path: root/guix-build-coordinator/agent.scm
Commit message (Collapse)AuthorAge
* Fix for guile-lib v0.2.8Christopher Baines2024-04-23
| | | | | | | | | I'm not sure quite what's changed, but it seems like the way the formatter is being called has changed in a backwards incompatible way. To maintain compatability with both older and newer versions of guile-lib, accept the formatter being called with extra arguments, and just look at the first three.
* Remove mutexes from the upload monitoringChristopher Baines2024-03-13
| | | | Instead, store the state in the work queue.
* Don't pass time to wait-condition-variableChristopher Baines2023-12-02
| | | | | | | | | | My intent here was to avoid threads waiting indefinitely for a signal. Unfortunately, it seems that specifying a timeout can result in wait-condition-variable entering an endless loop internally (#67194). Hopefully avoid this by not passing a time to wait-condition-variable. Maybe some new threads/fibers will need adding that periodically signal these condition variables to replicate the timeout behaviour.
* Actually use a separate store connectionChristopher Baines2023-11-23
| | | | Fix the obvious bug in 3711cd8f1d901b424b2073dc54be4eec83c95482.
* Remove set-store-connection-timeoutChristopher Baines2023-11-20
| | | | As non blocking connections are being used now.
* Use a temporary store connection for substituting derivationsChristopher Baines2023-11-20
|
* Use port timeouts when checking substitutesChristopher Baines2023-11-20
| | | | As I think this is better than the alarm syscall.
* Switch to (mostly) non blocking store connections in the agentChristopher Baines2023-11-20
| | | | Pending #67245.
* Pass the store in to delete-outputsChristopher Baines2023-11-20
|
* Pass the store in to get-output-detailsChristopher Baines2023-11-20
|
* Don't shadow store in the agentChristopher Baines2023-11-20
|
* Pass the store in to substitute-derivationChristopher Baines2023-08-20
| | | | To avoid an additional store connection.
* Keep less derivation files open when reading derivationsChristopher Baines2023-08-03
| | | | | This probably isn't the main problem with having too many open files, but it might help avoid bursts of open files.
* Remove crude fetching inputs timeoutChristopher Baines2023-08-02
| | | | As this breaks builds that rely on fetching large inputs.
* Fix not updating about uploads based on timeChristopher Baines2023-05-22
|
* Clean up some handling of uploads for agentsChristopher Baines2023-05-11
| | | | This commit should correct the progress reporting on partial uploads.
* Have agents report on the progress of the coordinator hashing outputsChristopher Baines2023-05-11
| | | | Otherwise it looks like the upload should finish, but hasn't.
* Tweak retrying for status update requestsChristopher Baines2023-05-10
| | | | | Don't retry status updates many times, since the information will be more out of date each time.
* Include system uptime in the agent status informationChristopher Baines2023-05-05
| | | | As I've found this useful in spotting systems which have problems.
* Enable submitting regular status updates for the hurdChristopher Baines2023-05-05
|
* Log which inputs are missingChristopher Baines2023-05-05
| | | | As this can be useful.
* Fix using with-upload-monitoring in submit-one-outputChristopher Baines2023-05-03
|
* Remove the gbc prefix from the thread namesChristopher Baines2023-05-02
| | | | As this shouldn't be needed to help identify them.
* Stop monitoring uploads through the chunked output portChristopher Baines2023-04-30
| | | | Use the dump-port* progress reporter instead.
* Deallocate canceled builds from agents when they startupChristopher Baines2023-04-21
|
* More reliably delete the compressed outputsChristopher Baines2023-04-20
|
* Remove redundant appendChristopher Baines2023-04-20
|
* Log more about temporary files in the agentChristopher Baines2023-04-18
|
* Prioritise post build actionsChristopher Baines2023-04-11
| | | | | | | By the priority of the build, and then by the bytes that need uploading. This should help ensure that priority builds get handled first when there's congestion getting data back to the coordinator. Prioritising builds with less data to upload should also keep things moving when uploads are slow as well.
* Change - to _ to fix listing post build job derivationsChristopher Baines2023-04-11
| | | | Signed-off-by: Christopher Baines <mail@cbaines.net>
* Use underscores for derivation_nameChristopher Baines2023-04-11
| | | | | | As this is more consistent in the JSON responses. Signed-off-by: Christopher Baines <mail@cbaines.net>
* Expose the derived priority to agentsChristopher Baines2023-04-11
| | | | | Rather than the priority, as it's the derived priority that they should be using for decision making.
* Reduce logging on build failuresChristopher Baines2023-04-11
|
* Have agents send their status every 30 secondsChristopher Baines2023-03-22
|
* Print a backtrace when a exception happens for build-thingsChristopher Baines2023-03-21
|
* Log the derivation substitute urls when failing to fetch derivationsChristopher Baines2023-03-13
| | | | This might make it clearer when there's a configuration issue.
* Try to improve protections against getting stuck when substitutingChristopher Baines2023-01-30
| | | | | | | I think the agent and coordinator can get stuck when talking to the guix-daemon to substitute things, potentially because the Guile code involved on the daemon side isn't guarding against getting stuck. Try to work around this here by setting timeouts on the connection to the daemon.
* Use port timeouts when fetching build inputsChristopher Baines2022-11-10
| | | | As this seems to hang sometimes, and maybe port timeouts will help with this.
* Switch to using = for testing numerical equalityChristopher Baines2022-10-22
| | | | As I think I've been abusing eq?.
* Reset the upload progress when appropriateChristopher Baines2022-10-14
|
* Add some more logging when agents submit outputsChristopher Baines2022-10-14
|
* Allow specifying the maximum number of allocated buildsChristopher Baines2022-10-10
| | | | For an agent.
* Properly handle build failure statusesChristopher Baines2022-09-02
|
* Don't record the build as failed if the exception is unknownChristopher Baines2022-09-02
| | | | As this can cause spurious failures.
* Fix adding GC roots for built outputsChristopher Baines2022-09-02
|
* Allow changing the agent parallel uploadsChristopher Baines2022-07-06
|
* Move getting output details soon after the outputs are generatedChristopher Baines2022-07-05
| | | | Like compressing the outputs, it's worth doing this sooner rather than later.
* Add temp roots for built outputsChristopher Baines2022-07-05
| | | | In case these need sending to the coordinator.
* Set thread names to help with debuggingChristopher Baines2022-06-15
|
* Fix spacing in a logging messageChristopher Baines2022-05-21
|