guix/build-coordinator

	Commit message (Collapse)	Author	Age
*	Revert "Remove redundant sqlite-reset calls"	Christopher Baines	2020-11-13
\| \| \| \| \| \| \| \| \|	I do not understand why, but I think the removal of the "redundant" reset statements has broken the database in some way I haven't been able to pin down. Either writes aren't working, or some reads are returning stale data. I'm also seeing errors about locked tables :( This reverts commit 049334e423241426c0eef526eda8818e96f5b3ca.
*	Handle agent requested systems in the basic allocation strategy	Christopher Baines	2020-11-13
\| \| \| \| \|	Previously, the derivation system was ignored, but this now takes it in to account. The implementation is copied from the derivation ordered allocator.
*	Cache a couple of SQLite statements that should be cached	Christopher Baines	2020-11-13
\|
*	Remove redundant sqlite-reset calls	Christopher Baines	2020-11-13
\| \| \| \| \|	Which is most of them. Not sure why I started resetting statements after their use, but it's unnecessary.
*	Fix build-started hook processing prompt	Christopher Baines	2020-11-09
\|
*	Make call-with-time-logging write output in a thread safe way	Christopher Baines	2020-11-09
\|
*	Make hook processing a bit more efficient	Christopher Baines	2020-11-09
\| \| \| \| \|	Rather than polling the database every second, use some condition variables to wake threads when there's probably an event.
*	Add logging around hook processing	Christopher Baines	2020-11-08
\| \| \| \|	This might help work out why it gets stuck.
*	Use the build coordinator logger in the agent messaging server	Christopher Baines	2020-11-07
\|
*	Use the logger module to add times to the log output	Christopher Baines	2020-11-07
\| \| \| \| \|	Just for the request processing at the moment, but with a plan for more things in the future.
*	Fix the unprocessed_builds table sticking around	Christopher Baines	2020-11-06
\|
*	Speed up populating the unbuilt_outputs table	Christopher Baines	2020-11-06
\|
*	Rework how the derivation ordered allocator gets builds	Christopher Baines	2020-11-06
\| \| \| \| \| \|	Use a temporary table to avoid computing the priorities for all builds. This speeds up the allocation to only take a few seconds on the database I'm testing against.
*	Handle multiple values in call-with-time-logging	Christopher Baines	2020-11-06
\|
*	Use the unbuilt_outputs table in the derivation ordered allocator	Christopher Baines	2020-11-06
\| \| \| \|	As this speeds the query up substantially.
*	Add an unbuilt_outputs table	Christopher Baines	2020-11-06
\| \| \| \| \| \| \|	One of the slow things in the derivation ordered allocator is working out what outputs are unbuilt, as this requires looking at all the derivation outputs (of which there are lots), and checking if any build exists which has succeeded.
*	Improve SQLite statement handling	Christopher Baines	2020-11-04
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The Guix Build Coordinator would segfault, and this seemed to come when preparing statements. I think this is happening because the (sqlite3) bindings finalize statements when they're out of scope, and this happens in the garbage collector thread. SQLite is running in multi-threaded mode, which means actions relating to one database connection shouldn't happen concurrently in different threads, hence I think this is leading to a segfault. To work around this behaviour, pass #:cache? #t to sqlite-prepare so statements are long lived where possible, or in the few cases where the SQL is dynamic, make sure to finalize it before the garbage collector gets a chance. This'll hopefully mean that there's less segfaults...
*	Include the Guile internal real and run times as metrics	Christopher Baines	2020-11-02
\| \| \| \|	This will help track CPU time, as well as restarts/crashes.
*	Attempt to more gracefully handle the problem of missing derivations	Christopher Baines	2020-11-02
\| \| \| \|	In the agent and allocator.
*	Remove some left in debugging output	Christopher Baines	2020-11-02
\|
*	Only consider builds created in the last two weeks	Christopher Baines	2020-10-29
\| \| \| \| \| \|	For the derivation ordered allocator. This is an quick alternative for having some kind of archival mechanism for builds. It should reduce the time it takes the allocator to run.
*	Only consider unprocessed builds for prioritisation	Christopher Baines	2020-10-29
\| \| \| \|	As there's no need to consider unprocessed builds in this part of the query.
*	Don't assume the missing input to a build is a direct input	Christopher Baines	2020-10-24
\| \| \| \| \| \| \| \| \| \| \| \|	Substitutes could be available for all direct inputs, but be missing for things they reference. This could happen if those builds happened on a machine with the store items available for example. Therefore, search the entire graph for the relevant derivation when looking for the derivation to build to provide the missing input. This change matches up with the similar improvement around handling fetching substitutes.
*	Improve missing inputs behaviour	Christopher Baines	2020-10-24
\| \| \| \| \| \| \| \|	When a substitute is found for a direct input, but it can't be fetched, this is probably because something it referenced isn't available. Therefore, look through the references recursively and collect up the store items that aren't available locally or via a substitute. Send this list to the coordinator so that it can schedule builds.
*	Add missing newline to failed to fetch substitutes message	Christopher Baines	2020-10-24
\|
*	Use valid-path? rather than file exists for testing store items	Christopher Baines	2020-10-24
\| \| \| \| \|	As the file might exist, but ignored because the daemon is treating it as invalid.
*	Have the agent handle errors from the coordinator	Christopher Baines	2020-10-24
\| \| \| \| \|	When submitting builds. The agent will now retry the relevant thing, like uploading the log file if the coordinator says that still needs doing.
*	Better handle agent errors on the coordinator side	Christopher Baines	2020-10-24
\| \| \| \| \|	Things like the agent not having the log file, or an output. This will allow the agent to actually retry the relevant thing.
*	Extract out agents submitting log files	Christopher Baines	2020-10-24
\| \| \| \|	So that this code can be retried if submitting the build result fails.
*	Add the ability to ignore errors when retrying	Christopher Baines	2020-10-24
\| \| \| \|	As this will enable responding to some exceptions at a higher level.
*	Improve the line length for the receiving outputs code	Christopher Baines	2020-10-24
\|
*	Allow configuring the s3-publish-hook with a aws command	Christopher Baines	2020-10-24
\| \| \| \|	So that an absolute filename can be used.
*	Add some validation for hooks	Christopher Baines	2020-10-24
\|
*	Make the s3 utils command configurable	Christopher Baines	2020-10-24
\| \| \| \|	In case you want to use the absolute location of the binary.
*	Remove unnecessary underscore	Christopher Baines	2020-10-23
\| \| \| \|	This matches a change in the guile prometheus library.
*	Move the post-publish-behaviour inside the s3 publish hook	Christopher Baines	2020-10-22
\| \| \| \| \| \|	So that it'll run only if the narinfo on S3 is changed, because this should prevent it running when the hook wouldn't change the narinfo on S3, because one already exists.
*	client-communication: Do not use a hard-coded uri.	Mathieu Othacehe	2020-10-20
\| \| \| \| \|	* guix-build-coordinator/client-communication.scm (send-request): Use coordinator-uri instead of the hard-coded localhost uri.
*	Display exception details prior to backtrace	Christopher Baines	2020-10-20
\| \| \| \| \|	To make sure some useful information makes it out, because (backtrace) can raise an exception.
*	Support extending the S3 publish hook	Christopher Baines	2020-10-19
\| \| \| \|	To allow doing things with the nar/narinfo files before they're deleted.
*	Improve .tmp build log file handling	Christopher Baines	2020-10-11
\| \| \| \|	Make more of an effort to ignore the .tmp files.
*	Show backtrace on agent exceptions	Christopher Baines	2020-10-11
\|
*	Move the registry file to a clearer name	Christopher Baines	2020-10-11
\| \| \| \| \|	This will be a breaking change for existing deployments, as the old sqitch.db file will need to be moved manually.
*	Add a hook to recomress build log files	Christopher Baines	2020-10-11
\|
*	Move around the code for build log file locations	Christopher Baines	2020-10-11
\| \| \| \| \| \|	build-log-file-location replaces build-log-file-exists? as it doesn't always return a boolean, it also changes to return an absolute filepath for the log file if it exists, as this will be easier to use.
*	Exclude .tmp files when checking for build logs	Christopher Baines	2020-10-10
\|
*	Guard against receiving parts of build log files	Christopher Baines	2020-10-10
\|
*	Fix missing bad-request procedure	Christopher Baines	2020-10-07
\|
*	Separate the agent messaging server and client code	Christopher Baines	2020-10-07
\| \| \| \|	So that the client part doesn't depend on fibers.
*	Split the fibers utils from the main utils module	Christopher Baines	2020-10-07
\| \| \| \| \|	To start making it possible to use the agent, without having to load anything related to fibers (as it doesn't work on the hurd yet).
*	Guard against Guix Data Service requests hanging	Christopher Baines	2020-10-02
\| \| \| \| \|	I don't know if this is happening, but the hooks are getting stuck, and this might be a cause.