[PATCH v2 3/6] test: fence: Fix race condition
Dan Scally
dan.scally at ideasonboard.com
Sun Jun 2 00:13:49 CEST 2024
Hi Laurent - thanks for the patch
On 29/05/2024 16:43, Laurent Pinchart wrote:
> The fence test is racy, as it relies on the main loop being executed
> between completion of request signalledRequestId_ and
> signalledRequestId_ + 1. This usually happens, but is not guaranteed.
>
> To fix the race condition, change the request identification logic by
> replacing usage of the cookie value, which is zero-based and wraps
> around at nbuffers_ - 1, with a completed request counter that is
> one-based and doesn't wrap. The completedRequestId_, expiredRequestId_
> and signalledRequestId_ variables now track the identifier of the last
> request that has completed, the request whose fence will time out, and
> the request whose fence will be signalled.
>
> Signed-off-by: Laurent Pinchart <laurent.pinchart at ideasonboard.com>
Took me a while to wrap my head around it, but I think it's fine:
Daniel Scally <dan.scally at ideasonboard.com>
> ---
> Changes since v1:
>
> - Add and update comments
> - Fix typo in commit message
> ---
> test/fence.cpp | 57 +++++++++++++++++++++++++++++++-------------------
> 1 file changed, 36 insertions(+), 21 deletions(-)
>
> diff --git a/test/fence.cpp b/test/fence.cpp
> index c282886287aa..a8fba7284d82 100644
> --- a/test/fence.cpp
> +++ b/test/fence.cpp
> @@ -57,8 +57,11 @@ private:
> bool expectedCompletionResult_ = true;
> bool setFence_ = true;
>
> - unsigned int completedRequest_;
> -
> + /*
> + * Request IDs track the number of requests that have completed. They
> + * are one-based, and don't wrap.
> + */
> + unsigned int completedRequestId_;
> unsigned int signalledRequestId_;
> unsigned int expiredRequestId_;
> unsigned int nbuffers_;
> @@ -126,8 +129,19 @@ int FenceTest::init()
> return TestFail;
> }
>
> - signalledRequestId_ = nbuffers_ - 2;
> - expiredRequestId_ = nbuffers_ - 1;
> + completedRequestId_ = 0;
> +
> + /*
> + * All but two requests are queued without a fence. Request
> + * expiredRequestId_ will be queued with a fence that we won't signal
> + * (which is then expected to expire), and request signalledRequestId_
> + * will be queued with a fence that gets signalled. Select nbuffers_
> + * and nbuffers_ * 2 for those two requests, to space them by a few
> + * frames while still not requiring a long time for the test to
> + * complete.
> + */
> + expiredRequestId_ = nbuffers_;
> + signalledRequestId_ = nbuffers_ * 2;
>
> return TestPass;
> }
> @@ -189,16 +203,16 @@ void FenceTest::requestRequeue(Request *request)
> const Request::BufferMap &buffers = request->buffers();
> const Stream *stream = buffers.begin()->first;
> FrameBuffer *buffer = buffers.begin()->second;
> - uint64_t cookie = request->cookie();
>
> request->reuse();
>
> - if (cookie == signalledRequestId_ && setFence_) {
> + if (completedRequestId_ == signalledRequestId_ - nbuffers_ && setFence_) {
> /*
> - * The second time this request is queued add a fence to it.
> - *
> - * The main loop signals it by using a timer to write to the
> - * efd2_ file descriptor before the fence expires.
> + * This is the request that will be used to test fence
> + * signalling when it completes next time. Add a fence to it,
> + * using efd2_. The main loop will signal the fence by using a
> + * timer to write to the efd2_ file descriptor before the fence
> + * expires.
> */
> std::unique_ptr<Fence> fence =
> std::make_unique<Fence>(std::move(eventFd2_));
> @@ -213,16 +227,15 @@ void FenceTest::requestRequeue(Request *request)
>
> void FenceTest::requestComplete(Request *request)
> {
> - uint64_t cookie = request->cookie();
> - completedRequest_ = cookie;
> + completedRequestId_++;
>
> /*
> - * The last request is expected to fail as its fence has not been
> - * signaled.
> + * Request expiredRequestId_ is expected to fail as its fence has not
> + * been signalled.
> *
> * Validate the fence status but do not re-queue it.
> */
> - if (cookie == expiredRequestId_) {
> + if (completedRequestId_ == expiredRequestId_) {
> if (validateExpiredRequest(request) != TestPass)
> expectedCompletionResult_ = false;
>
> @@ -230,7 +243,7 @@ void FenceTest::requestComplete(Request *request)
> return;
> }
>
> - /* Validate all requests but the last. */
> + /* Validate all other requests. */
> if (validateRequest(request) != TestPass) {
> expectedCompletionResult_ = false;
>
> @@ -271,7 +284,7 @@ int FenceTest::run()
> }
>
> int ret;
> - if (i == expiredRequestId_) {
> + if (i == expiredRequestId_ - 1) {
> /* This request will have a fence, and it will expire. */
> std::unique_ptr<Fence> fence =
> std::make_unique<Fence>(std::move(eventFd_));
> @@ -318,11 +331,13 @@ int FenceTest::run()
> Timer timer;
> timer.start(1000ms);
> while (timer.isRunning() && expectedCompletionResult_) {
> - if (completedRequest_ == signalledRequestId_ && setFence_)
> + if (completedRequestId_ == signalledRequestId_ - 1 && setFence_)
> /*
> - * signalledRequestId_ has just completed and it has
> - * been re-queued with a fence. Start the timer to
> - * signal the fence in 10 msec.
> + * The request just before signalledRequestId_ has just
> + * completed. Request signalledRequestId_ has been
> + * queued with a fence, and libcamera is likely already
> + * waiting on the fence, or will soon. Start the timer
> + * to signal the fence in 10 msec.
> */
> fenceTimer.start(10ms);
>
More information about the libcamera-devel
mailing list