[libcamera-devel] [RFC PATCH v2 2/4] ipa: raspberrypi: Introduce an autofocus algorithm

Laurent Pinchart laurent.pinchart at ideasonboard.com
Sun Mar 27 23:35:12 CEST 2022


On Thu, Mar 24, 2022 at 10:54:21AM +0100, Jean-Michel Hautbois via libcamera-devel wrote:
> On 24/03/2022 00:49, Kieran Bingham wrote:
> > Quoting Jean-Michel Hautbois via libcamera-devel (2022-03-23 16:01:43)
> >> Now that the ancillary links are plumbed and we can set the lens
> >> position, implement a contrast-based algorithm for RPi. This algorithm
> >> is adapted from the one proposed for the IPU3 IPA.
> >>
> >> It is currently taking all the regions and tries to make the focus on
> >> the global scene in a first attempt.
> >>
> >> Signed-off-by: Jean-Michel Hautbois <jeanmichel.hautbois at ideasonboard.com>
> >> ---
> >>   .../raspberrypi/controller/af_algorithm.hpp   |  20 ++
> >>   src/ipa/raspberrypi/controller/af_status.h    |  31 +++
> >>   src/ipa/raspberrypi/controller/focus_status.h |   3 +
> >>   src/ipa/raspberrypi/controller/iob/af.cpp     | 231 ++++++++++++++++++
> >>   src/ipa/raspberrypi/controller/iob/af.h       |  55 +++++
> >>   src/ipa/raspberrypi/meson.build               |   1 +
> >>   6 files changed, 341 insertions(+)
> >>   create mode 100644 src/ipa/raspberrypi/controller/af_algorithm.hpp
> >>   create mode 100644 src/ipa/raspberrypi/controller/af_status.h
> >>   create mode 100644 src/ipa/raspberrypi/controller/iob/af.cpp
> >>   create mode 100644 src/ipa/raspberrypi/controller/iob/af.h
> >>
> >> diff --git a/src/ipa/raspberrypi/controller/af_algorithm.hpp b/src/ipa/raspberrypi/controller/af_algorithm.hpp
> >> new file mode 100644
> >> index 00000000..553a37e1
> >> --- /dev/null
> >> +++ b/src/ipa/raspberrypi/controller/af_algorithm.hpp
> >> @@ -0,0 +1,20 @@
> >> +/* SPDX-License-Identifier: BSD-2-Clause */
> >> +/*
> >> + * Copyright (C) 2019, Raspberry Pi (Trading) Limited
> >> + *
> >> + * af_algorithm.hpp - autofocus control algorithm interface
> >> + */
> >> +#pragma once
> >> +
> >> +#include "algorithm.hpp"
> >> +
> >> +namespace RPiController {
> >> +
> >> +class AfAlgorithm : public Algorithm
> >> +{
> >> +public:
> >> +       AfAlgorithm(Controller *controller) : Algorithm(controller) {}
> >> +       // An af algorithm must provide the following:
> >> +};
> >> +
> >> +} // namespace RPiController
> >> diff --git a/src/ipa/raspberrypi/controller/af_status.h b/src/ipa/raspberrypi/controller/af_status.h
> >> new file mode 100644
> >> index 00000000..835e1e2f
> >> --- /dev/null
> >> +++ b/src/ipa/raspberrypi/controller/af_status.h
> >> @@ -0,0 +1,31 @@
> >> +/* SPDX-License-Identifier: BSD-2-Clause */
> >> +/*
> >> + * Copyright (C) 2020, Raspberry Pi (Trading) Limited
> >> + * Copyright (C) 2022, Ideas On Board
> >> + *
> >> + * af_status.h - autofocus measurement status
> >> + */
> >> +#pragma once
> >> +
> >> +#include <linux/bcm2835-isp.h>
> >> +
> >> +/*
> >> + * The focus algorithm should post the following structure into the image's
> >> + * "af.status" metadata.
> >> + */
> >> +
> >> +#ifdef __cplusplus
> >> +extern "C" {
> >> +#endif
> > 
> > I don't understand why this extern C is required?
> 
> To be honest: it comes from a copy/paste of focus_state.hpp and I did 
> not think about it at all. All the _status files have this extern "C", I 
> suppose there is a reason, David ? Naushir ?
> 
> >> +
> >> +struct AfStatus {
> >> +       unsigned int num;
> >> +       uint32_t focus_measures[FOCUS_REGIONS];
> >> +       bool stable;
> >> +       uint32_t focus;
> >> +       double maxVariance;
> >> +};
> >> +
> >> +#ifdef __cplusplus
> >> +}
> >> +#endif
> >> diff --git a/src/ipa/raspberrypi/controller/focus_status.h b/src/ipa/raspberrypi/controller/focus_status.h
> >> index ace2fe2c..8122df4b 100644
> >> --- a/src/ipa/raspberrypi/controller/focus_status.h
> >> +++ b/src/ipa/raspberrypi/controller/focus_status.h
> >> @@ -19,6 +19,9 @@ extern "C" {
> >>   struct FocusStatus {
> >>          unsigned int num;
> >>          uint32_t focus_measures[FOCUS_REGIONS];
> >> +       bool stable;
> > 
> > I assume there will be different states for this later, depending on
> > what needs to get reported back in metadata, but that can be handled
> > when the AF controls get plumbed.
> 
> It is really to introduce it, I did not change the structure, as I think 
> it should be done at the same time as the controls introduction.
> 
> >> +       uint32_t focus;
> >> +       double maxVariance;
> >>   };
> >>   
> >>   #ifdef __cplusplus
> >> diff --git a/src/ipa/raspberrypi/controller/iob/af.cpp b/src/ipa/raspberrypi/controller/iob/af.cpp
> >> new file mode 100644
> >> index 00000000..dc5258ba
> >> --- /dev/null
> >> +++ b/src/ipa/raspberrypi/controller/iob/af.cpp
> >> @@ -0,0 +1,231 @@
> >> +/* SPDX-License-Identifier: LGPL-2.1-or-later */
> >> +/*
> >> + * Copyright (C) 2021, Red Hat
> >> + * Copyright (C) 2022, Ideas On Board
> >> + *
> >> + * af.cpp - automatic contrast-based focus algorithm
> >> + */
> >> +#include <cmath>
> >> +
> >> +#include <stdint.h>
> >> +
> >> +#include <libcamera/base/log.h>
> >> +
> >> +#include "af.h"
> >> +
> >> +using namespace RPiController;
> >> +using namespace libcamera;
> >> +
> >> +LOG_DEFINE_CATEGORY(IoBAf)
> >> +
> >> +#define NAME "iob.af"
> >> +
> >> +/*
> >> + * Maximum focus steps of the VCM control
> >> + * \todo should be obtained from the VCM driver
> > 
> > I think we're in a position where we should be doing this now.
> 
> Yes :-).
> 
> >> + */
> >> +static constexpr uint32_t kMaxFocusSteps = 1023;
> > 
> > pi at earth:~ $ v4l2-ctl -d /dev/v4l-subdev1 -l
> > 
> > Camera Controls
> > 
> >                   focus_absolute 0x009a090a (int)    : min=0 max=4095 step=1 default=0 value=1017
> > 
> >  From an Arducam IMX519 with an AK7375.

Note that the value gives the absolute maximum supported by the VCM
controller driver. It most likely won't match the maximum value that a
particular camera module should use. This should come from the tuning
data.

> >> +
> >> +/* Minimum focus step for searching appropriate focus */
> >> +static constexpr uint32_t kCoarseSearchStep = 30;
> >> +static constexpr uint32_t kFineSearchStep = 1;
> > 
> > The course step is probably going to have to be relative to either the
> > FineSearchStep or a division of the max by how long we want to take to
> > search?
> 
> Again, I think it should be done with the AF controls ?

Not sure to follow you here.

> >> +
> >> +/* Max ratio of variance change, 0.0 < kMaxChange < 1.0 */
> >> +static constexpr double kMaxChange = 0.5;
> >> +
> >> +/* The numbers of frame to be ignored, before performing focus scan. */
> >> +static constexpr uint32_t kIgnoreFrame = 10;
> > 
> > Why is this? Is that to let the AE/AGC settle?
> > 
> > Is there already a means in the RPi IPA to 'know' when that has settled
> > ?
> 
> Indeed, there is a mistrustCount_ which is used in 
> IPARPI::signalStatReady() so I think there is not need to add another 
> drop here... :-).
> 
> >> +
> >> +/* Fine scan range 0 < kFineRange < 1 */
> >> +static constexpr double kFineRange = 0.05;
> >> +
> >> +Af::Af(Controller *controller)
> >> +       : AfAlgorithm(controller), focus_(0), bestFocus_(0), ignoreCounter_(0),
> >> +         currentVariance_(0.0), previousVariance_(0.0), maxStep_(0),
> >> +         coarseCompleted_(false), fineCompleted_(false)
> >> +{
> >> +}
> >> +
> >> +char const *Af::Name() const
> >> +{
> >> +       return NAME;
> >> +}
> >> +
> >> +void Af::Initialise()
> >> +{
> >> +       status_.focus = 0.0;
> >> +       status_.maxVariance = 0.0;
> >> +       status_.stable = false;
> >> +}
> >> +
> >> +void Af::Prepare(Metadata *image_metadata)
> >> +{
> >> +       image_metadata->Set("af.status", status_);
> >> +}
> >> +
> >> +double Af::estimateVariance()
> >> +{
> >> +       unsigned int i;
> >> +       double mean;
> >> +       uint64_t total = 0;
> >> +       double var_sum = 0.0;
> >> +
> >> +       /* Compute the mean value. */
> >> +       for (i = 0; i < FOCUS_REGIONS; i++)
> >> +               total += status_.focus_measures[i];
> >> +       mean = total / FOCUS_REGIONS;
> >> +
> >> +       /* Compute the sum of the squared variance. */
> >> +       for (i = 0; i < FOCUS_REGIONS; i++)
> >> +               var_sum += std::pow(status_.focus_measures[i] - mean, 2);
> >> +
> >> +       return var_sum / FOCUS_REGIONS;
> >> +}
> >> +
> >> +bool Af::afNeedIgnoreFrame()
> >> +{
> >> +       if (ignoreCounter_ == 0)
> >> +               return false;
> >> +       else
> >> +               ignoreCounter_--;
> >> +       return true;
> >> +}
> >> +
> >> +void Af::afCoarseScan()
> >> +{
> >> +       if (coarseCompleted_)
> >> +               return;
> >> +
> >> +       if (afNeedIgnoreFrame())
> >> +               return;
> >> +
> >> +       if (afScan(kCoarseSearchStep)) {
> >> +               coarseCompleted_ = true;
> >> +               status_.maxVariance = 0;
> >> +               focus_ = status_.focus - (status_.focus * kFineRange);
> >> +               status_.focus = focus_;
> >> +               previousVariance_ = 0;
> >> +               maxStep_ = std::clamp(focus_ + static_cast<uint32_t>((focus_ * kFineRange)),
> >> +                                     0U, kMaxFocusSteps);
> >> +       }
> >> +}
> >> +
> >> +void Af::afFineScan()
> >> +{
> >> +       if (!coarseCompleted_)
> >> +               return;
> >> +
> >> +       if (afNeedIgnoreFrame())
> >> +               return;
> >> +
> >> +       if (afScan(kFineSearchStep)) {
> >> +               status_.stable = true;
> >> +               fineCompleted_ = true;
> >> +       }
> >> +}
> >> +
> >> +bool Af::afScan(uint32_t minSteps)
> >> +{
> >> +       if (focus_ > maxStep_) {
> >> +               /* If the max step is reached, move lens to the position. */
> >> +               status_.focus = bestFocus_;
> >> +               return true;
> >> +       } else {
> >> +               /*
> >> +                * Find the maximum of the variance by estimating its
> >> +                * derivative. If the direction changes, it means we have passed
> >> +                * a maximum one step before.
> >> +                */
> >> +               if ((currentVariance_ - status_.maxVariance) >=
> >> +                   -(status_.maxVariance * 0.1)) {
> >> +                       /*
> >> +                        * Positive and zero derivative:
> >> +                        * The variance is still increasing. The focus could be
> >> +                        * increased for the next comparison. Also, the max
> >> +                        * variance and previous focus value are updated.
> >> +                        */
> >> +                       bestFocus_ = focus_;
> >> +                       focus_ += minSteps;
> >> +                       status_.focus = focus_;
> >> +                       status_.maxVariance = currentVariance_;
> >> +               } else {
> >> +                       /*
> >> +                        * Negative derivative:
> >> +                        * The variance starts to decrease which means the maximum
> >> +                        * variance is found. Set focus step to previous good one
> >> +                        * then return immediately.
> >> +                        */
> >> +                       status_.focus = bestFocus_;
> >> +                       return true;
> >> +               }
> >> +       }
> >> +
> >> +       previousVariance_ = currentVariance_;
> >> +       LOG(IoBAf, Debug) << " Previous step is "
> >> +                         << bestFocus_
> >> +                         << " Current step is "
> >> +                         << focus_;
> >> +       return false;
> >> +}
> > 
> > This 'hill climb' stops at the first hill. Is it ever possible or
> > expected that there could be more than one hill? (I.e. could there be
> > another bigger hill after a small peak?)
> 
> Theretically we could have a local maximum lower than global maximum, yes.
> 
> > I wonder if the over all algorithm would need to sweep, and store the
> > values in a table - to then choose the best position - but that's quite
> > a rework from the current model - so it's just an open question for now,
> > I suspect keeping this as is for now is enough to get started, but it's
> > probably a research topic.
> 
> Considering a vcm with 4096 positions, and a step of 30, it would take 
> ~137 frames to sweep. So, something like 5 seconds at 30fps. We could 
> also say we want it to sweep for one second, and calculate 
> kCoarseSearchStep based on this, but I suspect it could pass the maximum 
> without noticing it...
> 
> /*
> 
> variance
>   ^
>   |               max        max'
>   |      |      |  |   |      |      |      |      |
>   |      |      |  |   |      |      |      |      |
>   |      |      |  |   |      |      |      |      |
>   |      |      |  |   |      |      |      |      |
>   |      |      |  v   |      |      |      |      |
>   |      |      |  xx  |      |      |      |      |
>   |      |      | xx x |  xxxx|      |      |      |
>   |      |      |xx   x|xx    |xx    |      |      |
>   |      |      |x     |      | x    |      |      |
>   |      |      |      |      |  x   |      |      |
>   |      |     x|      |      |  xx  |      |    xx|
>   |      |    xx|      |      |   x  |      |   xx |
>   |      |   x  |      |      |    x |      |  xx  |
>   |     x|xxx   |      |      |    xx|      |  x   |
>   |    xx|      |      |      |     x|      |xx    |
>   |    x |      |      |      |      |x     |x     |
>   |  xxx |      |      |      |      |xxxxxx|      |
>   |xxx   |      |      |      |      |      |      |
> -+------+------+------+------+------+------+------+--> position
>   |     step
>   */
> 
> Here, we sampled a maximum (max') which is not the real one (max).
> I suppose we have room for improvements, it is a first algorithm.
> 
> >> +
> >> +void Af::afReset()
> >> +{
> >> +       if (afNeedIgnoreFrame())
> >> +               return;
> >> +
> >> +       status_.maxVariance = 0;
> >> +       status_.focus = 0;
> >> +       focus_ = 0;
> >> +       status_.stable = false;
> >> +       ignoreCounter_ = kIgnoreFrame;
> >> +       previousVariance_ = 0.0;
> >> +       coarseCompleted_ = false;
> >> +       fineCompleted_ = false;
> >> +       maxStep_ = kMaxFocusSteps;
> >> +}
> >> +
> >> +bool Af::afIsOutOfFocus()
> >> +{
> >> +       const uint32_t diff_var = std::abs(currentVariance_ -
> >> +                                          status_.maxVariance);
> >> +       const double var_ratio = diff_var / status_.maxVariance;
> >> +       LOG(IoBAf, Debug) << "Variance change rate: "
> >> +                         << var_ratio
> >> +                         << " Current VCM step: "
> >> +                         << status_.focus;
> >> +       if (var_ratio > kMaxChange)
> >> +               return true;
> >> +       else
> >> +               return false;
> >> +}
> >> +
> >> +void Af::Process(StatisticsPtr &stats, Metadata *image_metadata)
> >> +{
> >> +       unsigned int i;
> >> +       image_metadata->Get("af.status", status_);
> >> +
> >> +       /* Use the second filter results only, and cache those. */
> >> +       for (i = 0; i < FOCUS_REGIONS; i++)
> >> +               status_.focus_measures[i] = stats->focus_stats[i].contrast_val[1][1]
> >> +                                         / stats->focus_stats[i].contrast_val_num[1][1];
> >> +       status_.num = i;
> >> +
> >> +       currentVariance_ = estimateVariance();
> >> +
> >> +       if (!status_.stable) {
> >> +               afCoarseScan();
> >> +               afFineScan();
> >> +       } else {
> >> +               if (afIsOutOfFocus())
> >> +                       afReset();
> >> +               else
> >> +                       ignoreCounter_ = kIgnoreFrame;
> >> +       }
> >> +}
> >> +
> >> +/* Register algorithm with the system. */
> >> +static Algorithm *Create(Controller *controller)
> >> +{
> >> +       return new Af(controller);
> >> +}
> >> +static RegisterAlgorithm reg(NAME, &Create);
> >> diff --git a/src/ipa/raspberrypi/controller/iob/af.h b/src/ipa/raspberrypi/controller/iob/af.h
> >> new file mode 100644
> >> index 00000000..45c9711f
> >> --- /dev/null
> >> +++ b/src/ipa/raspberrypi/controller/iob/af.h
> >> @@ -0,0 +1,55 @@
> >> +/* SPDX-License-Identifier: LGPL-2.1-or-later */
> >> +/*
> >> + * Copyright (C) 2021, Red Hat
> >> + * Copyright (C) 2022, Ideas On Board
> >> + *
> >> + * af.h - automatic contrast-based focus algorithm
> >> + */
> >> +#pragma once
> >> +
> >> +#include "../af_algorithm.hpp"
> >> +#include "../af_status.h"
> >> +#include "../metadata.hpp"
> >> +
> >> +namespace RPiController {
> >> +
> >> +class Af : public AfAlgorithm
> >> +{
> >> +public:
> >> +       Af(Controller *controller);
> >> +       char const *Name() const override;
> >> +       void Initialise() override;
> >> +       void Prepare(Metadata *image_metadata) override;
> >> +       void Process(StatisticsPtr &stats, Metadata *image_metadata) override;
> >> +private:
> >> +       double estimateVariance();
> >> +       bool afNeedIgnoreFrame();
> >> +       void afCoarseScan();
> >> +       void afFineScan();
> >> +       bool afScan(uint32_t minSteps);
> >> +       void afReset();
> >> +       bool afIsOutOfFocus();
> >> +
> >> +       AfStatus status_;
> >> +
> >> +       /* VCM step configuration. It is the current setting of the VCM step. */
> >> +       uint32_t focus_;
> >> +       /* The best VCM step. It is a local optimum VCM step during scanning. */
> >> +       uint32_t bestFocus_;
> >> +
> >> +       /* The frames ignored before starting measuring. */
> >> +       uint32_t ignoreCounter_;
> >> +
> >> +       /* Current AF statistic variance. */
> >> +       double currentVariance_;
> >> +       /* It is used to determine the derivative during scanning */
> >> +       double previousVariance_;
> >> +       /* The designated maximum range of focus scanning. */
> >> +       uint32_t maxStep_;
> >> +       /* If the coarse scan completes, it is set to true. */
> >> +       bool coarseCompleted_;
> >> +       /* If the fine scan completes, it is set to true. */
> >> +       bool fineCompleted_;
> >> +};
> >> +
> >> +} /* namespace RPiController */
> >> diff --git a/src/ipa/raspberrypi/meson.build b/src/ipa/raspberrypi/meson.build
> >> index 32897e07..37068ecc 100644
> >> --- a/src/ipa/raspberrypi/meson.build
> >> +++ b/src/ipa/raspberrypi/meson.build
> >> @@ -28,6 +28,7 @@ rpi_ipa_sources = files([
> >>       'controller/controller.cpp',
> >>       'controller/histogram.cpp',
> >>       'controller/algorithm.cpp',
> >> +    'controller/iob/af.cpp',
> >>       'controller/rpi/alsc.cpp',
> >>       'controller/rpi/awb.cpp',
> >>       'controller/rpi/sharpen.cpp',

-- 
Regards,

Laurent Pinchart


More information about the libcamera-devel mailing list