Evaluating the suitability of the consumer low-cost Parrot Flower Power soil moisture sensor for scientific environmental applications

Citizen science, scientific work and data collection conducted by or with non-experts, is rapidly growing. Although the potential of citizen science activities to generate enormous amounts of data otherwise not feasible is widely recognized, the obtained data are often treated with caution and scepticism. Their quality and reliability is not fully trusted since they are obtained by non-experts using low-cost instruments or scientifically non-verified methods. In this study, we evaluate the performance of Parrot’s Flower Power soil moisture sensor used within the European citizen science project the GROW Observatory (GROW; https: //growobservatory.org, last access: 30 March 2020). The aim of GROW is to enable scientists to validate satellite-based soil moisture products at an unprecedented high spatial resolution through crowdsourced data. To this end, it has mobilized thousands of citizens across Europe in science and climate actions, including hundreds who have been empowered to monitor soil moisture and other environmental variables within 24 high-density clusters around Europe covering different climate and soil conditions. Clearly, to serve as reference dataset, the quality of ground observations is crucial, especially if obtained from low-cost sensors. To investigate the accuracy of such measurements, the Flower Power sensors were evaluated in the lab and field. For the field trials, they were installed alongside professional soil moisture probes in the Hydrological Open Air Laboratory (HOAL) in Petzenkirchen, Austria. We assessed the skill of the low-cost sensors against the professional probes using various methods. Apart from common statistical metrics like correlation, bias, and root-mean-square difference, we investigated and compared the temporal stability, soil moisture memory, and the flagging statistics based on the International Soil Moisture Network (ISMN) quality indicators. We found a low intersensor variation in the lab and a high temporal agreement with the professional sensors in the field. The results of soil moisture memory and the ISMN quality flags analysis are in a comparable range for the low-cost and professional probes; only the temporal stability analysis shows a contrasting outcome. We demonstrate that low-cost sensors can be used to generate a dataset valuable for environmental monitoring and satellite validation and thus provide the basis for citizen-based soil moisture science.

Abstract. Citizen science, scientific work and data collection conducted by or with non-experts, is rapidly growing. Although the potential of citizen science activities to generate enormous amounts of data otherwise not feasible is widely recognized, the obtained data are often treated with caution and scepticism. Their quality and reliability is not fully trusted since they are obtained by non-experts using low-cost instruments or scientifically non-verified methods. In this study, we evaluate the performance of Parrot's Flower Power soil moisture sensor used within the European citizen science project the GROW Observatory (GROW; https: //growobservatory.org, last access: 30 March 2020). The aim of GROW is to enable scientists to validate satellite-based soil moisture products at an unprecedented high spatial resolution through crowdsourced data. To this end, it has mobilized thousands of citizens across Europe in science and climate actions, including hundreds who have been empowered to monitor soil moisture and other environmental variables within 24 high-density clusters around Europe covering different climate and soil conditions. Clearly, to serve as reference dataset, the quality of ground observations is crucial, especially if obtained from low-cost sensors. To investigate the accuracy of such measurements, the Flower Power sensors were evaluated in the lab and field. For the field trials, they were installed alongside professional soil moisture probes in the Hydrological Open Air Laboratory (HOAL) in Petzenkirchen, Austria. We assessed the skill of the low-cost sensors against the professional probes using various methods. Apart from common statistical metrics like correlation, bias, and root-mean-square difference, we investigated and compared the temporal stability, soil moisture memory, and the flagging statistics based on the International Soil Moisture Network (ISMN) quality indicators. We found a low intersensor variation in the lab and a high temporal agreement with the professional sensors in the field. The results of soil moisture memory and the ISMN quality flags analysis are in a comparable range for the low-cost and professional probes; only the temporal stability analysis shows a contrasting outcome. We demonstrate that low-cost sensors can be used to generate a dataset valuable for environmental monitoring and satellite validation and thus provide the basis for citizen-based soil moisture science.

Introduction
The importance of soil moisture for the hydrological cycle and the climate system is well known (Seneviratne et al., 2010). For decades techniques and instruments to measure and monitor water content in the soil at different spatial and temporal scales have been developed and improved. One of the most promising means to observe the state of soil moisture in the top soil layer on a global and long-term scale is microwave remote sensing. A range of active and passive satellite instruments have collected and are still gathering data, forming the basis for various soil moisture products including but not limited to Advanced Microwave Scanning Radiometer (AMRS-E) Land Parameter Retrieval Model (LPRM), Level 2 Passive Soil Moisture Product (L2SMP), Advanced SCATterometer (ASCAT) Soil Moisture Product, Sentinel-1 Surface Soil Moisture, and ESA Climate Change Initiative (CCI) Soil Moisture, (e.g. de Jeu et al., 2008;Chan et al., 2016;Wagner et al., 2013;Bauer-Marschallinger et al., 2019;Dorigo et al., 2017). Ground observations play a crucial role in evaluating and further improving these satellite products, particularly with respect to novel satellite missions that provide data at unprecedented spatial resolution, e.g. Copernicus Sentinel-1. Although several in situ soil moisture monitoring stations and networks have been established around the globe, ranging from short-term campaigns to long-term observations, and many of them provide their valuable data through the International Soil Moisture Network (ISMN; Dorigo et al., 2011), acquisition, installation, and maintenance of the professional equipment are costly and laborious. Thus, the coverage of such stations is mostly limited in space, density, and duration.
A large number of measurement techniques for estimating soil moisture at the point scale are available (Ochsner et al., 2013). Most common are methods based on electromagnetic waves, including time domain reflectometry (TDR) (e.g. Robinson et al., 2008) and capacitance sensors (e.g. Bogena et al., 2007). While TDR sensors are known for their higher accuracy, the advantage of capacitance probes is that they are less expensive while still providing sufficiently reliable readings (e.g. Kojima et al., 2016). For this reason, capacitance sensors have gained popularity and have often been investigated. While commonly being referred to as "low-cost" sensors (e.g. Mittelbach et al., 2011;Bogena et al., 2007;Kizito et al., 2008;Domínguez-Niño et al., 2019;González-Teruel et al., 2019;Matula et al., 2016), deploying them in high numbers and densities is still too expensive. The alternative is to either design and develop a completely new sensor (e.g. González-Teruel et al., 2019;Kojima et al., 2016), which is a time consuming and challenging task, or make use of existing commercial products, which are not designed for scientific use and are in fact low-cost sensors.
The latter idea was pursued by the European citizen science project GROW Observatory (GROW; https:// growobservatory.org/, last access: 30 March 2020). The aim of GROW is to generate a vast in situ soil moisture network across Europe maintained by citizens using the commercial soil moisture sensor Flower Power (FP), produced by the French company Parrot SA, in order to support remote sensing scientists. The involvement of non-experts in collecting scientific data has a long history, especially in the fields of biology, astronomy, and meteorology (Silvertown, 2009). In recent years, citizen science has aimed to go beyond mere data collection to introduce non-scientists to the research design, data interpretation, and application of results, which is also the scope of the GROW Observatory.
In this study, we assess the performance of the low-cost soil moisture sensor Flower Power (FP) from Parrot SA, used within the GROW Observatory. In particular, we evaluate the FP sensor under laboratory conditions and in an extensive field campaign performed in an agricultural catchment in Austria. The main objectives are to (i) assess the agreement of FP readings with gravimetric measurements, (ii) determine the intersensor variability, (iii) test the suitability of the sensors for outdoor usage, (iv) assess their agreement with professional soil moisture probes, and finally (v) conclude if the FP sensors are suitable for scientific environmental applications, in particular satellite validation.

Low-cost sensor
The low-cost sensor used in this study is the Flower Power (FP) sensor (Parrot, 2019c), a consumer product of the French company Parrot SA. It was designed for home use, indoor and outdoor, with the purpose of observing the condition of users' plants. The sensor provides information about soil humidity, air temperature, light intensity, and fertilizer content in the soil (Fig. 1). The soil water content is measured with a capacitance probe, consisting of two flat rods of 10 cm length.
This method, which has been known for many years (Dean et al., 1987), uses an oscillator to propagate an electromagnetic signal through the rods into the soil. The charging time of the electromagnetic field is related to the capacitance of the soil, which in turn is related to its dielectric permittivity. Consequently, as the dielectric permittivity is sensitive to water, the soil moisture content can be estimated. Kizito et al. (2008) summarized that the measurement frequency of capacitance probes is one of the main factors influencing the sensitivity of their measurements to soil texture, electrical conductivity, and temperature. Unfortunately, no information about the measurement frequency of the FP sensor is supplied by the manufacturer.
Two metal domes sitting on top of the FP sensor prongs estimate the fertilizer level by measuring the electric conductivity. When vertically installed in soil as designated, the prongs are completely sunk into the ground, and only a plastic fork of an approximate height of 9 cm with two ends remains above the soil. One end holds the battery compartment for one 1.5 V AAA battery, the other end an air temperature and light sensor, measuring visible light in the wavelength range between 400 and 700 nm. Observations of all four variables are automatically taken every 15 min and stored on the device, where data can be saved for up to 80 d before they get overwritten. One battery provides the sensor with enough energy to continue measuring for 6 to 12 months, depending on the weather conditions. Details about the available variables, their units, range, and accuracy indicated by the manufacturer can be found in Table 1. The observations collected by the FP sensor can only be accessed by the Parrot Flower Power app, which is available for iOS and Android smartphones. When positioned in close vicinity to the sensor, the app automatically connects via the Bluetooth Low Energy (LE) protocol and collects the stored data from the sensor. As soon as an internet connection is available the observations are uploaded to the Parrot Cloud (Parrot, 2019a). After this process the collected data can be viewed in the time series visualization of the app, in addition to the live mode, where only current values are visible. From the Parrot Cloud the data can be obtained in .csv format through an API provided by Parrot SA (Parrot, 2019b). The Parrot Flower Power app and cloud services were integrated as part of the GROW ecosystem of services. GROW furthermore augmented the FP sensor with social and scientific services to address the needs of citizens, scientists, and policy makers participating in the citizen observatory. That wider system architecture is outside the scope of this paper.

Reference sensors
Two conventional sensors were used to evaluate soil water content measurements of the FP sensor in the field: the time domain transmissivity (TDT) SPADE sensor (Qu et al., 2013;Sceme.de GmbH, 2019) and the 5TM (ECH2O 5TM from METER Group, Inc. USA) based on capacitance technology. The SPADE sensor consists of a sensor head of 8 cm length and a flat prong 3 cm wide and 12 cm long containing the closed transmission line of the ring oscillator. The frequency of the ring oscillator depends on the dielectric constant of the surrounding material, i.e. the soil. The determined dielectric constant can range from 1 to 85, with a relative accuracy given as ±4 % by the manufacturer (Sceme.de GmbH, 2019). It also measures soil temperature within a range from −10 to +85 • C, with an accuracy of ±0.5 • C (Qu et al., 2013). Observations are transmitted and stored according to the SoilNet wireless network technology developed at Forschungszentrum Jülich (Bogena et al., 2010).
The 5TM probe consists of three flat rods and has a total length of 10.9 cm and width of 3.4 cm. The mineral soil calibration, supplied by the manufacturer, is used, providing soil water content within a range of 0.0 to 1.0 m 3 m −3 , a resolution of 0.0008 m 3 m −3 , and an accuracy of ±0.03 m 3 m −3 , specified by the producer (METER Group, 2019). Soil temperature measured by the 5TM probe can range from −40 to +60 • C and is given with a resolution of 0.1 • C and an accuracy of ±1 • C. Data are logged with the Em50G Data Logger (METER Group, Inc. USA).
Ambient temperature readings of the low-cost sensor are compared with the conventional Vaisala HUMICAP humidity and temperature probe HMP155 (Vaisala Oyj, Finland), which has a measurement range from −80 to +60 • C (Vaisala, 2019). The CNR4 Net-Radiometer (Kipp & Zonen B.V., the Netherlands), a four-component net radiometer, serves as reference for the FP sensor light readings. In particular in this study, we use the incoming shortwave radiation which is observed within a spectral range from 300 to 2800 nm, with a sensitivity of 10 to 20 µV W −1 m −2 (Kipp and Zonen, 2019). The observed spectral range differs signif- icantly from that of the FP sensor, thus the observations are not comparable in absolute terms. Nevertheless, the relative temporal agreement between the FP sensor and the CNR4 can be determined.

Study area and sensor set-up
The study area is the Hydrological Open Air Laboratory (HOAL; http://hoal.hydrology.at; last access: 30 March 2020; Blöschl et al., 2016) located in Petzenkirchen (48 • 9 N, 15 • 9 E), Austria. The gentle hills of the agricultural catchment, covering an area of 66 ha, are situated at an average elevation of 296 m above sea level. The climate is characterized by a mean annual temperature of 9.5 • C and a mean annual precipitation of 823 mm per year (Blöschl et al., 2016). Blöschl et al. describe the different monitoring stations and devices installed in the HOAL catchment in order to investigate and answer a variety of scientific hypotheses. While four rain gauges were already set up in 2010, the weather station, located in the centre of the catchment, was fully equipped with instruments in 2012. Among other devices, the weather station accommodates the air temperature probe HMP155 and the four-component net radiometer CNR4, installed at heights of 2 and 2.5 m, respectively. Since 2013, the catchment is equipped with 20 permanent and 11 temporary soil moisture stations ( Fig. 2; Vreugdenhil et al., 2013). The permanent stations are located in pasture and forest; the temporary stations are installed in agricultural fields and have to be removed and re-installed on an irregular basis to allow for field management. All except one of the soil moisture stations are equipped with the SPADE TDT sensors. One station, namely "Hoal_D3", located next to the weather station, uses the 5TM sensor to observe soil water content and soil temperature at depths of 0.05 and 0.10 m and serves as a reference for the observations of the SPADE TDT sensors.
Except for this station, all soil moisture monitoring stations are equipped with horizontally installed sensors at four different depths: 0.05, 0.10, 0.20, and 0.50 m. In addition, two stations have a sensor at a depth of 1.00 m available. For nine stations, two probes collect soil moisture readings simultaneously at a depth of 0.05 m, and 11 stations are equipped with two probes at a depth of 0.10 m.
In Spring 2017, 37 FP sensors were placed alongside the 31 professional sensors in the HOAL catchment (Fig. 2), allowing for cross-validation between FP sensors at six locations. Between spring 2017 and spring 2019, 15 additional sensors were installed, seven to replace sensors with a failure and eight to allow for further cross-validation between FP sensors. Thus, in total 52 FP sensors were available to evaluate their performance with respect to 31 professional probes. Fifty sensors were installed as designated in a vertical position, providing information about the water content of the upper 10 cm of soil. Two sensors were buried at a depth of 0.05 m in a horizontal position to allow for a more direct comparison with the horizontally positioned professional probes. The exact horizontal distance between the FP sensors and the professional probes is unknown due to the different set up times. The position of the professional probes could only be estimated based on the wires and data logging devices above the ground. Thus, the maximum possible distance between both sensors is estimated to be 1 m, but it is assumed to be much less in most cases. In May 2018, one FP sensor (TUW90a) was attached to a rod at a height of 1.7 m next to the weather station with the purpose of collecting light intensity.
The observation period of the FP sensors varies due to different installation dates and temporary removal because of field management and ranges from a few months to 1.5 years. Three FP sensors were excluded from the analysis due to sensor malfunction and too short observation periods. For similar reasons six professional sensors at the depth of 0.05 m and 11 sensors at the depth of 0.10 m were neglected during the analysis. In general, only sensor pairs with more than 3 months of simultaneous observations were considered for further analysis. All investigations were performed on hourly values. For convenience all soil moisture values and results are expressed in cubic metres per cubic metre.

Methods
Two different approaches were applied to evaluate the performance of Parrot's Flower Power sensors. First, observations of five FP sensors were compared to gravimetrically measured soil water content in the laboratory (Calibration, see Sect. 3.1). Second, the Parrot FP sensors were installed in the field, and their measurements were compared to those from professional soil moisture probes by using four different metrics: (1) conventional statistical measures, (2) relative mean difference, (3) autocorrelation, and (4) independent quality indicators (Sect. 3.2). All comparisons were performed on hourly observations.

Calibration
The customer of the FP sensor has to use the predefined calibration for potting soil, as this sensor is a commercial product designed to monitor potted plants, and cannot choose between different calibration curves depending on local soil conditions which are typically available for professional sensors. Thus, in summer 2018, five Parrot FP sensors were calibrated for the dominant soil texture class present in the HOAL catchment under laboratory conditions, following a calibration procedure developed at the Institute for Land and Water Management Research (Federal Agency for Water Management Austria, Petzenkirchen, Austria). A homogeneous soil sample from the catchment with particle sizes of less than 2 mm was used for this exercise. The soil sample, characterized as silty clay loam, according to the United States Department of Agriculture (USDA) standards (Soil Science Division Staff, 2017), consists of 27.5 wt % clay, 65 wt % silt, and 7.5 wt % sand. The air-dry sample was divided into five equal parts in order to investigate sensor performance for five different amounts of soil water content: air dry, 0.08, 0.20, 0.30 m 3 m −3 , and fully saturated. The respective amount of water was added to the air-dry soil samples, manually mixed, and left to rest under protected conditions over night to ensure a homogeneous water content throughout the entire sample. A Boyle-Mariotte bottle was used to obtain the fully saturated soil sample. The gravimetric water content of the five soil samples was determined in order to establish the respective soil volume needed to fill five cylinders (15 cm diameter, 30 cm height) with a bulk density of 1.3 g cm −3 . The five cylinders were filled with the five soil samples following the same technique: one-fifth of the soil sample was added to the cylinder and manually compacted with a proctor plate until the soil specimen reached a height of 5 cm in the cylinder and thus the desired density. The soil surface was roughened, and the same procedure was repeated for the remaining four-fifths until the cylinder was filled up to a height of 25 cm. After all five cylinders were filled, the five FP sensors were subsequently installed in undisturbed spots of the five soil samples. The according time stamp and soil moisture readings in live mode of the Parrot App were recorded immediately and again before the FP sensor was removed. The actual soil moisture readings were extracted afterwards from the Parrot cloud. The FP sensors were left in each soil sample for at least 20 min to ensure that at the minimum two data records per soil sample were automatically stored on the sensor. As soon as the sensor measurements were finished small soil samples were extracted from the five cylinders, and their water content was derived gravimetrically to serve as reference for the sensor readings.

Sensor-to-sensor comparison
The measurements collected by the FP sensors in the field were compared to the sensor readings taken with professional probes described in Sect. 2.2.

Temporal agreement
Commonly known statistical measures including Pearson correlation coefficient (R), standard deviation, bias, rootmean-squared difference (RMSD), and unbiased root-meansquared difference (uRMSD) were used to determine the agreement between the in situ soil moisture recordings of the FP and the professional sensors. Observations from the SPADE and 5TM sensors at depths of 5 and 10 cm served as a reference. Thirty-three sensor pairs with an average observation period of 10 months were used for comparison between FP sensors and professional probes at a depth of 0.05 m (Table A1 in Appendix). For the comparison with the professional sensors at a depth of 0.10 m, 27 sensor pairs were analysed.
In addition to the soil water content, the air temperature measurements of five FP sensors were evaluated with respect to the HMP155 air temperature and relative humidity sensor positioned at a height of 2 m at the weather station of the HOAL catchment. Four of the five FP sensors investigated were conventionally installed in soil and encompass all four land cover classes of the catchment (grassland, cropland, sparse forest, and forest). The fifth sensor is located at the weather station 1.7 m above the soil and was also used to evaluate the light intensity observations against the downwelling shortwave radiation obtained by the CNR4 Net-Radiometer.

Temporal stability
The concept of spatial soil moisture patterns being persistent over time is called temporal stability; it was introduced by Vachaud et al. (1985) and is often investigated (e.g. Cosh, 2004;Brocca et al., 2010Brocca et al., , 2012Caldwell et al., 2018). Temporally stable and representative sensors for an entire area or catchment with respect to the average conditions in that area can be identified through this method, even as locations that are consistently wetter or drier than the overall average. For this purpose the mean relative difference (MRD) was used, according to the following definition where θ i,j is the water content at hour j and sensor i, and θ j refers to the average over all sensors at hour j . Consequently, a negative (positive) MRD i means that sensor i represents a drier (wetter) location with respect to the catchment average. The sensor with the lowest absolute MRD is the best estimate 122 A. Xaver et al.: Consumer low-cost sensor for scientific applications for the catchment average. The according variance σ 2 i results from the equation and measures the temporal stability of the relation between a sensor i and the overall mean.
Our goal was to compare the soil moisture temporal stability of both sensor types, the low-cost and the professional sensors. In particular, we were interested in whether equal sites were identified as dry and wet locations based on the both sensor sets. A total of 26 sensor pairs (low-cost and professional) formed the basis for the temporal stability analysis; only one sensor pair per location was used. In the case of two suitable sensors of the same type (FP sensors or professional sensors) being available at the same location, the one with the longer observation period was chosen.

Soil moisture memory
The capability of soil to store information of an anomalous condition (e.g. rainfall, drought) long after its occurrence is commonly referred to as soil moisture memory (Delworth and Manabe, 1988). Memory timescales of root-zone soil moisture are typically on the order of a week to a few months, depending on soil properties and meteorological variables (Ghannam et al., 2016).
Soil moisture memory is commonly derived by the timelagged autocorrelation given as where τ describes the time lag and λ refers to the decay timescale or e-folding time, at which the autocorrelation r reduces to 1/e (Delworth and Manabe, 1988). It represents a measure of how long a soil moisture anomaly can be detected and, more importantly, influences the atmosphere. For this reason, soil moisture memory is being well investigated and implemented by land-climate modellers (e.g. Maurer et al., 2001;Koster and Suarez, 2001;Sörensson and Berbery, 2015). The importance and potential of soil moisture memory are also recognized in other research areas. For example, Rebel et al. (2012) and Piles et al. (2018) used an autocorrelation analysis to compare the temporal dynamics captured by different soil moisture products, i.e. in situ, modelled, and satellite-based. We applied this technique to the catchment average soil moisture derived from both sensor types, using the same number of FP and professional sensors, and expect that any arising differences are mainly driven by the individual sensor characteristics since the temporal dynamics, i.e. anomalies, of the catchment are the same. Considering that less than 2 years of observations were available for the FP sensors, the soil moisture climatology and consequently the anomalies were derived based on moving averages of 35 d (Dorigo et al., 2015;Albergel et al., 2012). To ensure comparability the same method for deriving the climatology and anomalies was applied to the observations from the professional sensors. Dorigo et al. (2013) developed automated quality control procedures that were specifically designed for in situ soil moisture observations and which have been implemented in the International Soil Moisture Network (ISMN; https: //ismn.geo.tuwien.ac.at/; last access: 30 March 2020 Dorigo et al., 2011). The ISMN is an international initiative to make in situ soil moisture observations available in a harmonized format. The automated quality control procedures of the ISMN comprise 13 different quality identifiers (C01-C03; D01-D10; see Table 3) based on three different approaches. First, a simple threshold-based technique identifies values that are outside of a reasonable geophysical range (C01-C03). Second, a geophysical consistency test investigates the plausibility of soil moisture observations in connection with additional environmental variables such as soil temperature and precipitation (D01-D05). Third, suspicious measurements are detected based solely on the spectrum of the soil moisture time series (D06-D10). To every soil moisture observation one or more quality identifiers are attached. In the case that none of the three approaches apply to the underlying observation, the identifier G (good) is assigned. For more details we refer to Dorigo et al. (2013).

Automated quality control
In this study, we applied the ISMN quality flags to both the data from the low-cost sensors and from the professional probes, with the purpose of comparing the flagging statistics and investigating if the percentage of flagged observations differs substantially. In order to make the results comparable, the ISMN quality identifiers were applied to the same number of FP and professional devices, i.e. one FP sensor per location.

Calibration results
The gravimetrically determined and the observed soil water content obtained by using the five FP sensors are shown in Fig. 3 and Table A2. For low soil water content from 0.03 m 3 m −3 (air dry) to 0.20 m 3 m −3 , the FP sensors consistently measure higher values than the actual water content. All five sensors show the highest (positive) deviation of 0.06 to 0.09 m 3 m −3 at the gravimetrically derived water content of 0.08 m 3 m −3 . For the higher soil water content the deviation is much lower and ranges from 0.01 to 0.02 m 3 m −3 . The measurements of the five low-cost sensors are consistent; only sensor 196 shows consistently higher values than the other four sensors. The overall intersensor variation is very low with only 0.01 m 3 m −3 , which is better than the sensor accuracy provided by the producer. Our results for the silty clay loam soil in the HOAL catchment are consistent with the findings from Kovács et al. (2019) who evaluated the performance of 28 FP sensors in the laboratory using a different approach for four other soil types: sandy loam, clay loam, loam, and loamy sand. Although differences were observed for the four soil types the authors report the following similar behaviour of the FP sensors: positive deviation from actual water content in dry conditions, with highest deviation for very dry soils, which coincides with our findings. The authors further report a negative deviation for water content above 0.40 m 3 m −3 . We cannot observe this negative deviation from the gravimetric water content as the high clay content of the silty clay loam soil from the HOAL catchment prohibits a manual preparation of a water content higher than 0.30 m 3 m −3 , and full saturation is reached at the water content of 0.48 m 3 m −3 .
In order to establish a calibration function for the FP sensor observations in the HOAL catchment, a linear function was fitted through the observations obtained in the laboratory experiment based on the weighted least squares method (Fig. 3). Following this method, the weights of the linear regression for the different water content levels were calculated as the inverse variances of the five FP sensors (Table A2). Only the regression weight of the water content level of 0.30 m 3 m −3 was divided in half and set to 0.3 as we assume that the sensor readings are not fully reliable. At this specific water level the clay particles started to aggregate, due to the high clay content in the soil, and small gaps of air developed during the process of manually mixing the soil. As a consequence, it is likely that the FP sensors underestimated the actual water content. Thus, the lower weight was given to the observations at this gravimetric water content. The resulting equation of the fitted function, SM cal = 1.16SM FP − 0.07 (in cubic metres per cubic metre(m 3 m −3 )), where SM FP represents the FP sensor soil moisture observation and SM cal stands for the calibrated soil water content, is valid for silty clay loam and was applied to all observations obtained by the FP sensors in the HOAL catchment.  device is heavily warmed by incoming solar radiation, heating up the plastic cover of the sensor and falsifying the reading of the temperature sensor which sits right below the plastic cover. This temperature effect is apparent from recorded temperature observations exceeding plausible values measured with professional devices, especially during the summer months, when the sun reaches its maximum strength at this latitude (Fig. 4a). Figure 4b compares air temperature observations of the FP sensor TUW46a, located in the sparse forest, with the professional air temperature sensor, installed in grassland cover at the weather station. The protective effect of the vegetation canopy during the summer months is clearly visible. In the beginning of May, when the leaves in the sparse forest are not fully grown, the sensor is still partly reached and warmed by direct sun light, thus leading to an overestimation of the air temperature. As soon as the canopy cover of the deciduous trees is fully developed and the FP sensor is shaded, the air temperature of the FP sensors corresponds well with the measurements of the professional probe. When leaf fall starts in October and the FP sensor is not protected from the incoming radiation anymore, the deviation from the professional probe is apparent again. Figure 5a illustrates the daily temperature ranges of the professional air temperature sensor HMP155 and the FP sensors TUW90a with respect to their daily maximum observation. The warming effect of the FP sensor caused by the incoming solar radiation is clearly reflected in the higher maximum values reached by the low-cost sensor. In Fig. 4, it is visible that sensor TUW90a records lower minimum values than the professional sensor, which can be confirmed by comparing the recorded minimum temperatures of both probes ( Fig. A1a in Appendix). It is unclear if this offset is a general characteristic of the FP temperature probe. Sensor TUW46a shows a contrasting behaviour (Fig. A1b), recording consistently higher daily minimum values than the HMP155 probe, which could be a result of the different sensor location; TUW46a is located in the sparse forest, while HMP155 and TUW90a are installed at the weather station. The variation in daily temperature range (Fig. 5b) is comparable to the daily variation recorded by the conventional probe. Only between daily maximum temperatures of 25 and 30 • C does the daily range of the FP sensor vary more than for the HMP155, which is probably a result of the temperature effect discussed earlier. Overall, a high agreement with the professional air temperature device was reached, indicating that the ambient temperature measurements of the FP sensor can be beneficial for environmental applications. Certainly, the warming effect of the plastic cover of the FP sensor due to incoming solar radiation has to be considered. The light intensity observed by the FP sensor could serve as a reference to filter out untrustworthy temperature observations. If the absolute temperature values are of interest, further investigations are recommended as our initial investigation showed a possible offset from observations obtained with the professional probe. Figure 6 shows the light intensity observed by the FP sensor TUW90a, installed at 1.7 m height, in relation to the downwelling shortwave radiation obtained from the professional sensor CNR4 Net-Radiometer, both located at the weather station of the HOAL catchment. Despite the much smaller range of incoming shortwave radiation observed by the FP sensors than the professional probe, i.e. only the visible domain, a reasonable agreement underpinned by a correlation value of 0.87 is reached. Computing the bias between the observations made by the devices is unfeasible because of the different wavelength ranges observed and the different measurement units. A variation in the incoming radiation throughout the seasons can be observed for both devices. In Fig. 7 this annual cycle is clearly visible in the observations of the CNR4 probe. For the FP sensor TUW46a, located in sparse forest, the effect of fully developed canopy cover during the summer season shading the sensor is evident, resulting in a low correlation value of 0.29 with the professional probe, which is located in the open field.

Light level
The light intensity observations of the FP probes are certainly useful for tracking the development of canopy cover, i.e. when the sensor is shaded, and providing a reliability measure for the temperature measurements. Due to the lack of a professional sensor with the same observational wavelength range we are not able to provide a more detailed analysis. Thus, for any further applications we recommend a comparison of the FP sensor observations with an appropriate professional device.

Soil moisture
For the sensor-to-sensor comparison between the FP sensors and the professional probes at a depth of 0.05 m, an overall correlation coefficient of 0.80 was obtained, with individual correlation coefficients ranging from 0.30 to 0.97 (p < 0.001). The average bias is −0.05 m 3 m −3 , and the overall RMSD is 0.08 m 3 m −3 (uRMSD 0.05 m 3 m −3 ). For the comparison with the professional sensors at a depth of 0.10 m, an overall correlation of 0.79, a bias of −0.06 m 3 m −3 , and a RMSD of 0.10 m 3 m −3 (uRMSD 0.05 m 3 m −3 ) were observed. The results are visualized in Taylor diagrams (Taylor, 2001) in Figs. 8 and A7.
The FP sensors and professional probes agree well in both depths. As the results compared with the professional probes at depths of 0.05 and 0.10 m are in a very similar range and  are driven by similar characteristics; only the comparison with the near-surface layer is discussed in more detail. Figure 9 serves as an example of the overall good agreement between a FP sensor and a professional probe. For this location, precipitation events are captured simultaneously by both sensor types, resulting in synchronous time series shapes and a high correlation value of 0.96. In comparison with the scientific probe the FP sensor shows higher noise (0.005 m 3 m −3 ) and intra-daily variability (0.015 m 3 m −3 ), which are most likely due to a sensitivity to temperature resulting from the measurement frequency; both effects are well below the sensor accuracy provided by the manufacturer. In general, the measurements of the FP sensors are characterized by pronounced responses to precipitation events, while often much weaker or in some cases no reaction is recorded by the professional probes. This different reaction strength causes the standard deviation of the FP sensors to be considerably higher than the standard deviation of the professional probes, in the case of sensor TUW267a even up to 2.2 times as high (Fig. A2). The considerable differ-ence in absolute soil moisture values, visible in the overall results (Fig. 8), can also be explained by the vertical installation of the FP sensors. The low-cost devices record the soil water content from the surface, which is wetting and drying out fast, down to a depth of 10 cm. The horizontally installed professional probes are less affected by the drying surface and in general record higher water content levels. In addition, differences in the absolute water content can be caused by local variations in soil texture and formed soil aggregates. Although the FP devices were calibrated for the dominant soil texture type present in the HOAL catchment, the conditions in the laboratory are ideal (e.g. homogeneous soil sample), and less perfect conditions with local variations in soil texture and formed aggregates (e.g. even macro pores) are to be expected in the field. These local variations in soil can also differ within the small horizontal distance between the locations of the professional and low-cost sensors and thus can also contribute to the deviation in absolute values between both devices. Another reason for observed instances of poor correspondence, represented by low correlation val-A. Xaver et al.: Consumer low-cost sensor for scientific applications Figure 7. Same as Fig. 6, but for FP sensor TUW46a (located in sparse forest). ues, is suspicious behaviour or sensor failure of one of or both sensors, e.g. extreme response to negative temperatures (e.g. Fig. A4).
The soil moisture readings of the two FP sensors buried in a horizontal position at a depth of 0.05 m agree very well with the observations of the horizontally installed professional probes (e.g. Fig. A3). High correlation values of 0.96 and 0.97 are reached. In addition, the deviation in absolute values is low with the bias varying from 0.01 to 0.04 and RMSD from 0.02 to 0.05 m 3 m −3 (uRMSD from 0.02 to 0.03 m 3 m −3 ). This observation supports the hypothesis that part of the discrepancies observed for the other sensors stem from the differences in installation (i.e. vertical vs. horizontal).
So far we have demonstrated the performance of the FP sensors in comparison with the TDT SPADE sensors. Now, we investigate the results with respect to the scientific probe 5TM. The two sensors, both based on the capacitance technology, show a good temporal agreement with a correlation value of 0.89 (Fig. A5). Clearly visible is the broader value domain of the low-cost soil moisture observations, which results in a RMSD of 0.06 m 3 m −3 and a normalized standard deviation of 1.77, and could be a consequence of not only the different sensor position but also the different measurement frequencies used by the two sensor types. The statistical metrics obtained by the second FP sensor installed next to the 5TM probe show almost identical values, confirming the low intersensor variability already observed during calibration. No clear dependencies of the FP sensor performance compared with the professional devices on land cover or topography were found (not shown).
The intercomparison of five pairwise-installed FP sensors with an average observation length of 11 months resulted in an overall correlation of 0.87, a bias of 0.01 m 3 m −3 , and a RMSD of 0.08 m 3 m −3 (uRMSD 0.04 m 3 m −3 ) (Fig. A8). Individual correlation values range from 0.55 to 0.98 (p < 0.001); the individual RMSD values range from 0.04 to 0.13 m 3 m −3 (uRMSD 0.02 to 0.07 m 3 m −3 ), and the bias ranges from −0.07 to 0.13 m 3 m −3 .
The noticeable high bias of 0.13 m 3 m −3 (Fig. A8), which occurs for the sensor pair TUW37a and TUW56a located in the forest, can be explained by their different sensor position. FP sensor TUW56a is installed horizontally at a depth of 0.05 m and shows constantly higher water content levels than the vertically installed sensor TUW37a, which might be due to the faster drying near the soil surface or different texture conditions. Nonetheless, their good temporal agreement is underpinned by a correlation value of 0.95 (Fig. A6). In addition to the difference in the overall soil moisture level and the amplitude during the wetting process, another effect caused by the different sensor position is visible: the sensor installed in a vertical position shows steeper drying curves, reflecting the faster drying process at the soil surface. This characteristic is even more prominent for the sensor pair installed in sparse forest and is responsible for a high RMSD of 0.10 m 3 m −3 (uRMSD 0.06 m 3 m −3 ). The difference in strength of the described characteristics of both horizontally installed FP sensors compared with their vertically installed counterparts is most likely driven by differences in soil texture at the two locations in the forest and sparse forest, respectively.
The lowest correlation result of 0.55, accompanied by a noticeable RMSD of 0.09 m 3 m −3 (uRMSD 0.07 m 3 m −3 ), is obtained for the sensor pair TUW30a and TUW31a, which is located in cropland and seems to be driven by suspicious observations of sensor TUW30a during the winter season. The same FP sensor is also responsible for the lowest correlation value achieved (0.30) in the comparison with the professional probe at a depth of 0.05 m (Fig. 8).
Overall, the high temporal agreement of the FP sensor soil moisture observations with the readings from professional devices is evident, thus allowing the use of the low-cost sensor scientific applications focusing on temporal dynamics. The clear deviation in absolute terms from conventional sensors is mostly a consequence of the different sensor position, as the improved statistics of the buried FP sensors showed. Figure 10 shows the calculated mean relative differences and the correlation between individual sensors and the catchment mean for the FP sensors (Fig. 10a) and the corresponding professional sensors at a depth of 0.05 m (Fig. 10b). The FP sensors and corresponding professional probes are sorted by the mean relative difference values of the FP sensors. The error bars refer to the standard deviation of the mean relative difference between the catchment average and the individual sensor observations. While FP sensor TUW19a was identified as the most representative for the catchment average derived from the lowcost sensors, for the professional probes Hoal_13 was found to be the most representative. Although both sensors are located in the same land cover class (sparse forest) they are not located next to each other. A similar outcome was obtained for the standard deviation of the mean relative difference. The most representative low-cost and professional sensors, TUW34a and Hoal_30, respectively, are both located in cropland but in different parts of the catchment. In general, distinct higher variation in and deviation from the mean relative difference in the FP sensors compared with the professional probes are noticeable and are expected to be a consequence of the high variance in the water content, driven by the vertical sensor position already discussed in Sect. 4.2.1.

Temporal stability
Although known wet locations (area around sensors TUW45a and TUW46a) of the catchment could correctly be identified by the FP sensors as wetter than the catchment average, the known dry area of the catchment, i.e. the forest, is not clearly represented by the FP sensors (TUW35a and TUW37a). Only sensor TUW37a was clearly identified as drier than the network average. In addition, sensor locations identified as wetter than the catchment average based on the professional sensors were identified as drier than the average when considering the low-cost sensors (e.g. TUW24a and Hoal_22) or vice versa (e.g. TUW13a and Hoal_06). The difference in absolute water content values can also result from local differences in soil texture. Although the FP sensors were installed in close vicinity to the professional probes the exact distance is often unknown (due to different installation dates) but is estimated to be no more than 1 m. Consequently, the deviation in absolute values and mean relative differences cannot solely be reduced to the different sensor types. When looking at the relative agreement with the low-cost sensor-based catchment average a considerable high overall correlation of 0.80 was reached, with individual values ranging from 0.48 to 0.94 (p < 0.001) (Figs. 10, A9). The two sensors, TUW19a and TUW34a, previously identified as most representative are in good agreement with the catchment average when considering the correlation. Noticeably, FP sensor TUW30a is according to the derived mean relative difference a close approximation of the catchment average, but the low correlation value reveals the existing problems of this sensor, which we already discussed in the previous section. The correlation results based on the data of the professional sensors are within a comparable range, from 0.46 to 0.96 (p < 0.001) and an overall correlation value of 0.84. Again, for both sensor types, sensors at different locations in the catchment were identified as representing the catchment average the best, as follows: TUW25a of the lowcost sensors, located in grassland, and the professional sensor Hoal_04 in sparse forest, respectively. No clear dependency of the relative agreement with the catchment average on the land cover is visible.
In summary, the temporal stability results of the FP sensors clearly differ from those of the professional probes and are thus unsatisfactory. Although it has to be recognized that the mean relative difference is strongly affected by absolute soil moisture values, which themselves can strongly be influenced by local variation in soil texture. Figure 11 shows the autocorrelation functions and e-folding times, λ, for the catchment average based on the FP sensors and the professional sensors at a depth of 0.05 m, respectively. The resulting e-folding time for the FP sensors is 83 h, reached 7 h earlier than the e-folding time of the professional probes. This is most likely a result of the vertical installation position of the FP sensors, causing pronounced reactions to precipitation events as shown and described in Sect. 4.2.1. While the horizontally installed professional probes show a weaker wetting and drying signal after rain events, causing the autocorrelation curve to be slightly flatter and reaching the e-folding time a few hours delayed. On average an e- folding time of about 3.5 d was estimated by both sensor types, which is in accordance with e-folding times of in situ observations in existing literature (Ghannam et al., 2016).

Soil moisture memory
When investigating the e-folding times with respect to the different land cover classes (see Table 2), we find similar results. For three of the four land cover classes, the obtained e-folding times are reached earlier for the low-cost sensors than for the professional probes. The only exception is the land cover class sparse forest, for which a persistence of soil moisture anomalies 4 h longer was identified for the FP sensor observations. For both sensor types the sparse forest was the land cover class with the shortest soil moisture memory. This can be explained by the known wet locations be- ing situated in sparse forest (area around sensors TUW45a and TUW46a). Rahman et al. (2015) and Orth and Seneviratne (2013) found a faster decay of soil moisture memory in wet regions than in dry areas, explained by usually smaller anomalies in the already wet soil. The same effect can be valid on a catchment scale. This is in agreement with the longer persistence of anomalies in the forest, the known dry area of the catchment. As already visible in the temporal stability results, the forest was identified as dry based on the professional sensors but was less clear for the FP sensors.
Consequently, a stronger memory in the forest is reflected by the professional probes.

Automated quality control
Overall, 24.6 % of all soil moisture observations obtained by the FP sensors were flagged according to the ISMN quality procedures, whereas 19.7 % of the measurements from professional sensors at a depth of 0.05 m were flagged. The flagged observations can be subdivided in different subcategories (Table 3).
The majority of observations identified as suspicious consists of geophysically consistency identifiers (D01-D05), for both the FP and professional sensor observations. Especially the temperature-related quality indicators (D01-D03) are merely a consequence of the climate prevalent at the HOAL catchment, which is characterized by pronounced winter seasons with temperatures below 0 • C, and not a sign for suspicious sensor behaviour. Naturally, air reaches freezing temperatures more frequently than soil. Thus, the percentage of air-temperature-dependent flagged observations is much higher for the FP probes that measure air temperature than for the scientific devices that measure soil temperature. A considerably larger percentage of soil moisture observations during periods with negative Global Land Data Assimilation System (GLDAS) soil temperature (D03) is noticeable for the professional sensors than for the low-cost devices. This difference results from a 20 d data gap for all SPADE sensors in the HOAL catchment in spring 2018, caused by a damaged fibre-optic cable. Both sensor types also recorded a considerable number of values that exceeded the saturation point (identifier C03), derived from the Harmonized World Soil Database (FAO/IIASA/ISRIC/ISSCAS/JRC, 2012). As we are already aware of the high sensitivity of the FP sensors to rain events, driven by their vertical sensor position, we are not surprised about the higher percentage of soil moisture observations above the saturation point. Pronounced differences in the flagging statistics can be observed for the following ISMN quality indicators: D09, D10, and C02. With exception of D09, a higher percentage of flags was identified for the professional sensor observations. The D09 flag identifies a period of measurements with very low variability following a distinct negative break. On the one hand, this flag is detected in FP observations during winter season, when the soil moisture drops due to low temperatures and remains at a low level. Consequently, the D02 identifier is also attached to these observations. On the other hand, it was observed that soil moisture values drop to zero and remain there for some time without any visible physical damage of the FP sensor. The reason is unknown, but we speculate that this is a consequence of corroding contacts.
Measurements detected outside the geophysically plausible range of 0.60 m 3 m −3 capture a problem prevalent for the professional sensors. Soil moisture suddenly rises to unrealistically high values and remains at this level for some time. We speculate that this is caused by an energy supply problem or a connection problem with the data logger. The same suspicious measurements are also responsible for a higher number of D10 identifiers than observed in the FP observations. Although the underlying algorithm for identifying suspicious positive breaks (D08) caught only a small percentage of FP sensor observations, it is worth mentioning that mainly precipitation-induced rises in soil moisture triggered this quality identifier.
Overall, the ISMN quality flagging statistics underpin that the low-cost sensors provide meaningful observations comparable to those from professional probes. The main sensor characteristic, the high sensitivity to precipitation due to the vertical installation position, was captured by the quality identifiers. For any scientific usage the FP sensor flaw of unexplained soil moisture values of zero, revealed by the D09 flag, has to be considered by masking the zero values. Although the number of these zero values can take up a large portion of the time series of a sensor, fortunately this problem occurred only for a very limited number of FP sensors (three out of 52). Thus, this problem can be overcome, either by installing multiple low-cost sensors at the same location or by removing those observations retrospectively, during data analysis .

Example: scientific application
To demonstrate the suitability of the FP sensors for scientific applications, we evaluated the Advanced SCATterometer (ASCAT) remotely sensed soil moisture product (H113, H114) at the HOAL catchment by using the low-cost and the professional sensors. The ASCAT instruments on board the Meteorological Operational (MetOp) satellite series provide  observations with a spatial resolution of 25 km and a revisit time of 1-2 d. The retrieved soil moisture product (Wagner et al., 1999Naeimi et al., 2009) represents soil moisture in the upmost layer of the soil and is given in a relative unit as degree of saturation. To overcome the mismatch of units and spatial resolution between the satellite product and the in situ sensors, the satellite-based observations were rescaled to the mean and standard deviation of the ground data. In order to ensure longest possible observation periods of the ground observations, only permanently installed sensors were used to derive the catchment average. Daily means of water content were calculated and disregarded if air temperature values were below 3 • C. For the FP sensors this filtering was done based on their air temperature records; for the professional probes air temperature observations from the weather station in the middle of the HOAL catchment were applied. In addition to the absolute soil moisture values, we compared the anomalies of the satellite and the ground data, which we derived by using a 35 d moving average. The evaluation results are shown in Table 4. The correlations between the satellite product and the ground observations based on the absolute soil moisture values are high and almost identical with 0.75 and 0.76 for the lowcost and professional sensors, respectively. The same is true for the uRMSD with 0.03 m 3 m −3 for the FP sensors and 0.02 m 3 m −3 for the scientific sensors. When considering the soil moisture anomalies, the uRMSD is identical for both sensor types. A much weaker agreement between the anomalies of the satellite product and the sensors was obtained with correlation values of 0.42 and 0.26, where the higher correlation value was achieved with the low-cost values.
These results herein demonstrate the suitability of the FP sensors for satellite validation. It is expected that many other scientific applications can benefit from the use of low-cost soil moisture sensors, i.e. the FP sensor, and a variety of scientific experiments become possible only due to the use of low-cost devices.

Conclusions
In this study we investigated the performance and scientific suitability of a commercial low-cost soil moisture sensor in the laboratory and in the field, which lead us to the following conclusions: -Laboratory calibration showed that the FP sensor overestimates soil water content in dry conditions of the silty clay loam soil of the Austrian catchment. Similar to pro-fessional sensors a site specific calibration is highly recommended if the absolute amount of soil moisture is of interest. For applications where relative measurements have priority, i.e. satellite validation, FP sensors are very well suited without laboratory calibration.
-In addition, we investigated the intersensor variability in the FP sensor in the laboratory and found very small differences between the sensors, which confirms their reliability.
-The comparison with professional soil moisture sensors in the field shows a high agreement and confirms their ability to capture the wetting and drying process in the soil. Discrepancies were mostly driven by the different sensor position (vertical/horizontal) and local variations in soil texture.
-The strongest weakness of this sensor, of recording zero soil moisture values without any physical damage, was observed only for a very limited number of devices and can be overcome by a clever experiment set-up. The compactness, simplicity, and affordable price of the sensor allows for installation of multiple devices at the same location and easy re-installation or replacement. In general, these characteristics hold the potential for many scientific applications in which a large number and high density of sensors are otherwise not feasible with professional sensors, e.g. Zappa et al. (2020), and Zappa et al. (2019).
-Field testing has proven that the FP sensor can endure outdoor conditions. In fact, the longest surviving FP sensor within the framework of GROW is still recording reasonable data after more than 3 years in the field. This shows that the FP sensor can be deployed for shortand medium-term applications, while long-term usability still has to be investigated.
Overall, we conclude that the FP sensor is suitable for scientific environmental applications, and we expect that many scientific studies will benefit from the use and abilities of the FP sensor data collected within the GROW project. The deployment of such a low-cost sensor within a citizen observatory can provide an effective mechanism to support current Earth observation capabilities.        Data availability. The in situ soil moisture observations of the lowcost sensors are available through University of Dundee and the ISMN (International Soil Moisture Network). The in situ soil moisture observations of the professional sensors will become available in the ISMN.
Author contributions. AX conceived and designed the experiments; AX analysed the data and wrote the paper; AX and LZ collected the FP sensor measurements with support of GR; AX conducted the laboratory calibration with help of IP and LZ; IP and MV provided the soil moisture data of the professional sensors; GR provided data of all other used professional sensors; and DH and WAD contributed with their expertise. All authors participated in the revision of the paper.
Competing interests. The authors declare that they have no conflict of interest.