I appreciate the substantial changes to the manuscript based on the reviewer comments. There are a few remaining issues that I find not yet acceptable, but I see the paper on a good path to be published soon with minor revisions.
## Major
- In general, I appreciate the conservative discussion of the results in the revised manuscript, but some formulations are still inadequate. The muon and gamma data do not show convincing agreement to scientifically expected results (low correlation to Medusa, no sign of FD events). At most, these results could be called "a first indication" that there is something behind it, but they need to be investigated in further studies (as was discussed, thank you). Still, I have to be strict here and point out some remaining inconsistencies that need to be reformulated to avoid misunderstanding and misinterpretation. Please see below.
- Nov 2021 is a clear sign of a Forbush Decrease (FD). This effect has global influence on the neutron counts everywhere and should be corrected for all CRNS data. However, your Figure 8 indicates that the measured muons do not recognize the FD event, and are not very well in sync with the other features from JUNG, too. Hence, to me this is a clear indication that the measured muons are not suited for CRNS correction. You mentioned the reason that "FINAPP3 muon detector has been optimized to follow relative long-term variability (weeks to months)". How was this "optimization" done? Can't you show the original raw hourly muons flux? If the muon data is smoothed over many days, I won't consider it of any use for incoming neutron correction. In my view, this should be communicated as a major flaw of the method.
- The authors have responded to the request for a comparison of the gamma sensor with a traditional gamma sensor by adding section 3.4. Here, the authors mentiond that the correlation is low (R²=0.077) "mainly due to the presence of some outliers". This is not sound science. Please remove the outliers before publishing those results, or restrict the analysis to periods without precipitation. Otherwise readers cannot draw any conclusion from such a comparison.
- In the same section, the authors provide a 6h average of the data, present a correlation of R²=0.29, and conclude "the dynamic is well captured". Again with a super-tiny plot with grey-in-greyish lines (Fig 9, right) and no-transparent super-thick point markers clumping above each other (left). It is impossible to visually identify any details, shifts, biases, lag times, etc. Please update your general presentation of the results (see cosmetics). Please also avoid calling these results "well capturing the dynamics". A rough tendency to follow the Medusa data is visible, but the low correlation and the significant deviations indicate that the data products represent different environmental effects and should be used with care. Please be more clear about this in the text.
- The authors now mention that "air temperature measured at 2 m hight provides a good approximation on the muon effect (de Mendonça et al., 2016)". From a quick browse of the cited study, I see strong complexity in this relationship, with seasonal temperatures in upper and lower athmospheric zones being completely out of phase in some regions of the Earth, while being similar in others. From this the authors should conclude that this is a serious issue that should be investigated in the future, instead of making the impression that taking near-ground temperature is a good and generally applicable approach.
## Minor
- Thanks for adding Fig. 7 to test the relationship of muons on athmosperic data. To me, the large spread in the correlation to air pressure is suspicious, maybe this indicates the influence of factors to the signal other than cosmogenic muons? It may be worth a note.
- Eq 3: Do you mean 1+alpha instead of 1-alpha? Have you corrected for air humidity following Rosolem et al. 2013 properly?
- It is not clear from the methods description nor Figure 3 where the Medusa system was located.
- It is not clear from the methods description in what year the measurements have been taken. The whole presentation appeared a bit complicated to me, as many different types of data have been recorded during many different periods, even at the same sites, but also among different sites. I'd appreciate an overview either in text, plot, or table, to better streamline the readers understanding.
- The authors added a zoom on on the neutron data comparison and I am ackowledging it. While the point cloud in the correlation plot is not insightful (better use smaller points of non-filled points or transparency), the dynamics of the neutron data looks convincing. However, the CRS2000 and Lab-C data seem to be more stable (right panels c and f). Is this due to the higher count rate, or could it be that Finapp sees other influencing factors, too? Maybe the uncertainty bands could be indicated or mentioned in the caption.
- For a more conserverative formulation, I'd suggest to change in the conclusions "Muons were found to be a possible alternative for incoming correction for CRNS application" to "Muons were found to be a potential candidate to support the correction for incoming cosmic rays."
- The sentence in the conclusion is misleading: "On the other hand, the use of gamma-ray spectrometry was identified as an alternative method for non-invasive soil moisture estimation.". I'd assume that you are referring to the cited paper who found this. But it was not found in your study as no spectrometer was used. Make more clear that this has been found by authors which is what motivated you to test a total gamma counter (which led to different findings).
## Cosmetics
- It is very hard to identify dates within the plates for international readers. X tick marks like "03/07" are not very helpful and very confusing. Please use informative tick marks like "Jul 03". Also, most plots do not show a year, not even in the caption. It could be added to the first tick mark or to the caption.
- Most plots are very hard to read due to small panel size and greyish look. Avoid grey background, use clean, thin, black-and-white axis objects to increase the contrast, avoid colors whenever possible, choose meaningful axis ticks (see above), use column headings to highlight which panels correspond to which sites (e.g., Fig 8). Generally, consider using full-width plots for time series data instead of narrow panels that barely span 20% of the page width. |
The authors present a novel set of experiments using a new sensor to simultaneously measure neutrons, muons, and gammas. The new sensor is compared to conventional sensors with satisfactory results for longer time periods of integration (i.e. 6 hrs for neutrons). The technology is lighter and potentially a lower cost, which will open up doors for more applications in science and in practical applications. The manuscript is well written and appropriate for the journal. I have a few suggestions that should be addressed prior to publication. Some instances of English grammar and word choice will need to be addressed.
L33: “Runoff generation”
L 40: “More recently, attention”
L 44: “has shown”
Figure 1: Are events outside of the blue and red ovals not included in the analysis?
L165: For the gammas are there any corrections needed for pressure or air temperature/humidity variations?
L247: I would use the SG filter on the neutron/gamma count time series, not the soil moisture time series that have been transformed by the calibration function.
L 302-305: This sentence is confusing and long. Please rewrite.
Figure 7. Will be interesting to compare the Muon detection with the correction factor being proposed by McJannet and Desilets using cutoff rigidity and atmospheric depth with the NMDB historical data. Unfortunately, that work is in the review process still.
Figure 8. So the soil moisture data is from FINAPP and not depth weighted following the Schron method? If you did have gravimetric surveys how would you depth weight them for the gamma sensitivity? From my understanding they would have a similar sensitivity with depth as the neutrons but be a little shallower (10-15cm?)?
Table 1. Iwema et al. 2015 recommends 3 calibrations for estimating N0. From the variability here I would do at least 3 to estimate some uncertainty on N0. I agree additional gravimetric studies are needed, particularly for establishing the gamma to soil moisture dependence, especially when used in cropping systems with significant temporal variations in vegetation biomass. Unfortunately, for CRNS and GRS studies all roads don’t lead to Rome but to more gravimetric sampling :).
Iwema, J., Rosolem, R., Baatz, R., Wagener, T., & Bogena, H. (2015). Investigating temporal field sampling strategies for site-specific calibration of three soil moisture–neutron intensity parameterisation methods. HESS, 19, 3203–3216. https://doi.org/10.5194/hess-19-3203-2015