Re: [Ifeffit] Question about transform windows and statistical parameters

Matt, On May 13, 2011, at 8:39 AM, Matt Newville wrote:
I'm confused. We Fourier transform k-weighted data. Since Ifeffit uses the high-R amplitude to estimate uncertainty, it seems to me that what matters is signal-to-noise, not just noise in the original unweighted chi(k). Am I wrong in that? I may be misunderstanding how epsilon_r is calculated. And epsilon_r is the relevant epsilon for a fit in R space, right?
I just tried it on the FeC2O4 chi(k) attached to this post. It's a good example of data where it's not immediately clear to me what the "best" value for kmax is, so it would be tempting to use RCS to compare fits over different k-ranges. I used k-weight 3, and Hanning windows with dk = 1. I chose kmin as 2 and stepped kmax by 0.5, recording epsilon_r for each: kmax epsilon_r 7 0.034840105 7.5 0.041843848 8 0.082627337 8.5 0.087550367 9 0.086032007 9.5 0.085996216 10 0.088679339 10.5 0.090364699 11 0.092509939 11.5 0.108103081 There's a general trend of increasing epsilon_r with an increase in k. There's also a jump of a factor of 2 between 7.5 and 8. Why? Because there's a glitch there, and the glitch adds high-R structure. To make sure there wasn't something odd about this particular chi(k), I took one of the data sets included with the horae distribution: the file y300.chi in the ybco folder. I followed the same procedure as before, except I stepped by 1 inverse angstrom each time, because of the greater data range. kmax epsilon_r 7 0.012866125 8 0.073383695 9 0.078255772 10 0.080016040 11 0.091634572 12 0.105419473 13 0.164341701 14 0.195266957 15 0.224727593 16 0.411139882 17 0.480293296 If anything, the trend is more clear here.
I'm not making any claims as to whether, in general, the noise in the data depends on R. I can speculate about circumstances where low R noise is greater (due, for instance, to temperature fluctuations in cooling water, which are likely to be fairly slow), or where high R noise is greater (an example here would be if whatever system is keeping the beam on the sample vertically as the mono scans is tending to overshoot). But Ifeffit's estimation of epsilon_r demonstrably does not depend on the R-range used for fitting, regardless of the distribution of noise in R. That's a very different thing. Thus, changing the R-range of a fit is completely safe as far as comparing RCS goes. --Scott Calvin Sarah Lawrence College

Hi Scott,
Sorry, I read epsilon as "noise in chi(k)". This is the most
meaningful physical/statistical measure: epsilon_r surely depends on
k-weight and can depend on k-range as it samples different portions of
the spectra. Like you say, it will tend to increase as you increase
the k-range.
On Fri, May 13, 2011 at 11:58 AM, Scott Calvin
Well, except for that jump (which I would say is appropriate, as the spike add weights at all frequencies), I'd say epsilon_r is pretty constant, varying by 10% (not bad for a crude estimate) up to k=11. |chi(k)| drops by considerably over that range, possibly to well below the noise level by k=10. So the higher end there is clearly not going to help the fit -- all you're adding is noise.
Between 8 and 12 Ang^-1 there is what I would call a small change You're certainly adding more noise and progressively less signal as you increase k, even for a noise level in chi(k) that does not depend of k. There are sharp features that could easily be considered "white noise". But I don't strongly disagree either -- epsilon_r does definitely increase as you increase the k-range.
Ah, OK, I think I see what you were getting at. But I think the epsilon_r and epsilon_k are still roughly good for using reduced chi-square to compare fits of different k- and R-ranges. If anything, the estimate in the number of independent points is a much cruder estimate than the estimate of epsilon. --Matt
participants (2)
-
Matt Newville
-
Scott Calvin