Difference between revisions of "Talk:Circular Error Probable"

From ShotStat
Jump to: navigation, search
(Dependency and Correlation)
(Dependency and Correlation)
Line 105: Line 105:
<br />[[User:Herb|Herb]] ([[User talk:Herb|talk]]) 11:14, 4 June 2015 (EDT)
<br />[[User:Herb|Herb]] ([[User talk:Herb|talk]]) 11:14, 4 June 2015 (EDT)
: ''Regarding the earlier question: We know that in general correlation and dependence are not the same thing.  But as Armadillo noted, it is redundant to say we're using a bivariate normal with independent ''and'' uncorrelated axes. [[User:David|David]] ([[User talk:David|talk]]) 12:26, 4 June 2015 (EDT)''

Revision as of 12:26, 4 June 2015

I think the section on "Systematic Accuracy Bias" misses the point.

Consider two experiments.

(1) Sight weapon then Shoot 10 groups

(2) (a)Sight weapon then shoot group (b) do (a) 10 times

Method (1) has accuracy bias but method (2) does not.

Herb (talk) 18:53, 1 June 2015 (EDT)

Here's my take: "Systematic accuracy bias" on this page means that the true center of the shot distribution is different from the point of aim (POA). The CEP-radius can then be calculated around two possible centers - either a) around the known POA, or b) around the unknown true distribution mean.
a) takes into account systematic accuracy bias because the further away the true center of the shot distribution is from the POA, the larger the CEP around the POA becomes, holding distribution spread constant. b) does not take into account systematic accuracy bias because CEP does not vary with increasing distance from true group center to POA, holding distribution spread constant.
In a) only distribution spread (or the full covariance matrix) has to be estimated from observed data, in b) the true center also has to be estimated.
Any help in getting this issue better across is highly appreciated.

RE: The CEP-radius can then be calculated around two possible centers - either a) around the known POA, or b) around the unknown true distribution mean. from above.

The wiki in general uses a third point, the \(\overline{POI_{sample}}\), from which the measurements are calculated. It is the only point for which an actual measurement can be obtained from the sample of shots.

All in all semantics, but the wiki ought to be consistent unless there is some reason to deviate. If a deviation is required, then it should be explicitly stated.

Herb (talk) 16:34, 2 June 2015 (EDT)

True, we can sample the POI, but there exists a true POI. We can use the true POI in our models regardless of whether we can sample or estimate it in practice. That's true of many of the variables and measures discussed throughout the site: There is a true value we may (or may not) be able to estimate, and then there are sample values (which may be used to estimate the true value). David (talk) 16:39, 2 June 2015 (EDT)
David, I fully agree. My explanation was concerned with the true CEP - which is unfortunately lacking its own distinct symbol. The true CEP has an estimator \(\widehat{CEP}\) which uses \(\overline{POI_{sample}}\) as an estimate for the true group center (when systematic accuracy bias is not taken into account).
How about using * as a superscript to denote the value for the population? So \(CEP^*\) for the population \(CEP\) measure taken from the true \(COI^*\)? I had a mental fart using \(\overline{POI_{sample}}\). It is already well defined in the literature as \(COI\). So \(COI\) (sample of shots) as opposed to \(COI^*\) (the population of shots).
I never doubted that the population values existed it is just that the typical frame of reference is the \(COI\) (from the shot sample), not \(COI^*\) (the population of shots). Since you hop around in a wiki, I think that the wiki has to be more diligent in using a consistent nomenclature than you might use in a book which is read more linearly.
So I would change first paragraph to be:
Rayleigh: When the true center of the shot distribution, \(COI^*\), coincides with the true point of aim, \(POA^*\), then radial error around the \(COI^*\) for a bivariate distribution with the horizontal and vertical measurements uncorrelated, independent, and normally distributed with equal variances, follows a Rayleigh distribution. This distribution is described in the Closed Form Precision section. In three dimensions (spherical error probable, SEP), the radial error follows a Maxwell-Boltzmann distribution which is the basis for the ideal gas laws in chemistry.
So when we measure \(CEP\) it is from \(COI\). Thus due to sampling there are biases floating around as \(COI^*\) and \(POI^*\) and a bias between \(COI\) and \(COI^*\).
Herb (talk) 11:34, 3 June 2015 (EDT)
So * denotes the true ("population") value, and the absence of a * denotes the sample value? I thought it usually went the other way around – i.e., asterisk/hash/bar for sample value, but as long as it's consistent here I'll live. So you're proposing:
  1. Markup variables with "*" whenever they are a true value and the variable may also denote a sample value. (Therefore, there would be no POA* because there is no such thing as a sample POA, right?)
  2. Use "COI" (Center of Impact) everywhere instead of "POI" (Point of Impact).
David (talk) 17:16, 3 June 2015 (EDT)
No, I'm proposing "*" just for some variables (measures):
 \(COI\) - center of impact measured for n shots on a target
 \(COI^*\) - "true" center of impact, ie if COI could be measured for an infinite number of shots (sighting error essentially).
 \(POA\) - position where you were really aiming for a shot
 \(\overline{POA}\) - position where you were really aiming averaged over one target
 \(\overline{POA}_m\) - position where you were really aiming averaged over m targets
 \(POA^*\) - "true" point of aim, ie as if measured for an infinite number of shots
The point here is process control. For instance if you could pluck out \(\sigma_{POA}\) then you could work to improve that aspect of your shooting. As it is \(\sigma_{POA}^2\) is buried in \(\sigma_{System}^2\) and virtually impossible to detect - if you're shooting close to decently. Think of a bad check weld when using sights.
My thought was that \(\overline{COI}_m\) would be averrage COI over \(m\) targets.
On second though how about \(\overline{COI}_\infty\) instead of \(COI^*\)??
For CEP calculation that takes into account systematic accuracy, the main point about POA is that it needs to be known, i.e. there is no uncertainty or need for estimation. Therefore "POA" has the meaning of "this is where the bullets should go". In the literature on this (eg. Grubbs), POA is typically taken as coordinates (0,0). I agree that "POA" can have a different meaning along the lines of "this is where the sight was actually pointing", but as you say, it's not clear how that could be estimated independently.
In the above suggested paragraph uncorrelated and equal variances are actually probably the most redundant, but I think it necessary to make both points clearly. In other words, I'm sure only about 1% of the readers would understand why the two are redundant. Also "uncorrelated" and "independent" are two different concepts.
Herb (talk) 12:43, 3 June 2015 (EDT)
I haven't verified it, but I think the Rayleigh model may only require that the orthogonal variables have equal variance and be uncorrelated. I.e., I don't think independence is a requirement. David (talk) 17:16, 3 June 2015 (EDT)
I have. Look at the figure from the wikipedia article Correlation and dependence. The Rayleigh Distribution is shown in top row center. All the shot patterns on the bottom row have 0 correlation. The shots are obviously not independent in H-V. The hollow circle would have equal variance too. Obviously a hollow circle is a contrived example, but paying attention to such examples is the way to keep statistics from biting your ass. :-(
Herb (talk) 20:44, 3 June 2015 (EDT)
For the Rayleigh model, we are assuming bivariate normality of h and v coordinates. For bivariate normal variables, "independence" and "0 correlation" are equivalent. "0 correlation" and "equal variances" are not redundant as you can have 0 correlation with unequal variances as well as correlated variables with equal variances.

Correlation and Variances

If the variances are not equal you get an ellipse.

Correlation is based on probabilistic expectation, not because of an causality interaction between the two variables ( h and v). So if the shot pattern is an ellipse instead of a circle there will be correlation. Think of fitting a straight line to the data with linear least squares. There is a correlation of the data to the long axis of the ellipse. The rub here is that correlation does not prove causation. ie. just because we find an ellipse doesn't mean that h = Function(v) or that v = Function(h).
Herb (talk) 11:14, 4 June 2015 (EDT)

Dependency and Correlation

H and V are both \(\mathcal{N}(0,\,\sigma^2)\)

shoot a target.

Thus there is no correlation. ie, no straight line to fit with linear least squares. There is just a random radial pattern.

I calculate CEP(50) and make a circular steel plate that big. Say the plate is 10 inches.

I put the center of the plate over the center of the target and shoot. I take plate off and look at holes in target The holes form an annulus. There is no linear correlation because there is no line upon which the data bunches.

But given H=1, is it possible that V=2? No that shot would have hit the steel plate and was stopped. So H and V are not independent now. In other words, can you tell which target was shot with the plate and which without?

Yes this shot pattern seems unrealistic in the real world, but that isn't how statistics works. We can only use assumptions that we started with. For example given the starting normal distributions above, let's assume that I'm shooting a handgun. I ask the computer "What is the probability of my pistol shot going 5 miles high along the vertical axis?" The computer will spit out a number. With no external ballistics in probability model there is nothing to tell computer that such a situation is impossible.

See wikipedia article Correlation and dependence

Herb (talk) 11:14, 4 June 2015 (EDT)

Regarding the earlier question: We know that in general correlation and dependence are not the same thing. But as Armadillo noted, it is redundant to say we're using a bivariate normal with independent and uncorrelated axes. David (talk) 12:26, 4 June 2015 (EDT)