A7.2 Statistical testing of 2 D position error and TTFF for A-GPS and A-GNSS Minimum Performance test cases

3GPP51.010-1Mobile Station (MS) conformance specificationPart 1: Conformance specificationTS

Tools: ARFCN - Frequency Conversion for 5G NR/LTE/UMTS/GSM

A7.2.1 Test Method

Each test is performed in the following manner:

a) Setup the required test conditions.

b) Measure the 2D position and Time to First Fix repeated times. Start each repetition after having applied the message ‘RESET MS POSITIONING STORED INFORMATION’. This ensures that each result is independent from the previous one. The results, measured, are simplified to:

good result, if the 2D position and TTFF are ≤ limit.

bad result, if the 2D position or TTFF or both are > limit

c) Record the number of results (ns) and the number of bad results (ne)

d) Stop the test at a pass or an fail event.

e) Once the test is stopped, decide according to the pass fail decision rules ( A7.2.4.2)

A7.2.2 Error Ratio (ER)

The Error Ratio (ER) is defined as the ratio of bad results (ne) to all results (ns).

(1-ER is the success ratio)

A7.2.3 Test Design

A statistical test is characterised by:

Test-time, Selectivity and Confidence level

A7.2.3.1 Confidence level

The outcome of a statistical test is a decision. This decision may be correct or in-correct. The Confidence Level CL describes the probability that the decision is a correct one. The complement is the wrong decision probability (risk) D = 1-CL

A7.2.3.2 Introduction: Supplier Risk versus Customer Risk

There are two targets of decision:

a) A measurement on the pass-limit shows, that the DUT has the specified quality or is better with probability CL (CL e.g.95%) This shall lead to a “pass decision”

The pass-limit is on the good side of the specified DUT-quality. A more stringent CL (CL e.g.99%) shifts the pass-limit further into the good direction. Given that the quality of the DUTs is distributed, a greater CL passes less and better DUTs.

A measurement on the bad side of the pass-limit is simply “not pass” (undecided)

aa) Complementary:

A measurement on the fail-limit shows, that the DUT is worse than the specified quality with probability CL.

The fail-limit is on the bad side of the specified DUT-quality. A more stringent CL shifts the fail-limit further into the bad direction. Given that the quality of the DUTs is distributed, a greater CL fails less and worse DUTs.

A measurement on the good side of the fail-limit is simply “not fail”.

b) A DUT, known to have the specified quality, shall be measured and decided pass with probability CL. This leads to the pass limit.

For CL e.g. 95%, the pass limit is on the bad side of the specified DUT-quality. CL e.g.99% shifts the pass-limit further into the bad direction. Given that the DUT-quality is distributed, a greater CL passes more and worse DUTs.

bb) A DUT, known to be an (ε0) beyond the specified quality, shall be measured and decided fail with probability CL.

For CL e.g.95%, the fail limit is on the good side of the specified DUT-quality.

NOTE: the different sense for CL in (a), (aa) versus (b), (bb).

NOTE: for constant CL in all 4 bullets (a) is equivalent to (bb) and (aa) is equivalent to (b).

A7.2.3.3 Supplier Risk versus Customer Risk

The table below summarizes the different targets of decision.

Table A7.2.3.3 Equivalent statements

	Equivalent statements, using different cause-to-effect-directions, and assuming CL = constant >0.5
cause-to-effect-directions	Known measurement result  estimation of the DUT’s quality	Known DUT’s quality  estimation of the measurement’s outcome
Supplier Risk	A measurement on the pass-limit shows, that the DUT has the specified quality or is better (a)	A DUT, known to have an (ε0) beyond the specified DUT-quality, shall be measured and decided fail (bb)
Customer Risk	A measurement on the fail-limit shall shows, that the DUT is worse than the specified quality (aa)	A DUT, known to have the specified quality, shall be measured and decided pass (b)

NOTE: The bold text shows the obvious interpretation of Supplier Risk and Customer Risk.
The same statements can be based on other DUT-quality-definitions.

A7.2.3.4 Introduction: Standard test versus early decision concept

In standard statistical tests, a certain number of results (ns) is predefined in advance of the test. After ns results the number of bad results (ne) is counted and the error ratio (ER) is calculated as ne/ns.

Applying statistical theory, a decision limit can be designed, against which the calculated ER is compared to derive the decision. Such a limit is one decision point and is characterised by:

– D: the wrong decision probability (a predefined parameter)

– ns: the number of results (a fixed predefined parameter)

– ne: the number of bad results (the limit based on just ns)

In the formula for the limit, D and ns are parameters and ne is the variable. In the standard test ns and D are constant. The property of such a test is: It discriminate between two states only, depending on the test design:

– pass (with CL) / undecided (undecided in the sense: finally undecided)

– fail (with CL) / undecided (undecided in the sense: finally undecided)

– pass(with CL) / fail (with CL) (however against two limits).

In contrast to the standard statistical tests, the early decision concept predefines a set of (ne,ns) co-ordinates, representing the limit-curve for decision. After each result a preliminary ER is calculated and compared against the limit-curve. After each result one may make the decision or not (undecided for later decision) The parameters and variables in the limit-curve for the early decision concept have a similar but not equal meaning:

– D: the wrong decision probability (a predefined parameter)

– ns: the number of results (a variable parameter)

– ne: the number of bad results (the limit. It varies together with ns)

To avoid a “final undecided” in the standard test, a second limit must be introduced and the single decision co-ordinate (ne,ns) needs a high ne, leading to a fixed (high) test time. In the early decision concept, having the same selectivity and the same confidence level an “undecided” does not need to be avoided, as it can be decided later. A perfect DUT will hit the decision coordinate (ne,ns) with ne=0. This test time is short.

A7.2.3.5 Standard test versus early decision concept

For Supplier Risk:
The wrong decision probability D in the standard test is the probability, to decide a DUT in-correctly in the single decision point. In the early decision concept there is a probability of in-correct decisions d at each point of the limit-curve. The sum of all those wrong decision probabilities accumulate to D. Hence d<D

For Customer Risk:
The correct decision probability CL in the standard test is the probability, to decide a DUT correctly in the single decision point. In the early decision concept there is a probability of correct decisions cl at each point of the limit-curve. The sum of all those correct decision probabilities accumulate to CL. Hence cl<CL or d>D

A7.2.3.6 Selectivity

There is no statistical test which can discriminate between a limit-DUT-quality and a DUT-quality which is an (ε0) apart from the limit in finite time and confidence level CL>1/2. Either the test discriminates against one limit with the results pass (with CL)/undecided or fail (with CL)/undecided, or the test ends in a result pass (with CL)/fail (with CL) but this requires a second limit.

For CL>0.5, a (measurement-result = specified-DUT-quality), generates undecided in test “supplier risk against pass limit” (a in clause A7.2.3.2) and also in the equivalent test against the fail limit (aa in clause A7.2.3.2)

For CL>0.5, a DUT, known to be on the limit, will be decided pass for the test “customer risk against pass limit” (b in clause A7.2.3.2) and also in the equivalent test against fail limit (bb in clause A7.2.3.2).

This overlap or undecided area is not a fault or a contradiction, however it can be avoided by introducing a Bad or a Good DUT quality according to:

– Bad DUT quality: specified DUT-quality * M (M>1)

– Good DUT quality: specified DUT-quality * m (m<1)

Using e.g M>1 and CL=95% the test for different DUT qualities yield different pass probabilities:

Figure A7.2.3.6: Pass probability versus DUT quality

A7.2.3.7 Design of the test

The test is defined according to the following design principles:

1. The early decision concept is applied.

2. A second limit is introduced: Bad DUT factor M>1

3. To decide the test pass:

Supplier risk is applied based on the Bad DUT quality

To decide the test fail

Cusomer Risk is applied based on the specified DUT quality

The test is defined using the following parameters:

1. Specified DUT quality: ER = 0.05

2. Bad DUT quality: M=1.5 (selectivity)

3. Confidence level CL = 95% (for specified DUT and Bad DUT-quality)

This has the following consequences:

a) A measurement on the fail limit is connected with 2 equivalent statements:

A measurement on the fail-limit shows, that the DUT is worse than the specified DUT-quality

A DUT, known to have the specified quality, shall be measured and decided pass

A measurement on the pass limit is connected with the complementary statements:

A measurement on the pass limit shows, that the DUT is better than the Bad DUT-quality.

A DUT, known to have the Bad DUT quality, shall be measured and decided fail

The left column is used to decide the measurement.

The right column is used to verify the design of the test by simulation.

The simulation is based on the two fulcrums A and B only in Figure A7.2.3.6. There is freedom to shape the remainder of the function.

b) Test time

1. The minimum and maximum test time is fixed.

2. The average test time is a function of the DUT’s quality.

3. The individual test time is not predictable (except ideal DUT).

c) The number of decision co-ordinates (ne,ns) in the early decision concept is responsible for the selectivity of the test and the maximum test time. Having fixed the number of decision co-ordinates there is still freedom to select the individual decision co-ordinates in many combinations, all leading to the same confidence level.

A7.2.4 Pass fail decision

A7.2.4.1 Numerical definition of the pass fail limits

ne	nsp	nsf	ne	nsp	nsf	ne	nsp	nsf	ne	nsp	nsf
0	77	NA	43	855	576	86	1525	1297	129	2173	2050
1	106	NA	44	871	592	87	1540	1314	130	2188	2067
2	131	NA	45	887	608	88	1556	1331	131	2203	2085
3	154	NA	46	903	625	89	1571	1349	132	2218	2103
4	176	NA	47	919	641	90	1586	1366	133	2233	2121
5	197	NA	48	935	657	91	1601	1383	134	2248	2139
6	218	42	49	951	674	92	1617	1401	135	2263	2156
7	238	52	50	967	690	93	1632	1418	136	2277	2174
8	257	64	51	982	706	94	1647	1435	137	2292	2192
9	277	75	52	998	723	95	1662	1453	138	2307	2210
10	295	87	53	1014	739	96	1677	1470	139	2322	2227
11	314	100	54	1030	756	97	1692	1487	140	2337	2245
12	333	112	55	1046	772	98	1708	1505	141	2352	2263
13	351	125	56	1061	789	99	1723	1522	142	2367	2281
14	369	139	57	1077	805	100	1738	1540	143	2381	2299
15	387	152	58	1093	822	101	1753	1557	144	2396	2317
16	405	166	59	1108	839	102	1768	1574	145	2411	2335
17	422	180	60	1124	855	103	1783	1592	146	2426	2352
18	440	194	61	1140	872	104	1798	1609	147	2441	2370
19	457	208	62	1155	889	105	1813	1627	148	2456	2388
20	474	222	63	1171	906	106	1828	1644	149	2470	2406
21	492	237	64	1186	922	107	1844	1662	150	2485	2424
22	509	251	65	1202	939	108	1859	1679	151	2500	2442
23	526	266	66	1217	956	109	1874	1697	152	2515	2460
24	543	281	67	1233	973	110	1889	1714	153	2530	2478
25	560	295	68	1248	990	111	1904	1732	154	2544	2496
26	577	310	69	1264	1007	112	1919	1750	155	2559	2513
27	593	325	70	1279	1024	113	1934	1767	156	2574	2531
28	610	341	71	1295	1040	114	1949	1785	157	2589	2549
29	627	356	72	1310	1057	115	1964	1802	158	2603	2567
30	643	371	73	1326	1074	116	1979	1820	159	2618	2585
31	660	387	74	1341	1091	117	1994	1838	160	2633	2603
32	676	402	75	1357	1108	118	2009	1855	161	2648	2621
33	693	418	76	1372	1126	119	2024	1873	162	2662	2639
34	709	433	77	1387	1143	120	2039	1890	163	2677	2657
35	725	449	78	1403	1160	121	2054	1908	164	2692	2675
36	742	465	79	1418	1177	122	2069	1926	165	2707	2693
37	758	480	80	1433	1194	123	2084	1943	166	2721	2711
38	774	496	81	1449	1211	124	2099	1961	167	2736	2729
39	790	512	82	1464	1228	125	2114	1979	168	2751	2747
40	807	528	83	1479	1245	126	2128	1997	169	2765	NA
41	823	544	84	1495	1263	127	2143	2014
42	839	560	85	1510	1280	128	2158	2032

NOTE: The first column is the number of bad results (ne)
The second column is the number of results for the pass limit (ns_p)
The third column is the number of results for the fail limit (ns_f)

A7.2.4.2 Pass fail decision rules

Having observed 0 bad results, pass the test at ≥77 results, otherwise continue

Having observed 1 bad result, pass the test at ≥106 results, otherwise continue

Having observed 2 bad results, pass the test at ≥131 results, otherwise continue

etc. until

Having observed 6 bad results, pass the test at ≥218 results, fail the test at ≤ 42 results, otherwise continue

Having observed 7 bad results, pass the test at ≥238 results, fail the test at ≤ 52 results, otherwise continue

etc. until

Having observed 168 bad results, pass the test at ≥2751 results, fail the test at ≤2747 results, otherwise continue

Having observed 169 bad results, pass the test at ≥2765 results, otherwise fail

NOTE: an ideal DUT passes after 77 results. The maximum test time is 2765 results.

A7.2.4.3 Background information to the pass fail limits

There is freedom to design the decision co-ordinates (ne,ns).

The binomial distribution and its inverse is used to design the pass and fail limits. Note that this method is not unique and that other methods exist.

Where

fail(..) is the error ratio for the fail limit

pass(..) is the error ratio for the pass limit

ER is the specified error ratio 0.05

ne is the number of bad results. This is the variable in both equations

M is the Bad DUT factor M=1.5

d_f is the wrong decision probability of a single (ne,ns) co-ordinate for the fail limit.
It is found by simulation to be d_f = 0.004

cl_p is the confidence level of a single (ne,ns) co-ordinate for the pass limit.
It is found by simulation to be cl_p = 0.9975

qnbinom(..): The inverse cumulative function of the negative binomial distribution

The simulation works as follows:

A large population of limit DUTs with true ER = 0.05 is decided against the pass and fail limits.

cl_pand d_f are tuned such that CL (95%) of the population passes and D (5%) of the population fails.

A population of Bad DUTs with true ER = M*0.05 is decided against the same pass and fail limits.

cl_pand d_f are tuned such that CL (95%) of the population fails and D (5%) of the population passes.

This procedure and the relationship to the measurement is justified in clause A7.2.3.7. The number of DUTs decrease during the simulation, as the decided DUTs leave the population. That number decreases with an approximately exponential characteristics. After 169 bad results all DUTs of the population are decided.

NOTE: The exponential decrease of the population is an optimal design goal for the decision co-ordinates (ne,ns), which can be achieved with other formulas or methods as well.

Annex 8:
Void

Annex 9 (normative):
GAN certificate