Archives

  • 2018-07
  • 2018-10
  • 2018-11
  • 2019-04
  • 2019-05
  • 2019-06
  • 2019-07
  • 2019-08
  • 2019-09
  • 2019-10
  • 2019-11
  • 2019-12
  • 2020-01
  • 2020-02
  • 2020-03
  • 2020-04
  • 2020-05
  • 2020-06
  • 2020-07
  • 2020-08
  • 2020-09
  • 2020-10
  • 2020-11
  • 2020-12
  • 2021-01
  • 2021-02
  • 2021-03
  • 2021-04
  • 2021-05
  • 2021-06
  • 2021-07
  • 2021-08
  • 2021-09
  • 2021-10
  • 2021-11
  • 2021-12
  • 2022-01
  • 2022-02
  • 2022-03
  • 2022-04
  • 2022-05
  • 2022-06
  • 2022-07
  • 2022-08
  • 2022-09
  • 2022-10
  • 2022-11
  • 2022-12
  • 2023-01
  • 2023-02
  • 2023-03
  • 2023-04
  • 2023-05
  • 2023-06
  • 2023-08
  • 2023-09
  • 2023-10
  • 2023-11
  • 2023-12
  • 2024-01
  • 2024-02
  • 2024-03
  • 2024-04
  • br It is assumed that each sample j is generated

    2018-10-24


    It is assumed that each sample , j=1, 2, …, is generated by a source with an explicitly unknown conditional distribution density (8). Let us represent the quantity dF(w) in the following form: where N(w) is the information capacity of the source w; is the total information capacity of all sources; g(w) is the weight taking into account the rest of the factors (weight selection is described below). If we take into account that the relative information capacity of a source can be estimated as , then we can obtain the following estimate for the quantity (11): If we take the unit weights g(w)=1, then the Formula (13) is reduced to the average value of the squared regression deviation over buy Caspase-3/7 Inhibitor the whole aggregate sample , and we obtain the buy Caspase-3/7 Inhibitor coinciding with Eq. (12). The described scheme involving dividing the aggregate sample into groups is typical for the so-called variance analysis developed by Fisher [13] to take into account the influence of the systematic error of the group and the random error within the group. This scheme is convenient for describing the situation with heterogeneous information sources based on both the theoretical and the experimental methods for obtaining cross-sections. Within this scheme, it is also convenient to formulate and test statistical hypotheses involving clustering. Testing the condition under which the systematic error of a partial source exceeds certain bounds is of practical interest, in particular, for the case of electron-atom excitation we are examining. Let us assume that the parameters for approximating the cross-section have been found from minimizing the expectation of the squared deviation (13). Let us formulate for the chosen information source the following hypothesis about the magnitude of the conditional expectation (10) of the squared regression deviation:
    It is known [11–13] that for testing this hypothesis, the statistic can be used, where is the sample mean of a random variable of the squared regression deviation δ=δ() of the information source ; is the respective squared sample variance. The Neyman–Pearson model bases the criterion for rejecting the hypothesis (14) at the regression level α on the critical region for the statistic (15). If this statistic falls into the critical region, then the hypothesis is rejected. In this case, the probability of rejecting a true hypothesis (type I error) does not exceed the quantity where φ(y) is the distribution of the statistic (15). Let us turn our attention to the distribution of the statistic (15). In principle, it can be obtained from the distributions of the conditional deviations that, as stated above, are unknown to us. The problem can be significantly simplified if we examine the case when the sample number is large enough. Then, according to the central limit theorem, the distribution of the numerator of the Expression (15) can be approximated by a normal distribution independent of the Distribution (17). The squared sample variance in the denominator of the Expression (15) tends to the variance. Thus, the Distribution (15) can be approximated by a standard normal distribution with the density
    Let us note that in case of a normal distribution of the squared deviation quantities , the Statistic (15) has a Student\'s t-distribution with m= 1 degrees of freedom which at large values of m tends to a normal distribution. However, the same as for logarithmic deviations, the assumption that the distribution of the squared deviation quantities is normal may turn out to be unjustified. In our studies, this choice is made on the following principle. The data are taken from peer-reviewed journals with a unit weight =1. The other sources are taken with a zero weight, =0, i.e., they are not taken into account. Such a model is justified because the respective results have passed an expert test. Attempting to make a more clear-cut distinction between information sources inevitably introduces more subjectivity, and, due to the above-described considerations, applying statistical methods does not eliminate this subjectivity.