The following statistics and distributions are included in this option:
These statistics are discussed in this section. The selection procedure of data is dealt with in next section.
For data vector Xi, (i = 1, N), the basic statistics are determined as follows:
As can be seen, the relative cumulative frequency is plotted with the Chegodayev function
For the mean and the variance also the 95% confidence intervals will be computed. The Student-t distribution is to be applied and the percentage points tn,α /2 and tn,1-α /2 are computed, where n = N - 1 is the number of degrees of freedom. The confidence limits for the mean then read:
Given an estimate of the sample variance the true variance σY2 will be contained within the following confidence interval with a probability of 100(1-α) %:
The values for χ2n,α/2 and χ2n,1-α/2 are read from the tables of the Chi-square distribution for given aand n.
The 95% confidence limits for the median, 25% quantile and 75% quantile are computed with:
Data for statistical analysis is be read from the hymosdatabase.
Series can be selected by clicking the series in the 'series codes' list box. Only one series may be selected at a time.
The following type of data can be considered:
In case of actual values a threshold selection menu is displayed from which no threshold, minimum, maximum, both, peaks over threshold or peaks under threshold can be selected.
The annual minimum/maximum values may be the minimum/maximum of full years or of a part of years, like seasonally or monthly extremes.
The computation period can be set to 'Full years' or 'Part of years'. When 'Full years' is selected all data values of the complete year will be taken into the analysis, make sure the start date and end date of the processing period are form the first of January to the first of January. When 'Part of years' is selected a start date and end date for the sub-period must be entered. When data for only the month of March must be selected, the start date of the sub-period must be set to "01-03" and the end date of the sub-period to "01-04".
The number of classes and the lower and upper class limits for the cumulative frequency distribution and histogram can be entered. When no values are entered for the lower and upper class, HYMOS will compute the lower and upper class levels from the data.
Note that the time series that are investigated should not contain missing values!
The basic statistics and fitting distribution functions of HYMOS can use the POT method for selecting data. The Basic Statistics method also has an option of selecting data by a threshold. How these data selection options work will be explained here.
In case of actual values a threshold selection menu is displayed from which no threshold, minimum, maximum, both, Peaks over Threshold or Peaks under Threshold can be selected. If peaks over threshold is selected also a value for the 'Horizon' can be entered (a period in time interval units of the original series, e.g. days, which is used to skip lower peaks within that period before or after a peak). Default value = 1 (no horizon). A maximum of 1000 values/peaks of a maximum of 50000 input values will enter analysis.
The selection of data for threshold method is very straightforward:
For use of the Peaks Over Threshold (POT) and Peaks Under Threshold (PUT)-methods actual values have to be selected. For these methods a threshold value and a horizon are entered. For the POT option all values below the threshold will be excluded from computation, for the PUT option all values above the threshold will be excluded. The data used for further computation are all peaks between successive up-crossings and
Down-crossings taking into account the given horizon. The default value of the horizon is 1 (no horizon).
In the picture you see four peaks on time steps t1 to t4, a given horizon and a threshold. First the highest peaks between an up-crossing and down-crossing are computed. Because peak at t4 is higher than t3 within the same crossing period, peak t3 is not seen as a real peak. When the horizon is set to 1 the POT method will return three peaks for the selected period, namely t1, t2 and t4. When a value for the horizon is entered larger than 1, HYMOS will skip all the lower peaks within the horizon period. In this example, peak t2 will be skipped and the POT method will return peak t1 and t4.