Allocation of Sample Sizes in bi-objective Stratified
Transcription
Allocation of Sample Sizes in bi-objective Stratified
Allocation of Sample Sizes in bi-objective Stratified Sampling Using Lexicographic Goal Programming 6.1 Introduction The allocation in stratified sampling is a procedure of assigning some portion of the total samples into each stratum according to the prescribed criterion. The usual criterion is to minimize the variance of the estimator of the population parameter under the constraints for the total survey cost. Neyman (1934) suggested the concept of optimum allocation, Mahalanobis (1944) introduced the cost function, and Stuart (1954) proved the optimum allocation using Cauchy inequality. It is well known that the Neyman allocation yields an estimate of the population parameter with maximum precision when all the strata have the same value for the survey cost. In practice, however, there are certain limitations to the use of the Neyman allocation. In multivariate stratified sampling, the Neyman allocation on the basis of any one variable may lead to loss of precision on the other variables. The problem of allocation in multivariate stratified sampling has been studied by Melaku and Sadasivan (1987), Kreienbrock (1993), and Khan et al. (1997). Neverthless, if the variables are correlated and certain variable is more important than the others it can be said that the efficiency of the estimator using the Neyman allocation may be secured generally. On the contrary, the absence of the strata standard deviations causes the situation that the efficiency of the estimator from the Neyman allocation is severely reduced. The variance of an estimator based on Neman allocation using the estimated strata standard deviations has been calculated by Evans (1951). Lower bound of the size of the preliminary samples to estimate the strata standard deviations for the Neyman allocation was derived by Sukhatme and Sukhatme (1970). Alternative 151 methods using other measures, such as the range of strata and the strata total of survey variable, instead of the standard deviation, have been considered by Bankier (1988). In real world problem situation, several conflicting objectives arises in multi-objective problems such as cost function, total sample size etc. So goal programming problem was introduced by Charns and Cooper (1961) to deal with the problem of achieving a set of conflicting goals. The objective function searches to minimize deviations from the set of pre-assigned goals. Narula and Wellington (2002) introduce some of the single sample statistical problems; those can be formulated and solved as multiple optimization problems. Decision makers sometimes set such goals, even when they are attainable within the available resources. Such problems are tackled with the help of the techniques of nonlinear goal programming, where the objective function is stated in such a way that the all goals must be attained. In this chapter, we have discussed the problem of bi-objective stratified sampling formulated as non-linear goal programming problem (NLGP) and discussed how the proper priority structure of non-linear goal programming (NLGP) model can be selected for obtaining the allocation of sample size in stratified random sampling where the two objectives are to be attained. In the solution process priorities are given to each objective one after other and a set of solution is obtained. From the solutions, the ideal solution is identified. The -distances of different solutions from the ideal solution are calculated. The solution corresponding to the minimum -distance gives the best compromise solution. 6.2 Formulation of the problem Many authors have considered the general problem of optimal design in stratified and multistage sampling (see, for example, Hartley, 1965; Folks and Antle, 1965; Kokan 152 and Khan, 1967; Chatterjee, 1968, 1972; Bethal, 1985, 1989; and Megerson, Clark and Fenley, 1986). In stratified sampling the total population is first partitioned into several sub-populations (called strata). Population characteristics can be inferred with samples from each stratum, exploiting the gain in precision in the estimates, administrative convenience, and the flexibility of using different sampling procedures in the different sub-populations. Let be the number of units in the number of strata into which the drawn from the stratum and units are divided. Let , where is the be the size of the sample stratum. Assume that the samples are drawn independently from different strata. The problem of optimally choosing the is known as the optimal allocation problem. The objectives in this problem are to minimize the variance of the estimate of the population characteristic under study and minimize the total cost of sampling. Let denote an unbiased estimator of the population mean, where characteristic under study. Let be an unbiased estimate of the stratum mean is the ; i.e., then is an unbiased estimate of the population mean . The variance of is given by (6.2.1) where Let and , be the cost of measuring one unit in the stratum. Then the cost function is considered as (6.2.2) where is the overhead cost and is the total cost for the survey. 153 (6.2.3) where and is the prefixed variance of the estimator of the population mean. So, the approach is to minimize cost as well as sample size for fixed variance. 6.3 Solution procedure In conventional priority based non linear goal programming, the solution under the decision makers imposed priority structure is considered as the optimal. But in different complex decision making situation, desired solution may not be acceptable underweight structure. Thus a better solution is always expected for which alternative priority under the given weight structure may be considered. To select the proper priority structure the following procedure is performed. If in the formulation, priorities for objective functions are considered. So involvement of priorities indicates that (Factorial) different solutions can be obtained from problems arises for number of different priority structures. Let , solution obtained by giving priorities to and be the number of objective functions. If priority is given to cost function then we have to solve the following lexicographic programming problem: (6.3.1) 154 and solution of lexicographic programming problem (6.3.1) is given by . Similarly if priority is given to total sample then, we have to solve the following lexicographic programming problem: (6.3.2) and solution of lexicographic programming problem (6.3.2) is given by . Then the ideal solution is defined as: (say). But in practice ideal solution can never be achieved. The solution, which is closest to the ideal solution, is accepted as the best compromise solution, and the corresponding priority structure is identified as most appropriate priority structure in the planning context. To obtain the best compromise solution, following goal programming problem is to be solved: (6.3.3) where are the under and over deviational variable respectively. Now, (6.3.4) 155 Eqn. (6.3.4) is defined as the the solution -distance from the ideal solution , to . Therefore, (6.3.5) Hence, is the best compromise solution. 6.4 Lexicographic goal programming in allocation problem Preemptive goal programming is a mathematical programming method developed to solve problems with conflicting linear or non linear objectives and linear and non linear constraints. The user is able to provide levels, or targets, of achievement for each objective and priorities the order in which goals are to be achieved. Since the different objectives have their own importance in the problem. The use of lexicographic goal programming problem is considered where the goals are arranged in the lexicographic order. In this chapter there are only two objective functions (cost, sample size). If first priority is given to cost function, the lexicographic goal programming problem is defined as (6.4.1) 156 Now, we solve the problem (6.4.1) using lexicographic goal programming and find the allocation of sample sizes as and Similarly, if first priority is given to total sample size function, thus the lexicographic programming problem is defined as: (6.4.2) Now, we solve the problem (6.4.2) using lexicographic goal programming and find the allocation of sample sizes as 6.5 and Numerical illustration Consider the following data of Jessen (1942) for illustrating the proposed method to find out the optimum values of allocations. For this purpose, the data of one characteristic are used which are tabulated in the following Table 1. Table 1 157 Solution of the problem when priority given to cost function The lexicographic goal programming problem (6.4.1) for the data of Table 1 is written as: (6.5.1) Now, we solve the non linear integer lexicographic goal programming problem (6.5.1) by LINGO Software, we obtained the following solution: with and Solution of the problem when priority given to sample size The lexicographic goal programming problem (6.4.2) for the data of Table 1 is written as: (6.5.2) 158 Now, we solve the non linear integer lexicographic goal programming problem (6.5.2) by LINGO Software, we obtained the following solution with and Table-2 In table- the -distances of all possible solutions from the ideal solution are calculated. From table- it is found that the minimum of the solutions from the ideal solution is -distances of possible . Table- which corresponds to the priority sample size. Therefore the best compromise solution of the problem is with . 159 6.6 Conclusion This paper provided a profound study of allocation of sample size in stratified sampling using -distances. The solution corresponding to the minimum distance is the best compromise solution. 160