Abstract

This paper proposes a dynamic Markov model for the estimation of binary state-to-state transition probabilities from a sequence of independent cross-sectional samples. It discusses parameter estimation and inference using maximum likelihood (ML) methodology. The model is illustrated by the application of a three-wave panel study on pupils’ interest in learning physics. These data encompass more information than what is used to estimate the model, but this surplus information allows us to assess the accuracy and the precision of the transition estimates. Bootstrap and Bayesian simulations are used to evaluate the accuracy and the precision of the ML estimates. To mimic genuine cross-sectional data, samples of independent observations randomly drawn from the panel are also analyzed.

References

Abowd, J. M. , Zellner, A. (1985). Estimating gross labor-force flows. Journal of Business and Economic Statistics, 3, 254–283. First citation in article Google Scholar
Achen, C. H. , Shively, W. P. (1995). Cross-level inference. Chicago, Ill: University of Chicago Press. First citation in article Google Scholar
Agresti, A. (1992). A survey of exact inference for contingency tables. Statistical Science, 7, 131–153. First citation in article Crossref, Google Scholar
Barnard, G. A. (1947). Significance tests for 2 × 2 tables. Biometrika, 34, 123–138. First citation in article Google Scholar
Bishop, Y. M. M. , Fienberg, S. E. , Holland, P. W. (1975). Discrete multivariate analysis: Theory and practice. Cambridge, MA: MIT Press. First citation in article Google Scholar
Böckenholt, U. , Dillon, W. R. (2000). Inferring latent brand dependencies. Journal of Marketing Research, 37, 72–87. First citation in article Crossref, Google Scholar
Chambers, R. L. , Steel, D. G. (2001). Simple methods for ecological inference in 2 × 2 tables. Journal of the Royal Statistical Society, Series A, 164, 175–192. First citation in article Crossref, Google Scholar
Cross, P. J. , Manski, C. F. (2002). Regressions, short and long. Econometrica, 70, 357–368. First citation in article Crossref, Google Scholar
Davison, A. C. , Hinkley, D. V. (1997). Bootstrap methods and their application. Cambridge, MA: Cambridge University Press. First citation in article Crossref, Google Scholar
Dobra, A. , Tebaldi, C. , West, M. (2003). Bayesian inference in incomplete multi-way tables. Durham, NC: Institute of Statistics and Decision Sciences, Duke University. First citation in article Google Scholar
Efron, B. , Tibshirani, R. J. (1993). An introduction to the bootstrap. New York: Chapman & Hall. First citation in article Crossref, Google Scholar
Eisinga, R. (2008). Information loss for 2 × 2 tables with missing cell counts: Binomial case. Statistica Neerlandica, 62, 239–254. First citation in article Crossref, Google Scholar
Fienberg, S. (1997). Confidentiality and disclosure limitation methodology. Challenges for national statistics and statistical research. Pittsburgh, PA: Department of Statistics, Carnegie Mellon University. First citation in article Google Scholar
Fingleton, B. (1997). Specification and testing of Markov chain models: An application to convergence in the European Union. Oxford Bulletin of Economics and Statistics, 59, 385–403. First citation in article Crossref, Google Scholar
Fisher, R. A. (1935). The logic of inductive inference (with discussion). Journal of the Royal Statistical Society, 98, 39–82. First citation in article Crossref, Google Scholar
Golan, A. , Judge, G. , Miller, D. (1996). Maximum entropy econometrics. Robust estimation with limited data. New York: Wiley. First citation in article Google Scholar
Golan, A. , Judge, G. , Robinson, S. (1994). Recovering information from incomplete or partial multisectoral economic data. Review of Economics and Statistics, 76, 541–549. First citation in article Crossref, Google Scholar
Haber, M. (1989). Do the marginal totals of a contingency table contain information regarding the table proportions? Communications in Statistics – Theory and Methods, 18, 147–156. First citation in article Crossref, Google Scholar
Hamdan, M. A. , Nasro, M. O. (1986). Maximum likelihood estimation of the parameters of the bivariate binomial distribution. Communication in Statistics – Theory and Methods, 15, 747–754. First citation in article Crossref, Google Scholar
Hawkins, D. L. , Han, C. P. , Eisenfeld, J. (1996). Estimating transition probabilities from aggregate samples augmented by haphazard recaptures. Biometrics, 52, 625–638. First citation in article Crossref, Google Scholar
Judge, G. , Miller, D. , Tam Cho, W. K. (2003). An information theoretic approach to ecological estimation and inference. In King, G. , Rosen, O. , Tanner, M. (Eds.), Ecological inference. New methodological strategies (pp. 162–187). New York: Cambridge University Press. First citation in article Google Scholar
Kalbfleish, J. D. , Lawless, J. F. (1984). Least squares estimation of transition probabilities from aggregate data. Canadian Journal of Statistics, 12, 169–182. First citation in article Crossref, Google Scholar
Kalbfleish, J. D. , Lawless, J. F. (1985). The analysis of panel data under a Markovian assumption. Journal of the American Statistical Association, 80, 863–871. First citation in article Crossref, Google Scholar
Karantininis, K. (2002). Information-based estimators for the non-stationary transition probability matrix: An application to the Danish pork industry. Journal of Econometrics, 107, 275–290. First citation in article Crossref, Google Scholar
Kelton, C. M. L. (1981). Estimation of time-independent Markov processes with aggregate data: A comparison of techniques. Econometrica, 49, 517–518. First citation in article Crossref, Google Scholar
Kelton, W. D. , Kelton, C. M. L. (1984). Hypothesis tests for Markov process models estimated from aggregate frequency data. Journal of the American Statistical Association, 79, 922–928. First citation in article Google Scholar
King, G. (1997). A solution to the ecological inference problem. Reconstructing individual behavior from aggregate data. Cambridge, MA: Cambridge University Press. First citation in article Google Scholar
King, G. , Rosen, O. , Tanner, M. (2003). Ecological inference. New methodological strategies. New York: Cambridge University Press. First citation in article Google Scholar
Kocherlakota, S. , Kocherlakota, K. (1992). Bivariate discrete distributions. New York: Marcel Dekker. First citation in article Google Scholar
Lawless, J. F. , McLeish, D. (1984). The information in aggregate data from Markov chains. Biometrika, 71, 419–430. First citation in article Crossref, Google Scholar
Lee, T. C. , Judge, G. G. , Zellner, A. (1970). Estimating the parameters of the Markov probability model from aggregate time series data. Amsterdam: North-Holland. First citation in article Google Scholar
Li, W. K. , Kwok, M. C. O. (1990). Some results on the estimation of a higher order Markov chain. Communications in Statistics. Part B. Simulation and Computation, 19, 363–380. First citation in article Crossref, Google Scholar
MacRae, E. C. (1977). Estimation of time-varying Markov processes with aggregate data. Econometrica, 45, 183–198. First citation in article Crossref, Google Scholar
McCue, K. F. (1995). Individual choice and ecological analysis. Pasadena, CA: California Institute of Technology. First citation in article Google Scholar
McCullagh, P. , Nelder, J. A. (1992). Generalized linear models (2nd ed.). London: Chapman & Hall. First citation in article Google Scholar
Moffitt, R. (1990). The effect of the U.S. welfare system on marital status. Journal of Public Economics, 41, 101–124. First citation in article Crossref, Google Scholar
Moffitt, R. (1993). Identification and estimation of dynamic models with a time series of repeated cross-sections. Journal of Econometrics, 59, 99–123. First citation in article Crossref, Google Scholar
Pelzer, B. , Eisinga, R. (2002). Bayesian estimation of transition probabilities from repeated cross sections. Statistica Neerlandica, 56, 23–33. First citation in article Crossref, Google Scholar
Pelzer, B. , Eisinga, R. , Franses, P. H. (2001). Estimating transition probabilities from a time series of repeated cross sections. Statistica Neerlandica, 55, 248–261. First citation in article Crossref, Google Scholar
Pelzer, B. , Eisinga, R. , Franses, P. H. (2002). Inferring transition probabilities from repeated cross sections. Political Analysis, 10, 113–133. First citation in article Crossref, Google Scholar
Pelzer, B. , Eisinga, R. , Franses, P. H. (2003). Ecological panel inference from repeated cross sections. In G. King, O. Rosen, M. Tanner, (Eds.), Ecological inference. New methodological strategies (pp. 188–205). New York: Cambridge University Press. First citation in article Google Scholar
Plackett, R. L. (1977). The marginal totals of a 2 × 2 table. Biometrika, 64, 37–42. First citation in article Google Scholar
Richardson, S. , Montfort, C. (2000). Ecological correlation studies. In P. Elliott, J. C. Wakefield, N. G. Best, D. J. Briggs, (Eds.), Spatial epidemiology. Methods and applications (pp. 205–220). Oxford: Oxford University Press. First citation in article Google Scholar
Tebaldi, C. , West, M. (1998). Reconstruction of contingency tables with missing data. Durham, NC: Institute of Statistics and Decision Sciences, Duke University. First citation in article Google Scholar
Vermunt, J. K. , Langeheine, R. , Böckenholt, U. (1999). Discrete-time discrete-state latent Markov models with time-constant and time-varying covariates. Journal of Educational and Behavioral Statistics, 24, 179–207. First citation in article Crossref, Google Scholar
Wakefield, J. (2003). Prior and likelihood choices in the analysis of ecological inference King, G. , Rosen, O. , Tanner, M. (Eds.), Ecological inference. New methodological strategies (pp. 13–50). New York: Cambridge University Press. First citation in article Google Scholar
Woodward, J. A. , Palmer, C. G. S. (1997). On the exact convolution of discrete random variables. Applied Mathematics and Computation, 83, 69–77. First citation in article Crossref, Google Scholar
Zepeda, L. (1995). Technical change and the structure of production. A nonstationary Markov analysis. European Review of Agricultural Economics, 22, 41–60. First citation in article Crossref, Google Scholar

Volume 4Issue 4January 2008

ISSN: 1614-1881eISSN: 1614-2241

Licenses & Copyright

Keywords

Acknowledgments:

This research was supported by a grant from the Netherlands Organisation for Scientific Research (NWO), Division for Social Sciences (# 480-04-009). The data on physics education used in this paper were collected by the Institute for Science Education in Kiel (Germany) and reprinted in part from Vermunt, Langeheine, and Böckenholt (1999). We thank Rolf Langeheine for permission to reproduce the data.

PDF download

Verify Phone

Congrats!

Recovering Transitions From Repeated Cross-Sectional Samples

Abstract

References

Licenses & Copyright

Acknowledgments:

Support & Contact

Support & Contact

Legal information

Legal information

More offers

More offers

Our partners

Our partners

Change Password

Your password must have 8 characters or more and contain 3 of the following:

Password Changed Successfully

Create a new account

Request Username

Verify Phone

Congrats!

Recovering Transitions From Repeated Cross-Sectional Samples

Abstract

References

Licenses & Copyright

Acknowledgments:

Support & Contact

Support & Contact

Legal information

Legal information

More offers

More offers

Our partners

Our partners