Probability, Dependency, and Frequency Are Not All Equally Involved in Statistical Learning
Abstract
Abstract. The ability to learn sequences depends on different factors governing sequence structure, such as transitional probability (TP, probability of a stimulus given a previous stimulus), adjacent or nonadjacent dependency, and frequency. Current evidence indicates that adjacent and nonadjacent pairs are not equally learnable; the same applies to second-order and first-order TPs and to the frequency of the sequences. However, the relative importance of these factors and interactive effects on learning remain poorly understood. The first experiment tested the effects of TPs and dependency separately on the learning of nonlinguistic visual sequences, and the second experiment used the factors of the first experiment and added a frequency factor to test their interactive effects with verbal sequences of stimuli (pseudo-words). The results of both experiments showed higher performance during online learning for first-order TPs in adjacent pairs. Moreover, Experiment 2 indicated poorer performance during offline recall for nonadjacent dependencies and low-frequency sequences. We discuss the results that different factors are not used equally in prediction and memorization.
References
1987). Factor analysis and AIC. In E. Parzen, K. Tanabe, & G. Kitagawa (Eds.), Selected papers of Hirotugu Akaike (pp. 371–386). Springer.
(2019). Linguistic processes do not beat visuo-motor constraints, but they modulate where the eyes move regardless of word boundaries: Evidence against top-down word-based eye-movement control during reading. PLoS One, 14(7), e0219666. 10.1371/journal.pone.0219666
(2017). Statistical learning: A powerful mechanism that operates by mere exposure. Wiley Interdisciplinary Reviews: Cognitive Science, 8(1-2), e1373. 10.1002/wcs.1373
(1998). Computation of conditional probability statistics by 8-month-old infants. Psychological Science, 9(4), 321–324. 10.1111/1467-9280.00063
(2008). Intact implicit learning of spatial context and temporal sequences in childhood autism spectrum disorder. Neuropsychology, 22(5), 563–570. 10.1037/0894-4105.22.5.563
(2009). Semantic priming in a cortical network model. Journal of Cognitive Neuroscience, 21(12), 2300–2319. 10.1162/jocn.2008.21156
(2005). Modality-constrained statistical learning of tactile, visual, and auditory sequences. Journal of Experimental Psychology: Learning, Memory, and Cognition, 31(1), 24–39. 10.1037/0278-7393.31.1.24
(2013). What is learning? On the nature and merits of a functional definition of learning. Psychonomic bulletin & Review, 20(4), 631–642. 10.3758/s13423-013-0386-3
(2001). Can sequence learning be implicit? New evidence with the process dissociation procedure. Psychonomic Bulletin & Review, 8(2), 343–350. 10.3758/bf03196171
(2017). Transitional probabilities count more than frequency, but might not be used for memorization. Cognitive Psychology, 92, 37–64. 10.1016/j.cogpsych.2016.11.004
(2002). Statistical learning of higher-order temporal structure from visual shape sequences. Journal of Experimental Psychology: Learning, Memory, and Cognition, 28(3), 458–467. 10.1037//0278-7393.28.3.458
(1997). Transfer and complexity in artificial grammar learning. Cognitive Psychology, 33(2), 154–207. 10.1006/cogp.1997.0654
(2002). Variability and detection of invariant structure. Psychological Science, 13(5), 431–436. 10.1111/1467-9280.00476
(2012). The best time to acquire new skills: Age-related differences in implicit sequence learning across the human lifespan. Developmental science, 15(4), 496–505. 10.1111/j.1467-7687.2012.01150.x
(2002). Visual statistical learning in infancy: Evidence for a domain general learning mechanism. Cognition, 83(2), B35–B42. 10.1016/s0010-0277(02)00004-5
(2014). Inter-synaptic learning of combination rules in a cortical network model. Frontiers in Psychology, 5, Article 842. 10.3389/fpsyg.2014.00842
(2011). Determinants of multiple semantic priming: A meta-analysis and spike frequency adaptive model of a cortical network. Journal of Cognitive Neuroscience, 23(6), 1447–1474. 10.1162/jocn.2010.21504
(2016). Semantic integration by pattern priming: Experiment and cortical network model. Cognitive Neurodynamics, 10(6), 513–533. 10.1007/s11571-016-9410-4
(2000). The influence of semantic context on initial eye landing sites in words. Acta Psychologica, 104(2), 191–214. 10.1016/s0001-6918(00)00020-2
(2022). Data for "Probability, Dependency and Frequency are not all Equally Involved in Statistical Learning”. https://osf.io/hjy9k/
(2021). Statistical learning of unbalanced exclusive-or temporal sequences in humans. PLoS One, 16(2), e0246826. 10.1371/journal.pone.0246826
(2010). Computational modeling of statistical learning: Effects of transitional probability versus frequency and links to word learning. Infancy, 15(5), 471–486. 10.1111/j.1532-7078.2009.00023.x
(2011). Interference between sentence processing and probabilistic implicit sequence learning. PLoS One, 6(3), e17577. 10.1371/journal.pone.0017577
(2004). Learning at a distance I. Statistical learning of non-adjacent dependencies. Cognitive Psychology, 48(2), 127–162. 10.1016/s0010-0285(03)00128-2
(2015). Is an attention-based associative account of adjacent and nonadjacent dependency learning valid? Acta Psychologica, 157, 195–199. 10.1016/j.actpsy.2015.03.002
(2007). PsychoPy—Psychophysics software in Python. Journal of Neuroscience methods, 162(1-2), 8–13. 10.1016/j.jneumeth.2006.11.017
(2009a). Learning in reverse: Eight-month-old infants track backward transitional probabilities. Cognition, 113(2), 244–247. 10.1016/j.cognition.2009.07.011
(2009b). Statistical learning in a natural language by 8-month-old infants. Child Development, 80(3), 674–685. 10.1111/j.1467-8624.2009.01290.x
(2006). Implicit learning and statistical learning: One phenomenon, two approaches. Trends in Cognitive Sciences, 10(5), 233–238. 10.1016/j.tics.2006.03.006
(2012). Beyond transitional probability computations: Extracting word-like units when only statistical information is available. Journal of Memory and Language, 66(4), 807–818
(2005). ERP effects of the processing of syntactic long-distance dependencies. Brain Research Cognitive Brain Research, 22(3), 407–428. 10.1016/j.cogbrainres.2004.09.012
(2013). R: A language and environment for statistical computing [Computer software manual]. R Core Team. https://www.R-project.org
. (2022). Learning higher-order transitional probabilities in nonhuman primates. Cognitive Science, 46(4), e13121. 10.1111/cogs.13121
(2019). Regularity extraction across species: Associative learning mechanisms shared by human and non‐human primates. Topics in Cognitive Science, 11(3), 573–586. 10.1111/tops.12343
(2010). Statistical learning and language acquisition. Wiley Interdisciplinary Reviews: Cognitive Science, 1(6), 906–914. 10.1002/wcs.78
(1996). Statistical learning by 8-month-old infants. Science, 274(5294), 1926–1928. 10.1126/science.274.5294.1926
(1996). Word segmentation: The role of distributional cues. Journal of Memory and Language, 35(4), 606–621. 10.1006/jmla.1996.0032
(2003). Pattern induction by infant language learners. Developmental Psychology, 39(3), 484–494. 10.1037/0012-1649.39.3.484
(2011). The learning of exclusive-or categories by monkeys (Macaca mulatta) and humans (Homo sapiens). Journal of Experimental Psychology: Animal Behavior Processes, 37(1), 20–29. 10.1037/a0019497
(2018). The developing infant creates a curriculum for statistical learning. Trends in Cognitive Sciences, 22(4), 325–336. 10.1016/j.tics.2018.02.004
(2013). The extraction and integration framework: A two-process account of statistical learning. Psychological Bulletin, 139(4), 792–814. 10.1037/a0030801
(2007). Statistical learning of syntax: The role of transitional probability. Language Learning and Development, 3(1), 1–42. 10.1207/s15473341lld0301_1
(2021). Regularity detection under stress: Faster extraction of probability-based regularities. PLoS ONE, 16(6), e0253123. 10.1371/journal.pone.0253123
(2020). Non-adjacent dependency learning in humans and other animals. Topics in Cognitive Science, 12(3), 843–858. 10.1111/tops.12381
(