Skip to main content
Open AccessResearch Article

When the Action to Be Performed at the Stage of Retrieval Enacts Memory of Action Verbs

Published Online:https://doi.org/10.1027/1618-3169/a000507

Abstract

Abstract. According to the embodied approach of language, concepts are grounded in sensorimotor mental states, and when we process language, the brain simulates some of the perceptions and actions that are involved when interacting with real objects. Moreover, several studies have highlighted that cognitive performances are dependent on the overlap between the motor action simulated and the motor action required by the task. On the other hand, in the field of memory, the role of action is under debate. The aim of this work was to show that performing an action at the stage of retrieval influences memory performance in a recognition task (experiment 1) and a cued recall task (experiment 2), even if the participants were never instructed to consider the implied action. The results highlighted an action-based memory effect at the retrieval stage. These findings contribute to the debate about the implication of motor system in action verb processing and its role for memory.

According to the embodied approach of cognition, knowledge relies on the sensorimotor and contextual dimensions of an experience. Moreover, the cognitive system is able to reenact sensorimotor states even in the absence of a stimulus through simulation (Barsalou, 2010).

Regarding embodied approaches of language, a growing number of researches have highlighted that concepts are grounded in sensorimotor mental states. In other words, when we read a word, the process of language comprehension recruits the same perception and action systems that are involved when interacting with real objects represented by the word. Moreover, a significant number of studies have shown a large overlap between the brain areas involved in language and action (for more details, read the following books: Borghi & Binkofski, 2014; Brouillet, 2019; De Vega et al., 2012; Fincher-Kiefer, 2019; Fischer & Coello, 2015, 2016; Pecher & Zwaan, 2005; Shapiro, 2014, 2019). Finally, several researches have shown a bidirectional link between action verb processing and the action performed: Processing an action verb affects action production (Andres et al., 2015; Bidet-Ildei et al., 2017; Boulenger et al., 2009; Glenberg & Kaschak, 2002; Springer & Prinz, 2010), and conversely performing an action affects action verb processing (Beauprez & Bidet-Ildei, 2017; Liepelt et al., 2012). Importantly, Aravena et al. (2010) highlighted the existence of this link at the neuronal level: Action-language comprehension and motor processes thus share neural resources and cooperate.

Although there is a growing body of research supporting the involvement of sensorimotor systems of the brain in the representation of conceptual knowledge, this is still a controversial issue (Dove, 2011; Meteyard et al., 2012; Michel, 2020; Miller et al., 2018; Wurm & Caramazza, 2019). The debate focuses mostly on the subordination between the sensorimotor system and the conceptual knowledge. If sensorimotor representations can contribute to the conceptual knowledge, it is argued that symbolic representations are necessary to apprehend the broad spectrum of this knowledge (Zwaan, 2016). To this end, Dove (2011, p. 1) suggested that our concepts are represented in at least two ways: (i) through sensorimotor simulations of our interactions with objects and events and (ii) through linguistic representations that are “dis-embodied.” That is to say, it involves sensorimotor simulations of experiences that are not associated with its semantic content (p. 6).

Regardless of this controversy, the role of the sensorimotor system in language is now broadly acknowledged, at least for some conceptual domains, which is not the case regarding memory processes. If the importance of performing an action to recover memories about actions is well documented (for review, see Engelkamp, 1998; Zimmer et al., 2001), the role of action for memory is still debated. For example, Mecklinger et al. (2004) showed that left ventral premotor cortex is activated during maintenance of manipulable objects in working memory. Shebani and Pulvermüller (2013) and Montero-Melis et al. (2019) showed that carrying out a rhythmic task with the hands led to selective impairment of working memory for hand-related words (e.g., clap), while carrying out the same task with the feet led to selective memory impairment for foot-related words (e.g., kick).

However, Pecher et al. (Pecher, 2013; Pecher et al., 2013; Quak et al., 2014), using interference methodology in behavioral studies, failed to highlight the role of motor program in working memory. Nevertheless, Gimenez and Brouillet (2020) showed that the procedure used by Pecher et al. (2013) could be challenged by changing the mode of response and the interference task, which led to an effect of the action performed on working memory.

In the same vein, Pecher et al. failed to replicate previous results showing that participants, in a short-term memory task on action verbs, made more errors when a motor interference task was congruent with the type of action verb than when the motor interference task was noncongruent (for review, see Zeelenberg & Pecher, 2016). However, Richardson et al. (2001), in a short-term visual memory task, showed that object affordance is stored at the stage of encoding and can be reactivated during a word recognition task even if the motor components were not involved. In an extension of this work, Brouillet et al. (2015) showed that if the motor components of the objects were perceptually present at the time of recognition, memory judgments were associated with the activation of the motor components of the objects learned.

In long-term memory, the results are also debated. Canits et al. (2018) investigated the influence of compatibility between the typical grasp size of an object and response grasp type on semantic categorization, on delayed free recall, and on a recognition memory task. In the categorization task, the compatibility effect was present, but it was not observed in the long-term memory tasks. These findings contrast with those of van Dam et al. (2013). In their study, participants performed an action between the learning and retrieval phases that was compatible with some learned objects (twist for paper mill, or press for doorbell) and incompatible with other learned objects (twist for piano or press for screw driver). They found that participants’ retrieval performance was better in the compatible conditions than incompatible conditions. The results obtained by Dutriaux et al. (2019) also contrast with those of Canits et al. (2018). In their study, participants’ hand posture was manipulated to interfere with motor simulation during language processing. In a learning phase, participants were exposed to object–verb sentences (e.g., action sentences: “to take a cup” vs. attentional sentences: “to see a cup”) while holding their hands behind their back or in front of them. Results showed that the “hand in the back” posture interfered with recall of object nouns when nouns appeared in the context of action sentences, and not when they appeared in the context of attentional sentences. Brouillet et al. (2018) also showed an effect of motor resonance in long-term memory: They observed, in a categorization task, that when right-oriented tools were primed by an injured hand, response times with the right hand were slower than when the prime was a healthy hand. This response delay was accentuated in a handgrip condition. According to them, these results support the idea that participants activated the corresponding motor program and the pain associated with the consequences of the simulated grasping action.

In addition to those studies, other findings showed that the neural structures governing the selection and execution of actions are activated when participants have to process verbs that represent these actions (Mecklinger et al., 2004; Pulvermüller & Fadiga, 2010; Pulvermüller et al., 2005; van Elk et al., 2010; Willems et al., 2010, 2011). Recently, Vitale et al. (2020) used transcranial direct current stimulation to test whether exogenous excitatory manipulation of the left primary motor cortex (M1) would improve memory performance with action sentences rather than with attention sentences (as in Dutriaux et al., 2019). Results showed strong evidence of a causal link between the activation of the motor system and memory for action sentences.

In sum, all these studies suggest a role of sensorimotor processes in memory activity. This idea is at the core of multiple traces memory models (e.g., MINERVA 2 – Hintzman, 1984, 1988; VISA – Whittlesea, 1989; Act-In – Versace et al., 2014; ATHENA – Briglia et al., 2018). In these models, the memory system stores the experiences of our interactions with the environment (i.e., episodic traces) in a multisensory and distributed way over the entire brain. In that sense, the content of new memory traces is closely related to former sensorimotor activities. Moreover, in these models, accessing a knowledge is the result of a coupling between the recovery situation and the set of activated episodic traces, and this activation depends on their similarities with the recovery situation, which is called the synergistic ecphory process (Tulving, 1976, 1982). In that sense, knowledge emerges when, in the current situation, there are cues that share various components of the memory traces, such as motor components (Brouillet & Versace, 2019). The aim of the present work was to test this assumption through the use of action verbs. Indeed, due to the role of sensorimotor processes in the processing and acquisition of action verbs, the sensorimotor system should be active also at the time of retrieval in a memory task.

To test this assumption, we carried out two experiments in which the participants had to perform an action to give their answer: The first was a recognition task, and the second a cue recall task. We supposed that the motor components of the action performed to give the responses should work as cues, even if participants were not instructed to consider the implied action. Therefore, our hypothesis is that when the action performed is consistent with the action associated with the verb, memory performance will be enhanced in various retrieval context: recognition and cued recall.

Experiment 1

The aim of this experiment was to show that performing an action congruent with action verbs increases participants’ performance in a recognition task, even when participants are not instructed to consider the action. We also predict that action verbs should be less recognized with an inconsistent action and that memory performance will be higher with a consistent action than with a neutral gesture (i.e., pressing a key).

As the action verbs used involve action toward or away from the body, we expect that action verbs involving a movement away from the body will be more easily and more quickly recognized when the participant has to perform an action away rather than toward the body. We expect the opposite pattern of performance for action verbs involving a movement toward the body. In other words, we expected an interaction between the movement associated with the verbs and the action performed to provide the response.

Method

Participants

Ninety students (Mage = 21.3, SD = 1.65) from the Department of Psychology at the University of Montpellier, France, and Nice, Italy, took part in this experiment for extra course credit. All participants were French-native speakers and right-handed. Their vision was normal or corrected to normal. All participants gave their informed consent to take part in this experiment and signed the Laboratory’s Charter of Ethics. We checked post hoc power analysis with G*Power software (Faul et al., 2007): For an effect size of 0.25, a probability of .05 and a total sample size of 90, G*Power indicates a power of 0.99.

Material

We selected 60 action verbs from Duscherer and Mounoud’s (2006) French norm. In a pre-experiment, the type of action associated with each verb and the certainty of the judgment made (1 = uncertain to 6 = absolutely certain) was tested with 50 other subjects. The verb selection criteria used were a consensus between participants greater than 80% on the action associated with the verb and a mean response certainty score greater than 4. These constraints allowed us to obtain 16 verbs associated with an action involving a movement away from the body (e.g., throw, push, and pour) and 16 verbs associated with an action involving a motion toward the body (e.g., pull, extract, and raise). The characteristics of the 32 verbs selected are shown in the Appendix. The verbs were presented in Times New Roman size 20 in black on a white screen.

Procedure

Participants were randomly assigned to one of three groups (see the section test phase) and tested individually. The experiment was performed on a Pentium IV computer with a 17-in monitor, using E-Prime software (Schneider et al., 2012). Participants were seated at a table in front of the computer screen, and the screen was set at a distance of about 60 cm from the participants’ eyes. For one group of participants, a keyboard was placed on the table, 50 cm away from the seat, so that they could rest their forearms on the table. For the two other groups of participants, a lever was placed on the right-hand side (we used a similar material as in Brouillet et al., 2010). This lever had to be either pushed or pulled to provide a response.

Learning Phase

In the learning phase, participants were instructed to remember the verbs that they were about to see for later recognition. Sixteen action verbs were then presented on the computer screen: Eight verbs were associated with movement away from the body, and eight others with a movement toward the body. On each trial, a fixation cross was presented in the center of the white screen for 500 ms. Then, a verb appeared in the center of the screen for 300 ms, and a white screen was presented for 500 ms before the beginning of the next trial. The order of the verbs was randomized.

Test Phase

In the test phase, 32 verbs were presented: the 16 verbs from the learning phase – namely, the “old” verbs – and 16 others that were not presented during the learning phase – namely, the “new” verbs. The new verbs included eight verbs associated with a movement away from the body, and eight verbs associated with a movement toward the body. For each trial, a fixation cross was presented in the center of the gray screen for 500 ms. Just after the cross disappeared, the verbs appeared in the center of the screen until participants gave their response. After their response, a blank screen was presented for 500 ms before the beginning of a new trial. To avoid a potential effect of the verb, the old verbs for a participant became the new ones for another, and reciprocally, the new verbs for a participant became the old ones for another. The order of presentation of the verbs was randomized between participants.

Participants were asked to indicate if the verbs presented were previously in the learning list.1 To respond, the first group of 30 participants had to use the index finger of their right hand to press the correct key on the keyboard (called “press” group). After each answer, participants were asked to put their forefinger back on the white key, which was located between the “yes” (indicating that the participants recognized the word as previously presented during the learning phase) and the “no” keys (i.e., the “yes” key corresponded to “V,” the “no” key to “N,” and the white key to “B”). The positions of the “yes” and “no” keys were counterbalanced between participants.

In the second and third groups, participants had to push or to pull the lever with their right hand. The second group of 30 participants had to push the lever to answer “yes” and pull it to answer “no” (called ““push” group). After each answer, participants were asked to place the lever back in the middle position, which was indicated by a sound click. The third group of 30 participants had to push the lever to answer “no” and pull it to answer “yes” (called “pull” group). Participants were told to respond as quickly and accurately as possible. The test phase took place about 3 min after the end of the learning phase, which was the time needed to read the instructions.

In summary, there were three sources of variation:

  1. 1.
    nature of the verbs (i.e., old vs. new) manipulated within participants,
  2. 2.
    verb category (i.e., away from the body vs. toward the body) manipulated within participants, and
  3. 3.
    response mode (i.e., press, push, and pull) manipulated between participants.

Analysis was conducted on the “yes” response.

Results

Statistics were carried out using JASP software (Wagenmakers, Love, et al., 2018; Wagenmakers, Marsman et al., 2018). We performed frequentist repeated-measures ANOVA followed by a Bayesian repeated-measures ANOVA with the category of verbs (i.e., away vs. toward) and nature of verbs (i.e., old vs. new) as a within-participant factor and the mode of response (i.e., pull, push, and press) as a between-participant factor. The Bayes factor (BF10) is the ratio of p (D∣H1), the probability of observing the data under the alternative hypothesis, and p (D∣H0), the probability of observing the data under the null hypothesis. So, the Bayes factor is a measure of the probability that the data match the alternative hypothesis rather than the null hypothesis (Rouder et al., 2009). In accordance with the recommendations from Schönbrodt and Wagenmakers (2018), a BF10 ≥ 10 will be interpreted as strong evidence for the alternative hypothesis, 3 ≤ BF10 < 10 as moderate evidence for the alternative hypothesis, and a BF10 < 3 as anecdotal evidence for the alternative hypothesis, with a BF10 close to 1 considered as no evidence.

Responses Analysis

The mean number of “yes” responses (Table 1) was calculated across participants for each experimental condition (i.e., nature and category of the verbs) and for each group (i.e., pull, push, and press). Note that answering “yes” to “old” verbs is an adequate answer (HIT: signal present and subject says “yes”) while answering “yes” to “new” verbs is an incorrect answer (FA, False Alarm: signal absent and subject says “yes”). The notation HIT and FA is the one used in signal detection theory (see below). We ran ANOVAs with the nature of the items (i.e., old vs. new) and verb category (i.e., away vs. toward verbs) as within-participant factors, and the mode of response (i.e., pull, push, and press) as a between-participant factor.

Table 1 Mean number of “Yes” responses (max. 8) for old (i.e., HIT) and new (i.e., FA) items according to verb category (toward vs. away from the body) and mode of response (pull, push, or press)

The three-way interaction (Nature of the verbs × Verb category × Response mode) was not significant, F(2,87) = 0.73, p = .48, , BF10 = 0.07. The interaction between the mode of response and the nature of the items was significant, F(2, 87) = 7.96, p < .01, =.15, BF10 = 2.17+56, as the interaction between the mode of response and the verb category, F(2, 87) = 256.19, p < .01, , BF10 = 8.39+19.

The analysis revealed a significant main effect of response mode, F(2, 87) = 3.86, p < .05, , BF10 = 0.06. Participants produced more “yes” responses (i.e., average HIT + FA) when performing an action with the lever compared to pressing a key on the keyboard, F(1, 87) = 7.28, p < .01, , BF10 = 5.07. However, there was no difference between the “push” group and the “pull” group, F < 1.

The analysis showed no effect of verb category, F(1, 87) = 0.01, p = .91. Verbs “toward” were as well recognized as verbs “away.”

The analysis revealed a classic effect of the nature of the items, F(1, 87) = 925.94, p < .01, , BF10 = 1.29+57. The old verbs were more rated as old than the new verbs for each response mode and each category of verbs (Table 2). So, we will first present the results for the old words and then for the new words (the Holm–Bonferroni correction is applied on partial analysis).

Table 2 Comparisons of old vs. new according to the nature of the verbs (toward and away) and to the mode of response (pulling, pushing, and pressing)
Old Verbs

The interaction “Nature of the verbs × Mode of response” was significant, F(2,87) = 96.59, p < .001, , BF10 = 1.52+26. However, neither the factor “nature of the verbs,” F(1,87) = 0.07, p = .78, nor the factor “mode of response” was significant, F(2,87) = 0.40, p = .66, . The comparisons (Table 3) showed that (a) for verbs “toward,” pulling responses improved correct recognitions compared to pushing and pressing responses, and pushing responses reduced correct recognitions compared to pressing; (b) for verbs “away,” pushing responses improved correct recognitions compared to pulling and pressing responses, and pulling reduced correct recognitions compared to pressing.

Table 3 Statistics according to the nature of the verbs (toward and away) and to the mode of response (pulling, pushing, and pressing)
New Verbs

The interaction “Nature of the verbs × Mode of response” was significant, F(2,87) = 161.06, p < .001, , BF10 = 3 × 10+19. The factor “nature of the verbs” was not significant, F(1,87) = 0.02, p = .87. But the factor “mode of response” was significant, F(2,87) = 13.09, p < .001, . The comparison showed that less false recognitions were made in the “press” group, compared to the “pull” group, t(58) = 4.72, pholm < .001, and to the “push” group, t(58) = 4.07, pholm < .001. There was no difference between the “pull” and “push” groups, t(58) = 0.64, pholm = .51.

The post hoc analysis of the interaction (Table 4) showed that (a) for verbs “toward,” pulling responses increased false recognitions compared to pushing and pressing responses, and pushing reduced false recognitions compared to pressing; (b) for verbs “away,” pushing responses increased false recognitions compared to pulling and pressing responses, and pulling reduced false recognitions compared to pressing.

Table 4 Statistics according to the nature of the verbs (toward and away) and to the mode of response (“pull,” “push,” and “press” groups)

Discrimination Accuracy

According to the accuracy-oriented approach (for review, see Koriat et al., 2000), memory performance does not depend only on the probability of recovering a mere event but also on reconstructing an event or an experience corresponding to the environmental constraints (i.e., constraint of the task, nature, and richness of the cue, fluency misattribution). In other words, memory accuracy preferentially refers to the ratio between the probability for the memory systems to produce an adequate response (HIT) and the probability to produce an inadequate response (FA). The Signal Detection Theory (Green & Swets, 1966) provides a model to measure participants' discrimination accuracy in terms of the distance between the means of the target and lure response distribution, and this distance is called dʹ (dʹ = z(FA) – z(H)). In other words, the calculation of dʹ allowed us to know the ability of the subject to distinguished old words (i.e., targets) from new words (i.e., lures). Specifically, the higher the dʹ value, the more the subject was able to discriminate old from new words, while the lower the dʹ value, the lesser the subject was able to distinguish them.

The proportions of responses were calculated for each condition for each participant. Hits (responses “yes” to old words) and false alarms (responses “yes” to new words) were used to calculate dʹ values. We replaced scores of 0 by 0.5/total and scores of 1 by (total − 0.5)/total, where total was the total number of items in the condition (Quak et al., 2014). Table 5 shows mean average dʹ values.

Table 5 Mean average dʹ according to the response mode (pulling, pushing, and pressing) and the category of verbs (toward and away)

The interaction between response mode and category of verbs was significant, F(2,87) = 3.86, p = .02, , BF10 = 2.25+4. No effect of verb category was observed, F(1,87) = 0.09, p = .76, , BF10 = 0.16. The analysis also revealed a significant main effect of response mode, F(2,87) = 20.85, p < .001, ; BF10 = 4.24+4. Participants better discriminated old and new verbs when they pressed the keyboard than when they moved the lever: “pull” vs. “press” group, t(58) = 5.45, pholm < .001, ; “push” vs. “press” group, t(58) = 5.51, pholm < .001, . The difference between “push” and “pull” groups was not significant, t(58) = .06, pholm = .95.

The post hoc analysis of the interaction (Table 6) showed that regardless of the verb category, participants' discrimination performance between old and new verbs were better in the “press” than “pull” and “push” groups, but the difference between “push” and “pull” groups was not significant.

Table 6 Statistics according to the nature of the verbs (toward and away) and to the mode of response (pulling, pushing, and pressing)

Response Times

The response times for the “press” group were not included in the analysis as the required action was not comparable to the response in the “push” and “pull” groups. In situations requiring pushing or pulling, the range of motion is greater than in situations requiring pressing, and therefore induce longer response times. In the “press” group, the mean response time for old items was 494.56 ms for toward verbs and 492.01 ms for away verbs. For new items, the mean response time was 569.80 ms for toward verbs and 588.31 ms for away verbs. The difference between these two categories of verbs was not significant (F < 1).

We conducted analyses of response times only for old items (Table 7). Indeed, for the new items, 12 participants in the “pull” group did not make false recognitions (FA) for verbs away and 9 participants of the “push” group did not make false recognitions for verbs toward.

Table 7 Mean RT (in ms) for old (i.e., HIT) for “old” responses according to verb category (toward vs. away from the body) and mode of response (pull vs. push)

For old verbs, analysis showed no effect of response mode, F(1, 58) = 1.33, p = .25, , BF10 = 0.32, and no effect of verb category, F(1,58) = 0.23, p = .62, , BF10 = 0.20. However, the analysis revealed a significant interaction between mode of response and verb category, F(1, 58) = 26.21, p < .01, , BF10 = 5,637.13. The comparisons showed that pulling for verbs toward induced faster response times than pushing, t(58) = 4.45, pholm <.001, , and the pulling for verbs away induced slower response times than pushing, t(58) = 2.83, pholm = .01, .

Conclusion

Our results showed a robust effect of overlapping between the movement associated with a verb and the action performed to render a memory judgment on that verb, even if the participants were not instructed to pay attention to the action performed to answer. Participants considered verbs as old items more often when congruent consistent action was required to produce the judgment: push for verbs away and pull for verbs toward, and the opposite pattern appeared with inconsistent actions: push for verbs toward and pull for verbs away. The analyses of the recognition times confirmed this overlapping effect: participants recognized more rapidly the verbs “toward” when they had to pull to respond “yes” than to push. The same was true for the verbs “away” when they had to push to respond “yes” than to pull. Finally, the ease, indicated by dʹ values, in discriminating new verbs from old verbs when the action to produce the judgment was congruent with the action suggested by the verbs reinforces the idea that the action performed to answer and the action linked with the verb resonate.

Experiment 2

The aim of this second experiment was to replicate the findings of the first experiment in a cued recall task. We expected that the recall rate would depend on the action performed: After a pushing action, participants' performance at recalling action verbs from the “away” category should be better; after a pulling action, participants should recall more verbs from the “toward” category; and after a pressing action, participants should recall “away” verbs and “toward” verbs in similar proportions.

Method

Participants

Sixty students (Mage = 20.6, SD = 1.92) from the Department of Psychology at the University of Montpellier who did not participate in Experiment 1 took part in this experiment for extra course credit. All participants were French-native speakers and right-handed. Their vision was normal or corrected to normal. All participants gave their informed consent to take part in this experiment and signed the Laboratory’s Charter of Ethics. We checked post hoc power analysis with G*Power software (Faul et al., 2007): For an effect size of 0.25, a probability of .05, and a sample size of 60, G*Power indicates a power of 0.93.

Material

We used 20 verbs from Experiment 1: 10 verbs associated with an action involving a movement away from the body (i.e., “away”), and 10 verbs associated with an action involving a motion toward the body (i.e., “toward”). The verbs were presented in Times New Roman size 20 in black on a white screen.

Procedure

The procedure was the same as Experiment 1 except that 20 action verbs were presented instead of 32, and the test phase consisted of a recall task.

Test Phase

Participants were asked to recall the verbs from the learning phase, irrespective of their order. A question mark appeared on the computer screen, and participants had to press a key (“press” group, 20 participants), pull a lever (“pull” group, 20 participants), or push it (push group, 20 participants) to make the question mark disappear. As soon as the question mark disappeared, participants had to recall one verb.2 It was stated that performing the action allowed them to produce verbs at their own rhythm.

The test phase took place about 5 min after the learning phase, which was the time needed to read the instructions and to learn how to use the response device to make the question mark disappear (six trials without recall were performed by the participants).

Results

The mean number of words recalled was calculated for each participant for each experimental condition (i.e., verbs) and for each group (i.e., pull, push, and press). We distinguished (Table 8) the verbs correctly recalled (i.e., present in the learning phase) from those falsely recalled (i.e., not present in the learning phase). For verbs falsely recalled, we distinguished between verbs that evoke an action away from the body (i.e., “away”), from those that evoke an action toward the body (i.e., “toward”), and from verbs that do not evoke one of these two actions (i.e., “others”). To do this, the classification was carried out by three independent judges.

Table 8 Mean number of items correctly recalled (max. 8) for old (i.e., HIT) and incorrectly recalled according to verb category (toward vs. away from the body) and mode of response (pull, push, or press)

We ran two distinct repeated-measures ANOVAs followed a Bayesian repeated-measures ANOVA on the number of words correctly recalled and the number of words falsely recalled, with the category of verbs (i.e., away vs. toward verbs) as a within-participant factor and the mode of response (i.e., pull, push, and press) as a between-participant factor.

Verbs Correctly Recalled

The analysis revealed no main effect of action, F(2, 57) = 1.44, p = .24, , BF10 = 0.15, nor effect of verb category, F(1,57) = 0.20, p = .65, , BF10 = 0.20. However, there was a significant interaction between the mode of response and the verbs, F(2, 57) = 86.57, p < .01, , BF10 = 6.89+19. The comparisons (Table 9) showed that (a) for verbs “toward,” the pulling mode of response increased the number of items correctly recalled compared to pushing and pressing, and the pushing mode of response reduced the number of items correctly recalled compared to pressing; (b) for verbs “away,” pushing increased correct recalls compared to pulling and pressing, and pulling reduced correct recalls compared to pressing.

Table 9 Statistics according to the nature of the verbs (toward and away) and to the mode of response (pulling, pushing, and pressing)

Verbs Falsely Recalled

The interaction between action and verbs was significant, F(2, 57) = 16.20, p < .01, , BF10 = 1.48+4. The analysis revealed no effect of verbs, F(1, 57) = 0.02, p = .88, , BF10 = 0.19. But there was a main effect of action, F(2, 57) = 7.18, p < .01, , BF10 = 3.79. Comparisons showed that pressing induced less false recalls than pulling, t(38) = 3.58, pholm = .002, , and pushing, t(38) = 2.38, pholm = .012, . But there was no difference between pulling and pushing, t(38) = 0.71, pholm = .477, . The post hoc analysis of the interaction (Table 9) showed that for verbs “toward,” the pulling mode of response induced more false recalls than pressing and pushing, but no difference was found between pressing and pushing; for verbs “away,” pushing produced more recalls than pulling and pushing, but no difference was found between pressing and pulling.

Conclusion

The results show a robust overlapping effect between the nature of the action performed and the category of the verb recalled. In other words, participants recalled more verbs “away from the body” after a pushing action and more verbs “toward the body” after a pulling action. Moreover, it should be noted that performing a pressing action did not induce the recall of a specific verb category and that performing an action incongruent with the movement associated with a verb resulted in lower levels of recall compared to performing a pressing action.

Even if the number of false recalls was low (i.e., maximum of three items per participant), the results highlighted that the action performed induced the production of nonlearned verbs for which the associated motor action was congruent with this action (i.e., pushing action induced production of “away” verbs, and pulling action induced the production of “toward” verbs). Interestingly, the action “press the keyboard” did not induce the production of a particular type of verb.

General Discussion

For a long time, the sensorimotor system was not considered as a contributing system in conceptual processing of words or sentences evoking an action. Over the last 20 years, behavioral and neuroimaging data suggested that the sensorimotor system is active during the processing of action words or sentences. However, the question is far from being definitely closed.

Among the researchers subscribing to the embodied cognition framework, there is a debate about the dependence of the conceptual system on the sensorimotor system (Dove, 2011; Meteyard et al., 2012; Michel, 2020; Wurm & Caramazza, 2019). Researchers like Mahon and Caramazza (2008) consider that the activation of sensorimotor areas observed with neuroimagery is the consequence of conceptual activation (see also Mahon, 2015, response to Glenberg, 2015) while others (e.g., Barsalou, 2016) consider that the sensorimotor system is not enough to account for conceptual abstraction, though it can facilitate abstraction in some cases. Another view (e.g., Glenberg, 2015) considering the activation of sensorimotor areas gives rise to conceptual abstraction. Finally, the pluralist view proposed by Zwaan (2014, 2016) suggests that if the sensorimotor system contributes to the conceptual system, it cannot account for the entirety of the conceptual knowledge, as symbolic representations remain necessary.

As can be seen, the role of the sensorimotor system in representing conceptual knowledge is still under hot debate.

In the field of memory, the multiple traces memory models (e.g., MINERVA 2 – Hintzman, 1984, 1988; VISA – Whittlesea, 1989; Act-In – Versace et al., 2014; ATHENA – Briglia, et al., 2018) consider that the content of memory traces is closely related to former sensorimotor activities and that knowledge emerges when, in the current situation, some cues share various sensorimotor components of the memory traces. The aim of the present work was to test this assumption for action verbs by assessing how the congruence with the verb of an action performed at the stage of retrieval could influence memory performance, even when participants were not instructed to consider the action implied by the verb.

The results of the first experiment showed an overlap effect between the movement associated with the verbs to be recognized and the action performed to provide the memory judgment. Participants recognized verbs more often and more quickly with a consistent action than with an inconsistent action. Moreover, discriminating new verbs from old verbs was more difficult when the action to produce the answer was congruent with the action suggested by the verb presented.

The results of the second experiment also showed an overlap effect between the action performed, which was used as a cue for the recall, and the ability to recall the verbs learned. Thus, performing an action congruent with the motor action associated with a verb resulted in higher levels of recall compared to performing an action incongruent with the motor action associated with the verbs.

Moreover, in the two experiments, in comparison with a neutral action (i.e., pressing a key), the retrieval of learned verbs was higher when the action was consistent with the motor action associated with them and weaker when the action was not consistent. These results support the idea that the memory performances of the participants are the product of the overlap between the simulated motor action induced when reading a verb and the motor action performed to provide a judgment. The results observed on the nonlearned verbs (false alarms in Experiment 1) and on the verbs wrongly produced (false recalls in Experiment 2) also support this idea. Therefore, those results highlight an action-based memory effect at the retrieval stage.

The first interest of this work is to enrich the literature on the links already observed between action and memory. According to the “enactment paradigm,” an item studied with an associated action is better memorized than an item studied only verbally (for review, see Zimmer et al., 2001). This finding was observed both in recall and recognition tasks, as well as when the action was performed at the time of encoding (e.g., Denis et al., 1991; Engelkamp et al., 1993; Knopf, 1991; Kubik et al., 2014; Lagacé & Guérard, 2015; Mohr et al., 1989; Zimmer, 1991). However, there were some discussions on the power of enactment in situations where the action was only performed at the time of retrieval (e.g., Brooks & Gardiner, 1994; Engelkamp et al., 1994; Kormi-Nouri et al., 1994; Mulligan & Hornstein, 2003; Norris & West, 1993). Our findings show an effect of the congruency of an action performed only at the stage of retrieval, after a verbal encoding task, in both recognition and recall tasks. Moreover, in contrast with previous studies, where participants were explicitly asked to perform the action associated with the item to be retrieved, in our experiments, the action is part of the experimental device, which means that participants did not focus on the action per se, but on the judgment to be made.

The second contribution of this work is that it supports the way in which memory recovery is conceived by multiple traces memory models. Indeed, it seems that, in our work, the feeling of memory emerges from the coupling between the present sensorimotor experience and the trace of past sensorimotor experiences associated to the verbs. As suggested by the ATHENA model (Briglia et al., 2018), what need to be memorized are the covariances (the notion of covariance is borrowed from O'Regan & Noë, 2001) between the present sensorimotor information processing and past sensorimotor information processing. These covariances are sufficient to account for cognitive information (Hutto & Myin, 2012) because if two processes are covariant, they are probably linked. Moreover, the intensity of the covariance (i.e., fluency) is a cue to infer the link between present and past information processing that will trigger the emergence of a feeling of pastness, which is then expressed in a memory judgment. In other words, this judgment would be modulated by the congruency generated by the overlapping between the motor programs associated with the action performed and the motor simulations associated with the verbs. Indeed, several works showed that motor compatibilities generate fluency (Beilock & Holt, 2007; Brouillet et al., 2011; Cannon et al., 2010; Jasmin & Casasanto, 2012; Yang et al., 2009) and motor fluency enhance memory (Brouillet et al., 2017).

Finally, the third interest of this work is associated with the value of dʹ (i.e., ability to discriminate old verbs and new verbs) in Experiment 1 and false recalls in experiment 2 (i.e., the production of new action verbs in accordance with the movement of the lever). Indeed, these can provide useful information on the mechanism at work at the recovery stage and, consequently, can contribute to the debate on the implication of the motor system in action verb processing. Overall, the results show that participants have more difficulty in discriminating old from new verbs when they move the lever than when they press a key on the keyboard. Moreover, this difficulty is higher when the action carried out with the lever is not congruent with the action associated with the verb. So, the action performed with the lever interferes with the access to memory traces. This suggests that it is the action performed with the lever that guides memory judgment by driving the activation of the sensorimotor dimensions associated with a verb. The false recalls in the second experiment support this interpretation. Thus, and in accordance with multitraces memory models, the coupling between the sensorimotor experiences would not be from past sensorimotor experiences to present sensorimotor experience but from present sensorimotor experience to past sensorimotor experiences. In other words, the present situation shapes what will be remembered, especially through the actions taken. In that way, memory is enacted (Brouillet, 2020).

We think that this finding could also be related with what is called “action reappraisal” by Federico and Brandimonte (2019, 2020) in the field of understanding tool use. In contrast with dominant conceptions (see Buxbaum, 2010), they consider that when agents have to use a tool they do not automatically activate the sensorimotor representations associated with the previous use of the tool, but first they reason about the “possibility to act” within a particular context and particular intention (see also Osiurak & Badets, 2016, 2017). Only in a second step, and according to what is needed, will the sensorimotor representations be activated. The paradigm called “technical reasoning hypothesis” has now received a significant number of experimental validations (Federico et al., 2021), even if it is still questionable (Osiurak et al., 2020).

All these findings suggest that the involvement of sensorimotor processes in knowledge is context-dependent. This has been well shown by Tousignant and Pexman (2012): Altering the task instructions given to participants changes the semantic information used to categorize words. Tousignant and Pexman (2012) compared the reaction times and accuracy in semantic decision for words that were high in body–object interaction (BOI) and for words low in BOI according to the given instruction (i.e., “is it an entity or not” vs. “is it an action or not”). BOI assesses perceptions of the ease with which a human body can physically interact with a word’s referent (Siakaluk, Pexman, Aguilera et al., 2008; Siakaluk, Pexman, Sears et al., 2008). When participants were told to judge entity, the semantic decision and reaction times were better for high BOI words than for low BOI words (the BOI effect). In contrast, when participants were told to judge action features, no BOI effect was observed. But the two intermediate conditions “entity vs. action” and “action vs. entity” showed moderate BOI effects. These results suggested that body-based semantic information might only be accessed when they are relevant to the task.

If it seems that the access to sensorimotor representation is context-dependent, this issue is also controversial. While Yee and Thompson-Schill (2016) consider that all semantic processing is driven by the context, Dove (2016) argues that this is not always the case.

Clearly, more researches are needed to clarify the role of sensorimotor processes in the processes underlying conceptual knowledge.

References

Appendix Characteristics of the 32 Verbs Selected: MA, C (max. 7), LF, NL

Table A1 Verbs implying action away from the body
Table A2 Verbs implying action toward the body

1The instruction was, “You will see a verb appearing in the center of the screen. You will have to say if this verb was present or not in the learning phase. The verb will remain displayed until you have given your answer” (common to all groups). “To do this, you have to tap the “yes” key or the “no” key with your forefinger of your right hand. Once you have given your answer you have to put it on the white key” (group “press”). “To do this, you have to touch the “yes” switch or the “no” switch with the lever. You must move the lever with your right hand. Once you have given the answer you must move the lever back to the center. You will hear a click at this point” (“pull” and “push” groups).

2The instruction was, “You will have to recall the verbs you have learned regardless of the order in which they were learned.” To allow your answer to be recorded, “you will have to produce the verbs you remember once you have removed the question mark displayed on the screen” (common to all groups). “To remove the question mark, you must press the space bar with the forefinger of your right hand” (“press” group). “To remove the question mark, you must move the lever with your right hand to the switch (group “pull” and “push”). Before you start, you will have to remove the question mark six times to familiarize with the device” (common to all groups).