Skip to main content
Open AccessShort Research Article

Correlational Evidence for the Role of Spatial Perspective-Taking Ability in the Mental Rotation of Human-Like Objects

Published Online:https://doi.org/10.1027/1618-3169/a000505

Abstract

Abstract. People can mentally rotate objects that resemble human bodies more efficiently than nonsense objects in the same/different judgment task. Previous studies proposed that this human-body advantage in mental rotation is mediated by one's projections of body axes onto a human-like object, implying that human-like objects elicit a strategy shift, from an object-based to an egocentric mental rotation. To test this idea, we investigated whether mental rotation performance involving a human-like object had a stronger association with spatial perspective-taking, which entails egocentric mental rotation, than a nonsense object. In the present study, female participants completed a chronometric mental rotation task with nonsense and human-like objects. Their spatial perspective-taking ability was then assessed using the Road Map Test and the Spatial Orientation Test. Mental rotation response times (RTs) were shorter for human-like than for nonsense objects, replicating previous research. More importantly, spatial perspective-taking had a stronger negative correlation with RTs for human-like than for nonsense objects. These findings suggest that human-like stimuli in the same/different mental rotation task induce a strategy shift toward efficient egocentric mental rotation.

Human-Body Advantage in Mental Rotation

Object-based mental rotation refers to the ability to create a mental image of an object and then mentally rotate it. In a classic experiment, Shepard and Metzler (1971) presented participants with a pair of three-dimensional objects, consisting of cubes, and required them to judge whether the two objects were the same or mirror images of each other regardless of the objects' orientations. The typical results showed a linear increase in mean response time (RT) with the angular disparity between the two objects, suggesting that people can mentally rotate objects at a constant rate as they physically rotate real objects (for a recent review, see Searle & Hamm, 2017).

People can perform a same/different mental rotation task more efficiently when the stimuli are human bodies (Amorim et al., 2006; Jansen et al., 2012; Voyer & Jansen, 2016) or cube objects with human-like features (e.g., head, face) that are analogous to human bodies (Amorim et al., 2006; Krüger et al., 2014; Makinae et al., 2015; Makinae & Kasai, 2017; Muto et al., 2020; Muto & Nagai, 2020; Sayeki, 1981; Voyer & Jansen, 2016) compared with nonsense cube objects. Amorim et al. (2006) proposed two underlying processes in the human-body advantage in mental rotation, called “spatial embodiment” and “motoric embodiment.” The spatial embodiment describes the process of projecting one's body axes onto human bodies or human-like objects with intrinsic axes (e.g., the top–bottom axis can be defined by the head position), while the motoric embodiment describes the process of the mental emulation of the posture of stimuli.

Empirical support for spatial and motoric embodiment remains limited. Accordingly, mental rotation performance did not improve when the stimuli were impossible postures (Amorim et al., 2006; Krüger et al., 2014; Sayeki, 1981) or desk lamps, which are familiar but not related to human bodies (Amorim et al., 2006). In addition, Jansen et al. (2012) reported that compared with non-athletes, male soccer players showed shorter overall RTs (but not mental rotation speeds) only for human bodies, which is at least consistent with a motoric embodiment. However, contrary to the motoric embodiment (and consistent with the spatial embodiment), Muto and Nagai (2020) demonstrated that snake-like objects, which had intrinsic axes but were difficult to imitate, increased mental rotation speeds by the same rate as the human-body advantage. Given the limited research and mixed results, more empirical investigations are needed to determine the mechanisms underlying the human-body advantage. In particular, the present study focused on the spatial embodiment.

Spatial Embodiment and Egocentric Mental Rotation

Spatial embodiment raises the possibility that participants not only execute object-based mental rotation but also mentally transform their perspectives to align their imagined orientation with the stimulus orientation in a same/different judgment task with human-like stimuli, which was claimed by some researchers (Amorim et al., 2006; Jansen et al., 2012; Voyer & Jansen, 2016). Such a perspective transformation has been considered as the mental rotation of the self, or egocentric mental rotation, which has a well-established distinction from object-based mental rotation (e.g., Habacha et al., 2018; Hegarty & Waller, 2004; Huttenlocher & Presson, 1973; Kessler & Thomson, 2010; Kozhevnikov & Hegarty, 2001; Muto et al., 2019, 2018; Wraga et al., 2000; Zacks et al., 2000; Zacks & Michelon, 2005; Zacks & Tversky, 2005). In most situations, compared with object-based mental rotation, egocentric mental rotation is more efficient and shows shallower RT slopes as a function of angular disparity (e.g., Habacha et al., 2018; Huttenlocher & Presson, 1973; Voyer et al., 2017; Wraga et al., 2000; Zacks et al., 2000; Zacks & Michelon, 2005; Zacks & Tversky, 2005). Hence, one may partially attribute the human-body advantage to a strategy shift from the object-based mental rotation to the more efficient egocentric mental rotation. Nonetheless, to the best of our knowledge, no studies have directly investigated the role of egocentric mental rotation in a same/different judgment task involving human-like stimuli.

The Present Study

The present study first investigated whether the egocentric mental rotation is adopted for the same/different comparison of human-like objects. Although egocentric mental rotation was shown to be adopted for left/right judgments of a rotated human body (e.g., Habacha et al., 2018; Zacks et al., 2000; Zacks & Michelon, 2005; Zacks & Tversky, 2005), it is controversial whether the adoption of egocentric mental rotation is determined by stimulus type (i.e., human-likeness) or task type (i.e., left/right identification; e.g., Voyer et al., 2017). In other words, it is still unclear whether the egocentric mental rotation is elicited for a same/different judgment of human-like objects. Thus, we examined whether the human-body advantage can be explained by egocentric mental rotation efficiency using an individual-differences approach as a first step.

Spatial perspective-taking, which refers to one's ability to understand an object's direction from a perspective other than their own, is known to entail egocentric mental rotation (e.g., Hegarty & Waller, 2004; Kessler & Thomson, 2010; Kozhevnikov & Hegarty, 2001; Muto et al., 2018, 2019). Consistent with this, Zacks et al. (2000) showed that performance on left/right judgments of human bodies (requiring egocentric mental rotation) was more strongly associated with the spatial perspective-taking ability, assessed by psychometric tests, than object-based mental rotation. Hence, we hypothesized that spatial perspective-taking ability has a stronger association with performance on the same/different judgment of human-like objects as opposed to nonsense ones.

To minimize the stimulus difference between the control and human-body conditions, the present study used cube objects as stimuli to which a human-like or nonsense pattern was added to appear or not appear as whole human bodies, respectively, following some previous studies (Makinae et al., 2015; Makinae & Kasai, 2017; Muto & Nagai, 2020; Sayeki, 1981), rather than using real human bodies (Amorim et al., 2006; Jansen et al., 2012; Voyer & Jansen, 2016). In addition, we recruited only female participants given the previous findings of sex differences observed in both object-based and egocentric mental rotation (e.g., Makinae et al., 2015; Makinae & Kasai, 2017; Tarampi et al., 2016; Voyer et al., 1995; Voyer et al., 2017; Voyer & Jansen, 2016) and larger human-body advantage among females than males in chronometric mental rotation experiments with cube stimuli (Makinae & Kasai, 2017; Makinae et al., 2015; but also see Voyer & Jansen, 2016, who reported the opposite result in an experiment using real human bodies). To assess spatial perspective-taking ability, we used the Road Map Test (Money et al., 1965, modified by Zacks et al., 2000) and the Spatial Orientation Test (Kozhevnikov & Hegarty, 2001, modified by Hegarty & Waller, 2004), which are commonly used as psychometric measures (e.g., Tarampi et al., 2016; Zacks et al., 2000). The main goal of the present study was to test whether human-like stimuli induce a strategy shift toward efficient egocentric mental rotation via spatial embodiment. If this hypothesis is true, spatial perspective-taking ability should have a stronger negative correlation with mean RT for the same/different judgment of human-like than nonsense objects.

Methods

Participants

Since the effect size was unknown, we patterned our sample size to that of Zacks et al. (2000), whose measures were similar to those of the present study and where data from 48 participants were analyzed. We planned to collect data from 50 women enrolled in an introductory psychology course (as part of a requirement). However, two did not participate in the experiment, one completed only the mental rotation task, and four had near-chance performance in at least one condition of the mental rotation task. Thus, only data from 43 women (mean age = 19.1 years, ranging from 18 to 31, two left-handed) were fit for analysis. All had a normal or corrected-to-normal vision and were naïve to the purpose of the study. This experiment was approved by the ethics board of the School of Human Sciences of Osaka University. All participants gave written informed consent before participating.

Materials and Procedure

All participants performed a computer-based same/different mental rotation task consisting of two blocks for nonsense and human-like conditions and then completed two timed paper-and-pencil tests for assessing spatial perspective-taking ability—the Road Map Test (Money et al., 1965, modified by Zacks et al., 2000) and the Spatial Orientation Test (Kozhevnikov & Hegarty, 2001, modified by Hegarty & Waller, 2004), in this order.

Mental Rotation Task

Stimuli were images of a three-dimensional object consisting of ten cubes. For the human-like condition, a human head was drawn on the object's uppermost cube so that it resembled a human body (Figure 1, right). For the control condition, we created a nonsense object by replacing the head of the human-like object with a nonsense pattern (Figure 1, left). For same–different comparisons, we also created mirror-reversed versions of the nonsense and human-like objects. By rotating these four images by 0°, ±60°, ±120°, and 180° in the picture plane, we generated 24 stimulus images (https://doi.org/10.17605/osf.io/gxnwv). All stimuli were displayed on a white background on a 21.5” LCD monitor with a resolution of 1,920 × 1,080 pixels. The participants were free to choose their most comfortable viewing distance (the possible range was approximately 50–70 cm).

Figure 1 Stimulus images of nonsense (left) and human-like (right) objects used in the mental rotation task. The frames surrounding the images were not shown in the actual experiment.

Participants underwent the nonsense and human-like conditions in separate blocks in a counterbalanced order. In each trial, a pair of stimulus images were presented side-by-side. One of the two images was a reference object, which was always presented at 0° (i.e., orientation in Figure 1), and the other was identical to or the mirror-reversed version of the object, presented either at 0°, ±60°, ±120°, or 180°. Participants had to press the “J” or “F” key as accurately and quickly as possible, according to whether the two images showed identical or different objects, respectively. The inter-trial interval was 500 ms. Each block consisted of 192 trials: six angles × two versions of the reference object (original or mirror-reversed) × two presentation positions of the reference object (left or right) × two pair types (same or different) × four repetitions. The trial order was randomized, with no feedback given. Participants were given the opportunity to take a break after every 24 trials. Before the experimental trials, the participants completed 96 practice trials that used objects with no patterns.

Psychometric Test for Assessing Spatial Perspective-Taking Ability

The Road Map Test (Money et al., 1965, modified by Zacks et al., 2000) shows a bird's-eye diagram of a city on a page. The participants were asked to imagine walking along a route on the map and indicate whether they should turn right or left at each of the 36 corners by writing either “み” or “ひ,” respectively (instead of “R” and “L,” we chose the initial letters of the Japanese words for “right” and “left” because all participants were Japanese). They were not allowed to rotate the page. The Road Map Test score was the number of corners that were answered correctly within 30 s.

Meanwhile, the Spatial Orientation Test (Kozhevnikov & Hegarty, 2001, modified by Hegarty & Waller, 2004) consists of 12 items (each printed on a page). Each item presented the participants with a picture of an array of objects. The participants were instructed to imagine standing in the location of one object, facing another object, and indicate the direction to a third object by drawing an arrow in a circle printed at the bottom following a sentence below the array (e.g., “Imagine you are standing at the car and facing the traffic light. Point to the stop sign.”). For the present study, we translated the original sentences into Japanese. We prohibited the participants from rotating the page or making any marks on the array. They were given 5 min to complete the test. We then calculated the absolute angular deviation from the correct answer (ranging from 0° to 180°) per item and then averaged the angular deviations across items. An angular deviation of 90° (chance level) was assigned to an unanswered item (only for one item of one participant).

Data Processing

We calculated the mean RTs of the mental rotation task for each combination of participant, condition (nonsense or human-like), and angular disparity (0°, 60°, 120°, or 180°) using data from same-pair trials. Before calculating the means, we excluded data from incorrect responses (7.28% of the same-pair trials) and responses slower than the mean RT plus three SDs per participant (2.00% of same-pair trials). As an index of spatial perspective-taking ability, we calculated a composite score by averaging the z-transformed Road Map Test score and the angular deviation of the Spatial Orientation Test that was also z-transformed and then negated so that higher values represent greater spatial perspective-taking ability. Raw data and scripts for analysis are available at https://doi.org/10.17605/osf.io/gxnwv.

Results

Preliminary Analyses

The mean RTs of the mental rotation task are shown in Figure 2. A repeated measures ANOVA,1 whose factors were condition (nonsense or human-like) and angular disparity (0°, 60°, 120°, or 180°), revealed faster mean RTs for the human-like object (M = 1,088 ms) than for the nonsense object (M = 1,280 ms), F(1, 42) = 28.48, ηp2 = .404, p < .001, which ensured the presence of the human-body advantage. The analysis also revealed a significant main effect of angular disparity, F(1.44, 60.43) = 139.63, ηp2 = .769, p < .001, and a significant interaction between condition and angular disparity, F(2.30, 96.40) = 13.10, ηp2 = .238, p < .001. These results reflected linear RT-angle functions and a shallower RT slope for the human-like object (5.20 ms/degree) than for the nonsense objects (6.57 ms/degree), consistent with previous studies (e.g., Amorim et al., 2006). For the following correlation analyses, we used the mean RTs of the nonsense and human-like conditions that collapsed across angular disparities. These RTs were positively correlated with each other, r = .767, t(41) = 7.66, p < .001 (two-tailed).

Figure 2 Mean response times of the mental rotation task (N = 43). Error bars represent 95% CIs of means.

Road Map Test scores (M = 13.3, SD = 4.9) were negatively correlated with angular deviations of the Spatial Orientation Test (M = 23.4°, SD = 8.4), r = −.388, t(41) = −2.70, p = .010 (two-tailed). This implies that the two measures commonly used spatial perspective-taking ability and justified using a composite score as a more reliable and valid measure of such an ability.

Relationship Between Mental Rotation and Spatial Perspective-Taking Ability

Figure 3 shows the relationship between spatial perspective-taking score (a composite score of the Road Map Test and the Spatial Orientation Test) and mean RT for mental rotation. Correlation analyses showed that higher spatial perspective-taking scores were significantly associated with shorter RTs for both the nonsense, r = −.307, t(41) = −2.07, p = .045 (two-tailed), and the human-like condition, r = −.470, t(41) = −3.41, p = .001 (two-tailed), of the mental rotation task. To test our directional hypothesis that RTs for the human-like condition are more strongly associated with spatial perspective-taking ability than RTs for the nonsense condition, we compared the two correlation coefficients using Meng et al.'s (1992) method. Consistent with our hypothesis, the analysis showed that spatial perspective-taking scores had a stronger association with RTs for human-like than for nonsense conditions, t(40) = 1.72, p = .046 (one-tailed).

Figure 3 Relations between the spatial perspective-taking score to the mean response times for the mental rotation of the nonsense (left) or human-like (right) object (N = 43). Solid lines and error bands represent regression lines and their 95% CIs. Dashed lines represent means of measures.

Discussion

To explore whether the process of mapping one's body axes onto stimuli contributes to the human-body advantage in mental rotation, we investigated the relationship between spatial perspective-taking ability and RTs for the same/different judgment of nonsense and human-like objects. First, the present study replicated the human-body advantage reported in the literature (Amorim et al., 2006; Jansen et al., 2012; Krüger et al., 2014; Makinae & Kasai, 2017; Makinae, et al., 2015; Muto & Nagai, 2020; Sayeki, 1981; Voyer & Jansen, 2016). As a novel contribution, we found that the human-body advantage occurred even when differences between the human-body and control conditions were much smaller than the previous studies. Second, and more importantly, correlation analyses revealed that spatial perspective-taking ability had a stronger association with mental rotation performance for human-like objects than for nonsense ones. These findings conform to the notion that human-like objects elicit an efficient egocentric rotation strategy in the same/different mental rotation task, leading to the human-body advantage. Although our correlational results were only a necessary condition for spatial embodiment and should be cautiously interpreted, they provided the rationale for more detailed investigations.

Mediation by Egocentric Mental Rotation

The present study provided the first evidence for the adoption of egocentric mental rotation in the same/different judgment of human-like objects. The idea that egocentric mental rotation is elicited, even for a same/different mental rotation task, seems inconsistent with previous evidence suggesting that object-based and egocentric mental rotation strategies are adopted selectively for the same/different judgment of a pair of rotated human bodies and the left/right judgment of a single rotated human body, respectively (e.g., Habacha et al., 2018; Voyer et al., 2017; Zacks et al., 2000; Zacks & Tversky, 2005). For example, RTs for left/right judgment were shorter and less dependent on stimulus orientation (e.g., Habacha et al., 2018; Voyer et al., 2017; Zacks et al., 2000; Zacks & Tversky, 2005), more associated with spatial perspective-taking ability, and less associated with object-based mental rotation ability, as measured by Vandenberg and Kuse's (1978) Mental Rotation Test (Zacks et al., 2000), and differently affected by stimulus size (Habacha et al., 2018) compared to the same/different judgment, all of which supported the task dependence of the mental rotation strategy. More critically, Zacks and Tversky (2005) demonstrated how performance on the same/different judgment of human bodies deteriorated when participants were instructed to use egocentric mental rotation as opposed to when they were instructed to use object-based mental rotation or when no instruction was given. Nonetheless, people may flexibly switch strategies (e.g., they could execute egocentric mental rotation only in trials in which it is effective) or combine both object-based and egocentric mental rotation within a trial rather than rely on a single strategy. Also, only people who are good at egocentric mental rotation may benefit from spatial embodiment, while others may persist in object-based mental rotation. Although these accounts are still speculative and premature, the present study suggests that human likeness of stimuli may elicit egocentric mental rotation even in the same/different task.

An alternative account is that general spatial ability, rather than spatial perspective-taking ability, contributes to the human-body advantage, given correlations among different spatial abilities (e.g., Hegarty & Waller, 2004; Kozhevnikov & Hegarty, 2001). The present findings alone cannot rule out this possibility because we measured only spatial perspective-taking ability using the Road Map Test and the Spatial Orientation Test. However, this alternative seems less plausible because the object-based mental rotation score (measured by Vandenberg and Kuse's Mental Rotation Test) and the spatial perspective-taking score (measured by the Road Map Test and the Spatial Orientation Test) distinctively accounted for performance on the same/different and left/right judgment tasks (Zacks et al., 2000) and because lower object-based mental rotation performance was associated with a larger human-body advantage (Makinae & Kasai, 2017). Thus, the general spatial ability proposition may not adequately explain the present results.

Limitations

The present study has three limitations. First, since the present experiment used only cube stimuli to which a nonsense or human-like pattern was added, it is still unclear to which extent the present findings can be generalized to other stimuli, including more realistic human bodies and familiar objects other than human bodies. Recently, Muto and Nagai (2020) found that not only human-like objects but also snake-shaped cubes with a snake face can be rotated quickly (seemingly via spatial embodiment), implying that the facilitation effect is not limited to human bodies. Given that the bodily projections are feasible for any object with at least two asymmetrical intrinsic axes (Muto et al., 2019), the spatial perspective-taking ability could contribute to a mental rotation task with other kinds of stimuli.

Second, since the present study targeted only female participants, potential sex differences in spatial embodiment were unexplored. In particular, given sex differences in mental rotation strategies (e.g., Heil & Jansen-Osmann, 2008), and their potential effects on the human-body advantage (e.g., Makinae et al., 2015; Makinae & Kasai, 2017; Voyer & Jansen, 2016), we cannot rule out that the mechanisms of the human-body advantage differ across sexes, necessitating further investigations including both sexes. However, recent evidence indicates that observed sex differences in mental rotation tasks can be explained largely by psychosocial and methodological factors such as gender stereotypes, motivation, task and item structures, and participants' academic specialization (e.g., Boone & Hegarty, 2017; Moè, 2016; Sanchis-Segura et al., 2018). Therefore, the present findings seem likely to hold for both sexes as long as such factors are properly controlled.

Third, the extent to which spatial embodiment contributes to an overall human-body advantage remains unexplained. Given the involvement of whole-body motor simulation in bodily projections entailed by spatial perspective-taking (e.g., Kessler & Thomson, 2010; Muto et al., 2018), a spatial embodiment might not be independent of (or interact with) motoric embodiment or mental emulation of the posture of stimuli. To address this issue, future research should conduct more sophisticated experiments isolating the spatial and motoric embodiment components.

Implications for Neurocognitive Research

The present findings are expected to play an important role in deepening our understanding of the neural substrates involved in visuospatial transformations and body representations. In a recent event-related potential study, Jansen et al. (2020) found time-dependent distinctions of brain activations in frontal, central, and parietal regions between real human bodies and abstract cube objects, using the same stimuli as used by Amorim et al. (2006). Although their findings are interesting, it is still unclear which feature of stimuli is critical for the human-body advantage and distinctive brain activations (e.g., contour smoothness, luminance contrast, presence of spatial cues, and/or human-likeness). This limitation may be overcome by the present minimal stimulus manipulation (cf. Perruchoud et al., 2016, who showed that small visual differences can trigger different brain activations in another mental rotation task). More specifically, considering the role of egocentric mental rotation in the human-body advantage, neural activities for same/different judgments of human-like and nonsense objects are predicted to overlap with the contrasts between object-based and egocentric mental rotation, respectively. While previous studies on neural distinctions between object-based and egocentric mental rotation yielded somewhat mixed results, probably due to methodological differences, candidate regions responsible for egocentric mental rotation are the left parietal–temporal–occipital junction and the right parieto-occipital sulcus extending into the retrosplenial cortex (e.g., Lambrey et al., 2012; Wraga et al., 2005; for a review, see also Zacks & Michelon, 2005). Larger activation in such regions can be expected to be associated with shorter overall RT and/or a shallower slope for same/different judgments of human-like objects.

From an applied perspective, the present findings are potentially useful for clinical assessments and interventions, because mental rotation is a helpful tool to assess neuropsychological deficits and evaluate interventions (e.g., Lee et al., 1998; Scandola et al., 2019). Specifically, the simple inclusion of objects with human-like features would allow us to ameliorate visuospatial assessments by considering a patient's spatial perspective-taking ability and to establish customized therapeutic approaches according to each patient's mental spatial ability. Relatedly, our recent work (Muto et al., 2020) administered a paper-and-pencil mental rotation test with almost the same objects as the present study to elderly people aged 86–97 years and revealed that performance for human-like objects was better and more preserved with age than for nonsense objects. This suggests a neural basis for the mental rotation of human-like objects that is partially different from that of the mental rotation of nonsense objects in a way that compensates for the age-related decline of the object-based mental rotation ability. On the basis of the present findings, further studies are needed to establish the neuropsychological significance of the human-body advantage.

References

1We have reported degrees of freedom and p values corrected by Chi-Muller's ε to guard against sphericity violations.