On the Conditions Necessary for Obtaining Argument Structure Complexity Effects



Contents

By: A. René Schmauder

Shelia M. Kennison

Charles Clifton

Method

Subjects

Procedure

Materials

Results and Discussion

Conclusion

Footnotes

References:

By: A. René Schmauder

University of Massachusetts at Amherst

Shelia M. Kennison

University of Massachusetts at Amherst

Charles Clifton

University of Massachusetts at Amherst

The research reported here was supported by National Institutes of Health Grants HD-18708 and HD-07327 to the University of Massachusetts and by a Mount Holyoke College Alumnae Fellowship to A. René Schmauder.

Correspondence may be addressed to: A. René Schmauder, Department of Psychology, University of Massachusetts at Amherst, Amherst, Massachusetts 01003. Electronic mail may be sent to rene.schmauder@psych.umass.edu.

Shapiro and colleagues (Shapiro & Levine, 1990; Shapiro, Zurif, & Grimshaw, 1987, 1989) have recently reported that a cross-modal lexical decision (CMLD) task is sensitive to the amount of thematic information associated with the lexical entries of verbs. They consistently found longer lexical decision (LD) times in a CMLD task when an LD probe was presented visually at the offset of an auditorily presented verb that had more argument structure frames (thematic information) associated with its lexical entry compared with verbs with fewer argument structure frames. Shapiro et al. (1987, 1989) called this the "argument structure complexity effect. "

In an attempt to determine whether the argument structure complexity effect was generalizable to other on-line paradigms, Schmauder (1991) ran four experiments, including one using the CMLD paradigm of Shapiro et al. (1987). In none of these experiments were argument structure complexity effects evident, so Schmauder concluded that the effect might not generalize easily.

More recently, Shapiro, Brookins, Gordon, and Nagel (1991) reported results replicating the argument structure complexity effect reported by Shapiro et al. (1987, 1989) and Shapiro and Levine (1990). Furthermore, Shapiro et al. (1991) provided evidence that Schmauder's (1991) failure to find the argument structure effect with the CMLD paradigm was due to the nature of the secondary LD probe items used by Schmauder: Schmauder's probes were monosyllabic and relatively uniform in frequency and length compared with Shapiro and colleague's probes, which were multisyllabic and less uniform in frequency and length. Shapiro et al. (1991) concluded that Schmauder's probes permitted "subjects to perform the secondary task with little effort" (p. 993) and that the LD task "must be made difficult enough to yield significant interference, allowing thematic effects to be exposed" (p. 993).

The present observation reports the results of an experiment that failed to exhibit an effect of verbs' argument structure complexity (thematic complexity) with secondary LD probes used by Shapiro et al. (1987). First, we present a description of differences in materials across experiments. As noted above, the first to report the argument structure complexity effect were Shapiro et al. (1987) using a CMLD paradigm. Five verb categories were used: (a) transitives, (b) nonalternating datives, (c) alternating datives, (d) two-complements, and (e) four-complements (see Shapiro et al. , 1987, or Schmauder, 1991, for a detailed description of differences between the verb categories). An argument structure complexity effect would result in LD times to targets presented after four-complement verbs being longest and LD times to targets presented after transitive verbs being shortest. Sentences in which these verbs were embedded consisted of a subject noun phrase (NP) modified either by a relative clause or by a prepositional phrase (PP), followed first by the critical verb and then by an object NP (and some continuation).

Schmauder's (1991) experiments used three paradigms: (a) eye movement recording as subjects read sentences, (b) cross-modal naming, and (c) CMLD. In all three paradigms, Shapiro et al. 's (1987) verbs were used, with one extra verb for each verb category added for counterbalancing purposes. Schmauder created new sentences, similar to Shapiro et al. 's in form, in which the verbs were embedded. Also, Schmauder used new secondary LD probes, as described above.

Shapiro et al. (1991) used a subset of Schmauder's (1991) sentences (those with a subject NP modified by a relative clause) in a CMLD task. In their Experiments 1A and 1B, Shapiro et al. used probe items with similar characteristics to those used by Shapiro et al. (1987) and found the argument structure complexity effect. In Experiments 2A and 2B, Shapiro et al. used Schmauder's probes, presented either all in uppercase (Experiment 2A) or in alternating uppercase and lowercase (Experiment 2B).

xlm1761188tbl1a.jpg

Table 1 summarizes the results of the CMLD experiments presented in Shapiro et al. (1987, 1989, 1991) and Schmauder (1991).

The experiment reported in this observation used all of Schmauder's (1991) sentence materials. The tapes used in the present experiment were the tapes used in Schmauder's (1991) CMLD experiment, Experiment 4. The secondary LD probe set was varied as a between-subject variable. One probe set contained Schmauder's probes. The second probe set contained Shapiro et al. 's (1987) probes. 1 As was noted in Shapiro et al. (1991), the probes from Shapiro et al. (1987) were multisyllabic and came from a wider frequency range than did the single-syllable probes used by Schmauder. Schmauder's LD targets were 4-6 letters long, with frequency between 50 and 110 words per million (Francis & Kuçera, 1982). 2 Shapiro et al. 's (1987) LD targets were 5-9 letters long, with frequency between 56 and 312 words per million (Francis & Kuçera, 1982). We predicted that if Schmauder's failure to replicate Shapiro et al. (1987) resulted from using secondary probes that were less complex than those used by Shapiro et al. , we would find an effect of probe set in the present experiment when the Shapiro et al. probes were used.

Method

Subjects

Eighty members of the University of Massachusetts community were given course credit to participate in the experiment. All were native English speakers and reported normal vision and hearing.

Procedure

Subjects were instructed extensively before the experiment. They were told that sentences would be presented over the headphones and that at some point during each presentation, a word would appear on the computer screen in front of them while the sentence continued to play over the headphones. The task was to decide as quickly and as accurately as possible whether the target word was an English word or a nonsense word. Subjects pulled one of two triggers with their index fingers to respond to the LD target, and they were told to respond as quickly as possible to the LD target. Particularly slow subjects were informed of their performance and encouraged to respond faster. Furthermore, subjects were told that a true-false comprehension question would appear on the screen after each sentence and that accuracy rather than speed was important in responding to these questions, which was done by pulling one of the two triggers. The right trigger was used for yes and true responses, and the left trigger was used for no and false responses. At breaks between blocks, feedback was given on percentage correct for LD responses and comprehension questions.

Details of the experimental design were identical to those described in Schmauder (1991), the only difference being the addition of the probe-set variable. Type of LD target (either Schmauder's, 1991, set or Shapiro et al. 's, 1987, set) was between-subjects factor. Half of the subjects saw only monosyllabic LD targets. Half of the subjects saw only multisyllabic LD targets. Two within-subject factors were used: verb category and sentence frame. There were five levels of verb category (transitive, nonalternating dative, alternating dative, two-complement, and four-complement) and two levels of sentence frame (prepositional phrase and relative clause). Five pairings of verb to LD target were used. LD targets did not form plausible completions with the sentential verbs and did not relate semantically to the sentences. Four randomized orders of presentation were used, for a total of 20 testing conditions.

Materials

The sentence materials and LD probe sets used are described above. Nonwords for the LD targets were created by taking words similar to the LD probes for each probe set and altering the words by changing one letter. The resulting nonwords conformed to rules of English orthography.

Results and Discussion

As an initial step in the analysis of results, data points greater than two standard deviations from a subject's mean were eliminated. This procedure is equivalent to that used by Shapiro and colleagues (1987, 1989, 1991), in which data points lying more than two standard deviations from a subject's mean were replaced with that subject's mean score. This manipulation affected approximately 5% of each set of data. Note that Schmauder (1991) used reaction time (RT) cutoffs of 1,000 ms, 200 ms, and three standard deviations. The present pattern of results is not altered by using these stricter cutoffs.

Mean LD latencies and error rates are provided in

xlm1761188tbl2a.jpg

Table 2. Subjects' accuracy in the LD task and on the comprehension questions was always above 90%, so we are confident that subjects followed instructions and comprehended sentences adequately.

Analyses of variance (ANOVAs) on both accuracy of LD responses and RTs to LD probes were run with subjects and with items treated as random variables. We first present and discuss ANOVAs for the RT data. The ANOVAs of LD RTs showed that the main effect of LD target group was non-significant, F1(1,78) = 0. 629, p > . 5, MSe = 108,923, F2(1,50) = 0. 503, p > . 5, MSe = 2,028. The verb category main effect also was not significant, F1(4,312) and F2(4,50) < 1 F1 MSe = 4,778, F2 MSe = 2,028). Furthermore, there was no main effect of sentence frame, F1(1,78) = 0. 284, p < . 60, MSe = 5,634, F2(1,50) = 0. 9825, p < . 67, MSe = 1,315. The Verb Category × Sentence Frame interaction was significant by subjects but not by items, F1(4,312) = 3. 05, p < . 017, MSe = 5,698, F2(4,50) = 1. 67, p < . 17, MSe = 1,315. Finally, the LD Target Set × Verb Category × Sentence Frame interaction failed to reach significance, F1(4,312) < 1, MSe = 5,698, F2(4,50) = 1. 4, MSe = 1,315. 3()()

Although the only hint of any effect in the overall analysis was the interaction of verb category with sentence frame, because Shapiro et al. (1991) found probe-set differences, we ran separate ANOVAs for the two LD target sets. These analyses yielded results like those from the overall analysis. In Experiment Set 1, LD targets were those previously used by Schmauder (1991), and ANOVAs yielded the following results. The verb category main effect was nonsignificant, F1(4,156) and F2(4,25) < 1 F1 MSe = 4,274, F2 MSe = 909). There was no main effect of sentence frame, F1( 1, 39) and F2(1,25) < 1 F1 MSe = 5,135, F2 MSe = 1,202). The Verb Category × Sentence Frame interaction approached significance by subjects but not by items, F1(4,156) = 2. 35, p < . 056, MSe = 4,594, F2(4,25) = 1. 82, p < . 16, MSe = 1,202.

In Experiment Set 2, LD targets were those previously used by Shapiro et al. (1987). The verb category main effect was nonsignificant, F1(4,156) and F2(4,25) < 1 F1 MSe = 5,283, F2 MSe = 3,147). The main effect of sentence frame by subjects also failed to reach significance, F1(1,39) = 0. 4201, p < . 53, MSe = 6,131, F2(1,25) = 1. 27, p < . 27, MSe = 1,428. The Verb Category × Sentence Frame interaction was not significant, F1(4,156) = 1. 42, p < . 23, MSe = 6,802, F2(4,25) = 1. 23, p < . 32, MSe = 1,428.

In the analyses of accuracy data, the only effect that reached significance was the Sentence Frame × Probe Set interaction, which was significant by subjects, F1(1,78) = 5. 085, p < . 03, MSe = 0. 0079, and neared significance by items, F2(1,50) = 3. 246, p < 0. 07, MSe = 0. 0018. Subjects seemed to respond more accurately to the Schmauder (1991) LD probes when they occurred as subjects heard relative clause sentences (98% accuracy) than when these probe words occurred as subjects heard prepositional phrase sentences (96% accuracy), whereas subjects responded more accurately to the Shapiro et al. (1987) LD probes during prepositional phrase sentences (97% accuracy) than during relative clause sentences (96% accuracy). We believe that this resulted from idiosyncratic relations between LD probes and sentence contexts.

In the accuracy data, the Verb Category × Sentence Frame interaction approached significance by subjects, F1(4,312) = 2. 07, p < . 08, MSe = . 0081, but not by items, F2(4,50) = 1. 46, p > . 2, MSe = . 0018. This small, nonsignificant tendency reflected slightly higher accuracy to probes after transitives and two-complements in prepositional phrase contexts and a lack of such a tendency in relative clause contexts. Note that for all of these accuracy effects, the range in errors is from 2% to 5%.

The RT results suggest that regardless of the type of LD target, verb category did not consistently influence time to perform the LD task. Furthermore, both types of LD targets showed similar patterns of results for the two within-subject factors. The only significant effect found in the three sets of analyses of RT data was the Verb Category × Sentence Frame interaction, which was found to be nearly significant in the subjects' analysis for the monosyllabic LD target group and to be significant for the combined group of both types of LD targets. The significant interaction appears to be due to fast responses to LD targets following transitive verbs that were presented in prepositional phrase sentence contexts.

Calculating the 95% confidence interval based on the MSe from the subjects' analysis yielded a value of 23. 4. This indicates that, in the prepositional phrase sentences, RTs for LD targets following the transitive verbs were significantly faster than those for targets following nonaltenating dative verbs, two-complement verbs, and four-complement verbs. For the relative clause sentences, there were no significant differences between verb categories. Although faster responses to targets presented after transitives in prepositional phrase contexts is remotely suggestive of an argument structure complexity effect, the overall results of this experiment do not support such a conclusion. No differences appeared between RTs for probes that follow two-complement verbs and RTs for probes that follow four-complement verbs. No such pattern was evident in the relative clause materials. Results for the Shapiro et al. (1991) probes did not reflect argument structure complexity effects despite the fact that we had ample power in the present experiment to detect an effect the size of that reported by Shapiro et al. A power calculation showed that

1 − β was = . 95 to detect an argument structure complexity effect the size of Shapiro et al. 's (φ = 1. 94).

Conclusion

Clearly, something other than differences between secondary LD probes alone is responsible for the difference between the results reported by Shapiro et al. (1991) and those of the present experiment. We did not find argument structure complexity effects with probes used by Shapiro et al. (1987, 1989), which had the characteristics suggested by Shapiro et al. (1991) to be necessary for detecting verb complexity. This was true even though the Shapiro et al. (1987, 1989) probes were similar on relevant dimensions to the Shapiro et al. (1991) probe items that enabled detection of verb complexity.

Furthermore, even if some results are attributable to characteristics of the secondary probes, the effects did not generalize to eyetracking (Schmauder, 1991). This shows that complexity caused by the number of argument structure frames does not result in a processing cost evident during normal, silent reading, whereas effects of factors like length, frequency, and syntactic complexity are evident during silent reading (Henderson & Ferreira, 1990; Rayner, Sereno, Morris, Schmauder, & Clifton, 1989).

From the data reported by Schmauder (1991) and the results of the present study we can draw some conclusions. The argument structure frame complexity of verbs that are being read does not influence eye movements. Argument structure complexity effects are not consistently detectable with the one task, the CMLD task, which has been reported to be sensitive to the effect. Shapiro et al. 's (1991) proposal that secondary probes must be complex enough to place a significant processing load on subjects for the effect to be observed does not explain the disparity between Shapiro et al. 's and Schmauder's results in a straightforward way. Also, although it is possible that a verb's argument structure is accessed at some point during sentence processing (see Canseco-Gonzalez, Shapiro, Zurif, and Baker, 1990, for evidence of use of argument structure in an off-line task), the data reported by Schmauder (1991) and the results of the present study suggest that at least one of the following may be the case: (a) that immediate access of argument structure does not always impact verb processing, (b) that argument structure is not accessed immediately when a verb is processed, or (c) that the only on-line task that is sensitive to the effect interacts with characteristics of materials, instructions, or subjects in some way that has not yet been discovered. In summary, we feel that our results imply that a more complex argument structure does not necessarily increase sentence processing difficulty and that the technique that Shapiro and his colleagues developed for exploring argument structure has some limitations on its generality that are not yet understood.

Footnotes

1

We thank L. Shapiro for providing the set of lexical decision probes used in Shapiro, Zurif, and Grimshaw (1987).

2

Actually, because of an error, one probe was of frequency 12 words per million, although its verb sense was within the specified frequency range.

3

Shapiro, Brookins, Gordon, and Nagel (1991) grouped transitive, nonalternating dative, and alternating dative verbs into one subexperiment and two-complement and four-complement verbs into another subexperiment for theoretical reasons. As the focus of the present experiment is on the existence of any consistent differences among verb categories and not on the theoretical shift articulated by Shapiro et al. , we analyzed all verb categories together in our primary analysis. However, we also analyzed our data separately for the two groups of verb categories. Analyses with subjects and items treated as random factors showed no effect of probe set, verb category, or sentence frame for the transitive, nonalternating dative, and alternating dative verbs. Verb category (transitive, nonalternating dative, alternating dative) interacted with sentence frame (prepositional phrase, relative clause), F12, 156 = 1. 67, p < . 001, F22, 15 = 4. 4, p < . 03

, because LD responses in prepositional phrase contexts were fastest for probes following transitive verbs but in relative clause contexts were slowest following transitive verbs. For the two-complement and four-complement verbs, all Fs were less than one. In short, there were no consistent differences among verb categories attributable to argument structure complexity.

References:

1. Canseco-Gonzalez, E., Shapiro, L. P., Zurif, E. B. & Baker, E. (1990). Predicate-argument structure as a link between linguistic and non-linguistic representations. Brain and Language, 39, 391-404.

2. Francis, W. N. & Kuçera, H. (1982). Frequency analysis of English usage. Boston, MA: Houghton-Mifflin.

3. Henderson, J. M. & Ferreira, F. (1990). Effects of foveal processing difficulty on the perceptual span in reading: Implications for attention and eye movement control. Journal of Experimental Psychology: Learning, Memory, and Cognition, 16, 417-429.

4. Rayner, K., Sereno, S. C., Morris, R. K., Schmauder, A. R. & Clifton, C. (1989). Eye movements and on-line language comprehension processes. Language and Cognitive Processes, 4, SI 21-49.

5. Schmauder, A. R. (1991). Argument structure frames: A lexical complexity metric? Journal of Experimental Psychology: Learning, Memory, and Cognition, 17, 49-65.

6. Shapiro, L. P., Brookins, B., Gordon, B. & Nagel, N. (1991). Verb effects during sentence processing. Journal of Experimental Psychology: Learning, Memory, and Cognition, 17, 983-996.

7. Shapiro, L. P. & Levine, B. A. (1990). Verb processing during sentence comprehension in aphasia. Brain and Language, 38, 21-47.

8. Shapiro, L. P., Zurif, E. & Grimshaw, J. (1987). Sentence processing and the mental representation of verbs. Cognition, 27, 219-246.

9. Shapiro, L. P., Zurif, E. & Grimshaw, J. (1989). Verb processing during sentence comprehension: Contextual impenetrability. Journal of Psycholoinguistic Research, 18, 223-243.

Received: January 18, 1991. Revised: April 8, 1991. Accepted: April 23, 1991.


Copyright 1991 by the American Psychological Association, Inc.
For personal use only--not for distribution
This publication is protected by US and international copyright laws and its content may not be copied without the copyright holder's express written permission except for the print or download capabilities of the retrieval software used for access. This content is intended solely for the use of the individual user.
Source: Journal of Experimental Psychology: Learning, Memory, and Cognition. Vol. 17 (6) November 1991, pp. 1188-1192
Accession Number: xlm1761188 Digital Object Identifier: 10.1037//0278-7393.17.6.1188

 

Top of Page


BackBack