{"id":123,"date":"2007-09-04T15:05:07","date_gmt":"2007-09-04T13:05:07","guid":{"rendered":"http:\/\/www.dan.sperber.fr\/?p=123"},"modified":"2018-04-24T20:57:48","modified_gmt":"2018-04-24T19:57:48","slug":"the-why-and-how-of-experimental-pragmatics-the-case-of-%e2%80%98scalar-inferences%e2%80%99","status":"publish","type":"post","link":"https:\/\/www.dan.sperber.fr\/?p=123","title":{"rendered":"Ira Noveck &#038; Dan Sperber (2007) <b>The why and how of experimental pragmatics: The case of \u2018scalar inferences\u2019<\/b>. In N. Burton-Roberts (ed.), <i>Advances in Pragmatics<\/i>. (Palgrave)"},"content":{"rendered":"<p style=\"padding-left: 60px;\">&#8220;Although a few pioneers in psycholinguistics had, for more than twenty years, approached various pragmatic issues experimentally, it is only in the past few years that investigators have begun employing the experimental method in testing pragmatic hypotheses (see Noveck &amp; Sperber 2004). We see this emergence of a proper experimental pragmatics as an important advance with a great potential for further development. In this chapter we want to illustrate what can be done with experimental approaches to pragmatic issues by presenting one case, that of so-called \u2018scalar inferences\u2019, where the experimental method has helped sharpen a theoretical debate and has provided uniquely relevant evidence&#8230;&#8221; [<a href=\"https:\/\/www.dan.sperber.fr\/wp-content\/uploads\/2007_noveck_the-why-and-how-of-experimental-pragmatics.pdf\">PDF version<\/a>]<!--more--><\/p>\n<p style=\"text-align: center;\"><strong>Ira Noveck &amp; Dan Sperber<br \/>\n<strong><em>The why and how of experimental pragmatics: The case of \u2018scalar inferences\u2019<\/em><\/strong>.<br \/>\n<span style=\"font-weight: normal;\">In Noel Burton-Roberts (ed) <em>Advances in Pragmatics<\/em>. Palgrave. 2007<\/span><\/strong><\/p>\n<p style=\"text-align: center;\"><strong><br \/>\n<\/strong><\/p>\n<p>Although a few pioneers in psycholinguistics had, for more than twenty years, approached various pragmatic issues experimentally, it is only in the past few years that investigators have begun employing the experimental method in testing pragmatic hypotheses (see Noveck &amp; Sperber 2004). We see this emergence of a proper experimental pragmatics as an important advance with a great potential for further development. In this chapter we want to illustrate what can be done with experimental approaches to pragmatic issues by presenting one case, that of so-called \u2018scalar inferences\u2019, where the experimental method has helped sharpen a theoretical debate and has provided uniquely relevant evidence. We will focus on work done by the first author and his collaborators or work closely related to theirs, but other authors have also made important contributions to the topic (e.g. Papafragou and Musolino, 2003; Guasti, Chierchia, Crain, Foppolo, Gualmini, &amp; Meroni, 2005; De Neys &amp; Schaeken, in press).<\/p>\n<p><strong>Methodological background: the limits of pragmatic intuitions as evidence<\/strong><\/p>\n<p>Theoretical work in pragmatics relies heavily \u2013 often exclusively \u2013 on pragmatic intuitions. These are rarely complemented with observational data of a kind more common in sociologically oriented pragmatics. The use of statistical data from corpuses and from experiments is even less common. This situation results partly from the fact that most theoretical pragmatists have been trained in departments of linguistics where, quite often, linguistic intuitions are the only kind of data considered. Optimally, of course, one would want pragmatists to use whatever kind of data that may significantly confirm or disconfirm hypotheses. Moreover, a sensible methodological pluralism is not the only reason to diversify the types of evidence used in pragmatics. There are also principled limits to the use of pragmatic intuitions.<\/p>\n<p>It makes sense (even if it is not uncontroversial) to judge a semantic description by its ability to account for semantic intuitions. Of course, the use of semantic intuitions and of linguistic intuitions generally, raises methodological problems and calls for methodological caution. For instance, a linguist\u2019s intuitions may be biased by prior theoretical commitments. Also, one may mistake what are in fact pragmatic intuitions for semantic ones (as, Grice argued, ordinary language philosophers systematically did). Still, there are good reasons why semantic intuitions are so central to semantics. Semantic intuitions are not just about semantic facts; they are semantic facts themselves. For instance, the intuition that sentence (1) entails (2) is not about some semantic property that this sentence would have anyhow, regardless of its accessibility to speakers\u2019 intuitions.<\/p>\n<p>(1) John knows that it is raining<\/p>\n<p>(2) it is raining<\/p>\n<p>Rather, for (1) to have the meaning it has is, among other aspects, to be intuitively understood as entailing (2). A semantic analysis of linguistic expressions that accounts for all the speaker-hearer\u2019s semantic intuitions about these expressions may not be the best possible analysis, but it is descriptively adequate (in Chomsky\u2019s sense \u2013 an explanatory adequate description of the semantics of a given language, on the other hand, involves hypotheses about the capacities that make the acquisition of this semantics possible, and here observational and experimental evidence should be of relevance).<\/p>\n<p>The use of pragmatic intuitions raises the same methodological problems as does the use of semantic intuitions and then some. It is a mistake to believe that pragmatic intuitions of the kind used in pragmatics are data of the same kind as semantic intuitions used in semantics. Genuine pragmatic intuitions are those that addressees have about the intended meaning of an utterance addressed to them. Quite generally, pragmatic intuitions invoked in theoretical pragmatics are not about actual utterances addressed to the reader of a pragmatic article, but about hypothetical cases involving imaginary or generic interlocutors. Pragmatic intuitions on hypothetical utterances have proved useful in a variety of ways, but it is important to keep in mind that these are not about how an utterance is interpreted, but about how an utterance would be interpreted if it were produced in a specific situation by a speaker addressing a listener, with referring expressions having actual referents, and so on. These intuitions are educated guesses \u2013 and, no doubt, generally good ones \u2013 about hypothetical pragmatic facts, but are not themselves pragmatic facts and they may well in error. That is, we may be wrong about how, in fact, we would interpret a given utterance in a given context.<\/p>\n<p>Besides helping compensate for the inherent limits of pragmatic intuitions, an experimental approach can provide crucial evidence when deciding between alternative theories that may agree on the content of the interpretations of utterances, but that have different implications regarding the cognitive mechanisms through which these interpretations are arrived at. Of course, for their contribution to be of value, experimentalists must conform to fairly strict methodological criteria and measure just what they are intent on measuring\u2014typically the effect of one \u2018independent\u2019 variable on another \u2018dependent\u2019 variable without other uncontrolled variables affecting the results. We will show how this plays out in the study of \u2018scalar inferences\u2019.<\/p>\n<p>Theoretical background: Scalar implicatures as Generalised Conversational Implicatures (GCIs)<\/p>\n<p>The experiments we will present are relevant to the study of so-called \u2018scalar implicatures.\u2019 Here we just remind readers of the main features of the Gricean and neo-Gricean account of scalar implicatures, and focus on the claim that scalar implicatures are Generalized Conversational Implicatures, or GCIs. Scalar implicatures are illustrated by cases such as (3a) which is said to implicate (3c), or (4a) said to implicate (4c):<\/p>\n<p>(3) (a) It is possible that Hillary will win<br \/>\n(b) It is certain that Hillary will win<br \/>\n(c) It is not certain that Hillary will win<\/p>\n<p>(4) (a) Some of the guests have arrived<br \/>\n(b) All of the guests have arrived<br \/>\n(c) Not all of the guests have arrived<\/p>\n<p>Proposition (3b) is more informative than (3a), which it entails. If the more informative proposition would make a greater contribution to the common purpose of the conversation, then, a speaker obeying Grice\u2019s first Maxim of Quantity (\u201cMake your contribution as informative as is required\u201d) would be expected to express it unless she could not do so without violating the Supermaxim of Quality (\u201cTry to make your contribution one that is true\u201d). Hence, on a Gricean account, a speaker stating (3a) typically implicates (3c) (i.e., the negation of (3b)). For the same reasons, a speaker stating (4a) typically implicates (4c) (i.e., the negation of (4b)).<\/p>\n<p>Such implicatures are described as \u2018scalar\u2019 because, according to an account developed by neo-Griceans and in particular Lawrence Horn (1972), the derivation of these implicatures draws on pre-existing linguistic scales consisting in a set of alternate terms or expressions ranked by order of informativeness; &lt;possible, certain&gt; and &lt;some, all&gt; are examples of such scales. When a less informative term is used in an utterance in a way that does not satisfy the first maxim of quantity, the speaker can be taken to implicate that the proposition that would have been expressed by the use of a stronger term in the scale is false. This account of implicatures such as those carried by (3a) or (4a) extend to a wide variety of cases and has some intuitive appeal. It should not be seen however as obviously correct or without alternatives. In particular, its implications for processing are less attractive. According to such an account, the inference from the utterance to its scalar implicature goes through a consideration not just of what the speaker said and the context but also of what the speaker might have said but did not. It is this type of onerous inference that makes the Gricean account of implicature derivation seem implausible from a cognitive and developmental point of view.<\/p>\n<p>Levinson draws on another idea of Grice, that of Generalized Conversational Implicatures, to offer an account that might provide a solution to the problem posed by the derivational complexity of scalar implicatures. Grice noted that some implicatures are generally valid (from a pragmatic rather than logical point of view, of course) and therefore could be inferred without consideration of the context, except in cases where the context happens to make them invalid. Grice contrasted these Generalized Conversational Implicatures with Particular Conversational Implicatures, which are valid only in specific contexts. In his book Presumptive Meanings: The Theory of Generalized Conversational Implicatures (Levinson 2000), Levinson elaborates Grice\u2019s original and somewhat vague notion. For Levinson, GCIs are default inferences, that, is, inferences that are automatically generated and that can be cancelled if there are contextual reasons to do so. Levinson treats scalar implicatures as paradigmatic cases of GCIs (whereas Grice\u2019s own examples of GCIs don\u2019t include scalar implicatures). This has the advantage of making the inference of these implicatures a relatively light one-step process, which needs to access neither contextual premises nor the full Gricean rationale for their derivation.<\/p>\n<p>Levinson\u2019s own rationale for GCIs so conceived has to do with the optimization of processing. The existence of GCIs speeds up the process of communication that is slowed down, Levinson argues, by the need for phonetic articulation: some unencoded aspects of the speaker\u2019s meaning can be inferred from metalinguistic properties of the utterance such as the choice of a given word from among a set of closely related alternatives. For instance the speaker\u2019s choice of \u201csome\u201d rather than the stronger \u201call\u201d in (4a) (\u201cSome of the guests have arrived\u201d) justifies inferring that (4c) is part of her meaning. These are non-demonstrative inferences, or course. There are cases where these inferences are invalid. For instance, if it were contextually established that the speaker of (4a) has only partial information about the arrival of the guests, then (4c) would not be part of her meaning. Still, given that GCIs are valid in most contexts (or so it is assumed), the overall speeding up of communication made possible by the automaticity of GCIs is not compromised by the rare cases where contextual considerations force the hearer to countermand them.<\/p>\n<p>The theory of scalar implicatures as default GCIs combines four claims:<\/p>\n<p>(a)\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 These inferences are made by default, irrespective of the context, and cancelled when the context demands<\/p>\n<p>(b)\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 The fact that these inferences are made by default adds to the speed and efficiency of communication<\/p>\n<p>(c)\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 These inferences contribute implicatures to the interpretation of the utterance, as opposed to contributing enrichments of its explicit content (\u2018what is said\u2019 in Grice\u2019s terms or \u2018explicatures\u2019 in relevance theory\u2019s terms)<\/p>\n<p>(d)\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 These inferences are scalar: they exploit pre-existing scales such as &lt;some, all&gt;, &lt;or, and&gt;, &lt;possible, necessary&gt;<\/p>\n<p>We doubt all four claims. The bulk of this chapter will be devoted to explaining how experimental evidence has cast strong doubts on claim (a). First, however, we briefly present an argument that also casts doubt on (b), and we outline the relevance-theoretic approach, which is in contradiction with all four claims.<\/p>\n<p>This idea that default implicatures or GCIs would permit more efficient and speedier communication may seem sensible and capable of lending support to the whole theory. It raises however the following empirical issue. If the frequency of GCI cancellations were too high, their cost would offset the benefit of deriving GCIs by default.\u00a0 Suppose for instance that a given type of GCI had to be cancelled a third of the time. The cost of the use of such a GCI would be that of deriving it by default in all cases plus the cost of cancelling it in one third of the cases. This would have to be compared with the cost of deriving the implicature as a \u2018particularized conversational implicature,\u2019 that is, in a contextually sensitive and therefore more costly way, in two thirds of the cases, but without any cost of default derivation followed by cancellation in the other third of the cases. It is not clear that, with such frequencies, the rationale given for GCIs in term of economy would make much sense.<\/p>\n<p>To show that this kind of calculus is not unrealistic, consider the example of \u201cP or Q\u201d and its alleged GCI not (P and Q). We are not aware of any statistical data regarding the frequency of exclusive uses of \u201cor\u201d and we share the common intuition that quite often, when people utter a sentence of the form \u201cP or Q\u201d they can be taken to consider that P and Q is excluded. This exclusion however need not be part of their meaning. In most cases this exclusion follows from real world knowledge and not from the interpretation of \u201cor\u201d, as illustrated in (5)-(7):<\/p>\n<p>(5) He is a bachelor or he is divorced<\/p>\n<p>(6) Jane is in Paris or in Madrid<\/p>\n<p>(7) Bill will arrive Monday or Tuesday<\/p>\n<p>If \u201cP or Q\u201d implicates by default that not (P and Q), then, in all cases such as (5)-(7) where the two disjuncts cannot both be true for commonsense reasons, people automatically compute a GCI that causes the speaker\u2019s meaning to redundantly implicate what is already part of the common ground, and surely, this is a cost without associated benefit. Moreover, if one carefully excludes cases where mutual exclusivity of the disjuncts is self-evident and need not be communicated, and looks at cases such as (8)-(10) where neither the inclusive nor the exclusive interpretation is a priori ruled out, it is not at all obvious that the exclusive interpretation of \u201cor\u201d is dominant:<\/p>\n<p>(8)\u00a0\u00a0 She wears sunglasses or a cap<\/p>\n<p>(9)\u00a0\u00a0 Our employees speak French or Spanish<\/p>\n<p>(10)\u00a0 Bill will sing or play the piano<\/p>\n<p>We have no hard statistical data to present, but it seems less than obvious that a disposition to understand by default utterances of the form \u201cP or Q\u201d as implicating not (P and Q)\u00a0 would render communication speedier or more efficient. More generally, the effect that GCIs would have on the efficiency of communication should be investigated rather than assumed.<\/p>\n<p>Relevance theory\u2019s approach<\/p>\n<p>We will assume that the basic tenets of relevance theory are familiar (see Wilson &amp; Sperber 2004 for a recent restatement), and focus on how it applies to what neo-Griceans describe as \u2018scalar implicatures.\u2019 Two basic ideas play a crucial role here:<\/p>\n<p>(a)\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Linguistic expressions serve not to encode the speaker\u2019s meaning but to indicate it. The speaker\u2019s meaning is inferred from the linguistic meaning of the words and expressions used taken together with the context.<\/p>\n<p>(b)\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Inferring the speaker\u2019s explicit and implicit meaning (her explicatures and implicatures) is not done sequentially but in parallel. The final overall interpretation of an utterance results from a mutual adjustment of implicatures and explicatures guided by expectations of relevance.<\/p>\n<p>Here is a simple illustration of these two points:<\/p>\n<p>(11) Henry: Do you want to go on working, or shall we go to the cinema?<br \/>\nJane: I am tired. Let\u2019s go to the cinema.<\/p>\n<p>Jane\u2019s describing herself as \u201ctired\u201d achieves relevance as an explanation of her acceptance of Henry\u2019s suggestion. For this it must be understood that she is not just tired, but too tired to go on working, and at the same time not too tired to go to the cinema. Her used of \u201ctired\u201d serves to indicate an ad hoc concept TIRED* with an extension narrower than that of the linguistically encoded concept TIRED. Whereas TIRED extends from a minimal level of tiredness to complete exhaustion, TIRED* extends just over those levels of tiredness that explain why Jane would rather go to the cinema than work. Henry correctly understands Jane explicature to be (12) and her implicature to be (13), yielding an optimally relevant interpretation:<\/p>\n<p>(12) I am TIRED*<\/p>\n<p>(13) The reason why I would rather go to the cinema than work is that I am TIRED*<\/p>\n<p>Note that explicature (12), and in particular the interpretation of \u201ctired\u201d as indicating TIRED* is calibrated so as to justify implicature (13). The explicature therefore could only be inferred once the implicature had been tentatively assumed to be part of Jane\u2019s meaning. The overall interpretation results from a process of mutual adjustment between explicature and implicature.<\/p>\n<p>Consider now an expression typically supposed to give rise to \u2018scalar implicatures\u2019 such as \u201csome of the Xs\u201d. From a semantic point of view, \u201csome of the Xs\u201d has as its extension the set of subsets of n Xs where n is at least 2 and at most the total number of the Xs. From a relevance-theoretic pragmatic point of view, the use of an expression of the form \u201csome of the Xs\u201d, just as that of any linguistic expression, serves not to encode the speaker\u2019s meaning, but to indicate it. In particular the denotation of the concept indicated by a given use of \u201csome of the Xs\u201d may be an ad hoc concept SOME OF THE Xs* with a denotation different from that of the literal SOME OF THE Xs. Rather than ranging over all subsets of Xs between 2 and the total number of the Xs, the extension of SOME OF THE Xs* may be narrowed down at either end, or it may be extended so as to include subsets of one.<\/p>\n<p>Imagine (14) uttered in a discussion of the spread of scientific knowledge in America:<\/p>\n<p>(14) Most Americans are creationists and some even believe that the Earth is flat<\/p>\n<p>Clearly, the speaker is understood as meaning that a number of Americans much greater than two believe that the earth is flat. Two Americans with such a belief\u2014say two inmates in a psychiatric hospital\u2014would be enough to make her utterance literally true, but not, and by a wide margin, to make it relevant. Given that the speaker can be assumed to know that it is common knowledge that not all Americans believe that the earth is flat, there is no ground to assume that this is a part of her meaning (inferring it would not add any cognitive effect and it would involve a processing cost, hence it would detract from relevance). On the other hand, the speaker\u2019s contrastive use of \u201cmost\u201d and \u201csome\u201d and her use of \u201ceven\u201d make it part of her meaning that the Americans who believe the earth to be flat are fewer than those who believe in creationism (this, of course, entails that not all Americans believe that the earth is flat, but not every entailment of a speaker\u2019s meaning is part of that meaning). So the denotation indicated by \u201csome\u201d in (14) is narrower than its literal denotation at both ends: the subsets of Americans in the denotation of this occurrence of \u201csome\u201d are large enough to be relevant and hence much larger than sets of two Americans, and are smaller than the set of American creationists.<\/p>\n<p>Let us now go back to a version of example (4). Jane and Henry have invited a few friends to a dinner party. Suppose first that it was agreed that Henry would go and get the desert from the pastry shop as soon as the guests started arriving. Henry is in the garage, he hears the bell ring and then Jane shouting (15) to him:<\/p>\n<p>(15) Jane to Henry: Some of the guests have arrived<\/p>\n<p>Henry does not know whether one, many, or all the guests have arrived, or, for that matter, whether Jane has already opened the door and seen how many of them there are, and the question need not even come to his mind. What makes Jane\u2019s utterance relevant is that it implies that he should go now and this does not depend on the number of guests at the door. Henry\u2019s construal of \u201csome\u201d is compatible with any number of guests having arrived, even just one, and hence is an extended construal of \u201csome\u201d.<\/p>\n<p>Consider now a different scenario. Henry is alone in the kitchen cooking. Jane comes in and tells him (15). The consequences that Henry considers are that he should come and greet the guests and bring the finger food he has prepared as an appetizer. The value of \u201csome\u201d is taken to be a value for which these are the main consequences. If all the guests had come, what he should do would be not just to greet the guest and bring the finger food, but also and even more importantly, to put the fish in the oven and make the ultimate preparations for the meal itself. The fact that Jane\u2019s utterance achieves relevance without bringing to mind consequences more typical of the arrival of all the guests causes Henry to construe \u201csome\u201d with some vague cardinality above one and below all. Henry need not actively exclude all, he may just not even consider it. If however Henry is wondering whether all the guests have arrived, then he will take Jane\u2019s utterance to licence the inference that not all of them have. If moreover Henry had asked Jane whether all the guests had arrived, or if he knew that she knew that it was particularly relevant to him at this point in time, he would take that inference to be intended. He would also do so if she had put a contrastive stress on \u201csome\u201d, causing an extra effort and suggesting an extra effect. In other words, if there is some mutually manifest, actively represented reason to wonder whether all the guests have arrived, then (15) can be taken to implicate that not all of them have.<\/p>\n<p>From a\u00a0 relevance theory point of view, (11), (14), and (15) are just ordinary illustrations of the fact that linguistic expressions serve to indicate rather than encode the speaker\u2019s meaning and that the speaker\u2019s meanings are quite often a narrowing down or broadening of the linguistic meaning. Taking \u201csome\u201d to indicate not at least two and possibly all but at least two and fewer than all is a common narrowing down of the literal meaning of \u201csome\u201d at the level of the explicature of the utterance. It is not automatic but takes place when the consequences that render the utterance relevant as expected are characteristically carried by this narrowed down meaning.<\/p>\n<p>We are not denying that a statement of the form \u201c\u2026some\u2026\u201d may, in some cases carry an implicature of the form \u2026not all\u2026 (or, in other cases we will not discuss here, an implicature of the form \u2026some\u2026not\u2026). This occurs when the \u201c\u2026some\u2026\u201d\u00a0 utterance achieves relevance by answering a tacit or explicit question as to whether all items satisfy the predicate.The fact that it does not answer it positively implicates a negative answer and therefore a narrowed down construal of \u201csome\u201d as excluding all. Standard accounts of \u2018scalar implicatures\u2019 fail to distinguish the cases where the explicature merely entails \u2026not all\u2026 and the much less frequent cases where, moreover, the utterance implicates \u2026not all\u2026 .<\/p>\n<p>In all the cases where the meaning of \u201csome\u201d in an utterance is narrowed down so as to exclude all, this is the result of an inferential process that looks at consequences that might make the utterance relevant as expected, and that adjusts the meaning indicated by \u201csome\u201d to these consequences. In particular if what may make the utterance relevant is an implication that is true of some Xs but not of all Xs, then the meaning of \u201csome\u201d is adjusted so as to exclude all. These inferential processes result from the automatic attempt by the hearer to find an interpretation of the utterance that meets his expectation of relevance and they all follow the same heuristics. There is nothing distinctive in the way \u2018scalar\u2019 inferences are drawn. Moreover, the class of cases described in the literature as scalar inferences is characterised by an enrichment at the level of the explicature (where, for instance, \u201csome\u201d is reinterpreted in a way that excludes all) and only in a small sub-class of these is the exclusion of the more informative concept not just entailed but also implicated.<\/p>\n<p>According to relevance theory, then, so called \u2018scalar implicatures\u2019 are not scalar, nor necessarily implicatures. Of course, the notion of \u2018scalar implicature\u2019 could be redefined to fit just cases where there is an explicit or implicit question as to whether the use of a more informative expression than the one employed by the speaker (e.g. \u201call\u2019 instead of \u201csome\u201d) would have been warranted, and in such cases, a denial of a more informative claim can indeed be implicated by the use of the less informative expression. However, \u2018scalar implicatures\u2019 in this restricted sense depend on contextual premises (linked to the fact that the stronger claim was being entertained as a relevant possibility) rather than on a context-independent scale, and are not candidates therefore for the status of GCI.<\/p>\n<p>From the point of view of relevance theory then, the classical neo-Gricean theory of scalar implicatures can be seen as a mistaken generalisation of the relatively rare case where a weaker claim genuinely implicates the denial of a stronger claim that is contextually under consideration to the much more common case where the denotation of an expression is narrowed down so as to exclude marginal or limiting instances carrying untypical implications. For instance \u201cpossible\u201d as in (3a) (\u201cIt is possible that Hillary will win\u201d) is often construed as excluding, on one side, mere metaphysical possibility with very low empirical probability, and, on the other side, certainty and quasi-certainty. The trimming of \u201cpossible\u201d at both ends results in an enriched and generally more relevant meaning. Since the trimming at the very high probability end is not different from that at the very low probability end, both should be explained in the same way, ruling out the scalar aspect of the \u2018scalar implicature\u2019 account, which works, if at all, only at the upper end. On the other hand, if (3a) were uttered in reply to the question: \u201cIs it certain that Hillary will win?\u201d, then it would indeed implicate (3c) (\u201cIt is not certain that Hillary will win\u201d) because it would achieve relevance by implicitly answering in the negative a question that had been asked. From a relevance theory point of view, the two cases should be distinguished.<\/p>\n<p>This is not the place to compare in detail the GCI and the relevance-theoretic approaches. We focus rather on a testable difference in prediction between them. Levinson writes: \u201cGCI theory clearly ought to make predictions about process. But here the predictions have not yet been worked out in any detail\u201d (Levinson 2000:370). There is however one prediction about process that follows quite directly from GCI theory since it is hardly more than a restatement of some of the tenets of the theory. According to the theory, GCIs are computed by default and are contextually cancelled when needed. Both the computation of GCIs and their cancellation are processes and therefore should take each some time and effort (even if the default character of GCI should make their computation quite easy and rapid). Everything else being equal, less effort should be expended and less time taken in the normal case where a GCI is computed and not cancelled than in the exceptional case where a GCI is first computed and then cancelled. Relevance theory predicts just the opposite pattern.<\/p>\n<p>From a relevance-theoretic perspective, the speaker\u2019s meaning is always inferred, even when it consists in a literal interpretation of the linguistic expressions used. The inferences involved, however, differ in the time and effort they require. Both the sentence meaning and the context contribute to making some interpretations more easily derived than others. If only sentence meaning were involved, one should predict that the smaller the distance between it and the speaker\u2019s meaning it serves to indicate, the lesser would be the time and effort required to infer the speaker\u2019s meaning. Contextual factors, however, must be taken into account. For instance, an enriched interpretation may be primed by the context and, as a result may be easier to infer than a literal interpretation. Consider a variation of example (11):<\/p>\n<p>(16) Henry: You look tired, let\u2019s go to the cinema\u201d<br \/>\nJane:\u00a0\u00a0 I am tired, but not too tired to go on working<\/p>\n<p>A natural interpretation of Henry\u2019s utterance involves the ad hoc concept TIRED* such that being TIRED* is a sufficient reason to stop working and not a sufficient reason to stay at home. Jane could have answered, \u201cNo, I am not tired, I\u2019ll go on working\u201d meaning that she was not TIRED*. When Jane, rather, asserts that she is tired, Henry is primed to interpret \u201ctired\u201d as TIRED*. A relevant interpretation of Jane\u2019s whole utterance, however, imposes a broader, more literal and, in this situation, more effortful construal of the term.<\/p>\n<p>Even when an enriched interpretation of an utterance is not primed, it may require less processing effort than would the literal interpretation because the contextual implications that render relevant the enriched interpretation are more easily arrived at than those that would render relevant the literal interpretation. This typically occurs with metaphorical utterances: a relevant literal interpretation is often hard or even impossible to construct.<\/p>\n<p>In the absence of contextual factors that make an enriched interpretation of an utterance easier to arrive at, relevance theory predicts that a literal interpretation\u2014which involves just the attribution to the speaker of a meaning already provided by linguistic decoding\u2014should involve shallower processing and take less time than an enriched one\u2014which involves a process of meaning construction. Such is the case in particular in the experiments we describe below.<\/p>\n<p>The difference in prediction between GCI theory and relevance theory can be presented in table form:<\/p>\n<p><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<table class=\"MsoTableGrid\" style=\"border: medium none; margin-left: 50.4pt; border-collapse: collapse;\" border=\"1\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td style=\"border: medium none; padding: 0cm 5.4pt; width: 27pt;\" rowspan=\"3\" width=\"36\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\">Interpretation of the scalar   term<\/span><\/strong><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 108pt;\" width=\"144\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 162pt;\" width=\"216\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"margin: 6pt 0cm; text-align: center; page-break-after: avoid;\" align=\"center\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\">GCI theory<\/span><\/strong><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 117pt;\" width=\"156\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"margin: 6pt 0cm; text-align: center; page-break-after: avoid;\" align=\"center\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"><span> <\/span>relevance theory<\/span><\/strong><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 0cm 5.4pt; width: 108pt;\" width=\"144\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\">literal<\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 162pt;\" width=\"216\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">default enrichment + context-sensitive cancellation,<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">hence slower<\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/em><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 117pt;\" width=\"156\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">no enrichment,<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">hence faster<\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 0cm 5.4pt; width: 108pt;\" width=\"144\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\">enriched<\/span><\/strong><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 162pt;\" width=\"216\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">default enrichment,<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">hence faster<\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 117pt;\" width=\"156\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">context-sensitive enrichment,<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"><span> <\/span><strong><em>hence slower<\/em><\/strong><\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%; page-break-after: avoid;\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p>Table 1: contrasting predictions of GCI Theory and\u00a0 relevance theory regarding the speed of interpretation of scalar term (when an enriched construal is not contextually primed)<\/p>\n<p>This difference in prediction between the two theories is of a type that lends itself to experimental investigation.<\/p>\n<p>Methodological considerations in experimental approaches to \u2018scalar inferences\u2019<\/p>\n<p>In the experimental study of case of scalar inferences,[1] one has to keep four methodological considerations in mind. To begin with, one wants to be sure that a given result (whether it be the rate of responses that indicate a pragmatic enrichment or the mean reaction time associated with an enrichment) is a consequence of the experiment\u2019s intended target and not of other contextual variables. For example, one would want to be sure that the understanding of a disjunctive statement of the form P or Q as excluding P and Q is due to the pragmatic enrichment of the term \u201cor\u201d (from an inclusive to an exclusive interpretation) and not to some other feature. Thus, one would avoid investigating utterances that invite an exclusive understanding of the situation described rather than of the description itself. In example (6) (\u201cJane is in Paris or in Madrid\u201d) above, the exclusive understanding is based on our knowledge that a person cannot be in two places at the same time and need not involve any pragmatic enrichment of the meaning of the word \u201cor\u201d. In devising experimental material, it thus becomes important to invent examples where an enriched interpretation is not imposed by extra-pragmatic considerations. One can do this by using either examples where participants knowledge is equally compatible with a literal or an enriched interpretation of a scalar term, or examples where knowledge considerations might bias participants in favour of a literal interpretation: in both cases, if one finds evidence of enrichment, one will be confident that it comes from a pragmatic inference about what the utterance meant, rather than from a mere understanding of how the world is.<\/p>\n<p>Second, one would want a paradigm that allows for two identifiable outcomes so that the presence of an enrichment can be indicated by a unique sort of response while a non-enrichment can be indicated by a different response. This is why most of the experiments on scalars described here involve a scenario that could be described by means of a more informative utterance than the test utterance (uttered by a puppet or some other interlocutor).\u00a0 Imagine for example being shown five boxes each containing a token and then being told, \u201cSome boxes contain a token.\u201d If one interprets \u201csome\u201d literally (i.e. as compatible with all), one would agree with the statement; if one enriches \u201csome\u201d so as to make it incompatible with all, one would have to disagree. In such conditions, a participant\u2019s response (agrees or disagrees) is revealing of a particular interpretation.<\/p>\n<p>Third, one wants every assurance that an effect is robust. That is, one wants to see the same result over and over again and across a variety of comparable tasks. When two similar studies (for instance two studies investigating different scalar terms but in an equivalent manner) present comparable outcomes, each strengthens the findings of the other. On the other hand, if two very similar experiments fail to produce the same general effects, something is wrong. This does not mean that negative results are necessarily fatal for an experimental paradigm . If one carefully modifies an experiment and it prompts a different sort of outcome than previous ones (and in a predictable manner), it helps determine the factors that underlie an effect. This occurs with the developmental findings to be described below, which have generally shown that children are more likely than adults to agree with a weak statement (for instance the statement \u201cSome horses jumped over a fence\u201d) when a stronger one would be pragmatically justified (because, in fact, all the horses jumped over a fence). All sorts of follow-up studies have aimed to put this effect to the test. In general, the effect has been resilient; a few studies, however, show that one can get children to appear more adult-like through specific sorts of modifications. For example, experimenters have aimed to verify the effect under conditions where participants are given training or where scenarios are modified to highlight the contrast between the weak utterance and the stronger scenario. The net result is that the outcomes of these tests collectively help identify the factors that can encourage scalar inference-making.<\/p>\n<p>Fourth, it is important for any experiment to include as many reasonable controls as possible. These are test questions that are similar to the main items of interest, but aim basically to confirm that there is nothing bizarre in the task. For example, if one finds that participants\u2019 responses indicate that they enrich \u201csome\u201d but also that the same participants endorse the use of the word \u201csome\u201d to describe a scene where \u201cnone\u201d would be appropriate, then there is something questionable about the experiment. This rarely happens (the above example is presented for illustrative purposes only), but one needs to provide assurances to oneself and to readers that such bizarreness can be ruled out. Any decent task will include several controls that lead to uncontroversial responses in order to, in effect, contextualize the critical findings. The studies we will discuss exemplify the four methodological considerations we have just discussed.<\/p>\n<p>Developmental studies<\/p>\n<p>The experimental study of scalar inferences started within the framework of developmental studies on reasoning. Noveck (2001) investigated the way children responded (by agreeing or disagreeing) to a puppet who presented several statements, including one that could ultimately lead to a pragmatic enrichment. All statements, even those that served as controls to confirm that the participants understood the task, were about the contents of a covered box and were presented by a puppet (handled by the experimenter). Participants were told that the contents of the covered box resembled those of one or the other of two other boxes both which were open and with their contents in full view. One open box contained a parrot and the other contained a parrot and a bear. The participants then heard the puppet say[2]:<\/p>\n<p>(17) A friend of mine gave me this (covered) box and said, \u201cAll I know is that whatever is inside this box (the covered one) looks like what is inside this box (the one with a parrot and bear) or what is inside this box (the one with just a parrot).\u201d<\/p>\n<p>The participant\u2019s task was to say whether or not he agreed with further statements of the puppet. The key item was ultimately the puppet\u2019s \u201cunderinformative\u201d statement:<\/p>\n<p>(18) There might be a parrot in the box.<\/p>\n<p>Given that the covered box necessarily contained a parrot, the statement in (18) can be answered in one of two ways. The participant can \u201cagree\u201d if she interprets \u201cmight\u201d literally (so that &#8230;might\u2026 is compatible with \u2026must\u2026) or she can \u201cdisagree\u201d if she interprets might in an enriched way (where &#8230;might\u2026 is incompatible with \u2026must\u2026).\u00a0 Adults tended to be equivocal with respect to these two interpretations (35% agreed with the statement) while children (5-, 7- and 9-year-olds) tended to interpret this statement in a minimal way, i.e. literally. Collectively, 74% of the children responded by agreeing with the statement in (18). However, not all children were alike.<\/p>\n<p>The five year olds agreed with (18) at a rate of 72% (a percentage that is unlikely to occur by chance, which would yield 50% in such agree\/disagree contexts). Nevertheless, they failed to answer many control questions at such convincing rates. For example, when asked to agree or disagree with statements about the bear (\u201cThere has to be a bear,\u201d \u201cThere might be a bear,\u201d \u201cThere does not have to be a bear,\u201d \u201cThere cannot be a bear\u201d) they answered at levels that were comparable to those predicted by chance (55% correct across the four questions). Seven-year-olds, on the other hand, did manage to answer practically all seven control problems at rates that indicated they understood the task overall (77%). This is why Noveck (2001) reported that seven-year-olds were the youngest to demonstrate competence with this task while at the same time revealing that they preferred the literal interpretation of \u201cmight\u201d (at a rate, 80%, that is statistically distinguishable from expectations based on chance). The seven-year-olds thus provided the strongest evidence showing that those linguistically competent children who performed well on the task overall still interpreted \u201cmight\u201d in an unenriched way. As one might expect, the nine-year-olds also answered control problems satisfactorily. Response rates indicating unenriched interpretations of \u201cmight\u201d were high (69%) and much higher than the adults\u2019 but nevertheless were statistically indistinguishable from predictions based on chance suggesting that these children were beginning to appear adult-like with respect to (18).\u00a0 Overall, these results were rather surprising for a reasoning study because they indicated that children were more likely than adults to produce a logically correct evaluation of the underinformative modal statement. This sort of response is surprising and rare, but thanks to a pragmatic analysis\u2014where pragmatic enriched interpretations are viewed as likely to result from a richer inferential process than minimal interpretations that add nothing to semantic decoding\u2014these results had a ready interpretation.<\/p>\n<p>Despite taking every precaution (having numerous control items and sampling many children), one can never exclude that such effects might be the result of some subtle factor beyond the experimenter\u2019s intention or control. That is why\u2014especially when encountering counterintuitive results like these\u2014it pays to do follow-ups. There have been essentially two sorts.<\/p>\n<p>The first sort aims to verify the effect. In one follow-up (Noveck, 2001, Exp. 2), the same task as the one above was given to 5-year-olds and 7-year-olds as well as adults, but all participants were given more thorough training to ensure that they understood the parameters of the task. This was done through training on an identical scenario (one box containing a horse and a fish and another just a horse) where pointed questions were asked about the covered box (e.g. Could there be a fish by itself in the box?). Overall, the training increased rates of minimal interpretations of \u201cmight\u201d across all three ages when it came to the task of Experiment 1. Agreement with a statement like the one in (18) was now 81% for five-year-olds, 94% for seven-year-olds, and 75% for adults. Although rates of such minimal interpretations were statistically comparable across ages, one finds the same trends as in the first Experiment. Seven-year-olds again demonstrated (through performance with the control problems) that they were the youngest to demonstrate overall competence with the task while tending to be more likely than adults in retaining a literal interpretation of the weak scalar term. The data also revealed that the extra training encourages adults to behave more \u201clogically\u201d (to stick to the literal meaning of \u201cmight\u201d), like the children.<\/p>\n<p>In an effort to establish the developmental effect\u2019s reliability and robustness, Noveck (2001, Exp. 3) took advantage of an older study that (a) unintentionally investigated weak scalar expressions among 4- to 7-year-old children and that (b) also failed to show evidence of pragmatic enrichment. Smith (1980) presented statements such as \u201cSome giraffes have long necks\u201d to children and reported that it was surprising to find the children accepting these as true. In a third experiment, therefore, Noveck (2001) essentially continued from where Smith left off. The experiment adopted the same technique as Smith (which included pragmatically felicitous statements such as \u201cSome birds live in cages\u201d as well as statements with \u201call\u201d) in order to verify that the developmental findings of the first two experiments were not flukes. The only differences in this third experiment were that the children were slightly older (8- and 10-years-old) than in the first two studies and that the experimenter was as \u201cblind\u201d to the intention of the study as the participants (the student who served as experimenter thought that unusual control items such as \u201cSome crows have radios\u201d or \u201cAll birds have telephones\u201d were the items of interest). The results showed that roughly 87% of children accepted statements like \u201cSome giraffes have long necks\u201d whereas only 41% of adults did. Again, adults were more likely than children to enrich the interpretation of the underinformative statements (understanding \u2026some\u2026 as excluding \u2026all\u2026) and thus tended to reject them (since all giraffes have long necks). All participants answered the five sorts of control items (25 items altogether) as one would expect.<\/p>\n<p>These data prompted Noveck (2001) to revisit other classic studies that serendipitously contained similar scenarios (ones where a stronger statement would be appropriate but a weaker one is made) to determine whether they tell the same story as \u201cmight\u201d and \u201csome\u201d. In fact, three studies concerning \u201cor\u201d (Paris, 1973, Braine and Rumain, 1981, Sternberg, 1983), where a conjunctive situation is described with a weaker disjunction, provides further confirming evidence. The authors of these studies also reported counter-intuitive findings showing younger children being, in effect, more logical than adults (children tend to treat \u201cor\u201d inclusively more often than adults). None of these authors, lacking a proper pragmatic perspective, knew how to make sense of these data at the time. All told, this effect appeared robust.<\/p>\n<p>Other follow-up studies have actually taken issue with Noveck\u2019s interpretation of the findings. In fact, Noveck (2001, p.184) insisted that his data show that\u00a0 children are ultimately less likely than adults to pragmatically enrich underinformative items across tasks; this did not amount to a claim that children lacked pragmatic competence. Still, much work has been aimed at showing that young children are more competent than it might appear. These studies usually take issue with Noveck\u2019s Experiment 3 (the one borrowed from Smith, 1980) because it concerns the quantifier \u201csome\u201d (which is of more general interest than \u201cmight\u201d) and because the items used in that task are admittedly unusual (see Papafragou and Musolino, 2003; Chierchia, Guasti, Gualmini, Meroni, Crain &amp; Foppolo, 2004; Guasti, Chierchia, Crain, Foppolo, Gualmini and Meroni, 2005; Feeney, Scrafton, Duckworth, &amp; Handley, 2005).<\/p>\n<p>We highlight here the main advances of these studies. In two sets of studies, Papafragou and colleagues (Papafragou and Musolino, 2003; Papafragou and Tantalou, 2004) aimed to show that children as young as five are generally able to produce implicatures if the circumstances are right. Actually, Papafragou and Musolino (2003, Experiment 1) first confirmed the developmental effect summarized above by showing that 5-year-olds are less likely than adults to produce enrichments with \u201csome\u201d, \u201cstart\u201d, and \u201cthree\u201d, in cases where a stronger term was called for (namely, \u201call\u201d, \u201cfinished\u201d, and a \u201clarger number,\u201d respectively). They then modified the experimental setup in two ways in order to prepare their second experiment. First, before they were tested, participants received training aimed at enhancing their awareness to pragmatic anomalies. Specifically, children were told that the puppet would say \u201csilly things\u201d and that the point of the game was to help the puppet say it better (e.g. they would be asked whether a puppet described a dog appropriately by saying \u201cthis is a little animal with 4 legs\u201d). In the event that the child did not correct the puppet, the experimenter did. Second, the paradigm put the focal point on a protagonist\u2019s performance. Unlike in their Experiment 1, where participants were asked to evaluate a quantified statement like \u201cSome horses jumped over the fence\u201d (when in fact all the horses did), the paradigm in Experiment 2 raises the expectations that the stronger statement (with \u201call\u201d) might be true. Participants would hear a test statement like, \u201cMickey put some of the hoops around the pole\u201d (after having his been shown to succeed with all of the hoops), and they were also told how Mickey claims to be especially good at this game and that this is why another character challenges him to get all three around the pole. With these changes, 5-year-olds were more likely to produce enrichments than they were in the first experiment. Nevertheless, the five year olds, even in the second experiment, still produced enrichments less often than did adults. This indicates that \u2013 even with training and with a focus on a stronger contrast \u2013 pragmatic enrichments require effortful processing among children. [3]<\/p>\n<p>Guasti et al. (2005) argue that pragmatic enrichments ought to be as common among five year olds as they are among adults and further investigated the findings of Noveck (2001) and Papafragou and Musolino (2003).\u00a0 In their first experiment, they replicated the finding of Noveck (2001, Experiment 3) with \u201csome\u201d with 7-year-olds and used this as a baseline to study independently the role of the two factors manipulated by Papafragou and Musolino (2003). One factor was the role of training and how it affects children\u2019s proficiency at computing implicatures (Experiments 2 and 3) and the other was the role of placing emphasis on the outcome of a scalar implicature (Experiment 4). Their Experiments 1 through 3 showed that training young participants to give the most specific description of a given situation can indeed have a major effect on performance. While their initial experiment showed that 7-year-olds accept statements such as \u201cSome giraffes have long necks\u201d 88% of the time (against 50% for adults), when trained in this manner their acceptance rate drops to 52%, becoming adult-like. Nonetheless, this effect is short lasting, i.e. it does not persist when the same participants are tested a week later (Experiment 3). In the last experiment, the authors rendered the all alternative more salient in context. This was achieved, for instance, by presenting participants with a story where several characters have to decide whether the best way to go collect a treasure was to drive a motorbike or ride a horse. After some discussion, all of them choose to ride a horse. In this way it is made clearer that the statement subjects have to judge, \u201csome of the characters chose to ride horse,\u201d is underinformative. The results indicated that children are more likely to infer an enriched interpretation in an adult-like manner when the context makes this enrichment highly relevant.<\/p>\n<p>This last finding shows that one can create situations that encourage children to pragmatically enrich weak-sounding statements and to do so in an adult-like manner. It does not alter the fact that in less elaborate scenarios where cues to enrichment are less abundant, seven-year olds do not behave in this manner and it does not tell us what younger children do. Overall, the developmental effect shows that pragmatic enrichments are somewhat effortful. In experimental settings, the required effort can be somewhat lowered or the motivation to perform it may be heightened, but in the absence of such contextual encouragements, younger children faced with a weak scalar term are more likely to stick with its linguistically encoded meaning.<\/p>\n<p>If children had been found to perform scalar inferences by default, this would have been strong evidence in favour of the GCI theory approach. However, taken together, developmental data suggest that, for children, enriched interpretations of scalar terms are not default interpretations. This data is not knock down evidence against GCI theory, because it is compatible with two hypotheses: 1) scalar inferences are not default interpretations for adults either (even if adults are more likely to derive them because they can do so with relatively less effort and because they are more inclined to invest effort in the interpretation of an utterance given their greater ability to derive from it cognitive effects). Or, 2) in the course of development, children become capable and disposed to perform scalar inferences by default. The first hypothesis is consistent with the relevance theory approach while the second is consistent with the GCI approach. To find out which approach has more support, further work had to be done with adults.<\/p>\n<p><strong>Time course of comprehension among adults<\/strong><\/p>\n<p>As we mentioned before, GCI theory implies that a literal interpretation of a scalar, resulting from the cancellation of default enrichment, should take longer than an enriched interpretation, whereas\u00a0 relevance theory, denying that enrichment takes place by default, implies that an enriched interpretation, being computed when needed to meet contextual expectations of relevance, should take longer than a literal one. What is needed to test these contrasting predictions are experiments manipulating and measuring the time course of the interpretation of statements with weak scalar terms.<\/p>\n<p>As in the developmental tasks, one wants to make sure that enriched interpretations are clearly identifiable through specific responses, that the tasks used includes a variety of controls, and that the effect is reliable and robust. One way to identify enriched vs. literal interpretations is provided by earlier studies where participants were asked to judge true or false statements (such as \u201csome elephants are mammals\u201d) that could either be construed as literally true but underinformative, or in an enriched manner (as implying \u2026not all\u2026) and false. Hence participants\u2019 truth-value judgements reflect their literal or enriched interpretation.<\/p>\n<p>As we indicated, prior work was critical to developing the appropriate measures. In fact, Rips (1975) unintentionally included the right sort of cases when looking at other issues of categorization and with materials such as \u201csome congressmen are politicians.\u201d He examined the effect of the interpretation of the quantifier by running two studies, one in which participants were asked to treat \u201csome\u201d as meaning some and possibly all and another where they were asked to treat \u201csome\u201d as meaning some but not all. This comparison demonstrated that participants given the some but not all instructions in one experiment responded more slowly than those given the some and possibly all instructions in another. Despite these indications, Rips modestly hedged when he concluded that \u201cof the two meanings of Some, the informal meaning may be the more difficult to compute\u201d (italics added). To make sure that Rips\u2019s data were indeed indicative of a slowdown related to Some but not all readings, Bott &amp; Noveck (2004) ran a series of four experiments that followed up on Rips (1975) and essentially verified that enriched interpretations take longer than literal ones.<\/p>\n<p>Bott and Noveck\u2019s categorization task involved the use of underinformative items (e.g. \u201cSome cows are mammals\u201d) and five controls that varied the quantifier (Some and All), the category-subcategory order, as well as proper membership. The 6 types of statements are illustrated with the six possible ways one can employ the subcategory elephants below, but it should be pointed out that the paradigm was set up so that the computer randomly paired a given subcategory with a given category while verifying that, at the end of each experimental session, there were nine instances of each type:<\/p>\n<p>(19) (a) Some elephants are mammals (Underinformative).<br \/>\n(b) Some mammals are elephants.<br \/>\n(c) Some elephants are insects<br \/>\n(d) All elephants are mammals.<br \/>\n(e) All mammals are elephants.<br \/>\n(f) All elephants are insects.<\/p>\n<p>In the first Experiment, a sample of 22 participants was presented with the same task twice, once with instructions to treat \u201csome\u201d as meaning Some and possibly all and once with instructions to treat \u201csome\u201d as meaning Some but not all (and, of course, the order of presentation was varied). When participants were under instruction, in effect, to engage the scalar inference, they were shown to be less accurate and take significantly longer to respond to the underinformative items (like those in (19a)). Specifically, when instructions called for a Some but not all interpretation, rates of correct responses to the Underinformative item (i.e. judging the statement \u201cfalse\u201d) were roughly 60%; when instructions called for a Some and possibly all interpretation rates of correct responses to the Underinformative item (i.e. judging the statement \u201ctrue\u201d) were roughly 90%. For the control items, rates of correct responses were always above 80% and sometimes above 90%. One can see that the Underinformative case in the Some but not all condition provides exceptional data.<\/p>\n<p>The reaction time data showed that the correct responses to the Underinformative item in the Some but not all condition were exceptionally slow. It took roughly 1.4 seconds to correctly evaluate the underinformative statements in the Some but not all condition and around .8 seconds in the Some and possibly all conditions. To answer the control items\u2014across both sorts of instructions\u2014took at most 1.1 seconds but more often around .8 to .9 seconds. Thus,\u00a0 the underinformative statement in the Some but not all condition is the one most affected by the instructions. All this confirms Rips\u2019s initial findings. More importantly, there is not a single indication that interpreting \u201csome\u201d to mean some but not all is an effortless or quasi effortless step. Again, a default view of scalar inference would predict that under Some but not all instructions, responses to Underinformative statements would require less time than responses under Some and possibly all instructions. According to an account based on\u00a0 relevance theory, one should find the opposite. The data more readily support the relevance-theoretic account.<\/p>\n<p>A potential criticism of this Experiment is that the lower accuracy and the slowdown might be due to a response bias in favour of positive rather than negative response, given that the correct response to the Underinformative statement with the Some and possibly all instructions is to say \u201cTrue\u201d while the correct response to the Underinformative statement with the Some but not all instructions is to say \u201cFalse.\u201d To allay concerns regarding such a potential response bias, Bott and Noveck demonstrated experimentally that the effects linked to pragmatic effort are not simply due to hitting the \u201cFalse\u201d key.<\/p>\n<p>In a second experiment, the paradigm was modified so that the same overt response could be compared across both sorts of instructions; this way, participants\u2019 response choice (True vs. False) could not explain the observed effects. In order to arrive at this comparison, participants were not asked to agree or disagree with first-order statements such as those in (19), but with second order statements made about these first-order statements. For example, participants were presented with the two statements: \u201cMary says the following sentence is false\u201d \/ \u201cSome elephants are mammals.\u201d They were then asked to agree or disagree with Mary\u2019s second-order statement. In such a case, participants instructed to treat \u201csome\u201d as meaning Some but not all should agree, whereas participants instructed to treat \u201csome\u201d as meaning Some and possibly all should disagree, reversing the pattern of positive and negative response of the previous experiment.<\/p>\n<p>The results from this second experiment were nevertheless remarkably similar to those of the first one. Here, when participants were under instruction to, in effect, draw the scalar inference, they were less accurate and took significantly longer to respond correctly to the underinformative item. When \u201cagree\u201d was linked with instructions for a Some but not all interpretation, rates of correct responses were roughly 70%; when \u201cagree\u201d was linked with instructions for a Some and possibly all interpretation, rates of correct responses were roughly 90%. For all control items, rates of correct responses were always above 85% and often above 90%. One can see that, once again, the Underinformative case in the Some but not all condition provides exceptional data. The reaction time data also showed that the correct \u201cagree\u201d responses to the Underinformative item in the Some but not all condition were exceptionally slow. It took nearly 6 seconds to evaluate the underinformative statements correctly when \u201cagree\u201d was linked with instructions for a Some but not all interpretation and around 4 seconds when \u201cagree\u201d was linked with instructions in the Some and possibly all condition (all reaction times were longer than in the previous experiment due to the Mary says statement). The control items across both sorts of instructions took on average around 4.5 seconds and never more than 5 seconds. Again, the experiment demonstrated that any response that requires a pragmatic enrichment implies extra effort.<\/p>\n<p>Both of these experiments, though inspired by previous work, are arguably unnatural. It is unusual to instruct participants in a conversation, as was done in Experiment 1, as to how they should interpret the word \u201csome\u201d; the second experiment doubles the complexity by compelling participants to make metalinguistic judgments from statements like Mary says the following is false. Bott and Noveck\u2019s third experiment simplified matters by asking participants to make true\/false judgments about the categorical statements themselves and without prior instruction. With this sort of presentation, there is no useful sense in which a response is \u201ccorrect\u201d or not. Rather, responses reveal the participants literal or enriched interpretation and can be compared in terms of reaction time.<\/p>\n<p>Roughly 40% of participants responded \u201ctrue\u201d to Underinformative items and 60% \u201cfalse\u201d. This corresponds to the rates found among adults in Noveck\u2019s developmental studies (also see Noveck &amp; Posada, 2003; Guasti et al, 2005). The main finding was that mean reaction times were longer when participants responded \u201cfalse\u201d to the underinformative statements than when they responded \u201ctrue\u201d (3.3 seconds versus 2.7, respectively). Furthermore, \u201cfalse\u201d responses to the underinformative statements appear to be slower than responses to all of the control statements (including three, (18c), (18e), and (18f), that require a \u201cfalse\u201d response). The \u201ctrue\u201d response was made at a speed that was comparable to all of the control items.<\/p>\n<p>In their last experiment, Bott and Noveck\u2019s varied the time available to participants to respond to the statements. The rationale for this design was that, if as implied by GCI theory, literal interpretations of weak scalar terms take longer than the default enriched interpretations, then limiting the time available should decrease the rate of literal interpretations and increase the rate of enriched ones. On the other hand, if as implied by relevance theory, enriched interpretations take longer, then limiting the time should have the opposite effect. While following the same general procedure as the prior experiments (asking participants to judge the veracity of categorical statements), the paradigm manipulated the time available for the response. In one condition, participants had a relatively long time to respond (3 seconds), while in the other they had a relatively short time to respond (.9 seconds). Only the time to respond was manipulated. To control for uptake, participants were presented with the text one word at a time and at the same rate in both conditions, thus there is no possibility that participants in the Short-lag condition spend less time reading the statements than those in the Long-lag condition.<\/p>\n<p>Bott and Noveck reported that when a shorter period of time was available for participants to respond, they were more likely to respond \u201cTrue\u201d to Underinformative statements (indicating a literal interpretation). 72% of participants responded true in the \u201cShort-lag\u201d condition and 56% did so in the \u201cLong-lag\u201d condition. This strongly implies that they were less likely to derive the scalar inference when they were under time pressure than when they were relatively pressure-free. As in all the prior experiments, control statements provide a context in which to appreciate the differences found among Underinformative statements. These showed that performance among control statements in the Short Lag condition was quite good overall (rates of correct responses ranged from 75%-88%) and that, as one would expect, rates of correct performance among the control items increased with added time (by 5% on average). The contrast between a percentage that drops with extra time (as is the case for the Underinformative statements) and percentages that increase provide a unique sort of interaction confirming that time is necessary to provoke scalar inferences.<\/p>\n<p>The experiments we have described so far take into account the four methodological considerations we discussed earlier and allow well-controlled measure of a dependent variable: the rate or the speed of literal vs. enriched interpretations of weak scalar terms. Together, they provide strong evidence that an enriched interpretation of a weak scalar term requires more processing time than an unenriched, literal interpretation, as predicted by relevance theory and contrary to the prediction implied by GCI theory.<\/p>\n<p>Still, one might argue that the categorization tasks used, even if methodologically sound from an experimental psychology point of view, are too artificial to test pragmatic hypotheses. If the argument were that laboratory tasks are somehow irrelevant to pragmatics, we would argue that the onus of the proof is on the critics: after all, participants bring to bear on experimental verbal tasks their ordinary pragmatic abilities, just as they do in any uncommon form of verbal exchange. In particular, if it is part of adult pragmatic competence to make scalar inferences by default, it would take some arguing to make it plausible that an experimental setting somehow inhibits this basic disposition. On the other hand, if the argument is that fairly artificial laboratory experiments are not enough and that they should be complemented with more ecologically valid designs, we agree. Happily, Breheny, Katsos and Williams (2006) have provided just this kind of welcome complement.<\/p>\n<p>Following up on a procedure from Bezuidenhout &amp; Cutting (2004), Breheny et al. presented disjunctive phrases (such as \u201cthe class notes or the summary\u201d) in two kinds of contexts: Lower-bound contexts (where the literal reading of a scalar term is more appropriate as in (20) below), and Upper-Bound contexts (where the enriched reading of the scalar is more appropriate as in (21) below).\u00a0 These were presented as part of short vignettes (along with many \u201cfiller\u201d items to conceal the purpose of the study) and participants\u2019 reading times were measured. More specifically, participants were asked to read on a computer screen short texts that were presented one fragment at a time, and to advance in their reading by hitting the space bar (the slashes in (20) and (21) delimit fragments).<\/p>\n<p>(20) Lower-bound context<br \/>\nJohn heard that \/ the textbook for Geophysics \/ was very advanced. \/ Nobody understood it properly.\/ He heard that \/ if he wanted to pass the course \/ he should read \/ the class notes or the summary.<\/p>\n<p>(21) Upper-bound context<br \/>\nJohn was taking a university course \/ and working at the same time. \/ For the exams \/ he had to study \/ from short and comprehensive sources.\/ Depending on the course, \/ he decided to read \/ the class notes or the summary.<\/p>\n<p>If, in such a task, one found shorter reading times in the Upper-bound contexts that call for scalar inferences than in the Lower-bound contexts where the literal interpretation is more appropriate, this would support the GCI claim that scalar inferences are made by default. Findings in the opposite direction would support the\u00a0 relevance theory account. What Breheny et al. found is that phrases like the class notes or the summary took significantly longer to process in Upper-bound contexts than in Lower-bound contexts, a result consistent with findings reported above.<\/p>\n<p><strong>Conclusions<\/strong><\/p>\n<p>The experimental work we have summarized here verifies predictions derived from\u00a0 relevance theory, and falsifies predictions derived from GCI theory. Does this mean that\u00a0 relevance theory is true and GCI theory is false? Of course not. Nevertheless, these results should present a serious problem for GCI theorists. It is quite possible however that they will find a creative solution to the problem. They might for instance show that, in spite of the methodological precautions we have outlined, the studies reported failed to eliminate some uncontrolled factor, and that better studies provide evidence pointing in the opposite direction. They might, more plausibly, revise their theory so as to accommodate these results. One line of revision would be to reconsider the idea that GCI are default inferences (or to water down the notion of default to the point where it does not anymore have implications for processing time). After all, not all neo-Griceans agree with Levinson\u2019s account of GCI (see in particular Horn 2004, 2006). Still, it is worth noting that, if scalar inferences are not truly default inferences and involve each and every time paying attention to what the speaker chose not to say, then we are back to the worry that such inferences are excessively cumbersome. Generally speaking, experimental findings such as those we have summarised here should encourage neo-Griceans to work out precise and plausible implications of their approach at the level of cognitive processing.<\/p>\n<p>Relevance theorists are not challenged in the same way by the work we have described\u2014after all, their prediction is confirmed\u2014, but they should be aware that this prediction could be made from quite different theoretical points of view: it follows from\u00a0 relevance theory, but\u00a0 relevance theory does not follow from it. They might then try to develop aspects of these experiments that could give positive support to more specific aspects of the theory. For instance, according to the theory, hearers aim at an interpretation that satisfies their expectations of relevance and the relevance of an interpretation varies inversely with the effort needed to derive it. It should then be possible to cause participants to choose a more or a less parsimonious interpretation by increasing or decreasing the cognitive resources available to participants for the process of interpretation. The fourth experiment of Bott and Noveck (2004) can be seen as a first suggestive step in this direction.[4]<\/p>\n<p>As we have just explained, we do not expect readers to form a final judgement on the respective merits of GCI theory and\u00a0 Relevance theory on the basis of the experimental evidence presented. What we do hope is to have convinced you that, alongside other kinds of data, properly devised experimental evidence can be highly pertinent to the discussion of pragmatic issues, and that pragmatists\u2014and in particular students of pragmatics\u2014might greatly benefit from becoming familiar with relevant experimental work and from contributing to it (possibly in interdisciplinary ventures).<\/p>\n<p>Notes<\/p>\n<p>[1] From now on, for ease of exposition, we will use the term \u201cscalar\u201d without quotes to refer to the phenomena so described in the neo-Gricean approach. This use, of course, implies no theoretical commitment on our part.<\/p>\n<p>[2] The contents of parentheses were not said, but indicated.<\/p>\n<p>[3]\u00a0 Papafragou and Tantalou (2004) aim to show that five-year-olds can be encouraged to produce scalar inferences and at adult levels. However, we do not discuss their results here because their data are based on a non-standard paradigm in which participants are given no justifiable reason to accept the \u2018minimal\u2019 interpretation of a term such as \u2018some\u2019. In other words, the paradigm does not provide participants with two clear options. Moreover, much of the study\u2019s claims are based on children\u2019s self-reports and even these lead to the conclusion that at most 56% of Papafragou and Tantalou\u2019s participants derived scalar inferences.<\/p>\n<p>[4] For other experimental explorations based on relevance theory, see Van der Henst &amp; Sperber 2004.<\/p>\n<p>References<\/p>\n<p>Bezuidenhout, A. &amp; Cutting, J.C. (2002).\u00a0 Literal meaning, minimal propositions, and pragmatic processing.\u00a0 Journal of Pragmatics, 34, 433-456.<\/p>\n<p>Bott, L., &amp; Noveck, I. A. (2004). Some utterances are underinformative: The onset and time course of scalar inferences. Journal of Memory and Language, 51, 437-457.<\/p>\n<p>Braine, M. &amp; Rumain, B. (1981). Children\u2019s comprehension of &#8220;or&#8221;: Evidence for a sequence of competencies. Journal of Experimental Child Psychology, 31, 46-70.Breheny, R., Katsos, N., &amp; Williams, J. (2006). Are scalar implicatures generated on-line by default? Cognition, 100 (3), 434-463.<\/p>\n<p>Breheny, R., Katsos, N., &amp; Williams, J. (2006).\u00a0 Are generalized scalar implicatures generated by default?\u00a0 An on-line investigation into the role of context in generating pragmatic inferences.\u00a0 Cognition 100(3), 434-463.<\/p>\n<p>Chierchia, G., Guasti, T., Gualmini, A., Meroni, L., Crain, S. &amp; Foppolo, F. (2004). Adults and children&#8217;s semantic and pragmatic competence in interaction. In Experimental Pragmatics, I. A. Noveck &amp; D. Sperber (Eds.). Basingstoke: Palgrave Macmillan.<\/p>\n<p>Feeney, A., Scrafton, S., Duckworth, A. and Handley, S. J. 2004: The Story of Some: Everyday Pragmatic Inference by Children and Adults. Canadian Journal of Experimental Psychology, 58, 2, 121-132.<\/p>\n<p>Guasti, M. T., Chierchia, G.,Crain, S., Foppolo, F., Gualmini, A. &amp; Meroni. L. (2005). Why children and adults sometimes (but not always) compute implicatures. Language and Cognitive Processes, 20 (5), 667\u2013696<\/p>\n<p>Horn, L. R. (1972). On the Semantic Properties of Logical Operators in English.\u00a0 Ph. D. Dissertation, UCLA.<\/p>\n<p>Horn, L. R. (2004). Implicature. In Horn and Ward (eds.), 3-28.<\/p>\n<p>Horn, L. R. (2006). The Border Wars: A neo-Gricean perspective. In Turner and von Heusinger (eds.).<\/p>\n<p>Levinson, S. (2000). Presumptive Meanings. MIT Press.<\/p>\n<p>Noveck, I. A. (2001). When children are more logical than adults: Experimental investigations of scalar implicature. Cognition, 78(2), 165-188.<\/p>\n<p>Noveck, I. A. (2004). Pragmatic inferences related to logical terms. In I. A. Noveck &amp; D. Sperber (Eds.), Experimental Pragmatics. Basingstoke: Palgrave.<\/p>\n<p>Noveck, I. A. &amp; Posada, A. (2003). Characterizing the time course of an implicature: An evoked potentials study. Brain and Language, 85, 203-210.<\/p>\n<p>Noveck, I. A. &amp; Sperber, D.\u00a0 (2004), Experimental Pragmatics. Basingstoke: Palgrave.<\/p>\n<p>Papafragou, A., &amp; Musolino, J. (2003). Scalar implicatures: experiments at the semantics-pragmatics interface. Cognition, 86(3), 253-282.<\/p>\n<p>Papafragou, A; &amp; Tantalou, N. (2004). Children&#8217;s computation of implicatures. Language Acquisition 12(1), 71-82.<\/p>\n<p>Paris, S. G. (1973). Comprehension of language connectives and propositional logical relationships. Journal of Experimental Child Psychology, 16(2), 278-291.<\/p>\n<p>Rips, L. J. (1975). Quantification and semantic memory. Cognitive Psychology, 7(3), 307-340.<\/p>\n<p>Smith, C. L. (1980). Quantifiers and question answering in young children. Journal of Experimental Child Psychology, 30(2), 191-205.<\/p>\n<p>Sperber, D., &amp; Wilson, D. (1995). Relevance: Communication and Cognition. Oxford: Basil Blackwell.<\/p>\n<p>Sternberg, R. J. (1979). Developmental patterns in the encoding and combination of logical connectives. Journal of Experimental Child Psychology, 28(3), 469-498.<\/p>\n<p>Van der Henst, J.B.,\u00a0 Sperber, D. (2004). Some experimentally testable implications of relevance theory. In I. A. Noveck &amp; D. Sperber (Eds.), Experimental Pragmatics. Basingstoke: Palgrave.<\/p>\n<p>Wilson, D. &amp; Sperber, D., (2004). Relevance Theory, in G. Ward et L. Horn (Eds.), Handbook of Pragmatics. Blackwell.<\/p>\n<p><strong> <\/strong><\/p>\n<div id=\"_mcePaste\" style=\"overflow: hidden; position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px;\">\n<div class=\"Section1\">\n<p class=\"MsoNormal\" style=\"text-align: center; text-indent: 1cm; line-height: 150%;\" align=\"center\"><span style=\"font-size: 14pt; line-height: 150%;\" lang=\"EN-GB\">To appear in Noel Burton-Roberts (ed) <em>Advances in Pragmatics<\/em>. Palgrave<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; text-indent: 1cm; line-height: 150%;\" align=\"center\"><strong><span style=\"font-size: 14pt; line-height: 150%;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; text-indent: 1cm; line-height: 150%;\" align=\"center\"><strong><span style=\"font-size: 14pt; line-height: 150%;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; text-indent: 1cm; line-height: 150%;\" align=\"center\"><strong><span style=\"font-size: 14pt; line-height: 150%;\" lang=\"EN-GB\">The why and how of experimental pragmatics: <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; text-indent: 1cm; line-height: 150%;\" align=\"center\"><strong><span style=\"font-size: 14pt; line-height: 150%;\" lang=\"EN-GB\">The case of \u2018scalar inferences\u2019<\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; text-indent: 1cm; line-height: 150%;\" align=\"center\"><strong><span style=\"font-size: 14pt; line-height: 150%;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; text-indent: 1cm; line-height: 150%;\" align=\"center\"><span style=\"font-size: 14pt; line-height: 150%;\" lang=\"EN-GB\">Ira Noveck &amp; Dan Sperber<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Although a few pioneers in psycholinguistics had, for more than twenty years, approached various pragmatic issues experimentally, it is only in the past few years that investigators have begun employing the experimental method in testing pragmatic hypotheses (see Noveck &amp; Sperber 2004). We see this emergence of a proper experimental pragmatics as an important advance with a great potential for further development. In this chapter we want to illustrate what can be done with experimental approaches to pragmatic issues by presenting one case, that of so-called \u2018scalar inferences\u2019, where the experimental method has helped sharpen a theoretical debate and has provided uniquely relevant evidence. We will focus on work done by the first author and his collaborators or work closely related to theirs, but other authors have also made important contributions to the topic (e.g. Papafragou and Musolino, 2003; Guasti, Chierchia, Crain, Foppolo, Gualmini, &amp; Meroni, 2005; De Neys &amp; Schaeken, in press).<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><strong><span lang=\"EN-GB\">Methodological background: the limits of pragmatic intuitions as evidence<\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\">Theoretical work in pragmatics relies heavily \u2013 often exclusively \u2013 on pragmatic intuitions. These are rarely complemented with observational data of a kind more common in sociologically oriented pragmatics. The use of statistical data from corpuses and from experiments is even less common. This situation results partly from the fact that most theoretical pragmatists have been trained in departments of linguistics where, quite often, linguistic intuitions are the only kind of data considered. Optimally, of course, one would want pragmatists to use whatever kind of data that may significantly confirm or disconfirm hypotheses. Moreover, a sensible methodological pluralism is not the only reason to diversify the types of evidence used in pragmatics. There are also principled limits to the use of pragmatic intuitions.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\">It makes sense (even if it is not uncontroversial) to judge a semantic description by its ability to account for semantic intuitions. Of course, the use of semantic intuitions and of linguistic intuitions generally, raises methodological problems and calls for methodological caution. For instance, a linguist\u2019s intuitions may be biased by prior theoretical commitments. Also, one may mistake what are in fact pragmatic intuitions for semantic ones (as, Grice argued, ordinary language philosophers systematically did). Still, there are good reasons why semantic intuitions are so central to semantics. Semantic intuitions are not just <em>about<\/em> semantic facts; they are semantic facts themselves. For instance, the intuition that sentence (1) entails (2) is not <em>about<\/em> some semantic property that this sentence would have anyhow, regardless of its accessibility to speakers\u2019 intuitions.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -18pt; line-height: 150%;\"><span lang=\"EN-GB\">(1) John knows that it is raining<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -18pt; line-height: 150%;\"><span lang=\"EN-GB\">(2) it is raining<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -18pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">Rather, for (1) to have the meaning it has <em>is<\/em>, among other aspects, to be intuitively understood as entailing (2). A semantic analysis of linguistic expressions that accounts for all the speaker-hearer\u2019s semantic intuitions about these expressions may not be the best possible analysis, but it is descriptively adequate (in Chomsky\u2019s sense \u2013 an explanatory adequate description of the semantics of a given language, on the other hand, involves hypotheses about the capacities that make the acquisition of this semantics possible, and here observational and experimental evidence should be of relevance). <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\">The use of pragmatic intuitions raises the same methodological problems as does the use of semantic intuitions and then some. It is a mistake to believe that pragmatic intuitions of the kind used in pragmatics are data of the same kind as semantic intuitions used in semantics. Genuine pragmatic intuitions are those that addressees have about the intended meaning of an utterance addressed to them. Quite generally, pragmatic intuitions invoked in theoretical pragmatics are not about actual utterances addressed to the reader of a pragmatic article, but about hypothetical cases involving imaginary or generic interlocutors. Pragmatic intuitions on hypothetical utterances have proved useful in a variety of ways, but it is important to keep in mind that these are not about how an utterance is interpreted, but about how an utterance <em>would be<\/em> interpreted if it were produced in a specific situation by a speaker addressing a listener, with referring expressions having actual referents, and so on. These intuitions are educated guesses \u2013 and, no doubt, generally good ones \u2013 about hypothetical pragmatic facts, but are not themselves pragmatic facts and they may well in error. That is, we may be wrong about how, in fact, we would interpret a given utterance in a given context.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Besides helping compensate for the inherent limits of pragmatic intuitions, an experimental approach can provide crucial evidence when deciding between alternative theories that may agree on the content of the interpretations of utterances, but that have different implications regarding the cognitive mechanisms through which these interpretations are arrived at. Of course, for their contribution to be of value, experimentalists must conform to fairly strict methodological criteria and measure just what they are intent on measuring\u2014typically the effect of one \u2018independent\u2019 variable on another \u2018dependent\u2019 variable without other uncontrolled variables affecting the results. We will show how this plays out in the study of \u2018scalar inferences\u2019.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><strong><span lang=\"EN-GB\">Theoretical background: Scalar implicatures as Generalised Conversational Implicatures (GCIs)<\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\">The experiments we will present are relevant to the study of so-called \u2018scalar implicatures.\u2019 Here we just remind readers of the main features of the Gricean and neo-Gricean account of scalar implicatures, and focus on the claim that scalar implicatures are Generalized Conversational Implicatures, or GCIs. Scalar implicatures are illustrated by cases such as (3a) which is said to implicate (3c), or (4a) said to implicate (4c):<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(3) (a) It is possible that Hillary will win<br \/>\n(b) It is certain that Hillary will win<br \/>\n(c) It is not certain that Hillary will win<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(4) (a) Some of the guests have arrived<br \/>\n(b) All of the guests have arrived<br \/>\n(c) Not all of the guests have arrived<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">Proposition (3b) is more informative than (3a), which it entails. If the more informative proposition would make a greater contribution to the common purpose of the conversation, then, a speaker obeying Grice\u2019s first Maxim of Quantity (\u201cMake your contribution as informative as is required\u201d) would be expected to express it unless she could not do so without violating the Supermaxim of Quality (\u201cTry to make your contribution one that is true\u201d). Hence, on a Gricean account, a speaker stating (3a) typically implicates (3c) (i.e., the negation of (3b)). For the same reasons, a speaker stating (4a) typically implicates (4c) (i.e., the negation of (4b)).<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">Such implicatures are described as \u2018scalar\u2019 because, according to an account developed by neo-Griceans and in particular Lawrence Horn (1972), the derivation of these implicatures draws on pre-existing linguistic scales consisting in a set of alternate terms or expressions ranked by order of informativeness; &lt;<em>possible, certain<\/em>&gt; and &lt;<em>some, all<\/em>&gt; are examples of such scales. When a less informative term is used in an utterance in a way that does not satisfy the first maxim of quantity, the speaker can be taken to implicate that the proposition that would have been expressed by the use of a stronger term in the scale is false. This account of implicatures such as those carried by (3a) or (4a) extend to a wide variety of cases and has some intuitive appeal. It should not be seen however as obviously correct or without alternatives. In particular, its implications for processing are less attractive. According to such an account, the inference from the utterance to its scalar implicature goes through a consideration not just of what the speaker said and the context but also of what the speaker might have said but did not. It is this type of onerous inference that makes the Gricean account of implicature derivation seem implausible from a cognitive and developmental point of view.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">Levinson draws on another idea of Grice, that of Generalized Conversational Implicatures, to offer an account that might provide a solution to the problem posed by the derivational complexity of scalar implicatures. Grice noted that some implicatures are generally valid (from a pragmatic rather than logical point of view, of course) and therefore could be inferred without consideration of the context, except in cases where the context happens to make them invalid. Grice contrasted these Generalized Conversational Implicatures with Particular Conversational Implicatures, which are valid only in specific contexts. In his book <em>Presumptive Meanings: The Theory of Generalized Conversational Implicatures<\/em> (Levinson 2000), Levinson elaborates Grice\u2019s original and somewhat vague notion. For Levinson, GCIs are <em>default inferences<\/em>, that, is, inferences that are automatically generated and that can be cancelled if there are contextual reasons to do so. Levinson treats scalar implicatures as paradigmatic cases of GCIs (whereas Grice\u2019s own examples of GCIs don\u2019t include scalar implicatures). This has the advantage of making the inference of these implicatures a relatively light one-step process, which needs to access neither contextual premises nor the full Gricean rationale for their derivation.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">Levinson\u2019s own rationale for GCIs so conceived has to do with the optimization of processing. The existence of GCIs speeds up the process of communication that is slowed down, Levinson argues, by the need for phonetic articulation: some unencoded aspects of the speaker\u2019s meaning can be inferred from metalinguistic properties of the utterance such as the choice of a given word from among a set of closely related alternatives. For instance the speaker\u2019s choice of \u201csome\u201d rather than the stronger \u201call\u201d in (4a) (\u201cSome of the guests have arrived\u201d) justifies inferring that (4c) is part of her meaning. These are non-demonstrative inferences, or course. There are cases where these inferences are invalid. For instance, if it were contextually established that the speaker of (4a) has only partial information about the arrival of the guests, then (4c) would not be part of her meaning. Still, given that GCIs are valid in most contexts (or so it is assumed), the overall speeding up of communication made possible by the automaticity of GCIs is not compromised by the rare cases where contextual considerations force the hearer to countermand them.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">The theory of scalar implicatures as default GCIs combines four claims:<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 54pt; text-indent: -27pt; line-height: 150%;\"><!--[if !supportLists]--><span lang=\"EN-GB\"><span>(a)<span style=\"font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;\"> <\/span><\/span><\/span><!--[endif]--><span lang=\"EN-GB\">These inferences are made by default, irrespective of the context, and cancelled when the context demands<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 54pt; text-indent: -27pt; line-height: 150%;\"><!--[if !supportLists]--><span lang=\"EN-GB\"><span>(b)<span style=\"font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;\"> <\/span><\/span><\/span><!--[endif]--><span lang=\"EN-GB\">The fact that these inferences are made by default adds to the speed and efficiency of communication<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 54pt; text-indent: -27pt; line-height: 150%;\"><!--[if !supportLists]--><span lang=\"EN-GB\"><span>(c)<span style=\"font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;\"> <\/span><\/span><\/span><!--[endif]--><span lang=\"EN-GB\">These inferences contribute implicatures to the interpretation of the utterance, as opposed to contributing enrichments of its explicit content (\u2018what is said\u2019 in Grice\u2019s terms or \u2018explicatures\u2019 in relevance theory\u2019s terms)<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 54pt; text-indent: -27pt; line-height: 150%;\"><!--[if !supportLists]--><span lang=\"EN-GB\"><span>(d)<span style=\"font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;\"> <\/span><\/span><\/span><!--[endif]--><span lang=\"EN-GB\">These inferences are scalar: they exploit pre-existing scales such as &lt;<em>some<\/em>, <em>all<\/em>&gt;, &lt;<em>or<\/em>, <em>and<\/em>&gt;, &lt;<em>possible<\/em>, <em>necessary<\/em>&gt;<\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">We doubt all four claims. The bulk of this chapter will be devoted to explaining how experimental evidence has cast strong doubts on claim (a). First, however, we briefly present an argument that also casts doubt on (b), and we outline the relevance-theoretic approach, which is in contradiction with all four claims.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">This idea that default implicatures or GCIs would permit more efficient and speedier communication may seem sensible and capable of lending support to the whole theory. It raises however the following empirical issue. If the frequency of GCI cancellations were too high, their cost would offset the benefit of deriving GCIs by default.<span> <\/span>Suppose for instance that a given type of GCI had to be cancelled a third of the time. The cost of the use of such a GCI would be that of deriving it by default in all cases plus the cost of cancelling it in one third of the cases. This would have to be compared with the cost of deriving the implicature as a \u2018particularized conversational implicature,\u2019 that is, in a contextually sensitive and therefore more costly way, in two thirds of the cases, but without any cost of default derivation followed by cancellation in the other third of the cases. It is not clear that, with such frequencies, the rationale given for GCIs in term of economy would make much sense.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">To show that this kind of calculus is not unrealistic, consider the example of \u201cP or Q\u201d and its alleged GCI <em>not (P and Q)<\/em>. We are not aware of any statistical data regarding the frequency of exclusive uses of \u201cor\u201d and we share the common intuition that quite often, when people utter a sentence of the form \u201cP or Q\u201d they can be taken to consider that <em>P and Q<\/em> is excluded. This exclusion however need not be part of their meaning. In most cases this exclusion follows from real world knowledge and not from the interpretation of \u201cor\u201d, as illustrated in (5)-(7):<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(5) He is a bachelor or he is divorced <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(6) Jane is in <\/span><span lang=\"EN-GB\">Paris<\/span><span lang=\"EN-GB\"> or in <\/span><span lang=\"EN-GB\">Madrid<\/span><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(7) Bill will arrive Monday or Tuesday<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">If \u201cP or Q\u201d implicates by default that <em>not (P and Q)<\/em>, then, in all cases such as (5)-(7) where the two disjuncts cannot both be true for commonsense reasons, people automatically compute a GCI that causes the speaker\u2019s meaning to redundantly implicate what is already part of the common ground, and surely, this is a cost without associated benefit. Moreover, if one carefully excludes cases where mutual exclusivity of the disjuncts is self-evident and need not be communicated, and looks at cases such as (8)-(10) where neither the inclusive nor the exclusive interpretation is a priori ruled out, it is not at all obvious that the exclusive interpretation of \u201cor\u201d is dominant:<\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(8)<span> <\/span><span> <\/span>She wears sunglasses or a cap<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(9) <span> <\/span>Our employees speak French or Spanish<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(10) <span> <\/span>Bill will sing or play the piano<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">We have no hard statistical data to present, but it seems less than obvious that a disposition to understand by default utterances of the form \u201cP or Q\u201d as implicating <em>not<\/em> <em>(P and Q) <\/em><span> <\/span>would render communication speedier or more efficient. More generally, the effect that GCIs would have on the efficiency of communication should be investigated rather than assumed.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%; page-break-after: avoid;\"><strong><span lang=\"EN-GB\">Relevance theory\u2019s approach<\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">We will assume that the basic tenets of relevance theory are familiar (see Wilson &amp; Sperber 2004 for a recent restatement), and focus on how it applies to what neo-Griceans describe as \u2018scalar implicatures.\u2019 Two basic ideas play a crucial role here: <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 54pt; text-indent: -27pt; line-height: 150%;\"><!--[if !supportLists]--><span lang=\"EN-GB\"><span>(a)<span style=\"font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;\"> <\/span><\/span><\/span><!--[endif]--><span lang=\"EN-GB\">Linguistic expressions serve not to <em>encode<\/em> the speaker\u2019s meaning but to <em>indicate<\/em> it. The speaker\u2019s meaning is inferred from the linguistic meaning of the words and expressions used taken together with the context.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 54pt; text-indent: -27pt; line-height: 150%;\"><!--[if !supportLists]--><span lang=\"EN-GB\"><span>(b)<span style=\"font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;\"> <\/span><\/span><\/span><!--[endif]--><span lang=\"EN-GB\">Inferring the speaker\u2019s explicit and implicit meaning (her explicatures and implicatures) is not done sequentially but in parallel. The final overall interpretation of an utterance results from a mutual adjustment of implicatures and explicatures guided by expectations of relevance.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">Here is a simple illustration of these two points:<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(11) <em>Henry<\/em>: Do you want to go on working, or shall we go to the cinema?<br \/>\n<span> <\/span><em>Jane<\/em>: I am tired. Let\u2019s go to the cinema.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">Jane\u2019s describing herself as \u201ctired\u201d achieves relevance as an explanation of her acceptance of Henry\u2019s suggestion. For this it must be understood that she is not just tired, but too tired to go on working, and at the same time not too tired to go to the cinema. Her used of \u201ctired\u201d serves to indicate an ad hoc concept TIRED* with an extension narrower than that of the linguistically encoded concept TIRED. Whereas TIRED extends from a minimal level of tiredness to complete exhaustion, TIRED* extends just over those levels of tiredness that explain why Jane would rather go to the cinema than work. Henry correctly understands Jane explicature to be (12) and her implicature to be (13), yielding an optimally relevant interpretation:<\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(12) I am TIRED*<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\">(13) The reason why I would rather go to the cinema than work is that I am TIRED*<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 63pt; text-indent: -16.65pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">Note that explicature (12), and in particular the interpretation of \u201ctired\u201d as indicating TIRED* is calibrated so as to justify implicature (13). The explicature therefore could only be inferred once the implicature had been tentatively assumed to be part of Jane\u2019s meaning. The overall interpretation results from a process of mutual adjustment between explicature and implicature.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">Consider now an expression typically supposed to give rise to \u2018scalar implicatures\u2019 such as \u201csome of the Xs\u201d. From a semantic point of view, \u201csome of the Xs\u201d has as its extension the set of subsets of n Xs where n is at least 2 and at most the total number of the Xs. From a relevance-theoretic pragmatic point of view, the use of an expression of the form \u201csome of the Xs\u201d, just as that of any linguistic expression, serves not to encode the speaker\u2019s meaning, but to indicate it. In particular the denotation of the concept indicated by a given use of \u201csome of the Xs\u201d may be an ad hoc concept SOME OF THE Xs* with a denotation different from that of the literal SOME OF THE Xs. Rather than ranging over all subsets of Xs between 2 and the total number of the Xs, the extension of SOME OF THE Xs* may be narrowed down at either end, or it may be extended so as to include subsets of one. <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">Imagine (14) uttered in a discussion of the spread of scientific knowledge in <\/span><span lang=\"EN-GB\">America<\/span><span lang=\"EN-GB\">:<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 35.45pt; text-indent: 7.1pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 35.45pt; text-indent: 7.1pt; line-height: 150%;\"><span lang=\"EN-GB\">(14) Most Americans are creationists and some even believe that the Earth is flat<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 35.45pt; text-indent: 7.1pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">Clearly, the speaker is understood as meaning that a number of Americans much greater than two believe that the earth is flat. Two Americans with such a belief\u2014say two inmates in a psychiatric hospital\u2014would be enough to make her utterance literally true, but not, and by a wide margin, to make it relevant. Given that the speaker can be assumed to know that it is common knowledge that not all Americans believe that the earth is flat, there is no ground to assume that this is a part of her meaning (inferring it would not add any cognitive effect and it would involve a processing cost, hence it would detract from relevance). On the other hand, the speaker\u2019s contrastive use of \u201cmost\u201d and \u201csome\u201d and her use of \u201ceven\u201d make it part of her meaning that the Americans who believe the earth to be flat are fewer than those who believe in creationism (this, of course, entails that not all Americans believe that the earth is flat, but not every entailment of a speaker\u2019s meaning is part of that meaning). So the denotation indicated by \u201csome\u201d in (14) is narrower than its literal denotation at both ends: the subsets of Americans in the denotation of this occurrence of \u201csome\u201d are large enough to be relevant and hence much larger than sets of two Americans, and are smaller than the set of American creationists.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">Let us now go back to a version of example (4). Jane and Henry have invited a few friends to a dinner party. Suppose first that it was agreed that Henry would go and get the desert from the pastry shop as soon as the guests started arriving. Henry is in the garage, he hears the bell ring and then Jane shouting (15) to him:<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 72pt; text-indent: -18pt; line-height: 150%;\"><span lang=\"EN-GB\">(15) <em>Jane to Henry<\/em>: Some of the guests have arrived<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 72pt; text-indent: -18pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"><span> <\/span>Henry does not know whether one, many, or all the guests have arrived, or, for that matter, whether Jane has already opened the door and seen how many of them there are, and the question need not even come to his mind. What makes Jane\u2019s utterance relevant is that it implies that he should go now and this does not depend on the number of guests at the door. Henry\u2019s construal of \u201csome\u201d is compatible with any number of guests having arrived, even just one, and hence is an extended construal of \u201csome\u201d.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">Consider now a different scenario. Henry is alone in the kitchen cooking. Jane comes in and tells him (15). The consequences that Henry considers are that he should come and greet the guests and bring the finger food he has prepared as an appetizer. The value of \u201csome\u201d is taken to be a value for which these are the main consequences. If all the guests had come, what he should do would be not just to greet the guest and bring the finger food, but also and even more importantly, to put the fish in the oven and make the ultimate preparations for the meal itself. The fact that Jane\u2019s utterance achieves relevance without bringing to mind consequences more typical of the arrival of all the guests causes Henry to construe \u201csome\u201d with some vague cardinality above one and below all. Henry need not actively exclude <em>all<\/em>, he may just not even consider it. If however Henry is wondering whether all the guests have arrived, then he will take Jane\u2019s utterance to licence the inference that not all of them have. If moreover Henry had asked Jane whether all the guests had arrived, or if he knew that she knew that it was particularly relevant to him at this point in time, he would take that inference to be intended. He would also do so if she had put a contrastive stress on \u201csome\u201d, causing an extra effort and suggesting an extra effect. In other words, if there is some mutually manifest, actively represented reason to wonder whether all the guests have arrived, then (15) can be taken to implicate that not all of them have.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">From a <span> <\/span>relevance theory point of view, (11), (14), and (15) are just ordinary illustrations of the fact that linguistic expressions serve to indicate rather than encode the speaker\u2019s meaning and that the speaker\u2019s meanings are quite often a narrowing down or broadening of the linguistic meaning. Taking \u201csome\u201d to indicate not <em>at least two and possibly all <\/em>but <em>at least two and fewer than all<\/em> is a common narrowing down of the literal meaning of \u201csome\u201d at the level of the explicature of the utterance. It is not automatic but takes place when the consequences that render the utterance relevant as expected are characteristically carried by this narrowed down meaning.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">We are not denying that a statement of the form \u201c\u2026some\u2026\u201d may, in some cases carry an implicature of the form \u2026<em>not all<\/em>\u2026 (or, in other cases we will not discuss here, an implicature of the form <em>\u2026some\u2026not<\/em>\u2026). This occurs when the \u201c\u2026some\u2026\u201d <span> <\/span>utterance achieves relevance by answering a tacit or explicit question as to whether <em>all<\/em> items satisfy the predicate.The fact that it does not answer it positively <em>implicates<\/em> a negative answer and therefore a narrowed down construal of \u201csome\u201d as excluding all. Standard accounts of \u2018scalar implicatures\u2019 fail to distinguish the cases where the explicature merely entails <em>\u2026not all\u2026<\/em> and the much less frequent cases where, moreover, the utterance implicates <em>\u2026not all\u2026<\/em> .<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">In all the cases where the meaning of \u201csome\u201d in an utterance is narrowed down so as to exclude <em>all<\/em>, this is the result of an inferential process that looks at consequences that might make the utterance relevant as expected, and that adjusts the meaning indicated by \u201csome\u201d to these consequences. In particular if what may make the utterance relevant is an implication that is true of some Xs but not of all Xs, then the meaning of \u201csome\u201d is adjusted so as to exclude<em> all<\/em>. These inferential processes result from the automatic attempt by the hearer to find an interpretation of the utterance that meets his expectation of relevance and they all follow the same heuristics. There is nothing distinctive in the way \u2018scalar\u2019 inferences are drawn. Moreover, the class of cases described in the literature as scalar inferences is characterised by an enrichment at the level of the explicature (where, for instance, \u201csome\u201d is reinterpreted in a way that excludes <em>all<\/em>) and only in a small sub-class of these is the exclusion of the more informative concept not just entailed but also implicated.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">According to relevance theory, then, so called \u2018scalar implicatures\u2019 are not scalar, nor necessarily implicatures. Of course, the notion of \u2018scalar implicature\u2019 could be redefined to fit just cases where there is an explicit or implicit question as to whether the use of a more informative expression than the one employed by the speaker (e.g. \u201call\u2019 instead of \u201csome\u201d) would have been warranted, and in such cases, a denial of a more informative claim can indeed be implicated by the use of the less informative expression. However, \u2018scalar implicatures\u2019 in this restricted sense depend on contextual premises (linked to the fact that the stronger claim was being entertained as a relevant possibility) rather than on a context-independent scale, and are not candidates therefore for the status of GCI.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">From the point of view of relevance theory then, the <\/span><span lang=\"EN-US\">classical neo-Gricean theory of scalar implicatures <\/span><span lang=\"EN-GB\">can be seen as a mistaken generalisation of the relatively rare case where a weaker claim genuinely <em>implicates<\/em> the denial of a stronger claim that is contextually under consideration to the much more common case where the denotation of an expression is narrowed down so as to exclude marginal or limiting instances carrying untypical implications. For instance \u201cpossible\u201d as in (3a) (\u201cIt is possible that Hillary will win\u201d) is often construed as excluding, on one side, mere metaphysical possibility with very low empirical probability, and, on the other side, certainty and quasi-certainty. The trimming of \u201cpossible\u201d at both ends results in an enriched and generally more relevant meaning. Since the trimming at the very high probability end is not different from that at the very low probability end, both should be explained in the same way, ruling out the scalar aspect of the \u2018scalar implicature\u2019 account, which works, if at all, only at the upper end. On the other hand, if (3a) were uttered in reply to the question: \u201cIs it certain that Hillary will win?\u201d, then it would indeed implicate<span style=\"color: #333399;\"> <\/span>(3c) (\u201cIt is not certain that Hillary will win\u201d) because it would achieve relevance by implicitly answering in the negative a question that had been asked. From a relevance theory point of view, the two cases should be distinguished.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">This is not the place to compare in detail the GCI and the relevance-theoretic approaches. We focus rather on a testable difference in prediction between them. Levinson writes: \u201cGCI theory clearly ought to make predictions about process. But here the predictions have not yet been worked out in any detail\u201d (Levinson 2000:370). There is however one prediction about process that follows quite directly from GCI theory since it is hardly more than a restatement of some of the tenets of the theory. According to the theory, GCIs are computed by default and are contextually cancelled when needed. Both the computation of GCIs and their cancellation are processes and therefore should take each some time and effort (even if the default character of GCI should make their computation quite easy and rapid). Everything else being equal, less effort should be expended and less time taken in the normal case where a GCI is computed and not cancelled than in the exceptional case where a GCI is first computed and then cancelled. Relevance theory predicts just the opposite pattern. <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\"><span> <\/span><\/span><span lang=\"EN-US\">From a relevance-theoretic perspective, the speaker\u2019s meaning is always inferred, even when it consists in a literal <\/span><span lang=\"EN-GB\">interpretation<\/span><span lang=\"EN-US\"> of the linguistic expressions used. The inferences involved, however, differ in the time and effort they require. Both the sentence meaning and the context contribute to making some <\/span><span lang=\"EN-GB\">interpretations more easily derived than others. If only sentence meaning were involved, one should predict that the smaller the distance between it and the speaker\u2019s meaning it serves to indicate, the lesser would be the time and effort required to infer the speaker\u2019s meaning. Contextual factors, however, must be taken into account. For instance, an <\/span><span lang=\"EN-US\">enriched interpretation may be primed by the context<\/span><span lang=\"EN-GB\"> and, as a result may be easier to infer than a<\/span><span lang=\"EN-GB\"> <\/span><span lang=\"EN-US\">literal <\/span><span lang=\"EN-GB\">interpretation<\/span><span lang=\"EN-US\">. Consider a variation of example (11):<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-US\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 49.65pt; text-indent: -22.65pt; line-height: 150%;\"><span lang=\"EN-US\">(16) <em>Henry<\/em>: You look tired, let\u2019s go to the cinema\u201d<br \/>\n<em>Jane<\/em>:<span> <\/span>I am tired, but not too tired to go on working<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 49.65pt; text-indent: -22.65pt; line-height: 150%;\"><span lang=\"EN-US\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-US\">A natural <\/span><span lang=\"EN-GB\">interpretation<\/span><span lang=\"EN-US\"> of Henry\u2019s utterance involves the ad hoc concept TIRED* such that being TIRED* is a sufficient reason to stop working and not a sufficient reason to stay at home. Jane could have answered, \u201cNo, I am not tired, I\u2019ll go on working\u201d meaning that she was not TIRED*. When Jane, rather, asserts that she <em>is<\/em> tired, Henry is primed to interpret \u201ctired\u201d as TIRED*. A relevant <\/span><span lang=\"EN-GB\">interpretation<\/span><span lang=\"EN-US\"> of Jane\u2019s whole utterance, however, imposes a broader, more literal and, in this situation, more effortful <\/span><span lang=\"EN-GB\">construal of the term. <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\">Even when an enriched interpretation of an utterance is not primed, it may require less processing effort than would the literal interpretation because the contextual implications that render relevant the enriched interpretation are more easily arrived at than those that would render relevant the literal interpretation. This typically occurs with metaphorical utterances: a relevant literal interpretation is often hard or even impossible to construct.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">In the absence of contextual factors that make an enriched interpretation of an utterance easier to arrive at, relevance theory predicts that a literal interpretation\u2014which involves just the attribution to the speaker of a meaning already provided by linguistic decoding\u2014should involve shallower processing and take less time than an enriched one\u2014which involves a process of meaning construction. Such is the case in particular in the experiments we describe below. <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\">The difference in prediction between GCI theory and relevance theory can be presented in table form:<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<table class=\"MsoTableGrid\" style=\"border: medium none ; margin-left: 50.4pt; border-collapse: collapse;\" border=\"1\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td style=\"border: medium none; padding: 0cm 5.4pt; width: 27pt;\" rowspan=\"3\" width=\"36\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\">Interpretation of the scalar   term<\/span><\/strong><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 108pt;\" width=\"144\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"page-break-after: avoid;\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 162pt;\" width=\"216\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"margin: 6pt 0cm; text-align: center; page-break-after: avoid;\" align=\"center\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\">GCI theory<\/span><\/strong><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 117pt;\" width=\"156\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"margin: 6pt 0cm; text-align: center; page-break-after: avoid;\" align=\"center\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\"><span> <\/span>relevance theory<\/span><\/strong><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 0cm 5.4pt; width: 108pt;\" width=\"144\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\">literal<\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 162pt;\" width=\"216\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">default enrichment + context-sensitive cancellation,<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">hence slower<\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/em><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 117pt;\" width=\"156\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">no enrichment,<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">hence faster<\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 0cm 5.4pt; width: 108pt;\" width=\"144\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><span style=\"font-family: Arial;\" lang=\"EN-GB\">enriched<\/span><\/strong><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 162pt;\" width=\"216\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">default enrichment,<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">hence faster<\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><strong><em><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/em><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<\/td>\n<td style=\"padding: 0cm 5.4pt; width: 117pt;\" width=\"156\" valign=\"top\">\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\">context-sensitive enrichment,<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; page-break-after: avoid;\" align=\"center\"><span style=\"font-size: 11pt; font-family: Arial;\" lang=\"EN-GB\"><span> <\/span><strong><em>hence slower<\/em><\/strong><\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%; page-break-after: avoid;\"><span style=\"font-family: Arial;\" lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 49.65pt; text-indent: -49.65pt; page-break-after: avoid;\"><span style=\"text-decoration: underline;\"><span style=\"font-family: Arial;\" lang=\"EN-GB\">Table 1<\/span><\/span><span style=\"font-family: Arial;\" lang=\"EN-GB\">: contrasting predictions of GCI Theory and <span> <\/span>relevance theory regarding the speed of interpretation of scalar term (when an enriched construal is not contextually primed)<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">This difference in prediction between the two theories is of a type that lends itself to experimental investigation.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 27pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%; page-break-after: avoid;\"><strong><span lang=\"EN-GB\">Methodological considerations in experimental approaches to \u2018scalar inferences\u2019<\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">In the experimental study of case of scalar inferences,<a name=\"_ftnref1\"><\/a> one has to keep four methodological considerations in mind. To begin with, one wants to be sure that a given result (whether it be the rate of responses that indicate a pragmatic enrichment or the mean reaction time associated with an enrichment) is a consequence of the experiment\u2019s intended target and not of other contextual variables. For example, one would want to be sure that the understanding of a disjunctive statement of the form <em>P or Q<\/em> as excluding <em>P and Q<\/em> is due to the pragmatic enrichment of the term \u201cor\u201d (from an inclusive to an exclusive interpretation) and not to some other feature. Thus, one would avoid investigating utterances that invite an exclusive understanding of the situation described rather than of the description itself. In example (6) (\u201cJane is in Paris or in Madrid\u201d) above, the exclusive understanding is based on our knowledge that a person cannot be in two places at the same time and need not involve any pragmatic enrichment of the meaning of the word \u201cor\u201d. In devising experimental material, it thus becomes important to invent examples where an enriched interpretation is not imposed by extra-pragmatic considerations. One can do this by using either examples where participants knowledge is equally compatible with a literal or an enriched interpretation of a scalar term, or examples where knowledge considerations might bias participants in favour of a literal interpretation: in both cases, if one finds evidence of enrichment, one will be confident that it comes from a pragmatic inference about what the utterance meant, rather than from a mere understanding of how the world is.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Second, one would want a paradigm that allows for two identifiable outcomes so that the presence of an enrichment can be indicated by a unique sort of response while a non-enrichment can be indicated by a different response. This is why most of the experiments on scalars described here involve a scenario that could be described by means of a more informative utterance than the test utterance (uttered by a puppet or some other interlocutor). <span> <\/span>Imagine for example being shown five boxes each containing a token and then being told, \u201cSome boxes contain a token.\u201d If one interprets <em>\u201c<\/em>some<em>\u201d<\/em> literally (i.e. as compatible with <em>all<\/em>), one would agree with the statement; if one enriches <em>\u201c<\/em>some<em>\u201d<\/em> so as to make it incompatible with <em>all<\/em>, one would have to disagree. In such conditions, a participant\u2019s response (agrees or disagrees) is revealing of a particular interpretation.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Third, one wants every assurance that an effect is robust. That is, one wants to see the same result over and over again and across a variety of comparable tasks. When two similar studies (for instance two studies investigating different scalar terms but in an equivalent manner) present comparable outcomes, each strengthens the findings of the other. On the other hand, if two very similar experiments fail to produce the same general effects, something is wrong. This does not mean that negative results are necessarily fatal for an experimental paradigm . If one carefully modifies an experiment and it prompts a different sort of outcome than previous ones (and in a predictable manner), it helps determine the factors that underlie an effect. This occurs with the developmental findings to be described below, which have generally shown that children are more likely than adults to <em>agree<\/em> with a weak statement (for instance the statement \u201cSome horses jumped over a fence\u201d) when a stronger one would be pragmatically justified (because, in fact, all the horses jumped over a fence). All sorts of follow-up studies have aimed to put this effect to the test. In general, the effect has been resilient; a few studies, however, show that one can get children to appear more adult-like through specific sorts of modifications. For example, experimenters have aimed to verify the effect under conditions where participants are given training or where scenarios are modified to highlight the contrast between the weak utterance and the stronger scenario. The net result is that the outcomes of these tests collectively help identify the factors that can encourage scalar inference-making.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Fourth, it is important for any experiment to include as many reasonable controls as possible. These are test questions that are similar to the main items of interest, but aim basically to confirm that there is nothing bizarre in the task. For example, if one finds that participants\u2019 responses indicate that they enrich \u201csome\u201d but also that the same participants endorse the use of the word \u201csome\u201d to describe a scene where \u201cnone\u201d would be appropriate, then there is something questionable about the experiment. This rarely happens (the above example is presented for illustrative purposes only), but one needs to provide assurances to oneself and to readers that such bizarreness can be ruled out. Any decent task will include several controls that lead to uncontroversial responses in order to, in effect, contextualize the critical findings. The studies we will discuss exemplify the four methodological considerations we have just discussed.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><strong><span lang=\"EN-GB\">Developmental studies<\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">The experimental study of scalar inferences started within the framework of developmental studies on reasoning. Noveck (2001) investigated the way children responded (by agreeing or disagreeing) to a puppet who presented several statements, including one that could ultimately lead to a pragmatic enrichment. All statements, even those that served as controls to confirm that the participants understood the task, were about the contents of a covered box and were presented by a puppet (handled by the experimenter). Participants were told that the contents of the covered box resembled those of one or the other of two other boxes both which were open and with their contents in full view. One open box contained a parrot and the other contained a parrot and a bear. The participants then heard the puppet say<a name=\"_ftnref2\"><\/a>:<\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 28.25pt 0.0001pt 78pt; text-indent: -21.25pt; line-height: 150%;\"><span lang=\"EN-GB\">(17) A friend of mine gave me this (covered) box and said, \u201cAll I know is that whatever is inside this box (the covered one) looks like what is inside this box (the one with a parrot and bear) or what is inside this box (the one with just a parrot).\u201d<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">The participant\u2019s task was to say whether or not he agreed with further statements of the puppet. The key item was ultimately the puppet\u2019s \u201cunderinformative\u201d statement:<\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm -0.1pt 0.0001pt 78pt; text-indent: -21.25pt; line-height: 150%;\"><span lang=\"EN-GB\">(18) There might be a parrot in the box. <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">Given that the covered box <em>necessarily<\/em> contained a parrot, the statement in (18) can be answered in one of two ways. The participant can \u201cagree\u201d if she interprets \u201cmight\u201d literally (so that &#8230;<em>might<\/em>\u2026<em> <\/em>is compatible with \u2026<em>must<\/em>\u2026) or she can \u201cdisagree\u201d if she interprets <em>might<\/em> in an enriched way (where &#8230;<em>might\u2026 <\/em>is incompatible with \u2026<em>must\u2026<\/em>). <span> <\/span>Adults tended to be equivocal with respect to these two interpretations (35% agreed with the statement) while children (5-, 7- and 9-year-olds) tended to interpret this statement in a minimal way, i.e. literally. Collectively, 74% of the children responded by agreeing with the statement in (18). However, not all children were alike.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">The five year olds agreed with (18) at a rate of 72% (a percentage that is unlikely to occur by chance, which would yield 50% in such agree\/disagree contexts). Nevertheless, they failed to answer many control questions at such convincing rates. For example, when asked to agree or disagree with statements about the bear (\u201cThere has to be a bear,\u201d \u201cThere might be a bear,\u201d \u201cThere does not have to be a bear,\u201d \u201cThere cannot be a bear\u201d) they answered at levels that were comparable to those predicted by chance (55% correct across the four questions). Seven-year-olds, on the other hand, did manage to answer practically all seven control problems at rates that indicated they understood the task overall (77%). This is why Noveck (2001) reported that seven-year-olds were the youngest to demonstrate competence with this task while at the same time revealing that they preferred the literal interpretation of \u201cmight\u201d (at a rate, 80%, that is statistically distinguishable from expectations based on chance). The seven-year-olds thus provided the strongest evidence showing that those linguistically competent children who performed well on the task overall still interpreted \u201cmight\u201d in an unenriched way. As one might expect, the nine-year-olds also answered control problems satisfactorily. Response rates indicating unenriched interpretations of \u201cmight\u201d were high (69%) and much higher than the adults\u2019 but nevertheless were statistically indistinguishable from predictions based on chance suggesting that these children were <em>beginning<\/em> to appear adult-like with respect to (18). <span> <\/span>Overall, these results were rather surprising for a reasoning study because they indicated that children were more likely than adults to produce a logically correct evaluation of the underinformative modal statement. This sort of response is surprising and rare, but thanks to a pragmatic analysis\u2014where pragmatic enriched interpretations are viewed as likely to result from a richer inferential process than minimal interpretations that add nothing to semantic decoding\u2014these results had a ready interpretation.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Despite taking every precaution (having numerous control items and sampling many children), one can never exclude that such effects might be the result of some subtle factor beyond the experimenter\u2019s intention or control. That is why\u2014especially when encountering counterintuitive results like these\u2014it pays to do follow-ups. There have been essentially two sorts.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">The first sort aims to verify the effect. In one follow-up (Noveck, 2001, Exp. 2), the same task as the one above was given to 5-year-olds and 7-year-olds as well as adults, but all participants were given more thorough training to ensure that they understood the parameters of the task. This was done through training on an identical scenario (one box containing a horse and a fish and another just a horse) where pointed questions were asked about the covered box (e.g. <em>Could there be a fish by itself in the box?<\/em>). Overall, the training increased rates of minimal interpretations of \u201cmight\u201d across all three ages when it came to the task of Experiment 1. Agreement with a statement like the one in (18) was now 81% for five-year-olds, 94% for seven-year-olds, and 75% for adults. Although rates of such minimal interpretations were statistically comparable across ages, one finds the same trends as in the first Experiment. Seven-year-olds again demonstrated (through performance with the control problems) that they were the youngest to demonstrate overall competence with the task while <em>tending<\/em> to be more likely than adults in retaining a literal interpretation of the weak scalar term. The data also revealed that the extra training encourages adults to behave more \u201clogically\u201d (to stick to the literal meaning of \u201cmight\u201d), like the children.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">In an effort to establish the developmental effect\u2019s reliability and robustness, Noveck (2001, Exp. 3) took advantage of an older study that (a) unintentionally investigated weak scalar expressions among 4- to 7-year-old children and that (b) also failed to show evidence of pragmatic enrichment. Smith (1980) presented statements such as \u201cSome giraffes have long necks\u201d to children and reported that it was surprising to find the children accepting these as true. In a third experiment, therefore, Noveck (2001) essentially continued from where Smith left off. The experiment adopted the same technique as Smith (which included pragmatically felicitous statements such as \u201cSome birds live in cages\u201d as well as statements with \u201call\u201d) in order to verify that the developmental findings of the first two experiments were not flukes. The only differences in this third experiment were that the children were slightly older (8- and 10-years-old) than in the first two studies and that the experimenter was as \u201cblind\u201d to the intention of the study as the participants (the student who served as experimenter thought that unusual control items such as \u201cSome crows have radios\u201d or \u201cAll birds have telephones\u201d were the items of interest). The results showed that roughly 87% of children accepted statements like \u201cSome giraffes have long necks\u201d whereas only 41% of adults did. Again, adults were more likely than children to enrich the interpretation of the underinformative statements (understanding \u2026<em>some<\/em>\u2026 as excluding \u2026<em>all<\/em>\u2026) and thus tended to reject them (since<\/span><span style=\"font-family: Times;\" lang=\"EN-GB\"> all giraffes have long necks<\/span><span lang=\"EN-GB\">). All participants answered the five sorts of control items (25 items altogether) as one would expect.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">These data prompted Noveck (2001) to revisit other classic studies that serendipitously contained similar scenarios (ones where a stronger statement would be appropriate but a weaker one is made) to determine whether they tell the same story as \u201cmight\u201d and \u201csome\u201d. In fact, three studies concerning \u201cor\u201d (Paris, 1973, Braine and Rumain, 1981, Sternberg, 1983), where a conjunctive situation is described with a weaker disjunction, provides further confirming evidence. The authors of these studies also reported counter-intuitive findings showing younger children being, in effect, more logical than adults (children tend to treat \u201cor\u201d inclusively more often than adults). None of these authors, lacking a proper pragmatic perspective, knew how to make sense of these data at the time. All told, this effect appeared robust.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Other follow-up studies have actually taken issue with Noveck\u2019s <em>interpretation<\/em> of the findings. In fact, Noveck (2001, p.184) insisted that his data show that <span> <\/span>children are ultimately less likely than adults to pragmatically enrich underinformative items across tasks; this did not amount to a claim that children lacked pragmatic competence. Still, much work has been aimed at showing that young children are more competent than it might appear. These studies usually take issue with Noveck\u2019s Experiment 3 (the one borrowed from Smith, 1980) because it concerns the quantifier \u201csome\u201d (which is of more general interest than \u201cmight\u201d) and because the items used in that task are admittedly unusual (see Papafragou and Musolino, 2003; Chierchia, Guasti, Gualmini, Meroni, Crain &amp; Foppolo, 2004; Guasti, Chierchia, Crain, Foppolo, Gualmini and Meroni, 2005; Feeney, Scrafton, Duckworth, &amp; Handley, 2005).<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">We highlight here the main advances of these studies. In two sets of studies, Papafragou and colleagues (Papafragou and Musolino, 2003; Papafragou and Tantalou, 2004) aimed to show that children as young as five are generally able to produce implicatures if the circumstances are right. Actually, Papafragou and Musolino (2003, Experiment 1) first confirmed the developmental effect summarized above by showing that 5-year-olds are less likely than adults to produce enrichments with \u201csome\u201d, \u201cstart\u201d, and \u201cthree\u201d, in cases where a stronger term was called for (namely, \u201call\u201d, \u201cfinished\u201d, and a \u201clarger number,\u201d respectively). They then modified the experimental setup in two ways in order to prepare their second experiment. First, before they were tested, participants received training aimed at enhancing their awareness to pragmatic anomalies. Specifically, children were told that the puppet would say \u201csilly things\u201d and that the point of the game was to help the puppet say it better (e.g. they would be asked whether a puppet described a dog appropriately by saying \u201cthis is a little animal with 4 legs\u201d). In the event that the child did not correct the puppet, the experimenter did. Second, the paradigm put the focal point on a protagonist\u2019s performance. Unlike in their Experiment 1, where participants were asked to evaluate a quantified statement like \u201cSome horses jumped over the fence\u201d (when in fact all the horses did), the paradigm in Experiment 2 raises the expectations that the stronger statement (with \u201call\u201d) might be true. Participants would hear a test statement like, \u201cMickey put some of the hoops around the pole\u201d (after having his been shown to succeed with all of the hoops), and they were also told how Mickey claims to be especially good at this game and that this is why another character challenges him to get all three around the pole. With these changes, 5-year-olds were more likely to produce enrichments than they were in the first experiment. Nevertheless, the five year olds, even in the second experiment, still produced enrichments less often than did adults. This indicates that \u2013 even with training and with a focus on a stronger contrast \u2013 pragmatic enrichments require effortful processing among children.<span class=\"MsoFootnoteReference\"> <a name=\"_ftnref3\"><\/a><\/span> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Guasti et al. (2005) argue that pragmatic enrichments ought to be as common among five year olds as they are among adults and further investigated the findings of Noveck (2001) and Papafragou and Musolino (2003). <span> <\/span>In their first experiment, they replicated the finding of Noveck (2001, Experiment 3) with \u201csome\u201d with 7-year-olds and used this as a baseline to study independently the role of the two factors manipulated by Papafragou and Musolino (2003). One factor was the role of training and how it affects children\u2019s proficiency at computing implicatures (Experiments 2 and 3) and the other was the role of placing emphasis on the outcome of a scalar implicature (Experiment 4). Their Experiments 1 through 3 showed that training young participants to give the most specific description of a given situation can indeed have a major effect on performance. While their initial experiment showed that 7-year-olds accept statements such as \u201cSome giraffes have long necks\u201d 88% of the time (against 50% for adults), when trained in this manner their acceptance rate drops to 52%, becoming adult-like. Nonetheless, this effect is short lasting, i.e. it does not persist when the same participants are tested a week later (Experiment 3). In the last experiment, the authors rendered the <em>all <\/em>alternative more salient in context. This was achieved, for instance, by presenting participants with a story where several characters have to decide whether the best way to go collect a treasure was to drive a motorbike or ride a horse. After some discussion, all of them choose to ride a horse. In this way it is made clearer that the statement subjects have to judge, \u201csome of the characters chose to ride horse,\u201d is underinformative. The results indicated that children are more likely to infer an enriched interpretation in an adult-like manner when the context makes this enrichment highly relevant.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">This last finding shows that one can create situations that encourage children to pragmatically enrich weak-sounding statements and to do so in an adult-like manner. It does not alter the fact that in less elaborate scenarios where cues to enrichment are less abundant, seven-year olds do not behave in this manner and it does not tell us what younger children do. Overall, the developmental effect shows that pragmatic enrichments are somewhat effortful. In experimental settings, the required effort can be somewhat lowered or the motivation to perform it may be heightened, but in the absence of such contextual encouragements, younger children faced with a weak scalar term are more likely to stick with its linguistically encoded meaning. <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">If children had been found to perform scalar inferences by default, this would have been strong evidence in favour of the GCI theory approach. However, taken together, developmental data suggest that, for children, enriched interpretations of scalar terms are not default interpretations. This data is not knock down evidence against GCI theory, because it is compatible with two hypotheses: 1) scalar inferences are not default interpretations for adults either (even if adults are more likely to derive them because they can do so with relatively less effort and because they are more inclined to invest effort in the interpretation of an utterance given their greater ability to derive from it cognitive effects). Or, 2) in the course of development, children become capable and disposed to perform scalar inferences by default. The first hypothesis is consistent with the relevance theory approach while the second is consistent with the GCI approach. To find out which approach has more support, further work had to be done with adults.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><strong><span lang=\"EN-GB\">Time course of comprehension among adults<\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.45pt; line-height: 150%;\"><span lang=\"EN-GB\">As we mentioned before, GCI theory implies that a literal interpretation of a scalar, resulting from the cancellation of default enrichment, should take longer than an enriched interpretation, whereas <span> <\/span>relevance theory, denying that enrichment takes place by default, implies that an enriched interpretation, being computed when needed to meet contextual expectations of relevance, should take longer than a literal one. What is needed to test these contrasting predictions are experiments manipulating and measuring the time course of the interpretation of statements with weak scalar terms.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">As in the developmental tasks, one wants to make sure that enriched interpretations are clearly identifiable through specific responses, that the tasks used includes a variety of controls, and that the effect is reliable and robust. One way to identify enriched vs. literal interpretations is provided by earlier studies where participants were asked to judge true or false statements (such as \u201csome elephants are mammals\u201d) that could either be construed as literally true but underinformative, or in an enriched manner (as implying \u2026<em>not all<\/em>\u2026) and false. Hence participants\u2019 truth-value judgements reflect their literal or enriched interpretation.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">As we indicated, prior work was critical to developing the appropriate measures. In fact, Rips (1975) unintentionally included the right sort of cases when looking at other issues of categorization and with materials such as \u201c<span>some congressmen are politicians<\/span>.\u201d He examined the effect of the interpretation of the quantifier by running two studies, one in which participants were asked to treat \u201csome\u201d as meaning <em>some and possibly all<\/em> and another where they were asked to treat \u201csome\u201d as meaning <em>some but not all<\/em>. This comparison demonstrated that participants given the <em>some but not all<\/em> instructions in one experiment responded more slowly than those given the <em>some and possibly all<\/em> instructions in another. Despite these indications, Rips modestly hedged when he concluded that \u201cof the two meanings of <em>Some<\/em>, the informal meaning <em>may<\/em> be the more difficult to compute\u201d (italics added). To make sure that Rips\u2019s data were indeed indicative of a slowdown related to <em>Some but not all<\/em> readings, Bott &amp; Noveck (2004) ran a series of four experiments that followed up on Rips (1975) and essentially verified that enriched interpretations take longer than literal ones.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Bott and Noveck\u2019s categorization task involved the use of underinformative items (e.g. \u201cSome cows are mammals\u201d) and five controls that varied the quantifier (<em>Some<\/em> and <em>All<\/em>), the category-subcategory order, as well as proper membership. The 6 types of statements are illustrated with the six possible ways one can employ the subcategory <em>elephants<\/em> below, but it should be pointed out that the paradigm was set up so that the computer randomly paired a given subcategory with a given category while verifying that, at the end of each experimental session, there were nine instances of each type:<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 78pt; text-indent: -21.3pt; line-height: 150%;\"><span lang=\"EN-GB\">(19) (a) Some elephants are mammals (Underinformative).<br \/>\n(b) Some mammals are elephants.<br \/>\n(c) Some elephants are insects<br \/>\n(d) All elephants are mammals.<br \/>\n(e) All mammals are elephants.<br \/>\n(f) All elephants are insects.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\"><span> <\/span><\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\">In the first Experiment, a sample of 22 participants was presented with the same task twice, once with instructions to treat \u201csome\u201d<em> <\/em>as meaning <em>Some and possibly all<\/em> and once with instructions to treat \u201csome\u201d<em> <\/em>as meaning <em>Some but not all<\/em> (and, of course, the order of presentation was varied). When participants were under instruction, in effect, to engage the scalar inference, they were shown to be less accurate and take significantly longer to respond to the underinformative items (like those in (19a)). Specifically, when instructions called for a <em>Some but not all<\/em> interpretation, rates of correct responses to the Underinformative item (i.e. judging the statement \u201cfalse\u201d) were roughly 60%; when instructions called for a <em>Some and possibly all<\/em> interpretation<em> <\/em>rates of correct responses to the Underinformative item (i.e. judging the statement \u201ctrue\u201d) were roughly 90%. For the control items, rates of correct responses were always above 80% and sometimes above 90%. One can see that the Underinformative case in the <em>Some but not all<\/em> condition provides exceptional data.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">The reaction time data showed that the correct responses to the Underinformative item in the <em>Some but not all<\/em> condition were exceptionally slow. It took roughly 1.4 seconds to correctly evaluate the underinformative statements in the <em>Some but not all<\/em> condition and around .8 seconds in the <em>Some and possibly all<\/em> conditions. To answer the control items\u2014across both sorts of instructions\u2014took at most 1.1 seconds but more often around .8 to .9 seconds. Thus, <span> <\/span>the underinformative statement in the <em>Some but not all <\/em>condition is the one most affected by the instructions. All this confirms Rips\u2019s initial findings. More importantly, there is not a single indication that interpreting \u201csome\u201d to mean <em>some but not all<\/em> is an effortless or quasi effortless step. Again, a default view of scalar inference would predict that under <em>Some but not all<\/em> instructions, responses to Underinformative statements would require less time than responses under <em>Some and possibly all <\/em>instructions. According to an account based on <span> <\/span>relevance theory, one should find the opposite. The data more readily support the relevance-theoretic account.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">A potential criticism of this Experiment is that the lower accuracy and the slowdown might be due to a response bias in favour of positive rather than negative response, given that the correct response to the Underinformative statement with the <em>Some and possibly all<\/em> instructions is to say \u201cTrue\u201d while the correct response to the Underinformative statement with the <em>Some but not all<\/em> instructions is to say \u201cFalse.\u201d To allay concerns regarding such a potential response bias, Bott and Noveck demonstrated experimentally that the effects linked to pragmatic effort are not simply due to hitting the \u201cFalse\u201d key.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">In a second experiment, the paradigm was modified so that the same overt response could be compared across both sorts of instructions; this way, participants\u2019 response choice (True vs. False) could not explain the observed effects. In order to arrive at this comparison, participants were not asked to agree or disagree with first-order statements such as those in (19), but with second order statements made about these first-order statements. For example, participants were presented with the two statements: \u201cMary says the following sentence is false\u201d \/ \u201cSome elephants are mammals.\u201d They were then asked to agree or disagree with Mary\u2019s second-order statement. In such a case, participants instructed to treat \u201csome\u201d<em> <\/em>as meaning <em>Some but not all <\/em>should agree, whereas participants instructed to treat \u201csome\u201d<em> <\/em>as meaning <em>Some and possibly all <\/em>should disagree, reversing the pattern of positive and negative response of the previous experiment.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">The results from this second experiment were nevertheless remarkably similar to those of the first one. Here, when participants were under instruction to, in effect, draw the scalar inference, they were less accurate and took significantly longer to respond correctly to the underinformative item. When \u201cagree\u201d was linked with instructions for a <em>Some but not all<\/em> interpretation, rates of correct responses were roughly 70%; when \u201cagree\u201d was linked with instructions for a <em>Some and possibly all<\/em> interpretation,<em> <\/em>rates of correct responses were roughly 90%. For all control items, rates of correct responses were always above 85% and often above 90%. One can see that, once again, the Underinformative case in the <em>Some but not all<\/em> condition provides exceptional data. The reaction time data also showed that the correct \u201cagree\u201d responses to the Underinformative item in the <em>Some but not all<\/em> condition were exceptionally slow. It took nearly 6 seconds to evaluate the underinformative statements correctly when \u201cagree\u201d was linked with instructions for a <em>Some but not all<\/em> interpretation and around 4 seconds when \u201cagree\u201d was linked with instructions in the <em>Some and possibly all<\/em> condition (all reaction times were longer than in the previous experiment due to the <em>Mary says<\/em> statement). The control items across both sorts of instructions took on average around 4.5 seconds and never more than 5 seconds. Again, the experiment demonstrated that any response that requires a pragmatic enrichment implies extra effort.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Both of these experiments, though inspired by previous work, are arguably unnatural. It is unusual to instruct participants in a conversation, as was done in Experiment 1, as to how they should interpret the word \u201csome\u201d; the second experiment doubles the complexity by compelling participants to make metalinguistic judgments from statements like <em>Mary says the following is false<\/em>. Bott and Noveck\u2019s third experiment simplified matters by asking participants to make true\/false judgments about the categorical statements themselves and without prior instruction. With this sort of presentation, there is no useful sense in which a response is \u201ccorrect\u201d or not. Rather, responses reveal the participants literal or enriched interpretation and can be compared in terms of reaction time.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Roughly 40% of participants responded \u201ctrue\u201d to Underinformative items and 60% \u201cfalse\u201d. This corresponds to the rates found among adults in Noveck\u2019s developmental studies (also see Noveck &amp; Posada, 2003; Guasti et al, 2005). The main finding was that mean reaction times were longer when participants responded \u201cfalse\u201d to the underinformative statements than when they responded \u201ctrue\u201d (3.3 seconds versus 2.7, respectively). Furthermore, \u201cfalse\u201d responses to the underinformative statements appear to be slower than responses to all of the control statements (including three, (18c), (18e), and (18f), that require a \u201cfalse\u201d response). The \u201ctrue\u201d response was made at a speed that was comparable to all of the control items.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">In their last experiment, Bott and Noveck\u2019s varied the time available to participants to respond to the statements. The rationale for this design was that, if as implied by GCI theory, literal interpretations of weak scalar terms take longer than the default enriched interpretations, then limiting the time available should decrease the rate of literal interpretations and increase the rate of enriched ones. On the other hand, if as implied by relevance theory, enriched interpretations take longer, then limiting the time should have the opposite effect. While following the same general procedure as the prior experiments (asking participants to judge the veracity of categorical statements), the paradigm manipulated the time available for the response. In one condition, participants had a relatively long time to respond (3 seconds), while in the other they had a relatively short time to respond (.9 seconds). Only the time to <em>respond<\/em> was manipulated. To control for uptake, participants were presented with the text one word at a time and at the same rate in both conditions, thus there is no possibility that participants in the Short-lag condition spend less time reading the statements than those in the Long-lag condition.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Bott and Noveck reported that when a shorter period of time was available for participants to respond, they were more likely to respond \u201cTrue\u201d to Underinformative statements (indicating a literal interpretation). 72% of participants responded true in the \u201cShort-lag\u201d condition and 56% did so in the \u201cLong-lag\u201d condition. This strongly implies that they were less likely to derive the scalar inference when they were under time pressure than when they were relatively pressure-free. As in all the prior experiments, control statements provide a context in which to appreciate the differences found among Underinformative statements. These showed that performance among control statements in the Short Lag condition was quite good overall (rates of correct responses ranged from 75%-88%) and that, as one would expect, rates of correct performance among the control items <em>increased<\/em> with added time (by 5% on average). The contrast between a percentage that drops with extra time (as is the case for the Underinformative statements) and percentages that increase provide a unique sort of interaction confirming that time is necessary to provoke scalar inferences. <\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">The experiments we have described so far take into account the four methodological considerations we discussed earlier and allow well-controlled measure of a dependent variable: the rate or the speed of literal vs. enriched interpretations of weak scalar terms. Together, they provide strong evidence that an enriched interpretation of a weak scalar term requires more processing time than an unenriched, literal interpretation, as predicted by relevance theory and contrary to the prediction implied by GCI theory.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Still, one might argue that the categorization tasks used, even if methodologically sound from an experimental psychology point of view, are too artificial to test pragmatic hypotheses. If the argument were that laboratory tasks are somehow irrelevant to pragmatics, we would argue that the onus of the proof is on the critics: after all, participants bring to bear on experimental verbal tasks their ordinary pragmatic abilities, just as they do in any uncommon form of verbal exchange. In particular, if it is part of adult pragmatic competence to make scalar inferences by default, it would take some arguing to make it plausible that an experimental setting somehow inhibits this basic disposition. On the other hand, if the argument is that fairly artificial laboratory experiments are not enough and that they should be complemented with more ecologically valid designs, we agree. Happily, Breheny, Katsos and Williams (2006) have provided just this kind of welcome complement.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 35.4pt; line-height: 150%;\"><span lang=\"EN-GB\">Following up on a procedure from Bezuidenhout &amp; Cutting (2004), Breheny et al. presented disjunctive phrases (such as \u201cthe class notes or the summary\u201d) in two kinds of contexts: Lower-bound contexts (where the literal reading of a scalar term is more appropriate as in (20) below), and Upper-Bound contexts (where the enriched reading of the scalar is more appropriate as in (21) below). <span> <\/span>These were presented as part of short vignettes (along with many \u201cfiller\u201d items to conceal the purpose of the study) and participants\u2019 reading times were measured. More specifically, participants were asked to read on a computer screen short texts that were presented one fragment at a time, and to advance in their reading by hitting the space bar (the slashes in (20) and (21) delimit fragments). <span> <\/span><\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p style=\"margin: 0cm 28.25pt 0.0001pt 3cm; text-indent: -1cm; line-height: 150%;\"><span lang=\"EN-GB\">(20) <em>Lower-bound context<\/em><br \/>\nJohn heard that \/ the textbook for Geophysics \/ was very advanced. \/ Nobody understood it properly.\/ He heard that \/ if he wanted to pass the course \/ he should read \/ <em>the class notes or the summary<\/em>.<\/span><\/p>\n<p style=\"margin: 0cm 28.25pt 0.0001pt 3cm; text-indent: -1cm; line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p style=\"margin: 0cm 28.25pt 0.0001pt 3cm; text-indent: -1cm; line-height: 150%;\"><span lang=\"EN-GB\">(21) <em>Upper-bound context<br \/>\n<\/em>John was taking a university course \/ and working at the same time. \/ For the exams \/ he had to study \/ from short and comprehensive sources.\/ Depending on the course, \/ he decided to read \/ <em>the class notes or the summary<\/em>.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\">If, in such a task, one found shorter reading times in the Upper-bound contexts that call for scalar inferences than in the Lower-bound contexts where the literal interpretation is more appropriate, this would support the GCI claim that scalar inferences are made by default. Findings in the opposite direction would support the <span> <\/span>relevance theory account. What Breheny et al. found is that phrases like the <em>class notes or the summary<\/em> took significantly longer to process in Upper-bound contexts than in Lower-bound contexts, a result consistent with findings reported above.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%; page-break-after: avoid;\"><strong><span lang=\"EN-GB\">Conclusions<\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\">The experimental work we have summarized here verifies predictions derived from <span> <\/span>relevance theory, and falsifies predictions derived from GCI theory. Does this mean that <span> <\/span>relevance theory is true and GCI theory is false? Of course not. Nevertheless, these results should present a serious problem for GCI theorists. It is quite possible however that they will find a creative solution to the problem. They might for instance show that, in spite of the methodological precautions we have outlined, the studies reported failed to eliminate some uncontrolled factor, and that better studies provide evidence pointing in the opposite direction. They might, more plausibly, revise their theory so as to accommodate these results. One line of revision would be to reconsider the idea that GCI are default inferences (or to water down the notion of default to the point where it does not anymore have implications for processing time). After all, not all neo-Griceans agree with Levinson\u2019s account of GCI (see in particular Horn 2004, 2006). Still, it is worth noting that, if scalar inferences are not truly default inferences and involve each and every time paying attention to what the speaker chose not to say, then we are back to the worry that such inferences are excessively cumbersome. Generally speaking, experimental findings such as those we have summarised here should encourage neo-Griceans to work out precise and plausible implications of their approach at the level of cognitive processing.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\">Relevance theorists are not challenged in the same way by the work we have described\u2014after all, their prediction is confirmed\u2014, but they should be aware that this prediction could be made from quite different theoretical points of view: it follows from <span> <\/span>relevance theory, but <span> <\/span>relevance theory does not follow from it. They might then try to develop aspects of these experiments that could give positive support to more specific aspects of the theory. For instance, according to the theory, hearers aim at an interpretation that satisfies their expectations of relevance and the relevance of an interpretation varies inversely with the effort needed to derive it. It should then be possible to cause participants to choose a more or a less parsimonious interpretation by increasing or decreasing the cognitive resources available to participants for the process of interpretation. The fourth experiment of Bott and Noveck (2004) can be seen as a first suggestive step in this direction.<a name=\"_ftnref4\"><\/a><\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-indent: 1cm; line-height: 150%;\"><span lang=\"EN-GB\">As we have just explained, we do not expect readers to form a final judgement on the respective merits of GCI theory and <span> <\/span>Relevance theory on the basis of the experimental evidence presented. What we do hope is to have convinced you that, alongside other kinds of data, properly devised experimental evidence can be highly pertinent to the discussion of pragmatic issues, and that pragmatists\u2014and in particular students of pragmatics\u2014might greatly benefit from becoming familiar with relevant experimental work and from contributing to it (possibly in interdisciplinary ventures).<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center;\" align=\"center\"><strong><span lang=\"EN-GB\">Notes<\/span><\/strong><\/p>\n<p class=\"MsoFootnoteText\"><span class=\"MsoFootnoteReference\"><span style=\"font-size: 12pt;\" lang=\"EN-US\"><span><!--[if !supportFootnotes]--><span class=\"MsoFootnoteReference\"><span style=\"font-size: 12pt; font-family: &quot;Times New Roman&quot;;\" lang=\"EN-US\">[1]<\/span><\/span><!--[endif]--><\/span><\/span><\/span><span style=\"font-size: 12pt;\" lang=\"EN-US\"> From now on, for ease of exposition, we will use the term \u201cscalar\u201d without quotes to refer to the phenomena so described in the neo-Gricean approach. This use, of course, implies no theoretical commitment on our part.<\/span><\/p>\n<p class=\"MsoFootnoteText\"><span class=\"MsoFootnoteReference\"><span style=\"font-size: 12pt;\" lang=\"EN-US\">2<\/span><\/span><span style=\"font-size: 12pt;\" lang=\"EN-US\"> The contents of parentheses were not said, but indicated.<\/span><\/p>\n<p class=\"MsoFootnoteText\"><span class=\"MsoFootnoteReference\"><span style=\"font-size: 12pt;\" lang=\"EN-US\">3<\/span><\/span><span style=\"font-size: 12pt;\" lang=\"EN-US\"> Papafragou and Tantalou (2004) aim to show that five-year-olds can be encouraged to produce scalar inferences and at adult levels. However, we do not discuss their results here because their data are based on a non-standard paradigm in which participants are given no justifiable reason to accept the \u2018minimal\u2019 interpretation of a term such as \u2018some\u2019. In other words, the paradigm does not provide participants with two clear options. Moreover, much of the study\u2019s claims are based on children\u2019s self-reports and even these lead to the conclusion that at most 56% of Papafragou and Tantalou\u2019s participants derived scalar inferences.<\/span><\/p>\n<p class=\"MsoFootnoteText\"><span class=\"MsoFootnoteReference\"><span style=\"font-size: 12pt;\" lang=\"EN-US\">4<\/span><\/span><span style=\"font-size: 12pt;\" lang=\"EN-US\"> <\/span><span style=\"font-size: 12pt;\" lang=\"EN-GB\">For other experimental explorations based on relevance theory, see Van der Henst &amp; Sperber 2004.<\/span><\/p>\n<p class=\"MsoNormal\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p><strong><span style=\"font-size: 12pt; font-family: &quot;Times New Roman&quot;;\" lang=\"EN-GB\"><br style=\"page-break-before: always;\" \/> <\/span><\/strong><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center;\" align=\"center\"><strong><span lang=\"EN-GB\">References<\/span><\/strong><\/p>\n<p class=\"MsoNormal\"><span lang=\"EN-GB\"> <\/span><\/p>\n<p style=\"margin: 0cm 0cm 0.0001pt 49.65pt; text-indent: -49.65pt; line-height: 150%;\"><span lang=\"NL\">Bezuidenhout, A. &amp; Cutting, J.C. (2002).<span> <\/span><\/span><span lang=\"EN-GB\">Literal meaning, minimal propositions, and pragmatic processing.<span> <\/span>Journal of Pragmatics, 34, 433-456.<\/span><\/p>\n<p style=\"margin: 0cm 0cm 0.0001pt 49.65pt; text-indent: -49.65pt; line-height: 150%;\"><span lang=\"EN-GB\">Bott, L., &amp; Noveck, <\/span><span lang=\"EN-GB\">I.<\/span><span lang=\"EN-GB\"> A. (2004). Some utterances are underinformative: The onset and time course of scalar inferences. <em>Journal of Memory and Language, 51, <\/em>437-457.<\/span><\/p>\n<p style=\"margin: 0cm 0cm 0.0001pt 49.65pt; text-indent: -49.65pt; line-height: 150%;\"><span lang=\"EN-US\">Braine, M. &amp; Rumain, B. (1981). Children\u2019s comprehension of &#8220;or&#8221;: Evidence for a sequence of competencies. <em>Journal of Experimental Child Psychology, 31<\/em>, 46-70.<\/span><span lang=\"EN-GB\">Breheny, R., Katsos, N., &amp; Williams, J. (2006). <\/span>Are scalar implicatures generated on-line by default? <em><span lang=\"IT\">Cognition, 100 (3), <\/span><\/em><span lang=\"IT\">434-463.<\/span><span lang=\"IT\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-GB\">Breheny, R., Katsos, N., &amp; Williams, J. (2006).<span> <\/span>Are generalized scalar implicatures generated by default?<span> <\/span>An on-line investigation into the role of context in generating pragmatic inferences.<span> <\/span><\/span><em><span lang=\"IT\">Cognition 100(3)<\/span><\/em><span lang=\"IT\">, 434-463.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><!--[if supportFields]><span lang=EN-GB style=\"mso-ansi-language:EN-GB\" mce_style=\"mso-ansi-language:EN-GB\"><span style=\"mso-element:field-begin\" mce_style=\"mso-element:field-begin\"><\/span><\/span><span lang=IT style=\"mso-ansi-language: IT\" mce_style=\"mso-ansi-language: IT\"><span style=\"mso-spacerun:yes\" mce_style=\"mso-spacerun:yes\"> <\/span>ADDIN EN.REFLIST <\/span><span lang=EN-GB style=\"mso-ansi-language:EN-GB\" mce_style=\"mso-ansi-language:EN-GB\"><span style=\"mso-element:field-separator\" mce_style=\"mso-element:field-separator\"><\/span><\/span><![endif]--><span lang=\"IT\">Chierchia, G., Guasti, T., Gualmini, A., Meroni, L., Crain, S. &amp; Foppolo, F. (2004). <\/span><span lang=\"EN-GB\">Adults and children&#8217;s semantic and pragmatic competence in interaction. In <em>Experimental Pragmatics<\/em>, I. A. Noveck &amp; D. Sperber (Eds.). Basingstoke: Palgrave Macmillan.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span style=\"font-family: Times; color: #221f1f;\" lang=\"EN-US\">Feeney, A., Scrafton, S., Duckworth, A. and Handley, S. J. 2004: The Story of <em>Some<\/em>: Everyday Pragmatic Inference by Children and Adults. <\/span><em><span style=\"font-family: Times; color: #221f1f;\" lang=\"IT\">Canadian Journal of Experimental Psychology, 58,<\/span><\/em><span style=\"font-family: Times; color: #221f1f;\" lang=\"IT\"> 2, 121-132. <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"IT\">Guasti, M. T., Chierchia, G.,Crain, S., Foppolo, F., Gualmini, A. &amp; Meroni. <\/span><span lang=\"EN-GB\">L. (2005). Why children and adults sometimes (but not always) compute implicatures.<\/span><span lang=\"EN-GB\"> <\/span><em><span lang=\"EN-US\">Language and Cognitive Processes<\/span><\/em><span lang=\"EN-GB\">, 20 (5), 667\u2013696<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-GB\">Horn, L. R. (1972). <em>On the Semantic Properties of Logical Operators in English.<span> <\/span><\/em>Ph. D. Dissertation, UCLA.<\/span><\/p>\n<p class=\"bibbles\" style=\"margin: 0cm 9pt 0.0001pt 18pt; text-indent: -18pt; line-height: 150%;\"><span style=\"font-size: 12pt; line-height: 150%;\" lang=\"EN-US\">Horn, L. R. (2004). Implicature. In Horn and Ward (eds.),<em> <\/em>3-28. <\/span><\/p>\n<p class=\"bibbles\" style=\"margin: 0cm 4.5pt 0.0001pt 18pt; text-indent: -18pt; line-height: 150%;\"><span style=\"font-size: 12pt; line-height: 150%;\" lang=\"EN-US\">Horn, L. R. (2006). The Border Wars: A neo-Gricean perspective. In Turner and von Heusinger (eds.).<\/span><\/p>\n<p class=\"bibbles\" style=\"margin: 0cm 4.5pt 0.0001pt 0cm; text-indent: 0cm; line-height: 150%;\"><span style=\"font-size: 12pt; line-height: 150%;\" lang=\"EN-US\">Levinson, S. (2000). <em>Presumptive Meanings<\/em>. MIT Press.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 35.45pt; text-indent: -35.45pt; line-height: 150%;\"><span lang=\"EN-GB\">Noveck, I. A. (2001). When children are more logical than adults: Experimental investigations of scalar implicature. <em>Cognition, 78<\/em>(2), 165-188.<\/span><span lang=\"EN-US\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-GB\">Noveck, I. A. (2004). Pragmatic inferences related to logical terms. In I. A. Noveck &amp; D. Sperber (Eds.), <em>Experimental Pragmatics<\/em>. Basingstoke: Palgrave. <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm -3.05pt 0.0001pt 1cm; text-indent: -1cm; line-height: 150%;\"><span lang=\"EN-GB\">Noveck, I. A. &amp; Posada, A. (2003). Characterizing the time course of an implicature: An evoked potentials study. <em>Brain and Language, 85, 203-210.<\/em><\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-GB\">Noveck, I. A. &amp; Sperber, D. <span> <\/span>(2004), <em>Experimental Pragmatics<\/em>. Basingstoke: Palgrave. <\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-GB\">Papafragou, A., &amp; Musolino, J. (2003). Scalar implicatures: experiments at the semantics-pragmatics interface. <\/span><em><span>Cognition, 86<\/span><\/em><span>(3), 253-282.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span>Papafragou, A; &amp; Tantalou, N. <\/span><span style=\"font-size: 11pt; line-height: 150%;\">(2004). <\/span><span style=\"line-height: 150%;\">Children&#8217;s computation of implicatures. <em>Language Acquisitio<\/em><\/span><em><span>n 12(1), <\/span><\/em><span>71-82<\/span><span>.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span>Paris, S. G. (1973). <\/span><span lang=\"EN-GB\">Comprehension of language connectives and propositional logical relationships. <em>Journal of Experimental Child Psychology, 16(2)<\/em>, 278-291.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-US\">Rips, L. J. (1975). Quantification and semantic memory. <em>Cognitive Psychology, 7<\/em>(3), 307-340.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-GB\">Smith, C. L. (1980). Quantifiers and question answering in young children. <em>Journal of Experimental Child Psychology, 30(2)<\/em>, 191-205.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-GB\">Sperber, D., &amp; Wilson, D. (1995). <em>Relevance: Communication and Cognition<\/em>. Oxford: Basil Blackwell.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-GB\">Sternberg, R. J. (1979). Developmental patterns in the encoding and combination of logical connectives. <em>Journal of Experimental Child Psychology, 28<\/em>(3), 469-498.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-GB\">Van der Henst, J.B.,<span style=\"letter-spacing: -0.1pt;\"> <span> <\/span>Sperber, D. (2004). <\/span><\/span><span lang=\"EN-US\">Some experimentally testable implications of relevance theory.<span style=\"letter-spacing: -0.1pt;\"> <\/span><\/span><span lang=\"EN-GB\">In I. A. Noveck &amp; D. Sperber (Eds.), <em>Experimental Pragmatics<\/em>. Basingstoke: Palgrave.<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: 36pt; text-indent: -36pt; line-height: 150%;\"><span lang=\"EN-US\">Wilson, D. &amp; Sperber, D., (2004). Relevance Theory, in G. Ward et L. Horn (Eds.), <em>Handbook of Pragmatics<\/em>. Blackwell.<\/span><span lang=\"EN-GB\"> <\/span><\/p>\n<p class=\"MsoNormal\" style=\"line-height: 150%;\"><!--[if supportFields]><span lang=EN-US><span style=\"mso-element:field-end\" mce_style=\"mso-element:field-end\"><\/span><\/span><![endif]--><span lang=\"EN-GB\"> <\/span><\/p>\n<\/div>\n<div><!--[if !supportFootnotes]--><\/p>\n<hr size=\"1\" \/><!--[endif]--><\/p>\n<div id=\"ftn1\">\n<p class=\"MsoFootnoteText\"><a name=\"_ftn1\"><\/a><span lang=\"EN-US\"> From now on, for ease of exposition, we will use the term \u201cscalar\u201d without quotes to refer to the phenomena so described in the neo-Gricean approach. This use, of course, implies no theoretical commitment on our part.<\/span><\/p>\n<\/div>\n<div id=\"ftn2\">\n<p class=\"MsoFootnoteText\"><a name=\"_ftn2\"><\/a><span lang=\"EN-US\"> The contents of parentheses were not said, but indicated.<\/span><\/p>\n<\/div>\n<div id=\"ftn3\">\n<p class=\"MsoFootnoteText\"><a name=\"_ftn3\"><\/a><span lang=\"EN-US\"> Papafragou and Tantalou (2004) aim to show that five-year-olds can be encouraged to produce scalar inferences and at adult levels. However, we do not discuss their results here because their data are based on a non-standard paradigm in which participants are given no justifiable reason to accept the \u2018minimal\u2019 interpretation of a term such as \u2018some\u2019. In other words, the paradigm does not provide participants with two clear options. Moreover, much of the study\u2019s claims are based on children\u2019s self-reports and even these lead to the conclusion that at most 56% of Papafragou and Tantalou\u2019s participants derived scalar inferences.<\/span><\/p>\n<\/div>\n<div id=\"ftn4\">\n<p class=\"MsoFootnoteText\"><a name=\"_ftn4\"><\/a><span lang=\"EN-US\"> <\/span><span lang=\"EN-GB\">For other experimental explorations based on relevance theory, see Van der Henst &amp; Sperber 2004.<\/span><\/p>\n<\/div>\n<\/div>\n<p><!-- DEBUT \/ Cybermonitor \/ START --> <script type=\"text\/javascript\">\/\/ <![CDATA[\n     CM_RUBRIQUE = \"edit_PagesPersos_abonnes\"; CM_CLIENT = \"clubv3\"; CM_SECTION1 = \"PagesPersos\";\n\/\/ ]]><\/script> <script src=\"http:\/\/js.cybermonitor.com\/clubv3.js\">\n<\/script><\/p>\n<p><noscript> &amp;amp;amp;amp;amp;amp;lt;img src=&#8221;http:\/\/stat3.cybermonitor.com\/clubv3_v?R=edit_PagesPersos_abonnes&amp;amp;amp;amp;amp;amp;amp;S=total;PagesPersos&#8221; mce_src=&#8221;http:\/\/stat3.cybermonitor.com\/clubv3_v?R=edit_PagesPersos_abonnes&amp;amp;amp;amp;amp;amp;amp;amp;S=total;PagesPersos&#8221;&amp;amp;amp;amp;amp;amp;gt; <\/noscript> <!-- FIN \/ Cybermonitor \/ END --><\/div>\n","protected":false},"excerpt":{"rendered":"<p>[&#8230;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[19,5,6],"tags":[],"_links":{"self":[{"href":"https:\/\/www.dan.sperber.fr\/index.php?rest_route=\/wp\/v2\/posts\/123"}],"collection":[{"href":"https:\/\/www.dan.sperber.fr\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.dan.sperber.fr\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.dan.sperber.fr\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.dan.sperber.fr\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=123"}],"version-history":[{"count":20,"href":"https:\/\/www.dan.sperber.fr\/index.php?rest_route=\/wp\/v2\/posts\/123\/revisions"}],"predecessor-version":[{"id":1585,"href":"https:\/\/www.dan.sperber.fr\/index.php?rest_route=\/wp\/v2\/posts\/123\/revisions\/1585"}],"wp:attachment":[{"href":"https:\/\/www.dan.sperber.fr\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=123"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.dan.sperber.fr\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=123"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.dan.sperber.fr\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=123"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}