Word order

Authors: David Inman

Acknowledgements: Natalia Chousou-Polydouri, Marine Vuillermet, Nora Muheim

What?

This feature set encompasses some of the common typological questions about syntactic word order: the relative orders of subject, object, and verb in an unmarked matrix declarative transitive clause,1,2 the ordering of attributive and noun, and the ordering of genitive (or possessor) and (possessed) noun (Greenberg 1963, Dryer 2013a, Dryer 2013b, Dryer 2013c, Krasnoukhova 2016). Each of these language-wide comparanda have their own difficulties and must be defined narrowly to ensure cross-linguistic comparability.

In particular, the order of subject, object, and verb is fraught with issues, as has been pointed out by many previous researchers. Some of these problems are:

  • A topic/comment syntactic structure may be more salient than subject/object order (LaPolla and Poa 2008, Li and Thompson 1981).
  • Languages may vary their word order according to information structure, and it is not always clear what the “basic” or “unmarked” or "default" order is, or if one even exists (Hale 1980, Hale 1983).
  • Languages may have truly unconstrained word order with no difference in information structure. This may (or may not) correlate with other syntactic phenomena (Austin et al 2001).
  • Languages may have only a partially restricted basic word order, e.g. verb-initial, with free order between subject and object. These partially restricted orders still contain information about relative ordering (e.g. verb before subject), which some typologists argue are more significant than the full, six-way order (Dryer 1997, Dryer 2013d).

Despite these issues, which are addressed below, most languages do have a clear transitive clause word order, and several rigorous studies have yielded interesting results using typological data of this sort (Comrie 1989, Dryer 1992, Dryer 1996).

In order to compare the languages in our sample, we first defined the comparative concepts to do so: the “basic” transitive clause (WO-01), what counts as an attributive for attributive-noun constructions (WO-02), and the “basic” order in nominal possession constructions (WO-03).

The “basic” transitive clause

Our comparative concept for the basic transitive clause contains the following properties:

  1. It stands alone as a grammatical utterance
  2. It has declarative force (i.e., it is not interrogative or imperative)
  3. It has sentential focus and no argument is information structurally marked

For many languages, these constraints are sufficient to unambiguously assign a word order to the language as a whole. Even in cases that are traditionally described as having more complex word order constraints, such as the V-second order in German, the constraint of sentential focus (3 above) unambiguously forces the system to be categorized as “SVO”.

However, in some languages, the determination of the basic word order can remain difficult, typically because of difficulty determining which sentences have sentential focus. All else being equal:

  1. The word order of “out of the blue” declarative sentences reflects the basic word order.
  2. The most frequent word order of matrix declarative sentences in large corpora across many domains (conversation, instruction, narrative, news, etc) reflects the basic word order.
  3. The most typical positions of S and O when they are neither focus nor topic reflects the basic word order. To this end, the more syntactically complex S and O constituents are, the less likely they are to carry focus or topic information.

Since large corpora studies are often not available for minority languages, for consistency in our data, we a priori selected known problems with transitive clause word order typology and defined methods to approach them, as described in the How? section below.

The attributive-nominal construction

The order of attributive and noun (sometimes simply called adjective and noun) required a comparative concept of both the attributive and the construction being compared. We have used the term “attributive” rather than adjective to refer to property concept modifiers because many languages lack a well-defined syntactic category of adjectives, but there is nevertheless an ordering constraint between the modifier and noun. When modifying a noun with a property concept word, a language may have:

  • a dedicated syntactic category of adjectives and a single word order;
  • two or more dedicated syntactic categories of adjectives, each with different word orders;
  • no dedicated syntactic category of adjectives, with property concepts being encoded by nouns, verbs, bound elements, or a generic modifier class.

The comparative concept we chose, and for which the term “attributive” is reserved in this feature set, is the category of basic, qualificational property concepts, with a primary preference for the category which is dedicated to modifying nouns (if one exists), and a secondary preference for the lexically open class.

The class must include basic lexical terms, which means that they are underived. The class must also be qualificational, which means we definitionally exclude numerals and quantificational adjectives. When these properties are not enough to select a single lexical class in a language, we prefer the class dedicated to modifying nouns, meaning that when there is a closed class of adjectives, we select this over the open class of non-adjective property concepts. In most cases where a language lacks a clear class of adjectives, attributives are drawn from the class of verbs or a subclass of verbs (as mentioned in Dryer 2013c). The syntactic category to which the attributive class belongs is recorded in the Attributives feature set.

The secondary preference for an open class means that, in cases where adjectives behave differently according to sub-categories, we select the class whose lexical membership is open. Once the attributive class is selected, we narrowly code the surface word order of the attributive and noun, regardless of their syntactic properties.

The “basic” order in nominal possession constructions

Some languages have multiple categories of possession, depending on what is being possessed (e.g. alienable vs inalienable). For this comparative concept, we always selected the order used with the default (i.e. open) class of possessed nouns (alienable). If two or more constructions are used with different orders we prefer the order of the most common construction.

Why?

Word order has been claimed to have a regional areal effect in the linguistic literature, for example:

  • exclusively VO order in the Venezuelan-Antillean Linguistic Area (Umaña 1991: 125-126);
  • exclusively SOV order in the Colombian-Central American area (Umaña 1991: 127);
  • mostly SVO order in the languages of the Chaco region (Campbell et al 2012);
  • Exclusively predicate-initial order (VSO or VOS) in the Pacific Northwest (Beck 2000).
  • Possessed Noun - Possessor Noun order in Central America (Campbell 1997: 344)
  • Possessor Noun - Possessed Noun in Lowland South America (Campbell 1997: 350)
  • Noun - Attributive order in both the Andes and Lowland South America (Campbell 1997: 348-350)

How?

The typologization of word order requires solving unique and often complex problems, especially for the basic transitive clause. Prior to coding, we selected known problems and decided how exemplar problematic cases should be coded.

Problem 1: Determining “basic” ordering in a language with obligatory syntactic information structure

Some languages obligatorily mark information structure in the sentence, so that there is never an unmarked order. Thus, condition (3) of our criteria is never fully met. We still find it desirable to determine a basic word order for these languages, which we do by relaxing condition (3) and determining where each constituent occurs when it is unmarked. To the extent possible, the way we decide on the basic word order for such languages is to first determine:

  1. Where does the typical S occur when unmarked for information structure (and O is informationally-marked instead)?
  2. Where does the typical O occur when unmarked for information structure (and S is marked instead)?

The “neutral” position of S is decided according to its position in the (1) condition, and likewise the “neutral” position of O is decided according to its position in (2).

SVO: Mandarin Chinese [mand1415] (Sino-Tibetan)

Li and Thompson (1981: 15-21) call Mandarin a “topic prominent” language, rather than a subject prominent one, in that the topic must always occur first, while the subject may be completely absent. In addition, the pre- or post-verbal position is how definiteness is expressed. In the common (or “basic”) case of a definite subject and an indefinite object, an SVO word order appears, but OSV and SOV are also possible and, according to them, Mandarin Chinese has other features typical of SOV languages.

Li and Thompson (1981: 26) suggest that Mandarin is in the middle of a shift from basic SVO to basic SOV word order, and deciding between the two orders depends in part on the acceptance of the “basic” object as indefinite. Our definition of the basic transitive clause does not include definiteness, so it does not resolve the order question. However, Li and Thompson also state that their sample text yields more frequent SVO than SOV sentences, and that the more complex a sentence is, the more likely it is to have SVO. The frequency of SVO in a corpus should reflect the basic word order, and the more complex a sentence is, the more likely it is that the individual S and O noun phrases are not the bearers of information structure. Thus, both of these factors suggest that SVO is the most “basic” order in the language. From this data, we would encode Mandarin Chinese as having an SVO basic word order, but with a detailed remark about the frequency of SOV.

VSO: Machiguenga [mach1267] (Arawakan; Peru)

Emlen (2014: 22) mentions in passing that Machiguenga has SVO word order, and Michael (2014) says that the basic word order of closely-related Nanti [nant1250] (Arawakan; Peru) is arguably SVO. The subject prefix and object enclitic on the Machiguenga verb also suggest an SVO order (subj-[...]-V-[...]=obj). However, O'Hagan (2018) claims the basic word order of Machiguenga is VSO, although this is infrequent because information-structural marking typically requires one argument or another to move to the front.

In Inman’s (ms) corpus study, SVO was essentially equally common as VSO when both arguments were expressed by overt NPs. However, it was relatively rare for both arguments to be present. New participants in the discourse were always introduced post-verbally, while pre-verbal ordering was highly correlated with contrastive marking and question-answering. Inman suggests two plausible restrictions driving this distribution: (1) left-of-verb constituents must be given within the discourse context; (2) right-of-verb constituents must not carry focus or topic information (while left-of-verb constituents can, but don’t have to, be focus or topic). If true, this means that right-of-verb constituents are always unmarked. Though the fieldwork has not been done to confirm all hypotheses, it is compatible with O’Hagan’s interpretation of an unmarked VSO order, with discourse-specific information (topic, focus, and givenness) generating other orders.

Like Mandarin Chinese, the determination of word order in Machiguenga is complex, and would require a remark in the database describing alternative analyses. However, the data available suggest that Machiguenga has an information structurally neutral VSO order, which is how we would code this language.

Problem 2: Determining S and O ordering in languages with co-occurrence constraints.

Some languages do not allow both subject and object to appear overtly in the same clause as NPs, or this occurs so infrequently that it is difficult to determine what the basic order is. In these cases, subject and object are first ordered independently with respect to the verb and, if possible, the relative ordering of subject and object is inferred by determining special cases where both arguments are expressed (e.g., with pronouns). If an order between S and O cannot be determined at all, these languages should be encoded with a partially-restricted ordering, e.g. V-first.

VSO: Northern Lushootseed [lush1252] (Salishan; United States)

Lushootseed does not allow two non-oblique overt third person NPs (Beck 1996: 1-14). If both subject and object are present in the clause, the subject must be first or second person, expressed through a second position agreement enclitic. This necessarily yields a VSO order, if the subject is a speech act participant. In addition, objects that must be case-marked as oblique (as is the case for the verb ‘look for’) occur after the subject, yielding a V S oblique-O order. This also suggests a larger subject-before-complement pattern, which accords with the pattern seen for the second position elements. In this case, we would categorize the language as VSO, with a remark on the particularities of the system.

Problem 3: Determining if a language really has no basic word order

It may be the case that a language truly has no basic word order. However, we attempt to separate languages with truly free word order (where order is flexible even without information structure constraints) and languages with flexible word order (with associated, distinct information structure) that nevertheless have a basic word order.

SVO: Russian [russ1263] (Indo-European)

Russian has quite free word order depending on the discourse context and information structure, but is typically characterized as having basic SVO word order. Krupp (1983: 1-2) summarizes the reasoning for this in three points, which closely follow our assumptions about “basic” order:

  1. Approximately 80% of transitive sentences in Russian are SVO.
  2. When there is no inflectional information giving grammatical information for arguments, the ordering is SVO.
  3. Unmarked sentences, “out of the blue” sentences, and semantically nonsense sentences are given in SVO ordering.

Despite its flexibility, we would code Russian as having a basic SVO ordering.

no dominant order: Warlpiri [warl1254] (Pama-Nyungan; Australia)

Warlpiri, like Russian, has flexible word order, but unlike Russian it has no basic order. At least since Hale (1983), it has been described as “non-configurational.” In Hale’s words “it is true of Warlpiri that sentences containing the same content words in different linear arrangements count as repetitions of one another.” Simpson (1983: 140) agrees with Hale, saying “Like an intransitive sentence, [a transitive sentence] has completely free word order, except that the AUX [...] must remain in second position”.

There has been some recent work claiming the word order of Warlpiri is more constrained than it appears. Legate (2002: Chapter 4) argues that Warlpiri has “an articulated left periphery” which hosts topic and focus, which must be used in an obligatory and non-random manner. However, this is not a structure that makes reference to the status of verbal arguments (properties like subject and object), and so it cannot be categorized as a language that has any basic order of its S, V, and O elements.

Extracting data from WALS

Some data for this feature set was extracted from WALS chapters 81, 86, and 87 (Dryer 2013a, Dryer 2013b, Dryer 2013c) using custom R scripts. For ATLAs languages that were also present in one of these chapters, values were extracted directly from the WALS data. Basic transitive clause order (WO-01) was extracted from WALS 81, but given the differences in our typologization, all cases of <No dominant word order> were reviewed manually by checking grammars and changed appropriately. The order of attributive and noun (WO-02) was extracted from WALS 86, and given the differences in our typologization, all cases of <Only internally-headed relative clauses> were also reviewed manually and changed appropriately. Finally, the order of possessor and possessed noun was extracted from WALS 87. There was no difference in typological definitions in this last case, but WALS values were checked opportunistically as the coding of another of our feature set proceeded, namely Noun Possession Classes.

Features

WO-01: What is the word order for non-marked transitive clauses?

{ no dominant order | SVO | SOV | VSO | OVS | OSV | VOS | V-first | V-last | S-last }

The definition of “basic” transitive clauses, for the purposes of this feature, is given in the What? section above, with details about specific types of problems in determining the basic word order given in the How? section.

The number of possible states for this feature is rather large and encompasses all claims about complete and partial word orders found in the grammars consulted for this survey. The partial word orders encountered during coding were V-first, V-last, and S-last (the latter only in Hanis, Frachtenberg 1922: 414). There are other conceivable partially restricted orders such as “O-last” or “S-first”, but these were not encountered.

This feature was designed to be mostly compatible with SAILS NP2 and partially compatible with WALS chapter 81, from which some data was extracted (and then checked for consistency with our definitions, especially for “no dominant order”). Where this feature departs from these other databases is in allowing for the partial orders V-first, V-last and S-last.

V-first: Alsea-Yaquina [alse1251] (isolate; United States)

Alsea-Yaquina has relatively flexible word order and the earliest account states that most clauses are verb-initial, but the ordering of object and subject is flexible (Frachtenberg 1918: 421). A later corpus study confirms these observations, with all orders of VOS, VSO, SVO, and OVS attested and the two V-initial orders being the most common. No clear discourse pragmatic analysis could easily distinguish between VSO and VOS (Morgan et al 1985: 46), so we have thus coded Alsea-Yaquina as V-first.

SVO: Nanti [nant1250] (Arawakan; Peru)

Michael (2008: 342-345) notes that only one argument at a time can be expressed by a phonologically free NP, and subjects prototypically precede the verb when expressed, while objects prototypically follow the verb. However, all focused elements (including objects) are preverbal. He concludes by saying the language is basically SVO, even though the actual order Subject-Verb-Object is unobservable due to the one-NP constraint. However, the partial orderings SV and VO can allow us to construct a hypothetical SVO order, if two NPs were permitted.

WO-02: What is the common and basic word order for Attributive and Noun?

{ Attr-N | N-Attr | no dominant order }

The “attributive” category for this feature is described in the What? section above: the class of basic, qualificational property concepts, preferentially the category dedicated to modifying nouns, and preferentially the lexically open class.

The definitions for this feature are by design mostly compatible with Dryer (2013c), from which we imported some data, with the exception of Dryer’s “internally-headed relative clauses”. While Dryer assigns these internally-headed relative clauses their own distinct state, we have narrowly focused on surface word order, and encoded such cases according to the order of the attributive element and the head noun. This decision belies a potential comparative problem: the order of a head noun and a direct attributive modifier (the be.beautiful(-ADJVZ) dog) is perhaps not directly comparative with a head noun and a verbal modifier which is inside a relative clause (the dog, which be.beautiful). Ideally, modifiers which must be in a relative clause would receive their own state (such as <NA>). However, we did not collect the data to distinguish between these cases, and the determination of whether there is a relative clause or not (especially if the relative clause is headless) can be quite complex and even indeterminate. Therefore, this feature has only targeted the surface order of the attributive and head noun, even though we acknowledge this may not be the most ideal encoding.

In addition, some languages incorporate the noun and the attributive into a single word rather than expressing them independently, a phenomenon which typically correlates with attributives expressed as a verb. Analogous to how we have treated internally-headed relative clauses, we have again coded only the surface order of the attributive and noun. A complex example of such a case is described below for Tonkawa [tonk1249] (isolate; US).

N-Attr: Spanish [stan1288] (Indo-European)

Spanish has two sets of adjectives: a small, closed set which can come before the noun (typically with a different meaning than when they come after the noun): grande ‘great/big’, nuevo ‘new’, viejo ‘old’, etc. Although this Attr-N order is possible, it is not true for the open class of adjectives, and so the order would be N-Attr.

Attr-N: Nanti [nant1250] (Arawakan; Peru)

Nanti has approximately 25 underived adjectives, mainly restricted to the semantic classes of value, dimension, quantification, and number (Michael 2008: 311), which occur like all modifiers before the noun (Michael 2008: 369).

N-Attr: Tonkawa [tonk1249] (isolate; United States)

Tonkawa has predicative adjectives, which readily (and perhaps typically) are incorporated into the noun as a suffix (Wier 2016: 31). When an adjective is not incorporated, it can appear in either order, but N-Attr is the most typical (1). Incidentally, when the adjective is incorporated, the order is the same (2) (Wier 2016: 32).

(1) no-noto·-n we·ʔis-pax naxc-an-ʔa·-la wixwan-tak ye·la-k-laknoʔo
RED-touch-GER one-just build.fire-GER-DEF-NOM.SG small-very sit-PART-EVID
‘A tiny little spark lay [on] just one finger.’
(2) caka-wixwan-ʔa·-yʔik
river-small-DEF-ALL
‘the small river’

WO-03: What is the common and basic word order for Possessor (sometimes, "genitive") and Possessed (noun) for the open class of nouns?

{ NA | PSSR-PSSD | PSSD-PSSR | no dominant order }

As described in the What? section, there may be different orders for different possession constructions and/or different noun possession classes. We code for the open class of nouns (sometimes called the alienable class), and if the relative frequencies of available constructions are known, we prefer the most common one. If only information about pronominal possession is available, then this feature is coded as unknown <?>.

The state <no dominant order> refers to a single construction which has both orderings or when the grammar explicitly says that two constructions with different orders are equally common or basic. In cases where two different constructions exist for the default class with different orderings of possessor and possessed noun and there is no indication of which is more frequent or basic, we have opted to code this as <?>.

Though the ordering of these constituents is largely determined by the surface structure, there are two particular types of constructions for which we believe <NA> and <?> values are most appropriate. These are constructions in which there is an intervening lexical head between the possessor and possessed nouns: clausal possessive constructions and classifier possessive constructions.

In the rare case where the open class of nouns is possessed through a clausal construction, then the order is coded as <NA>. In this case, the subordinated verb in the construction is an intervening lexical head between the possessor and the possessed noun, which causes the two not to be sister nodes in the syntactic structure. We do not believe that the surface word order of such a construction is meaningfully comparable to constructions where the possessor and possessed are syntactic sisters.

In the case where the open class of nouns is possessed through a classifier construction (where the possessor syntactically possesses a classifier rather than the conceptually possessed noun), there is a potential for the conceptually possessed and syntactically possessed nouns to have different surface orders. For example, in a surface order of PSSD PSSR CLASSIFIER, the order of the possessor and syntactically possessed classifier is <PSSR-PSSD>, but the order of the possessor and conceptually possessed noun is <PSSD-PSSR>. In other cases, the relative order of the possessor with the syntactically possessed and conceptually possessed nouns is the same, as in the order PSSR CLASSIFIER PSSD. We made the analytical decision to encode the former case as a <?> (though no such case was encountered), and the latter according to the order of both syntactically and conceptually possessed nouns (e.g., PSSR-PSSD in the previous example).

PSSR-PSSD: Alsea-Yaquina [alse1251] (isolate; United States)

Both orders are possible in Alsea-Yaquina possession, but possessors typically occur before the possessed noun, with an agreeing element occuring on the possessed noun (Buckley 1989: 22)

(3) a-s xámni c-ɬaqúsin-ḵ
DEI-REF whale POSS-skin-POSS
‘the whale’s skin’
PSSD-PSSR: Nanti [nant1250] (Arawakan; Peru)

The possessor in Nanti follows the possessed noun, which receives a possessive prefix, as exemplified below (Michael 2008: 318, Michael p.c.).

(4) pine Soira ir-ishinto Bikotoro
you.see Soira 3mP-daughter Bikotoro
‘You see, Soira, BIkotoro’s daughter.’

Derived Features

WO-01a: Is there a dominant or fixed standard word order in main declarative transitive clauses?

{ yes | no }

no if WO-01 is <no dominant order>
yes if WO-01 is anything else

WO-01b: What is the order of S and V, if there is a dominant order?

{ NA | S>V | V>S }

NA if WO-01 is <no dominant order>
S>V if WO-01 is <SVO> OR <SOV> OR <OSV> OR <V-last>
V>S if WO-01 is <VSO> OR <VOS> OR <OVS> or <V-first> OR <S-last>

WO-01c: What is the order of S and O, if there is a dominant order?

{ NA | S>O | O>S }

NA if WO-01 is <no dominant order>
S>O if WO-01 is <SVO> OR <SOV> OR <VSO>
O>S if WO-01 is <OSV> OR <OVS> OR <VOS> OR <S-last>

WO-01d: What is the order of V and O, if there is a dominant order?

{ NA | V>O | O>V }

NA if WO-01 is <no dominant order>
V>O if WO-01 is <SVO> OR <VSO> OR <VOS> OR <V-first>
O>V if WO-01 is <SOV> OR <OSV> OR <OVS> OR <V-last>

WO-01e: What is the first element in a transitive clause?

{ V-first | S-first | O-first | no dominant order }

no dominant order if WO-01 is <no dominant order>
V-first if WO-01 is <VSO> OR <VOS> OR <V-first>
S-first if WO-01 is <SVO> OR <SOV>
O-first if WO-01 is <OVS> OR <OSV>

WO-01f: If the first element is S, what is the full word order?

{ NA | SVO | SOV }

NA if WO-01e is <O-first> OR <V-first> OR <NA>
SVO if WO-01 is <SVO>
SOV if WO-01 is <SOV>

WO-01g: If the first element is V, what is the full word order?

{ NA | VSO | VOS }

NA if WO-01e is <S-first> OR <O-first> OR <NA>
VSO if WO-01 is <VSO>
VOS if WO-01 is <VOS>

WO-01h: If the first element is O, what is the full word order?

{ NA | OVS | OSV }

NA if WO-01e is <S-first> OR <V-first> OR <NA>
OVS if WO-01 is <OVS>
OSV if WO-01 is <OSV>

Results

As expected, V-initial word orders are dominant in the Pacific Northwest and common in Central America, and VSO is much more frequent than VOS (WO-01). SOV is the most common word order in the sample, with SVO orders more common in California and Oregon, northern Central America, and parts of the Andes and the Gran Chaco. Outside of the more densely-sampled Americas, SVO is especially present in Southeast Asia and Africa. Absence of any dominant order is most frequent in Australia, and occurs with some regularity in the Americas, but without forming clear geographic clusters.

The order of attributive and noun (WO-02) shows clear macroareal patterns, with Asia largely Attr-N and Africa largely N-Attr. North and South America are split: the Pacific Northwest and the Andes largely display Attr-N order; the Amazon largely displays N-Attr order; and Central America has several well-grouped regions displaying each order. The absence of any dominant order is most common in Australia and in South America, especially in the transition region between the Attr-N Andes and the N-Attr Amazon.

The order of possessor and possessed noun (WO-03) is strongly PSSR-PSSD in the sample as a whole, with the PSSD-PSSR order clustering among the Pacific Northwest and Central American languages. These are both regions that are characterized by a V-initial transitive word order (WO-01), although a typological (rather than areal and coincidental) correlation between the two features is not apparent. As with other features, the absence of any dominant order is especially common in Australia, although even there it is a minority state.

Contributions

Conceptualization: David Inman

Data collection: David Inman, Natalia Chousou-Polydouri, Marine Vuillermet, Nora Muheim

Supervision of data collection: David Inman, Natalia Chousou-Polydouri

Computer code: David Inman

References

  • Austin, Peter K. 2001. Word Order in a Free Word Order Language: the case of Jiwarli. In Simpson, Jane and Nash, David and Laughren, Mary and Austin, Peter and Alpher, Barry (eds.), Forty Years On: Ken Hale and Australian Languages, 305–324. Canberra: Research School of Pacific and Asian Studies, Australian National University.

  • Beck, David. 1996. Subjecthood, agency, and topicality in Lushootseed. Toronto Working Papers in Linguistics 15.

  • Beck, David. 2000. Grammatical Convergence and the Genesis of Diversity in the Northwest Coast Sprachbund. Anthropological Linguistics 42(2). 147–213.

  • Buckley, Eugene. 1989. The Alsea noun phrase. Kansas Working Papers in Linguistics 14(2). 20–36.

  • Campbell, Lyle. 1997. American Indian Languages: the Historical Linguistics of Native America. (Oxford Studies in Anthropological Linguistics, 4.) Oxford: Oxford UP: Oxford University Press. xi+512pp.

  • Campbell, Lyle & Verónica Grondona. 2012. The Indigenous Languages of South America: A Comprehensive Guide. (The World of Linguistics, 2.) De Gruyter Mouton.

  • Comrie, Bernard. 1989. Language universals and linguistic typology: Syntax and morphology. University of Chicago press.

  • Dryer, Matthew S. 1992. The Greenbergian word order correlations. Language 68(1). 81–138. Linguistic Society of America.

  • Dryer, Matthew S. 1996. Word order typology. Handbook on syntax 2. 1050–1065. De Gruyter Mouton.

  • Dryer, Matthew S. 1997. On the six-way word order typology. Studies in Language 21(1). 69–103. John Benjamins.

  • Dryer, Matthew S. 2013. Order of Subject, Object and Verb. In Matthew S. Dryer and Martin Haspelmath (eds.), The World Atlas of Language Structures Online. Leipzig: Max Planck Institute for Evolutionary Anthropology.

  • Dryer, Matthew S. 2013. Order of Genitive and Noun. In Matthew S. Dryer and Martin Haspelmath (eds.), The World Atlas of Language Structures Online. Leipzig: Max Planck Institute for Evolutionary Anthropology.

  • Dryer, Matthew S. 2013. Order of Adjective and Noun. In Matthew S. Dryer and Martin Haspelmath (eds.), The World Atlas of Language Structures Online. Leipzig: Max Planck Institute for Evolutionary Anthropology.

  • Dryer, Matthew S. 2013. On the six-way word order typology, again. Studies in Language 37(2). 267–301. John Benjamins.

  • Emlen, Nicholas Q. 2014. Language and coffee in a trilingual Matsigenka-Quechua-Spanish frontier community on the Andean-Amazonian borderland of Southern Peru. University of Michigan dissertation.

  • Frachtenberg, Leo Joachim. 1918. Alsea grammar. Washington, D.C.: Smithsonian Institution, MS. 431pp.

  • Frachtenberg, Leo J. 1922. Coos. In Franz Boas (ed.), Handbook of American Indian languages: Volume 2, 297–430. Washington: Government Printing Office.

  • Greenberg, Joseph H. 1963. Universals of language. Cambridge, Ma: MIT Press. 73–113pp.

  • Hale, Ken. 1980. On the position of Walbiri in a typology of the base. Bloomington: Indiana University Linguistics Club.

  • Hale, Ken. 1983. Warlpiri and the Grammar of Non-Configurational Languages. Natural Language & Linguistic Theory 1(1). 5–47. Springer.

  • Krasnoukhova, Olga. 2016. Noun Phrase (NP). In Muysken, Pieter and Harald Hammarström and Olga Krasnoukhova and Neele Müller and Joshua Birchall and Simon van de Kerke and Loretta O'Connor and Swintha Danielsen and Rik van Gijn and George Saad (eds.), South American Indian Language Structures (SAILS) Online. Leipzig: Max Planck Institute for Evolutionary Anthropology.

  • Krupp, Martha. 1983. Word order in Russian: an experimental study. University of Alberta MA thesis. (xi+94pp.)

  • LaPolla, Randy & Dory Poa. 2008. On describing word order. In Felix K. Ameka and Alan Dench and Nicholas Evans (eds.), Catching Language: The Standing Challenge of Grammar Writing, 269–296. De Gruyter Mouton.

  • Legate, Julie Anne. 2002. Warlpiri: theoretical implications. Cambridge, MA: Massachusetts Institute of Technology dissertation. (283pp.)

  • Li, Charles & Sandra Thompson. 1981. A functional reference grammar of Mandarin Chinese. Berkeley and Los Angeles, CA: University of California Press.

  • Michael, Lev David. 2008. Nanti evidential practice: Language, knowledge, and social action in an Amazonian society. University of Texas at Austin dissertation. (xxv+462pp.)

  • Michael, Lev. 2014. The Nanti reality status system: Implications for the typological validity of the realis/irrealis contrast. Linguistic Typology 18(2). 251–288.

  • Morgan, Lawrence R, Wallace Chafe & Leanne Hinton. 1985. Northwest languages in search of a universal word order typology.

  • O'Hagan, Zachary. 2018. The Syntax of Matsigenka Object-Marking. Berkeley Papers in Formal Linguistics 1(1).

  • Simpson, Jane H. 1983. Aspects of Warlpiri Morphology and Syntax. Cambridge, MA: Massachusetts Institute of Technology dissertation. (533pp.)

  • Umaña, Adolfo Constenla. 1991. Las lenguas del área intermedia: Introducción a su estudio areal. Editorial Universidad de Costa Rica.

  • Wier, Thomas. 2016. A sketch grammar of Tonkawa. Ms. 41pp.


  1. This excludes existential (“NP is NP”) and attributive (“NP is Adj”) sentences, which often have special syntax or special marking. 

  2. The order of the intransitive clause was not investigated because of an overarching concern with feature independence. Rather than collecting both and then attempting to disentangle the dependencies between them, we only collected data on the transitive clause, following prior research emphasizing its importance.