The Areal Typology of Languages of the Americas (ATLAs) is a database of typological features targeting areal linguistic structures in North and South America, collected from language descriptions by a team of 21 authors.
ATLAs was first published in 2025 in Scientific Data. We periodically publish corrections, and thus any citation of the ATLAs data should include a link to the version used, as listed on Zenodo.
The language sample of ATLAs includes 325 languages, of which 115 are in North America and 105 in South America. Their geographic distribution and family affiliation can be seen in the map below.
The features of ATLAs were designed in order to maximize areal signal, with depth of coverage taking a priority over breadth. Some common typological features are thus not included. However, those features which are present are organized into feature sets, which as a whole are intended to cover their linguistic domain exhaustively. This design choice maximizes the chance that areally-relevant variation present within the typological domains surveyed is captured. To the extent possible, we tried not to include grammatical structures which could be identified as the result of recent contact between Indigenous American and European languages.
Features are further designed with the intention of maximizing logical independence, or making it possible to staightforwardly manipulate the data in order to attain logically independent variables. Many features contain <NA> states — not applicable — in order to avoid features that encode the same information multiple times. For example, if a language has no verbal indexation, its verbal alignment is <NA> — not applicable — rather than <neutral> — a neutral alignment.
ATLAs online requires a browser with Javascript enabled.
Data in ATLAs is organized into feature sets, which cover a typological domain or theme, and each feature set includes multiple features. Feature set descriptions provide lists of associated features, their linguistic motivation, coding methodology, and results, and can be accessed through the tab "Feature Sets" in the navigation bar. Alternatively, features can be accessed directly, without going through feature sets description, in the "Features" tab of the same navigation bar.
You can also browse and search for languages through the tab "Languages" on the navigation bar. Here, features can be viewed collectively for individual languages.
The contributors to the database are given in the "Contributors" tab, which associates individual authors and contributors with particular feature sets.
Finally, you can search for references through the "References" tab.
If you are citing the ATLAs database as a whole without using any associated data, you should cite the database as published in Scientific Data.
If you are using data from multiple feature sets of ATLAs, you should cite the database as a whole, as published in Scientific Data, as well as the specific version of the database from Zenodo you are using.
If you are citing a particular feature set description (without using any associated data), you can cite only the specific feature set, as seen in the feature set page.
If you are using data from a specific feature set, then you should cite the feature set in question, as well as the database publication in Scientific Data and the specific version of the database from Zenodo.
The content of this web site is published under a Creative Commons Licence. We invite scholars and users to use this data for further linguistic study, and look forward to comments, feedback, and questions.
ATLAs was funded by the Swiss National Science Foundation Sinergia Project "Out of Asia" (CRSII5_183578), the University of Zurich, and the Swiss National Centre of Competence in Research Evolving Language.