David Goldstein

Professor of Linguistics, Indo-European Studies, and Classics

University of California, Los Angeles

Biography

I hold a joint position in the Department of Linguistics and the Program in Indo-European Studies at UCLA, along with a courtesy appointment in the Department of Classics. My research centers on two main areas. One line of work applies computational phylogenetics and statistical modeling to the study of language change and sociocultural evolution. The other explores how language change informs linguistic theory, with a focus on syntactic and semantic change in Indo-European.

From December 2022 through July 2023, I was a Visiting Fellow at Clare Hall, University of Cambridge and during Easter Term 2023, I served as a Lewis-Gibson Fellow at the Cambridge Centre for Greek Studies. For 2025-26, I will be a Fellow at the Swedish Collegium for Advanced Study (SCAS).

I am honored to be a member of the 2021 cohort of Guggenheim Fellows.

Download my CV.

Interests

Computational phylogenetics
Syntactic, morphosyntactic, and semantic change
Indo-European
Quantitative methods
Corpus linguistics

Education

Ph.D., 2010
University of California, Berkeley
M.A., 2004
University of California, Berkeley
M.Phil., 2002
University of Oxford, Corpus Christi College
B.A., 2000
Amherst College

Recent & Upcoming Talks

From phonology to phylogeny

May 25, 2023 1:00 PM — 2:00 PM

A new approach to the diversification of ancient Greek

Apr 27, 2023 6:00 PM — 8:00 PM Room 1.11, Faculty of Classics

See all talks

Featured Publications

David Goldstein, Shawn McCreight, Éva Buchi, John Huelsenbeck

June 2024 The Evolution of Language: Proceedings of the 15th International Conference (Evolang XV), eds. Jonas Nölle, Limor Raviv, Kirstie Emma Graham, Stefan Hartmann, Yannick Jadoul, Mathilde Josserand, Theresa Matzinger, Katie Mudd, Michael Pleyer, Anita Slonimska, Sławomir Wacewicz, and Stuart Watson. Nijmegen: Max Planck Institute for Psycholinguistics, pp. 220–222.

An event-based model for linguistic phylogenetics

Linguistic phylogenies are standardly inferred from lexical cognate relationships (e.g., Bouckaert et al. 2012, Chang et al. 2015, Sagart et al. 2019). Despite the prevalence of this practice, it suffers from well-known drawbacks. First, it disregards the phylogenetic signal that exists in the form of the words themselves. Second, it limits the modeling possibilities since it relies on an arbitrary coding of the data. In this talk, I introduce a novel framework for linguistic phylogenetics that overcomes both of these shortcomings. The heart of this framework is the TKF91 model (Thorne et al. 1991), which allows phylogenetic inference to be carried out directly from cognate word-forms. This model not only opens up a new horizon in the study of linguistic phylogenetics, but allows historical linguists to investigate questions of sound change that were previously out of reach.

PDF DOI PDF

David Goldstein

March 2024 Diachronica

Divergence-time estimation in Indo-European: The case of Latin

Divergence-time estimation is one of the most important endeavors in historical linguistics. Its importance is matched only by its difficulty. As Bayesian methods of divergence-time estimation have become more common over the past two decades, a number of critical issues have come to the fore, including model sensitivity, the dependence of root-age estimates on uncertain interior-node ages, and the relationship between ancient languages and their modern counterparts. This study addresses these issues in an investigation of a particularly fraught case within Indo-European, the diversification of Latin into the Romance languages. The results of this study support a gradualist account of their formation that most likely begins after 300 CE. They also bolster the view that Classical Latin is a sampled ancestor of the Romance languages (i.e., it lies along the branch leading to the Romance languages).

PDF Code Dataset DOI

David Goldstein

June 2022 Diachronica

Correlated grammaticalization: The rise of articles in Indo-European

Grammaticalization is characterized by robust directional asymmetries (e.g., Kuteva et al. 2019). For instance, body-part nominals develop into spatial adpositions, minimizers develop into negation markers, and subject pronouns become agreement markers. Changes in the opposite direction are either rare or unattested (Garrett 2012:52). Such robust cross-linguistic asymmetries have led some scholars to reify grammaticalization trajectories as universal mechanistic forces (Heath 1998:729). One consequence of such a view is that the ambient morphosyntax of a language has little or even no relevance for grammaticalization. This paper uses Bayesian phylogenetic methods to demonstrate the critical role that pre-existing morphosyntax can play in grammaticalization. The empirical basis for this claim is the grammaticalization of definite and indefinite articles in the history of Indo-European: indefinite articles developed at a faster rate among languages in which a definite article had already emerged compared to those lacking a definite article. The two changes are thus correlated. The results of this case study suggest that there is much more to be learned about when and why grammaticalization occurs by investigating its relationship to the pre-existing linguistic system (cf. Reinöhl and Himmelmann 2017:381).

PDF Code Dataset DOI

Recent Publications

David Goldstein (2025). Bayesian phylogenetic methods overcome limitations of traditional subgrouping.

David Goldstein (2024). Diachronica at 40. Diachronica.

PDF DOI

David Goldstein (2022). Toward a non-teleological account of demonstrative reinforcement. Life cycle of language: Past, present, and future, ed. Alan C. L. Yu and Darya Kavitskaya. Oxford: Oxford University Press.

PDF DOI

David Goldstein (2022). The Old Irish article. Journal of Celtic Linguistics 23:1–34.

PDF DOI

David Goldstein (2022). There’s no escaping phylogenetics. Ha! Linguistic studies in honor of Mark R. Hale, ed. Laura Grestenberger, Charles Reiss, Hannes A. Fellner and Gabriel Z. Pantillon, 71-91. Wiesbaden: Reichert.

PDF Code Dataset

David Goldstein (2021). Review of L. Danckaert, The development of Latin clause structure: A study of the extended verb phrase (Oxford 2017). Kratylos 66:50–57.

PDF DOI

See all publications