Publications

An Information-Theoretic Characterization of Morphological Fusion

Linguistic typology generally divides synthetic languages into groups based on their morphological fusion. However, this measure has …

Simple induction of (deterministic) probabilistic finite-state automata for phonotactics by stochastic gradient descent

We introduce a simple and highly general phonotactic learner which induces a probabilistic finite-state automaton from word-form data. …

Predicting cross-linguistic adjective order with information gain

Deep Subjecthood: Higher-Order Grammatical Features in Multilingual BERT

We investigate how Multilingual BERT (mBERT) encodes grammar by examining how the high-order grammatical feature of morphosyntactic …

Word order affects the frequency of adjective use across languages

What's new? A comprehension bias in favor of informativity

The Natural Stories corpus: A reading-time corpus of English texts containing rare syntactic constructions

Sensitivity as a Complexity Measure for Sequence Classification Tasks

Abstract We introduce a theoretical framework for understanding and predicting the complexity of sequence classification tasks, using a …

Modeling word and morpheme order in natural language as an efficient tradeoff of memory and surprisal

Memory limitations are known to constrain language comprehension and production, and have been argued to account for crosslinguistic …

Do dependency lengths explain constraints on crossing dependencies?

Dependency locality as an explanatory principle for word order

Communication efficiency of color naming across languages provides a new framework for the evolution of color terms

What syntactic structures block dependencies in RNN language models?

Syntactic dependencies correspond to word pairs with high mutual information

Structural supervision improves learning of non-local grammatical dependencies

Neural language models as psycholinguistic subjects: Representations of syntactic state

How efficiency shapes human language

Are formal restrictions on crossing dependencies epiphenomenal?

What do RNN language models learn about filler--gap dependencies?

The Natural Stories Corpus

Mutual information impacts adjective ordering across languages

Comprehenders model the nature of noise in the environment

An information-theoretic explanation of adjective ordering preferences

A statistical comparison of some theories of NP word order

Generalizing dependency distance

Don't underestimate the benefits of being misunderstood

Color naming across languages reflects color use

A generative model of phonotactics

A functional theory of gender paradigms

Memory access during incremental sentence processing causes reading time latency

A meta-analysis of syntactic priming in language production

A corpus investigation of syntactic embedding in Pirahã

Rythm's role in the genitive construction choice in spoken English

Quantifying word order freedom in dependency corpora

Cross-linguistic gestures reflect typological universals: A subject-initial, verb-final bias in speakers of diverse languages

The `universal' structure of name grammars and the impact of social engineering on the evolution of natural information systems