EvoLangXI: Simple Agents Are Able To Replicate Speech Sounds Using 3d Vocal Tract Model

Piera Filippi, Jenna V. Congdon, John Hoang, Daniel Liu Bowling, Stephan Reber, Andrius Pašukonis, Marisa Hoeschele, Sebastian Ocklenburg, Bart de Boer, Christopher B. Sturdy, Albert Newen, Onur GÜntÜrkÜn

Do Lab Attested Biases Predict The Structure Of A New Natural Language?

Molly Flaherty, Katelyn Stangl, Susan Goldin-Meadow

Phoneme Inventory Size Distributions And The Origins Of The Duality Of Patterning

Luke Fleming

Cooperative Communication And Communication Styles In Bonobos And Chimpanzees In The Wild: Same Same But Different?

Marlen Fröhlich, Paul H Kuchenbuch, Gudrun Müller, Barbara Fruth, Takeshi Furuichi, Roman M Wittig, Simone Pika

Integration Or Disintegration?

Koji Fujita, Haruka Fujita

Migration As A Window Into The Coevolution Between Language And Behavior

Victor Gay, Daniel Hicks, Estefania Santacreu-Vasut

Effects Of Task-specific Variables On Auditory Artificial Grammar Learning And Generalization

Andreea Geambasu, Michelle J. Spierings, Carel ten Cate, Clara C. Levelt

Intentional Meaning Of Bonobo Gestures

Kirsty Graham, Catherine Hobaiter, Richard Byrne

The Impact Of Communicative Network Structure On The Conventionalization Of Referring Expressions In Gesture

Matt Hall, Russell Richie, Marie Coppola

Plain Simple Complex Structures: The Emergence Of Overspecification In An Iterated Learning Setup

Stefan Hartmann, Peeter Tinits, Jonas Nölle, Thomas Hartmann, Michael Pleyer

Language Origins In Light Of Neuro-atypical Cognition And Speech Profiles

Wolfram Hinzen, Joana Rosselló

Deictic Tools Can Limit The Emergence Of Referential Symbol Systems

Elizabeth Irvine, Sean Roberts

Inferring The World Tree Of Languages From Word Lists

Gerhard Jaeger, Soeren Wichmann

Effort Vs. Robust Information Transfer In Language Evolution

T. Florian Jaeger, Maryia Fedzechkina

Nonlinear Biases In Articulation Constrain The Design Space Of Language

Rick Janssen, Bodo Winter, Dan Dediu, Scott Moisik, Sean Roberts

Simple Agents Are Able To Replicate Speech Sounds Using 3d Vocal Tract Model

Rick Janssen, Dan Dediu, Scott Moisik

Protolanguage Possibilities In A Construction Grammar Framework

Sverker Johansson

Modeling Language Change Triggered By Language Shift

Anna Jon-And, Elliot Aguilar

The Evolution Of Zipf’s Law Of Abbreviation

Jasmeen Kanwal, Kenny Smith, Jennifer Culbertson, Simon Kirby

The Spontaneous Emergence Of Linguistic Diversity In An Artificial Language

Deborah Kerr, Kenny Smith

Evolution Of The Language-ready Brain: Warfare Or ‘mother Tongues’?

Chris Knight, Camilla Power

A General Auditory Bias For Handling Speaker Variability In Speech? Evidence In Humans And Songbirds.

Buddhamas Kriengwatana, Paola Escudero, Anne Kerkhoven, Carel ten Cate

Cumulative Vocal Cultures In Orangutans And Their Ontogenetic Origin

Adriano Lameira, Jeremy Kendal, Marco Gamba

The Emergence Of Argument Marking

Sander Lestrade

Learnability Pressures Influence The Encoding Of Information Density In The Lexicon

Molly Lewis, Michael C. Frank

A Developmental Perspective On Language Origin: Children Are Old Hands At Gesture

Casey Lister, Tiarn Burtenshaw, Nicolas Fay, Bradley Walker, Jeneva Ohan

Emergence Of Signal Structure: Effects Of Duration Constraints

Hannah Little, Kerem Eryılmaz, Bart de Boer

Differing Signal-meaning Dimensionalities Facilitates The Emergence Of Structure

Hannah Little, Kerem Eryılmaz, Bart de Boer

Correlated Evolution Or Not? Phylogenetic Linguistics With Syntactic, Cognacy, And Phonetic Data

Giuseppe Longobardi, Armin Buch, Andrea Ceolin, Aaron Ecay, Cristina Guardiano, Monica Irimia, Dimitris Michelioudakis, Nina Radkevich, Gerhard Jaeger

The Evolution Of Redundancy In A Global Language

Gary Lupyan, Justin Sulik

Nonhuman Animals’ Use Of Ostensive Cues In An Object Choice Task

Heidi Lyn, Stephanie Jett, Megan Broadway, Mystera Samuelson

Language Adapts To Signal Disruption In Interaction

Vinicius Macuch Silva, Sean Roberts

Biological Systems Of Interest To Researchers Of Cultural Evolution

Luke Mccrohon

Preliminary Results From A Computational Multi Agent Modelling Approach To Study Humpback Whale Song Cultural Transmission

Michael Mcloughlin, Luca Lamoni, Ellen Garland, Simon Ingram, Alexis Kirke, Michael Noad, Luke Rendell, Eduardo Miranda

Human-like Brain Specialization In Baboons: An Invo Anatomical MRI Study Of Language Areas Homologs In 96 Subjects

Adrien Meguerditchian, Damien Marie, Konstantina Margiotoudi, Scott A. Love, Alice Bertello, Romain Lacoste, Muriel Roth, Bruno Nazarian, Jean-Luc Anton, Olivier Coulon

Linking The Processes Of Language Evolution And Language Change: A Five-level Hierarchy

Jérôme Michaud

Interaction For Facilitating Conventionalization: Negotiating The Silent Gesture Communication Of Noun-verb Pairs

Ashley Micklos

The Evolution Of Repair: Evidence From Online Conversations

Gregory Mills

Arbitrary Hierarchy: A Precedent For Language?

Dominic Mitchell

How Selection For Language Could Distort The Dynamics Of Human Evolution

William Mitchener

Make New With Old: Human Language In Phylogenetically Ancient Brain Regions

Marie Montant, Johannes Ziegler, Benny Briesemeister, Tila Brink, Bruno Wicker, Aurélie Ponz, Mireille Bonnard, Arthur Jacobs, Mario Braun

Frequency-dependent Regularization In Iterated Learning

Emily Morgan, Roger Levy

The Effect Of Modality On Signal Space In Natural Languages

Hope Morgan

Linguistic Structure Emerges In The Cultural Evolution Of Artificial Sign Languages

Yasamin Motamedi, Marieke Schouwstra, Kenny Smith, Simon Kirby

Self-organization In Sound Systems: A Model Of Sound Strings Processing Agents

Roland Mühlenbernd, Johannes Wahle

A Social Dimension Of Language Evolution

Albert Naccache

Edward Sapir And The Origin Of Language

Albert Naccache

Shared Basis For Language And Mathematics Revealed By Cross-domain Syntactic Priming

Tomoya Nakai, Kazuo Okanoya

Measuring Conventionalization In The Manual Modality

Savithry Namboodiripad, Daniel Lenzen, Ryan Lepic, Tessa Verhoef

Quantifying The Semantic Value Of Words

Dillon Niederhut

The Arbitrariness Of The Sign Revisited: The Role Of Phonological Similarity

Alan Nielsen, Dieuwke Hupkes, Simon Kirby, Kenny Smith

Semantic Approximation And Its Effect On The Development Of Lexical Conventions

Bill Noble, Raquel Fernández

Domestication And Evolution Of Signal Complexity In Finches

Kazuo Okanoya

Parrot " Phonological Regression": Expanding Our Understanding Of The Evolution Of Vocal Learning

Irene M. Pepperberg, Katia Zilber-Izhar, Scott Smith

Early Learned Words Are More Iconic

Lynn Perry, Marcus Perlman, Gary Lupyan, Bodo Winter, Dominic Massaro

Cooperative Communication: What Do Primates And Corvids Have To Tell?

Simone Pika

Construction Grammar For Apes

Michael Pleyer, Stefan Hartmann

The Evolution Of Im/politeness

Monika Pleyer, Michael Pleyer

What Kind Of Grammar Did Early Humans (and Neanderthals) Command? A Linguistic Reconstruction

Ljiljana Progovac

The Cultural Evolution Of Structure In Music And Language

Andrea Ravignani, Tania Delgado, Simon Kirby

Languages Support Efficient Communication About The Environment: Words For Snow Revisited

Terry Regier, Alexandra Carstensen, Charles Kemp

Strategies In Gesture And Sign For Demoting An Agent: Effects Of Language Community And Input

Lilia Rissman, Laura Horton, Molly Flaherty, Marie Coppola, Annie Senghas, Diane Brentari, Susan Goldin-Meadow

Social Biases Versus Efficient Communication: An Iterated Learning Study

Gareth Roberts, Mariya Fedzechkina

Vocal Learning And Homo Loquens

Joana Rosselló

The Cultural Evolution Of Complexity In Linguistic Structure

Carmen Saldana, Simon Kirby, Kenny Smith

Skepticism Towards Skepticism Towards Computer Simulation In Evolutionary Linguistics

Carlos Santana

From Natural Order To Convention In Silent Gesture

Marieke Schouwstra, Kenny Smith, Simon Kirby

Active Control Of Complexity Growth In Naming Games: Hearer's Choice

William Schueller, Pierre-Yves Oudeyer

Mind The Gap: Inductive Biases In Phonological Feature Learning

Klaas Seinhorst

Children's Production Of Determiners As A Test Case For Innate Syntactic Categories

Catriona Silvey, Christos Christodoulopoulos

Vocal Learning In Functionally Referential Chimpanzee Food Calls

Katie Slocombe, Stuart Watson, Anne Schel, Claudia Wilke, Emma Wallace, Leveda Cheng, Victoria West, Simon Townsend

Chimpanzees Process Structural Isomorphisms Across Sensory Modalities

Ruth Sonnweber, Andrea Ravignani

Rule Learning In Birds: Zebra Finches Generalize By Positional Similarities, Budgerigars By The Structural Rules.

Michelle Spierings, Carel ten Cate

Minimal Pressures Leading To Duality Of Patterning

Matthew Spike, Kenny Smith, Simon Kirby

Information Dynamics Of Learned Signalling Games

Matthew Spike, Simon Kirby, Kenny Smith

Metalinguistic Awareness Of Trends As A Driving Force In Language Change: An Empirical Study

Kevin Stadler, Elyse Jamieson, Kenny Smith, Simon Kirby

The Grammar Of The Body And The Emergence Of Complexity In Sign Languages

Rose Stamp, Wendy Sandler

Failures Of Perspective Taking In An Open-ended Signaling Task

Justin Sulik, Gary Lupyan

Against The Emergent View Of Language Evolution

Maggie Tallerman

Evidence Of Descent With Modification And Selection In Iterated Learning Experiments

Monica Tamariz, Joleana Shurley

What Is Unique About The Evolution Of Language Compared To Other Cultural Domains? An Experimental Study Of Language, Technology And Art

Monica Tamariz, Jon W. Carr

Learning To Learn From Similar Others: Approximate Bayesian Computation Through Babbling

Bill Thompson, Heikki Rasilo

Interpreting Silent Gesture

Bill Thompson, Marieke Schouwstra, Henriëtte de Swart

Arbitrariness Of Iconicity: The Sources (and Forces) Of (dis)similarities In Iconic Representations

Oksana Tkachman, Carla L. Hudson Kam

Experimental Evidence For Phonemic-like Contrasts In A Nonhuman Vocal System

Simon Townsend, Andrew Russell, Sabrina Engesser

Modeling The Emergence Of Creole Languages

Francesca Tria, Vittorio Loreto, Vito Servedio, S. Mufwene Salikoko

Dendrophobia In Bonobo Comprehension Of Spoken English

Robert Truswell

A Constant Rate Effect Without Stable Functions

Robert Truswell, Nikolas Gisborne

Norms For Constructing Language In Humans And Animals

Robert Ullrich

Addressees Use Zipf's Law As A Cue For Semantics

Freek Van de Velde, Dirk Pijpops

A Continuum Of Human Cognitive-linguistic Evolution

Olga Vasileva

Language Evolution In Ontogeny And Phylogeny

Olga Vasileva

Constituent Order In Pictorial Representations Of Events Is Influenced By Language

Anu Vastenius, Jordan Zlatev, Joost Van de Weijer

Iconicity, Naturalness And Systematicity In The Emergence Of Sign Language Structure

Tessa Verhoef, Carol Padden, Simon Kirby

Language Evolution And Language Origins In Teaching Linguistics At The University Level

Slawomir Wacewicz, Przemyslaw Zywiczynski, Arkadiusz Jasinski

Languages Prefer Robust Phonemes

Andrew Wedel, Bodo Winter

Rethinking Zipf’s Frequency-meaning Relationship: Implications For The Evolution Of Word Meaning

Bodo Winter, David Ardell

The Structure Of Iconicity In The English Lexicon

Bodo Winter, Lynn Perry, Marcus Perlman, Gary Lupyan

Signal Autonomy Is Shaped By Contextual Predictability

James Winters, Simon Kirby, Kenny Smith

The Cultural Co-evolution Of Language And Mindreading

Marieke Woensdregt, Kenny Smith, Chris Cummins, Simon Kirby

Genetic Drift Explains Sapir's ``drift'' In Semantic Change

Igor Yanovich

A Game Theoretic Account Of Semantic Subjectification In The Cultural Evolution Of Languages

Eva Zehentner, Andreas Baumann, Nikolaus Ritt, Christina Prömer

Deep Learning Models Of Language Processing And The Evolution Of Syntax

Willem Zuidema

Language-biology Coevolution Fixation Times

Bart de Boer

Catergory Learning In Audition, Touch, And Vision

Sabine van der Ham, Bill Thompson, Bart de Boer

Simple Agents Are Able To Replicate Speech Sounds Using 3d Vocal Tract Model

Rick Janssen¹ , Dan Dediu¹ and Scott Moisik²
1 Max Planck Institute for Psycholinguistics
2 MPI

Keywords: agent modelling, anatomical biasing, evolutionary computation, neural networks

Short description: Simple neural network agents are able to replicate speech sounds using a 3D vocal tract model. Investigation of anatomical biases in population is now feasible.

Abstract:

Many factors have been proposed to explain why groups of people use different speech sounds in their language. These range from cultural, cognitive, environmental (e.g., Everett, et al., 2015) to anatomical (e.g., vocal tract (VT) morphology). How could such anatomical properties have led to the similarities and differences in speech sound distributions between human languages?

It is known that hard palate profile variation can induce different articulatory strategies in speakers (e.g., Brunner et al., 2009). That is, different hard palate profiles might induce a kind of bias on speech sound production, easing some types of sounds while impeding others. With a population of speakers (with a proportion of individuals) that share certain anatomical properties, even subtle VT biases might become expressed at a population-level (through e.g., bias amplification, Kirby et al., 2007). However, before we look into population-level effects, we should first look at within-individual anatomical factors. For that, we have developed a computer-simulated analogue for a human speaker: an agent. Our agent is designed to replicate speech sounds using a production and cognition module in a computationally tractable manner.

Previous agent models have often used more abstract (e.g., symbolic) signals. (e.g., Kirby et al., 2007). We have equipped our agent with a three-dimensional model of the VT (the production module, based on Birkholz, 2005) to which we made numerous adjustments. Specifically, we used a 4th-order Bezier curve that is able to capture hard palate variation on the mid-sagittal plane (XXX, 2015). Using an evolutionary algorithm, we were able to fit the model to human hard palate MRI tracings, yielding high accuracy fits and using as little as two parameters. Finally, we show that the samples map well-dispersed to the parameter-space, demonstrating that the model cannot generate unrealistic profiles. We can thus use this procedure to import palate measurements into our agent’s production module to investigate the effects on acoustics. We can also exaggerate/introduce novel biases.

Our agent is able to control the VT model using the cognition module.

Previous research has focused on detailed neurocomputation (e.g., Kröger et al., 2014) that highlights e.g., neurobiological principles or speech recognition performance. However, the brain is not the focus of our current study. Furthermore, present-day computing throughput likely does not allow for large-scale deployment of these architectures, as required by the population model we are developing. Thus, the question whether a very simple cognition module is able to replicate sounds in a computationally tractable manner, and even generalize over novel stimuli, is one worthy of attention in its own right.

Our agent’s cognition module is based on running an evolutionary algorithm on a large population of feed-forward neural networks (NNs). As such, (anatomical) bias strength can be thought of as an attractor basin area within the parameter-space the agent has to explore. The NN we used consists of a triple-layered (fully-connected), directed graph. The input layer (three neurons) receives the formants frequencies of a target-sound. The output layer (12 neurons) projects to the articulators in the production module. A hidden layer (seven neurons) enables the network to deal with nonlinear dependencies. The Euclidean distance (first three formants) between target and replication is used as fitness measure. Results show that sound replication is indeed possible, with Euclidean distance quickly approaching a close-to-zero asymptote.

Statistical analysis should reveal if the agent can also: a) Generalize: Can it replicate sounds not exposed to during learning? b) Replicate consistently: Do different, isolated agents always converge on the same sounds? c) Deal with consolidation: Can it still learn new sounds after an extended learning phase (‘infancy’) has been terminated? Finally, a comparison with more complex models will be used to demonstrate robustness.

Download Paper (pdf)

Citation:

Janssen R., Dediu D. and Moisik S. (2016). Simple Agents Are Able To Replicate Speech Sounds Using 3d Vocal Tract Model. In S.G. Roberts, C. Cuskley, L. McCrohon, L. Barceló-Coblijn, O. Fehér & T. Verhoef (eds.) The Evolution of Language: Proceedings of the 11th International Conference (EVOLANG11). Available online: http://evolang.org/neworleans/papers/97.html

Bibtex file

Table of Contents

Simple Agents Are Able To Replicate Speech Sounds Using 3d Vocal Tract Model

Rick Janssen1 , Dan Dediu1 and Scott Moisik2 1 Max Planck Institute for Psycholinguistics 2 MPI

Rick Janssen¹ , Dan Dediu¹ and Scott Moisik²
1 Max Planck Institute for Psycholinguistics
2 MPI