SCHROERAND YU 7of8
thanpreviouswork(e.g.,Yu&Smith,2012)–whichmayhaveincreased
the learning demands on our participants. Second, although we chose
target words that are unlikely to be in early vocabulary, it is possi-
ble that infants in the study had prior exposure to the 10 words. The
effect of prior knowledge can be difficult to predict – prior exposure
maybeafactorthatcontributestothelearningoutcomesmeasuredat
test,butpreviousworkalsosuggeststhatinfantshaveworseretention
of novel object-label mappings when learning in the context of well-
knownwords(Kuckeret al.,2020).Furthermore,despitethe wide age
rangeof ourparticipants,neitherthe infant participant’s agenortheir
concurrent vocabulary size predicted their performance at the test
(correlations with MCDI scores reported in Table S2), suggesting that
prior knowledge may not have had a direct impact on word learning
in the present study. Lastly, we did not collect comprehensive demo-
graphic information from our participants, including whether infants
were learning English as their first language. Nonetheless, we anno-
tated and examined non-English words used in the play session and
found that the three infants that heard any non-English words per-
formed similarly to other subjects at test (these infants still heard the
targetobjectlabelsinEnglish;seeTableS1).Futureworkthatconsiders
whether the microlevelbehaviors examined in the present study vary
across different demographic groups would undoubtedly be a major
contributiontothefield.
5 CONCLUSION
We examined multimodal and social factors that support infant word
learning in naturalistic parent–infant play when learning is not an
exogenous goal of the interaction. Our findings suggest that only
studying infant-looking behavior and parent speech is not enough.
Considering how infants’ bodies shape their visual input will foster a
richer,mechanisticunderstandingofearlylanguageacquisition.
ACKNOWLEDGMENTS
This work was supported by NIH R01HD074601 and R01HD093792
to CY. SES was supported by the NSF GRFP (DGE-1610403) and NIH
T32HD007475. We thank the Computational Cognition and Learning
LabatIndianaUniversityandtheDevelopmentalIntelligenceLabatUT
Austin,especiallyDanielPearcy,HannahBurrell,DianZhi,Tian(Linger)
Xu, Julia Yurkovic-Harding,Drew Abney, Andrei Amatuni, and Jeremy
Borjon for their support in data collection, coding, and many fruitful
discussions.
CONFLICT OF INTEREST
Theauthorsdeclarenoconflictofinterest.
DATA AVAILABILITY STATEMENT
Thedataarenotpubliclyavailableduetoprivacyorethicalrestrictions,
butareavailableuponrequestfromthecorrespondingauthor.
ORCID
Sara E Schroer
https://orcid.org/0000-0002-6139-060X
ENDNOTE
1
To confirm our results, we also used stricter criteria to score trials as
correct/incorrect, for example, the infant must look at target for at least
250 ms longer than the distractor to be correct. This increases the num-
ber of not learned items and decreases the number of learned items.
Nonetheless,themainresultsofthepaperstillhold–showingthatinfants’
multimodalattentionisthestrongestpredictoroflearningandthatvisual
attentionaloneinthe3sbeforenamingnegativelypredictslearning.
REFERENCES
Amatuni, A., Schroer, S. E., Peters, R. E., Reza, M. A., Zhang, Y., Crandall,
D., & Yu, C. (2021). In the-moment visual information from the infant’s
egocentricview determines thesuccess ofinfant word learning:A com-
putational study. In Proceedings of the 43rd Annual Meeting of the
CognitiveScienceSociety.
Baldwin, D. A. (1993). Early referential understanding: Infants’ ability to
recognize referential acts for what they are. Developmental Psychology,
29(5),832–843.
Bambach, S., Crandall, D. J., Smith, L. B., & Yu, C. (2018). Toddler-inspired
visualobject learning. InProceedings of the Advances in NeuralInforma-
tionProcessingSystems(NeurIPS),(pp.31).
Bunce, J. P., & Scott, R. M. (2017). Finding meaning in a noisy world:
Exploring the effects of referential ambiguity and competition on 2.5-
year-olds’ cross-situational word learning. Journal of Child Language,
44(3),650–676.
Chang,L.,deBarbaro,K.,&Deák,G.(2016).Contingenciesbetweeninfants’
gaze, vocal,and manual actions and mothers’ object-naming: Longitudi-
nalchangesfrom4to9months.Developmental Neuropsychology,41(5–8),
342–361.
Chang, L. M., & Deák, G. O. (2019). Maternal discourse continuity and
infants’ actions organize 12-month-olds’ language exposure during
objectplay.Developmental Science,22(3),e12770.
Cheung, R. W., Hartley, C., & Monaghan, P. (2021). Caregivers use ges-
ture contingently to support word learning. Developmental Science, 24,
e13098.
Davoli, C. C., & Brockmole, J. R. (2012). The hands shield attention from
visual interference. Attention, Perception, & Psychophysics, 74(7), 1386–
1390.
de Villiers Rader, N., & Zukow-Goldring, P. (2012). Caregivers’ gestures
direct infant attention during early word learning: The importance of
dynamicsynchrony.Language Sciences,34(5),559–568.
Fenson,L.,Dale,P.S.,Reznick,J.S.,Thal,D.,Bates,E.,Hartung,J.P.,&Reilly,
J.S.(1993).MacArthur communicative development inventories: User’s guide
and technical manual.PaulH.Brookes.
Gogate, L. J., Bahrick, L. E., & Watson, J. D. (2000). A study of multimodal
motherese: The role of temporal synchrony between verbal labels and
gestures.Child Development,71(4),878–894.
Kelly, S. P., & Brockmole, J. R. (2014). Hand proximity differentially affects
visual working memory for color and orientation in a binding task.
Frontiers in Psychology,5,318.
Kretch, K. S., Franchak, J.M., & Adolph, K. E. (2014). Crawling and walking
infantsseetheworlddifferently.Child Development,85(4),1503–1518.
Kucker,S. C.,McMurray,B.,& Samuelson,L.K. (2020).Sometimes itis bet-
ter to know less: How known words influence referent selection and
retention in 18- to 24-month-old children. Journal of Experimental Child
Psychology,189,104705.
Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2017). lmerTest
package: Tests in linear mixed effects models. JournalofStatistical
Software,82(13),1–26.
Libertus,K.,Joh,A.S.,&Needham,A.W.(2016).Motortrainingat3months
affectsobjectexploration12monthslater.Developmental Science,19(6),
1058–1066.
MacDonald, K., Yurovsky, D., & Frank, M. C. (2017). Social cues modu-
late therepresentations underlying cross-situational learning. Cognitive
Psychology,94,67–84.
14677687, 0, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/desc.13290 by University Of Texas Libraries, Wiley Online Library on [09/01/2023]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License