The Shaping of a Standard Voice: Sonic and Sociotechnical Imaginaries in Smart Speakers

Domenico Napolitano

Abstract


This article deals with the issue of sound imaginaries, using artificial voice as a case study. Specifically, it focuses on the project of creating a voice standard associated with the 1980s speech synthesis device DECTalk. Using a media-archaeological methodology which draws on heterogeneous empirical material about technological materialities and discursive representations, the article investigates the imaginary formative principles governing the definition of a vocal sound ideal type such as that incorporated in speech synthesis systems of the ‘80s. Referring to the most recent theories on sonic imagination, the article argues that these formative principles are still at work in current smart speakers, as they refer to the imaginary of a voice “in general”, which recalls ideas of authority, fidelity and transcendence. As the “standard” of artificial voice is an ideal type created at the crossroads between cultural ideas of voice within European philosophical tradition and the techno-scientific knowledge through which voice has been measured, reified and adopted by organizations, the article argues that sonic imaginary can be related to the framework of sociotechnical imaginary theory.


Keywords


Sonic imaginary; Sociotechnical imaginary | Voice; Speech synthesis; Media archaeology

Full Text:

PDF

References


Abruzzese A. (2001), L’intelligenza del mondo. Fondamenti di storia e teoria dell’immaginario, Roma, Meltemi.

Allen, J., Hunnicutt S. M., Klatt D. (1987), From Text to Speech: The MITalk system, Cambridge, Cambridge University Press.

Altman R. (ed.) (1992), Sound Theory, Sound Practice, New York, Routledge.

Berger P. L., Luckmann T. (1966), The social construction of reality: a treatise in sociology of knowledge, Garden City, NY, Anchor Books.

Bory S., Bory P. (2015), “I nuovi immaginari dell’intelligenza artificiale”, Im@go, 6, 1: 66-85.

Cavarero A. (2005), For more than one voice: toward a philosophy of vocal expression, Redwood City, Stanford University Press. Original edition A più voci. Filosofia dell’espressione vocale, 1998.

Connor S. (2000), Dumbstruck: A Cultural History of Ventriloquism, Oxford, Oxford University Press.

Connor S. (2004), “The Strains of Voice”, in Felderer B. (ed.), Phonorama: Eine Kulturgeschichte der Stimme als Medium, Berlin, Matthes and Seitz, 158-72.

Cox C. (2011), “Beyond Representation and Signification: Toward a Sonic Materialism”, Journal of Visual Culture, 10, 2: 145-161.

Davis, E. (1998), TechGnosis: Myth, Magic, and Mysticism in the Age of Information, New York, Harmony Books.

Derrida J. (1967), La voix et le phénomène, Parigi, PUF.

Di Scipio A. (2013), “Sound object? Sound event! Ideologies of sound and the biopolitics of music”, Soundscape. Journal of acoustic ecology, 13: 10-14.

Dolar M. (2006), A voice and Nothing More, Cambridge MA, MIT Press.

Durand G. (1999), The Anthropological Structures of the Imaginary, Brisbane, Boombana Publications. Original edition Les Structures Antropologiques de l’Imaginaire, 1960.

Dyson F. (2010), Sounding New Media: Immersion and Embodiement in the Arts and Culture, Berkeley, University of California Press.

Dyson F. (2014), The Tone of Our Times: Sound, Sense, Economy and Ecology, Cambridge MA, MIT Press.

Eidsheim N. S. (2018), The Race of Sound. Listening, Timbre and Vocality in African American Music, Durham and London, Duke University Press.

Erlmann V. (2004), Reason and Resonance. A History of Modern Aurality, New York, Zone Books.

Ernst W. (2012), Digital Memory and The Archive, Minneapolis, University of Minnesota Press.

Ernst W. (2018), “Radical Media Archaeology: Its Epistemology, Aesthetics and Case Studies”, Artnodes, 21: 35-43.

Ernst W. (2021), Technológos in being: Radical Media Archaeology & the Computational Machine, New York, Bloomsbury.

Faber L. (2020), The Computer’s Voice: From Star Trek to Siri, Minneapolis, University of Minnesota Press.

Flanagan J. L. (1972a), Speech Analysis, Synthesis and Perception, Berlin, Springer-Verlag.

Flanagan J. L., Rabiner L. R. (eds.) (1973), Speech Synthesis, Murray Hill (NJ), Bell Telephone Labs.

Flanagan J. L. (1982), “Talking with computers: synthesis and recognition of speech by computers”, IEEE Transactions on Biomedical Engineering, vol. BME-29, n. 4: 223-232.

Grimshaw-Aagaard M., Walther-Hansen M., Knakkergaard M. (eds.) (2019), The Oxford Handbook of Sound and Imagination, 2 volumes, Oxford Univeristy Press.

Garcia C. (2018), “Bringing a new voice to genius: MITalk, the Calltext 5010 and Stephen Hawkins’ wheelchair”, Computer History Museum, 26 March, https://computerhistory.org/blog/how-dectalk-gave-voice-to-a-genius-engineering-stephen-hawkings-wheelchair/ (retrieved 15/10/2021).

Gehl R.W., Bakardjieva M. (eds.) (2017), Socialbots and Their Friends. Digital Media and the Automation of Sociality, London and New York, Routledge.

Guzman A. L. (2015), Imagining the Voice in the Machine: The Ontology of Digital Social Agents, PhD Dissertation, Chicago, University of Illinois.

Hankins T. L., Silverman R. J. (1995), Instruments and the imagination, Princenton, Princenton University Press.

Hester H. (2016), “Technically female: Women, machines and hyperemployment”, Salvage Magazine, https://salvage.zone/in-print/technically-female-women-machines-and-hyperemployment/ (retrieved 19/11/2020).

Humphry J., Chesher C. (2021), “Preparing for smart voice assistants: Cultural histories and media innovations”, New Media & Society, 23, 7: 1971–1988.

Ingold T. (2011), “Four Objections to the Concept of Soundscape”, in T. Ingold, Being Alive. Essays on Movement, Knowledge and Description, New York, Routledge, 136-139.

Jasanoff S. (2015), “Future Imperfect: Science, Technology, and the Imaginations of Modernity”, in Jasanoff S. and Kim S.-H. (eds.), Dreamscapes of Modernity. Sociotechnical Imaginaries and the Fabrication of Power, Chicago and London, The University of Chicago Press, 1-33.

Jasanoff S., Kim S.-H. (2015), Dreamscapes of Modernity. Sociotechnical Imaginaries and the Fabrication of Power, Chicago and London, The University of Chicago Press.

Kluitenberg E. (2006), Book of Imaginary Media: Excavating the Dream of the Ultimate Communication Medium, Rotterdam, Nai Publishers.

Lingel J., Crawford K. (2020), “«Alexa, Tell Me About Your Mother»: The History of Secretary and the End of Secrecy”, Catalyst: Feminism, Theory, Technoscience, 6, 1: 1-25.

Lorenzo-Trueba J., Klimkov V. (2019), “Neural text-to-speech makes speech synthesizers much more versatile”, Amazon Science, https://www.amazon.science/blog/neural-text-to-speech-makes-speech-synthesizers-much-more-versatile (retrieved 10/01/2021).

Männistö-Funk T., Sihvonen T. (2018), “Voices from the Uncanny Valley: How Robots and Artificial Intelligences Talk Back to Us”, Digital Culture & Society, 4, 1: 45–64.

Marzo P. L. (2019), “La natura immaginaria del sociale: un percorso morfologico”, in Marzo P.L., Mori L. (eds.), Le vie sociali dell’immaginario. Per una sociologia del profondo, Milano, Mimesis.

McLuhan M. (1964), Understanding Media, New York, McGraw Hill.

Mills M. (2012), “Media and Prosthesis. The Vocoder, the Artificial Larynx, and the History of Signal Processing”, Qui Parle, 21, 1: 107-149.

Mooney J., Pinch T. (2021), “Sonic Imaginaries. How Hugh Davies and David Van Koevering performed electronic music’s future”, in Hennion A., Levaux C. (eds.), Rethinking Music Through Sound and Technology Studies, New York and London, Routledge.

Mosco V. (2004), The Digital Sublime. Myth, Power and the Cyberspace, Cambridge MA, The MIT Press.

Musso M. G. (2019), “Immaginario, tecnologia e mutamento sociale”, in Marzo P. L., Mori L. (eds.), Le vie sociali dell’immaginario. Per una sociologia del profondo, Milano, Mimesis.

Napolitano D. (2020a), “«Where’s the voice of the machine?» An ethnography of artificial voice socio-technical networks”, Etnografia e ricerca qualitativa, 3: 351-372.

Napolitano D. (2020b), “The Cultural Origins of Voice Cloning”, in Verdicchio M., Carvalhais M., Ribas L., Rangel A. (eds.), xCoAx 2020 Proceedings of the Eighth Conference on Computation, Communication, Aesthetics & X, 59-73.

Napolitano D. (2021), “Reuniting speech-impaired people with their voices: Sound technologies for disability and why they matter for organization studies”, puntOorg International Journal, 7, 1, 6-21.

Nass C., Brave S. (2005), Wired for Speech. How Voice Activates and Advances the Human-Computer Relationship, Cambridge (MA), MIT Press.

Natale S. (2021), Deceitful Media. Artificial Intelligence and Social Life after the Turing Test, New York, Oxford University Press.

Natale S., Pasulka D. (2019), Believing in Bits: Digital Media and the Supernatural, New York, Oxford University Press.

Parikka J. (2012), What is Media Archaeology?, Cambridge, Polity Press.

Peters J. D. (2004), “Helmholtz, Edison and Sound History”, in Rabinovitz L., Geil A. (eds.), Memory Bytes: History, Technology and Digital Culture, Durham and London, Duke University Press, 177-198.

Peters J. D. (2015), The Marvelous Clouds, Chicago, University of Chicago Press.

Phan T. (2019), “Amazon Echo and the Aesthetics of Whiteness”, Catalyst: Feminism, Theory, Technoscience, 5, 1, 1-39.

Poizat M. (2001), Vox populi, vox dei, Paris, Editions Métailié.

Scha R. (1992), “Virtual Voices”, Mediamatic, 7, 1, 27-42.

Schafer R. M. (1977), Soundscape: Our Sonic Environment and the Tuning of the World, Toronto, McClelland & Stewart Ltd.

Sterne J. (2003), The Audible Past. Origins of Sound Reproduction, Durham and London, Duke University Press.

Sterne J. (2011), “Theology of Sound: A Critique of Orality”, Canadian Journal of Communication, 36, 207-225.

Sterne J. (2012), “Sonic Imaginations”, in Sterne J. (ed.), The Sound Studies Reader, New York, Routledge, 1-17.

Strengers Y., Kennedy J. (2020), The Smart Wife: Why Siri, Alexa, and Other Smart Home Devices Need a Feminist Reboot, Cambridge MA, The MIT Press.

Taylor C. (2004), Modern Social Imaginaries, Durham and London, Duke University Press.

Vallee M. (2017). “Technology, Embodiment, and Affect in Voice Sciences: The Voice Is an Imaginary Organ”, Body and Society, 23, 2, 83-105.

Velasco-Pufleau L. (2021), “Music, Noise and Conflict: Sociotechnical Imaginaries, Acoustic Agency and Ontological Assumptions about Sound”, Journal of the Royal Musical Association, 146, 2, 501-508.

Woods H.S. (2018), Asking more of Siri and Alexa: feminine persona in service of surveillance capitalism, Critical Studies in Media Communication, 35, 4, 334-349.

Wunenburger J.-J. (1991), L’imaginaire, Paris, PUF.

Zuboff S. (2019), The Age of Surveillance Capitalism, New York, Public Affairs.




DOI: https://doi.org/10.7413/228181381947

Refbacks

  • There are currently no refbacks.


Im@go. A Journal of the Social Imaginary - Biannual - Edizioni Mimesis