Sonnets written by women in the Spanish Golden Age: a stylometric approach
DOI:
https://doi.org/10.24197/cel.16.2025.175-198Keywords:
stylometry, women writers, poetry, spanish Golden Age, topic modellingAbstract
This study presents a first quantitative approach to the poetic writing of female authorship in the Golden Age through the application of computational and stylometric methodologies, following the trail of international relevant works in the field of Computational Stylistics. The results obtained suggest that it is possible to differentiate between female and male writing in this genre and period. Additionally, a metapoetic motif in women’s poetry related to female writing was detected.
Downloads
References
Argamon, Shlomo, et al. (2003). “Gender, Genre, and Writing Style in Formal Written Texts”. Text & Talk, 23. 3, pp. 321-46.
Baranda Leturio, Nieves (2014). “Isabel de Vega, poeta con musa (Alcalá, 1558, 1568)”. Epos, 30, pp. 99-112.
Baranda, Nieves, et al. (2019) “BIESES. Escritoras de la Edad Moderna, desde la bibliografía a las redes”. En Leticia Sánchez Hernández (ed.), Mujeres en la Corte de los Austrias: una red social, cultural, religiosa y política. Madrid: Polifemo, pp. 55-82.
Bazzaco, Stefano (2021). “Experimentos de estilometría en el ámbito de los libros de caballerías. El caso de atribución de un original italiano: Il terzo libro di Palmerino d’Inghilterra (Portonari, 1559)”. En Meritxell Simó et al. (eds.).“Prenga xascú ço qui millor li és de mon dit”. Creació, recepció i representació de la literatura medieval. San Millán de la Cogolla: Cilengua. Centro Internacional de Investigación de la Lengua Española, pp. 149-166.
Bermúdez Sabel, Helena (2023). “Exploring genderlect markers in a corpus of Nineteenth century Spanish novels”. En Anne Baillot et al. (eds.). Digital Humanities 2023: Book of Abstracts. Graz: ADHO, pp. 121-123. DOI: https://doi.org/10.5281/zenodo.7961822.
Blasco Pascual, Francisco Javier y Ruiz Urbón, Cristina (2023). Análisis de textos desde la Estilometría. Salamanca: Ediciones Universidad de Salamanca.
Blei, David M. (2012). “Probabilistics Topic Models”. Communications of the ACM, 55, pp. 77-84.
Burrows, John (2007). “All the Way Through: Testing for Authorship in Different Frequency Strata”. Literary and Linguistic Computing, 22. 1, pp. 27-47.
Calvo Tello, José (2021). The Novel in the Spanish Silver Age. A Digital Analysis of Genre using Machine Learning. Bielefeld University Press.
Carvajal y Mendoza, Luisa de (1990). Poesías completas. María Luisa García-Nieto Onrrubia (ed.). Badajoz: Diputación Provincial de Badajoz.
Craig, Hugh y Kinney, Arthur F. (eds) (2009). Shakespeare, Computers, and the Mystery of Authorship. Cambridge: Cambridge University Press.
Cuéllar, Álvaro y Vega García-Luengos, Germán (2023). “La francesa Laura. El hallazgo de una nueva comedia del Lope de Vega último”. Anuario Lope de Vega Texto literatura cultura, 29, pp. 131-98. DOI: https://doi.org/10.5565/rev/anuariolopedevega.492.
Eder, Maciej (2017). “Short samples in authorship attribution: a new approach”. Digital Humanities 2017. Conference abstracts. McGill University & Université de Montréal. August 8-11, 2017. Montréal: ADHO, https://dh2017.adho.org/abstracts/341/341.pdf [16/01/2025].
Eder, Maciej et al. (2016). “Stylometry with R: A Package for Computational Text Analysis”. The R Journal, 8. 1, pp. 107-121, https://journal.r-project.org/archive/2016/RJ-2016-007/index.html [16/01/2025].
Evert, Stefan, et al. (2017). “Understanding and explaining Delta measures for authorship attribution”. Digital Scholarship in the Humanities, 32, pp. ii4-16.
Fernández López, María (2015). Obra poética completa. Martina Vinatea Recoba (ed.). New York: IDEA.
Fox, Gwyn (2008). Subtle Subversions: Reading Golden Age Sonnets by Iberian Women. Washington: Catholic University of America Press.
Fradejas Rueda, José Manuel (2016). “El análisis estilométrico aplicado a la literatura española: las novelas policiacas históricas”. Caracteres. Estudios culturales y críticos de la esfera digital, 5. 2, pp. 196-246, http://revistacaracteres.net/revista/vol5n2noviembre2016/analisis-estilometrico/ [16/01/2025].
García-Reidy, Alejandro (2019). “Deconstructing the Authorship of Siempre ayuda la verdad: A Play by Lope de Vega?” Neophilologus, 103, 4, pp. 493-510.
Gorría, Ana (ed.) (2018). Antología de poetas españolas: de la generación del 27 al siglo XV. Barcelona: Alba.
Grün, Bettina, y Hornik, Kurt (2011). “Topicmodels: An R Package for Fitting Topic Models”. Journal of Statistical Software, 40. 1, pp. 1-30. DOI: https://doi.org/10.18637/jss.v040.i13.
Hernández-Lorenzo, Laura (2019). “Poesía áurea, estilometría y fiabilidad: Métodos supervisados de atribución de autoría atendiendo al tamaño de las muestras”. Caracteres. Estudios culturales y críticos de la esfera digital, 8. 1, pp. 189-228, http://revistacaracteres.net/wp-content/uploads/2019/06/Caracteresvol8n1mayo2019-estilometria.pdf [16/01/2025].
Hernández-Lorenzo, Laura (2022). “Stylistic Change in Early Modern Spanish Poetry Through Network Analysis (with an Especial Focus on Fernando de Herrera’s Role)”. Neophilologus, 106, pp. 397-417. DOI: https://doi.org/10.1007/s11061-021-09717-2.
Hernández-Lorenzo, Laura, y Byszuk, Joanna (2023). “Challenging Stylometry: the authorship of the baroque play La Segunda Celestina”. Digital Scholarship in the Humanities, 38. 2, pp. 544-58. DOI: https://doi.org/10.1093/llc/fqac063.
Hoover, David L. (2013). “Textual Analysis”. En Kenneth M. Price y Ray Siemens (eds.). Literary Studies in the Digital Age. Modern Language Association of America, https://dlsanthology.mla.hcommons.org/textual-analysis/ [16/01/2025].
Jannidis, Fotis, et al., (eds.). Digital Humanities. Eine Einführung. Stuttgart: J.B. Metzler Verlag, 2017.
Jockers, Matthew L. (2013). Macroanalysis: Digital Methods and Literary History. Urbana: University of Illinois Press.
Jockers, Matthew L., y Mimno, David (2013). “Significant Themes in 19th-Century Literature”. Poetics, 41. 6, pp. 750-69.
Jockers, Matthew L., y Underwood, Ted (2016). “Text-Mining the Humanities”. En Susan Schreibman et al., A New Companion to Digital Humanities. Wiley-Blackwell, pp. 291-306.
Koolen, Corina W. (2018). Reading beyond the Female. The Relationship between Perception of Author Gender and Literary Quality. Amsterdam: Universiteit van Amsterdam.
Koppel, Moshe, et al. (2022) “Automatically Categorizing Written Texts by Author Gender”. Literary and Linguistic Computing, 17, 4, pp. 401-12. DOI: https://doi.org/10.1093/llc/17.4.401.
Kroll, Simon, y Sanz-Lázaro, Fernando (2022). “Romances teatrales entre Mira de Amescua, Calderón y Lope: Ritmo, asonancia y cuestiones de autoría”. Revista de Humanidades Digitales, 7, pp. 1-18. DOI: https://doi.org/10.5944/rhd.vol.7.2022.31620.
Lou, Andrés, et al. (2015) “Multilabel Subject-Based Classification of Poetry”. Proceedings of the Twenty-Eighth International Florida Artificial Intelligence Research Society Conference. Association for the Advancement of Artificial Intelligence, https://aaai.org/papers/187-flairs-2015-10372/ [16/01/2025].
Marcos, Mercedes (2022). Un mar de sed donde me anego: la obra poética de Ana de la Trinidad. Burgos: Grupo editorial Fonte.
Martos, María Dolores (2018). “La voz poética”. En Nieves Baranda y Anne J. Cruz (eds.). Las escritoras españolas de la Edad Moderna: historia y guía para la investigación. Madrid: UNED, pp. 225-48.
Mimno, David, et al. (2011). “Optimizing Semantic Coherence in Topic Models”. Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. Edinburgh: Association for Computational Linguistics, pp. 262-72, https://aclanthology.org/D11-1024 [16/01/2025].
Moretti, Franco (2013). Distant Reading. London: Verso.
Navarro-Colorado, Borja (2018). “On Poetic Topic Modeling: Extracting Themes and Motifs From a Corpus of Spanish Poetry”. Frontiers in Digital Humanities, 5, https://www.frontiersin.org/article/10.3389/fdigh.2018.00015 [16/01/2025].
Navarro-Colorado, Borja, et al. (2016). “Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation.” En Nicoletta Calzolari et al. (eds.), Proceedings of the 10th edition of the Language Resources and Evaluation Conference, 23-28 May 2016, Portorož (Slovenia). Portorož: European Language Resources Association, pp. 4360-64, https://aclanthology.org/L16-1691/ [16/01/2025].
Olivares, Julián y Elizabeth Boyle (eds.) (2021). Tras el espejo la musa escribe: lírica femenina de los Siglos de Oro. Madrid: Siglo Veintiuno.
Pennebaker, James (2011). The Secret Life of Pronouns: What Our Words Say about Us. New York: Bloomsbury Press.
Piper, Andrew (2016). “There Will Be Numbers”. Cultural Analytics, 1. 1, pp. 1-10. DOI: https://doi.org/10.22148/16.006.
Ramírez de Guzmán, Catalina (2004). Obras poéticas. Joaquín de Entrambasaguas (ed.). Brenes: Muñoz Moya Editores Extremeños.
Savoy, Jacques (2018). “Is Starnone Really the Author behind Ferrante?” Digital Scholarship in the Humanities, 33. 4, pp. 902–918. DOI: https://doi.org/10.1093/llc/fqy016.
Schöch, Christof (2017). “Topic Modeling Genre: An Exploration of French Classical and Enlightenment Drama”. Digital Humanities Quarterly, 11, 2, http://www.digitalhumanities.org/dhq/vol/11/2/000291/000291.html [16/01/2025].
Sinclair, Stéfan y Geoffrey Rockwell (2016). “Text Analyzing and Visualization: Making the Meaning Count”. En Susan Schreibman, Ray Siemens y John Unsworth (eds.). A New Companion to Digital Humanities. Wiley-Blackwell, pp. 274-290.
Zayas, María de (2014). La traición en la amistad. Teresa Ferrer Valls (ed.). Alicante: Biblioteca Virtual Miguel de Cervantes, https://www.cervantesvirtual.com/obra/la-traicion-en-amistad/ [16/01/2025].
Zayas, María de (2022). Novelas amorosas y ejemplares. Julián Olivares (ed.). Madrid: Cátedra.
Weidman, Sean G. y O’Sullivan, James (2018). “The limits of distinctive words: Re-evaluating literature’s gender marker debate”. Digital Scholarship in the Humanities, 33. 2, pp. 374-390.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Laura Hernández-Lorenzo

This work is licensed under a Creative Commons Attribution 4.0 International License.
This journal enables free and immediate access to its content to foster global knowledge.

The articles published at Castilla. Estudios de Literatura will have a Creative Commons Attribution 4.0 International License (CC BY 4.0).
The authors continue as owners of their works, and can republish their articles in another medium without having to request authorization, as long as they indicate that the work was originally published in Castilla. Estudios de Literatura.

